Xinyuan Qian

Orcid: 0000-0002-9511-6713

According to our database1, Xinyuan Qian authored at least 66 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Audio-Visual Temporal Forgery Detection Using Embedding-Level Fusion and Multi-Dimensional Contrastive Loss.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

Efficient and Privacy-Preserving Outsourcing of Gradient Boosting Decision Tree Inference.
IEEE Trans. Serv. Comput., 2024

Deep Cross-Modal Retrieval Between Spatial Image and Acoustic Speech.
IEEE Trans. Multim., 2024

Decentralized Multi-Client Functional Encryption for Inner Product With Applications to Federated Learning.
IEEE Trans. Dependable Secur. Comput., 2024

M3TTS: Multi-modal text-to-speech of multi-scale style control for dubbing.
Pattern Recognit. Lett., 2024

I2TTS: Image-indicated Immersive Text-to-speech Synthesis with Spatial Perception.
CoRR, 2024

SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model.
CoRR, 2024

pTSE-T: Presentation Target Speaker Extraction using Unaligned Text Cues.
CoRR, 2024

OnePath: Efficient and Privacy-Preserving Decision Tree Inference in the Cloud.
CoRR, 2024

Analytic Class Incremental Learning for Sound Source Localization with Privacy Protection.
CoRR, 2024

Mamba in Speech: Towards an Alternative to Self-Attention.
CoRR, 2024

Audio-Visual Target Speaker Extraction with Reverse Selective Auditory Attention.
CoRR, 2024

MMAL: Multi-Modal Analytic Learning for Exemplar-Free Audio-Visual Class Incremental Tasks.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

ListenFormer: Responsive Listening Head Generation with Non-autoregressive Transformers.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Privacy-Preserving Data Evaluation via Functional Encryption, Revisited.
Proceedings of the IEEE INFOCOM 2024, 2024

SecSCS: A User-Centric Secure Smart Camera System Based on Blockchain.
Proceedings of the 44th IEEE International Conference on Distributed Computing Systems, 2024

An Efficient and Secure Privacy-Preserving Federated Learning Via Lattice-Based Functional Encryption.
Proceedings of the IEEE International Conference on Communications, 2024

GLMB 3D Speaker Tracking with Video-Assisted Multi-Channel Audio Optimization Functions.
Proceedings of the IEEE International Conference on Acoustics, 2024

Visually Guided Binaural Audio Generation with Cross-Modal Consistency.
Proceedings of the IEEE International Conference on Acoustics, 2024

LOCSELECT: Target Speaker Localization with an Auditory Selective Hearing Mechanism.
Proceedings of the IEEE International Conference on Acoustics, 2024

Text-Queried Target Sound Event Localization.
Proceedings of the 32nd European Signal Processing Conference, 2024

2023
ESA-FedGNN: Efficient secure aggregation for federated graph neural networks.
Peer Peer Netw. Appl., March, 2023

Speech-Oriented Sparse Attention Denoising for Voice User Interface Toward Industry 5.0.
IEEE Trans. Ind. Informatics, 2023

Neural-Free Attention for Monaural Speech Enhancement Toward Voice User Interface for Consumer Electronics.
IEEE Trans. Consumer Electron., 2023

A Time-Frequency Attention Module for Neural Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Device Features Based on Linear Transformation With Parallel Training Data for Replay Speech Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Audio-Visual Cross-Attention Network for Robotic Speaker Tracking.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

L$^{3}$ F-TOUCH: A Wireless GelSight With Decoupled Tactile and Three-Axis Force Sensing.
IEEE Robotics Autom. Lett., 2023

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions.
CoRR, 2023

Audio Visual Speaker Localization from EgoCentric Views.
CoRR, 2023

Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

A Miniaturised Camera-based Multi-Modal Tactile Sensor.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Toward Efficient and End-to-End Privacy-Preserving Distributed Gradient Boosting Decision Trees.
Proceedings of the IEEE International Conference on Communications, 2023

Ripple Sparse Self-Attention for Monaural Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023

Self-Convolution for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Stream Attention Based U-Net for L3DAS23 Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Audio-Visual Tracking of Concurrent Speakers.
IEEE Trans. Multim., 2022

Deep Audio-Visual Beamforming for Speaker Localization.
IEEE Signal Process. Lett., 2022

Speaker Extraction With Co-Speech Gestures Cue.
IEEE Signal Process. Lett., 2022

Predict-and-Update Network: Audio-Visual Speech Recognition Inspired by Human Speech Perception.
CoRR, 2022

Iterative Sound Source Localization for Unknown Number of Sources.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

CryptoFE: Practical and Privacy-Preserving Federated Learning via Functional Encryption.
Proceedings of the IEEE Global Communications Conference, 2022

Fast Secure Aggregation for Privacy-Preserving Federated Learning.
Proceedings of the IEEE Global Communications Conference, 2022

2021
Three-Dimensional Speaker Localization: Audio-Refined Visual Scaling Factor Estimation.
IEEE Signal Process. Lett., 2021

Efficient secret key reusing attribute-based encryption from lattices.
IACR Cryptol. ePrint Arch., 2021

SLoClas: A Database for Joint Sound Localization and Classification.
Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021

Is Someone Speaking?: Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

GCC-PHAT with Speech-oriented Attention for Robotic Sound Source Localization.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Multi-Target DoA Estimation with an Audio-Visual Fusion Mechanism.
Proceedings of the IEEE International Conference on Acoustics, 2021

An Efficient Ciphertext Policy Attribute-Based Encryption Scheme from Lattices and Its Implementation.
Proceedings of the 6th IEEE International Conference on Computer and Communication Systems, 2021

2020
Audio-Visual Multi-Speaker Tracking Based on the GLMB Framework.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019
Multi-Speaker Tracking From an Audio-Visual Sensing Device.
IEEE Trans. Multim., 2019

LOCATA challenge: speaker localization with a planar array.
CoRR, 2019

Accurate Target Annotation in 3D from Multimodal Streams.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
3D Mouth Tracking from a Compact Microphone Array Co-Located with a camera.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
3D audio-visual speaker tracking with an adaptive particle filter.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
An Antivibration Time-Delay Integration CMOS Image Sensor With Online Deblurring Algorithm.
IEEE Trans. Circuits Syst. Video Technol., 2016

A Global-Shutter Centroiding Measurement CMOS Image Sensor With Star Region SNR Improvement for Star Trackers.
IEEE Trans. Circuits Syst. Video Technol., 2016

2015
A time delay integration CMOS image sensor with online deblurring algorithm.
Proceedings of the VLSI Design, Automation and Test, 2015

An 8-stage time delay integration CMOS image sensor with on-chip polarization pixels.
Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

2014
Design and characterization of radiation-tolerant CMOS 4T Active Pixel Sensors.
Proceedings of the 2014 International Symposium on Integrated Circuits (ISIC), 2014

Profile driven dataflow optimisation of mean shift visual tracking.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

A dual-exposure in-pixel charge subtraction CTIA CMOS image sensor for centroid measurement in star trackers.
Proceedings of the 2014 IEEE Asia Pacific Conference on Circuits and Systems, 2014

2012
A Time-Delay-Integration CMOS image sensor with pipelined charge transfer architecture.
Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012


  Loading...