Sining Sun

Orcid: 0000-0002-2642-5096

According to our database1, Sining Sun authored at least 32 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 




Streaming Decoder-Only Automatic Speech Recognition with Discrete Speech Units: A Pilot Study.
CoRR, 2024

Skipformer: A Skip-and-Recover Strategy for Efficient Speech Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Self-Supervised Disentangled Representation Learning for Robust Target Speech Extraction.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Key Frame Mechanism for Efficient Conformer Based End-to-End Speech Recognition.
IEEE Signal Process. Lett., 2023

Key Frame Mechanism For Efficient Conformer Based End-to-end Speech Recognition.
CoRR, 2023

Two Stage Contextual Word Filtering for Context Bias in Unified Streaming and Non-streaming Transducer.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

DCCRN-KWS: An Audio Bias Based Model for Noise Robust Small-Footprint Keyword Spotting.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

CaTT-KWS: A Multi-stage Customized Keyword Spotting Framework based on Cascaded Transducer-Transformer.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Conversational Speech Recognition by Learning Conversation-Level Characteristics.
Proceedings of the IEEE International Conference on Acoustics, 2022

Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-EndSpeech Recognition.
CoRR, 2021

Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-End Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Improving Streaming Transformer Based ASR Under a Framework of Self-Supervised Learning.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Tiny Transducer: A Highly-Efficient Speech Recognition Model on Edge Devices.
Proceedings of the IEEE International Conference on Acoustics, 2021

Multi-head Monotonic Chunkwise Attention For Online Speech Recognition.
CoRR, 2020

Adversarial Regularization for Attention Based End-to-End Robust Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Adversarial Regularization for End-to-End Robust Speaker Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Unsupervised Adaptation with Adversarial Dropout Regularization for Robust Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Adversarial Examples for Improving End-to-end Attention-based Small-footprint Keyword Spotting.
Proceedings of the IEEE International Conference on Acoustics, 2019

Domain Adversarial Training for Improving Keyword Spotting Performance of ESL Speech.
Proceedings of the IEEE International Conference on Acoustics, 2019

An Attention-based Neural Network Approach for Single Channel Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2019

Virtual Adversarial Training for DS-CNN Based Small-Footprint Keyword Spotting.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Multiple fixed beamformers with a spacial Wiener-form postfilter for far-field speech recognition.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

A Robust Nonlinear Microphone Array Postfilter for Noise Reduction.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Investigating Generative Adversarial Networks Based Speech Dereverberation for Robust Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Training Augmentation with Adversarial Examples for Robust Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

A Probability Weighted Beamformer for Noise Robust ASR.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Unsupervised Domain Adaptation via Domain Adversarial Training for Speaker Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Domain Adversarial Training for Accented Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An unsupervised deep domain adaptation approach for robust speech recognition.
Neurocomputing, 2017

Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

The NNI Query-by-Example System for MediaEval 2015.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015
