Qiushi Zhu
Orcid: 0000-0002-1196-7781
According to our database1,
Qiushi Zhu
authored at least 26 papers
between 2015 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
VatLM: Visual-Audio-Text Pre-Training With Unified Masked Prediction for Speech Representation Learning.
IEEE Trans. Multim., 2024
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models.
CoRR, 2024
An Experimental Comparison of Noise-Robust Text-To-Speech Synthesis Systems Based On Self-Supervised Representation.
Proceedings of the IEEE International Conference on Acoustics, 2024
DurIAN-E 2: Duration Informed Attention Network with Adaptive Variational Autoencoder and Adversarial Learning for Expressive Text-to-Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2024
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
A Joint Speech Enhancement and Self-Supervised Representation Learning Framework for Noise-Robust Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
CoRR, 2023
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Robust Data2VEC: Noise-Robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
CoRR, 2022
CoRR, 2022
A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition.
CoRR, 2022
A Complementary Joint Training Approach Using Unpaired Speech and Text A Complementary Joint Training Approach Using Unpaired Speech and Text.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
A Noise-Robust Self-Supervised Pre-Training Model Based Speech Representation Learning for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022
Supervised and Self-Supervised Pretraining Based Covid-19 Detection Using Acoustic Breathing/Cough/Speech Signals.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
An Improved Wav2Vec 2.0 Pre-Training Approach Using Enhanced Local Dependency Modeling for Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
2017
Approximate evaluation of average downtime under an integrated approach of opportunistic maintenance for multi-component systems.
Comput. Ind. Eng., 2017
2015
A condition-based maintenance policy for multi-component systems with a high maintenance setup cost.
OR Spectr., 2015