Shujie Liu
Affiliations:- Microsoft Research Asia, Beijing, China
According to our database1,
Shujie Liu
authored at least 157 papers
between 2008 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
VatLM: Visual-Audio-Text Pre-Training With Unified Masked Prediction for Speech Representation Learning.
IEEE Trans. Multim., 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
VioLA: Conditional Language Models for Speech Recognition, Synthesis, and Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation.
CoRR, 2024
CoRR, 2024
CoRR, 2024
VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment.
CoRR, 2024
VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers.
CoRR, 2024
CoRR, 2024
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations.
CoRR, 2024
RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis.
CoRR, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
2023
Diffusion Conditional Expectation Model for Efficient and Robust Target Speech Extraction.
CoRR, 2023
VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation.
CoRR, 2023
CoRR, 2023
Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling.
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
LAMASSU: A Streaming Language-Agnostic Multilingual Speech Recognition and Translation Model Using Neural Transducers.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
GRAVO: Learning to Generate Relevant Audio from Visual Features with Noisy Online Videos.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the International Conference on Machine Learning, 2023
Robust Data2VEC: Noise-Robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation.
Proceedings of the IEEE International Conference on Acoustics, 2023
DATA2VEC-SG: Improving Self-Supervised Learning Representations for Speech Generation Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
On Decoder-Only Architecture For Speech-to-Text and Large Language Model Integration.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Building High-Accuracy Multilingual ASR With Gated Language Experts and Curriculum Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
IEEE J. Sel. Top. Signal Process., 2022
LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers.
CoRR, 2022
CoRR, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Improving Self-Supervised Learning for Speech Recognition with Intermediate Layer Supervision.
Proceedings of the IEEE International Conference on Acoustics, 2022
Optimizing Alignment of Speech and Language Latent Spaces for End-To-End Speech Recognition and Understanding.
Proceedings of the IEEE International Conference on Acoustics, 2022
Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Unispeech-Sat: Universal Speech Representation Learning With Speaker Aware Pre-Training.
Proceedings of the IEEE International Conference on Acoustics, 2022
Large-Scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2022
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
2021
CoRR, 2021
CoRR, 2021
CoRR, 2021
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Improving Multilingual Transformer Transducer Models by Reducing Language Confusions.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Microsoft Speaker Diarization System for the Voxceleb Speaker Recognition Challenge 2020.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Developing Real-Time Streaming Transformer Transducer for Speech Recognition on Large-Scale Dataset.
Proceedings of the IEEE International Conference on Acoustics, 2021
Don't Shoot Butterfly with Rifles: Multi-Channel Continuous Speech Separation with Early Exit Transformer.
Proceedings of the IEEE International Conference on Acoustics, 2021
Knowledge Enhanced Fine-Tuning for Better Handling Unseen Entities in Dialogue Generation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
SemFace: Pre-training Encoder and Decoder with a Semantic Interface for Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
IEEE Trans. Knowl. Data Eng., 2020
A Hierarchical Clustering Approach to Fuzzy Semantic Representation of Rare Words in Neural Machine Translation.
IEEE Trans. Fuzzy Syst., 2020
Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer.
CoRR, 2020
MoBoAligner: A Neural Alignment Model for Non-Autoregressive TTS with Monotonic Boundary Search.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Bridging the Gap between Pre-Training and Fine-Tuning for End-to-End Speech Translation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Accelerating Transformer Decoding via a Hybrid of Self-attention and Recurrent Neural Network.
CoRR, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
CoRR, 2018
Proceedings of the Natural Language Processing and Chinese Computing, 2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Proceedings of the 22nd Conference on Computational Natural Language Learning, 2018
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Proceedings of the Natural Language Processing and Chinese Computing, 2017
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
2016
Implicit Distortion and Fertility Models for Attention-based Encoder-Decoder NMT Model.
CoRR, 2016
Improving Attention Modeling with Implicit Distortion and Fertility for Machine Translation.
Proceedings of the COLING 2016, 2016
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
2015
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2015
Comput. Linguistics, 2015
Proceedings of the Natural Language Processing and Chinese Computing - 4th CCF Conference, 2015
Proceedings of the Natural Language Processing and Chinese Computing - 4th CCF Conference, 2015
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
2014
Woodpecker: An Automatic Methodology for Machine Translation Diagnosis with Rich Linguistic Knowledge.
J. Inf. Sci. Eng., 2014
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014
2013
Collective Corpus Weighting and Phrase Scoring for SMT Using Graph-Based Random Walk.
Proceedings of the Natural Language Processing and Chinese Computing, 2013
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013
2012
A novel 3D video transcoding scheme for adaptive 3D video transmission to heterogeneous terminals.
ACM Trans. Multim. Comput. Commun. Appl., 2012
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012
2011
IEEE Trans. Broadcast., 2011
Proceedings of the Network and Operating System Support for Digital Audio and Video, 2011
Proceedings of Machine Translation Summit XIII: Papers, 2011
Proceedings of Machine Translation Summit XIII: Papers, 2011
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011
2010
Proceedings of the Visual Communications and Image Processing 2010, 2010
Proceedings of the 18th International Conference on Multimedia 2010, 2010
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010
A novel prioritized spatial multiplexing for MIMO wireless system with application to H.264 SVC video.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010
Proceedings of the International Conference on Image Processing, 2010
Improved Discriminative ITG Alignment using Hierarchical Phrase Pairs and Semi-supervised Training.
Proceedings of the COLING 2010, 2010
Proceedings of the ACL 2010, 2010
2009
Proceedings of the 2009 Picture Coding Symposium, 2009
2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2008), 2008
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
Diagnostic Evaluation of Machine Translation Systems Using Automatically Constructed Linguistic Check-Points.
Proceedings of the COLING 2008, 2008