Rui Liu
Orcid: 0000-0003-4524-7413Affiliations:
- Inner Mongolia University, College of Computer Science, Hohhot, China
According to our database1,
Rui Liu
authored at least 59 papers
between 2016 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Multi-space channel representation learning for mono-to-binaural conversion based audio deepfake detection.
Inf. Fusion, May, 2024
J. Intell. Fuzzy Syst., March, 2024
Controllable Accented Text-to-Speech Synthesis With Fine and Coarse-Grained Intensity Rendering.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Text-to-Speech for Low-Resource Agglutinative Language With Morphology-Aware Language Model Pre-Training.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Contrastive Learning Based Modality-Invariant Feature Acquisition for Robust Multimodal Emotion Recognition With Missing Modalities.
IEEE Trans. Affect. Comput., 2024
Multi-modal and Multi-scale Spatial Environment Understanding for Immersive Visual Text-to-Speech.
CoRR, 2024
CoRR, 2024
Emphasis Rendering for Conversational Text-to-Speech with Multi-modal Multi-scale Context Modeling.
CoRR, 2024
FluentEditor+: Text-based Speech Editing by Modeling Local Hierarchical Acoustic Smoothness and Global Prosody Consistency.
CoRR, 2024
Leveraging Retrieval Augment Approach for Multimodal Emotion Recognition Under Missing Modalities.
CoRR, 2024
CoRR, 2024
Emotion and Intent Joint Understanding in Multimodal Conversation: A Benchmarking Dataset.
CoRR, 2024
MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024
MRAC'24 Track 2: 2nd International Workshop on Multimodal and Responsible Affective Computing.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024
Learning Noise-Robust Joint Representation for Multimodal Emotion Recognition under Incomplete Data Scenarios.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Pre-training Language Model for Mongolian with Agglutinative Linguistic Knowledge Injection.
Proceedings of the International Joint Conference on Neural Networks, 2024
Multi-Perspective Transfer Learning for Automatic MOS Prediction of Low Resource Language.
Proceedings of the International Conference on Asian Language Processing, 2024
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Decoupling Speaker-Independent Emotions for Voice Conversion via Source-Filter Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Learning Noise-Robust Joint Representation for Multimodal Emotion Recognition under Realistic Incomplete Data Scenarios.
CoRR, 2023
FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency.
CoRR, 2023
CoRR, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Exploiting Modality-Invariant Feature for Robust Multimodal Emotion Recognition with Missing Modalities.
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
IEEE ACM Trans. Audio Speech Lang. Process., 2022
IEEE Internet Things J., 2022
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis.
CoRR, 2022
A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion.
CoRR, 2022
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion.
Proceedings of the Neural Information Processing - 29th International Conference, 2022
Alignment-Learning Based Single-Step Decoding for Accurate and Fast Non-Autoregressive Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline.
Proceedings of the International Conference on Asian Language Processing, 2022
2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Exploiting Morphological and Phonological Features to Improve Prosodic Phrasing for Mongolian Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
FastTalker: A neural text-to-speech architecture with shallow and group autoregression.
Neural Networks, 2021
StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis.
CoRR, 2021
Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Seen and Unseen Emotional Style Transfer for Voice Conversion with A New Emotional Speech Dataset.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Mongolian emotional speech synthesis based on transfer learning and emotional embedding.
Proceedings of the International Conference on Asian Language Processing, 2021
Proceedings of the Blizzard Challenge 2021, virtual, October 23, 2021, 2021
2020
IEEE Signal Process. Lett., 2020
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
Building Mongolian TTS Front-End with Encoder-Decoder Model by Using Bridge Method and Multi-view Features.
Proceedings of the Neural Information Processing - 26th International Conference, 2019
Proceedings of the Blizzard Challenge 2019, Vienna, Austria, September 23, 2019, 2019
2018
Phonologically Aware BiLSTM Model for Mongolian Phrase Break Prediction with Attention Mechanism.
Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 27th International Conference on Computational Linguistics, 2018
2016
Proceedings of the 2016 International Conference on Asian Language Processing, 2016