Rui Liu

Orcid: 0000-0003-4524-7413

Affiliations:

Inner Mongolia University, College of Computer Science, Hohhot, China

According to our database¹, Rui Liu authored at least 59 papers between 2016 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

2016

2017

2018

2019

2020

2021

2022

2023

2024

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Multi-space channel representation learning for mono-to-binaural conversion based audio deepfake detection.

[BibT_eX]

[DOI]

Rui Liu

Jinhua Zhang

Guanglai Gao

Inf. Fusion, May, 2024

Modified suppressed relative entropy fuzzy c-means clustering algorithm.

[BibT_eX]

[DOI]

J. Intell. Fuzzy Syst., March, 2024

Controllable Accented Text-to-Speech Synthesis With Fine and Coarse-Grained Intensity Rendering.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Text-to-Speech for Low-Resource Agglutinative Language With Morphology-Aware Language Model Pre-Training.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Contrastive Learning Based Modality-Invariant Feature Acquisition for Robust Multimodal Emotion Recognition With Missing Modalities.

[BibT_eX]

[DOI]

IEEE Trans. Affect. Comput., 2024

Multi-modal and Multi-scale Spatial Environment Understanding for Immersive Visual Text-to-Speech.

[BibT_eX]

[DOI]

CoRR, 2024

Multi-Source Spatial Knowledge Understanding for Immersive Visual Text-to-Speech.

[BibT_eX]

[DOI]

Shuwei He

Rui Liu

Haizhou Li

CoRR, 2024

Emphasis Rendering for Conversational Text-to-Speech with Multi-modal Multi-scale Context Modeling.

[BibT_eX]

[DOI]

CoRR, 2024

FluentEditor+: Text-based Speech Editing by Modeling Local Hierarchical Acoustic Smoothness and Global Prosody Consistency.

[BibT_eX]

[DOI]

CoRR, 2024

Leveraging Retrieval Augment Approach for Multimodal Emotion Recognition Under Missing Modalities.

[BibT_eX]

[DOI]

CoRR, 2024

Open-vocabulary Multimodal Emotion Recognition: Dataset, Metric, and Benchmark.

[BibT_eX]

[DOI]

CoRR, 2024

MCDubber: Multimodal Context-Aware Expressive Video Dubbing.

[BibT_eX]

[DOI]

CoRR, 2024

Emotion and Intent Joint Understanding in Multimodal Conversation: A Benchmarking Dataset.

[BibT_eX]

[DOI]

CoRR, 2024

MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

MRAC'24 Track 2: 2nd International Workshop on Multimodal and Responsible Affective Computing.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

Learning Noise-Robust Joint Representation for Multimodal Emotion Recognition under Incomplete Data Scenarios.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

Generative Expressive Conversational Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Pre-training Language Model for Mongolian with Agglutinative Linguistic Knowledge Injection.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2024

Multi-Perspective Transfer Learning for Automatic MOS Prediction of Low Resource Language.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Asian Language Processing, 2024

Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Decoupling Speaker-Independent Emotions for Voice Conversion via Source-Filter Networks.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Distributed Sensor Selection for Speech Enhancement With Acoustic Sensor Networks.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Learning Noise-Robust Joint Representation for Multimodal Emotion Recognition under Realistic Incomplete Data Scenarios.

[BibT_eX]

[DOI]

CoRR, 2023

FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency.

[BibT_eX]

[DOI]

CoRR, 2023

Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech.

[BibT_eX]

[DOI]

Rui Liu

Bin Liu

Haizhou Li

CoRR, 2023

MnTTS2: An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis Dataset.

[BibT_eX]

[DOI]

CoRR, 2023

Explicit Intensity Control for Accented Text-to-speech.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Exploiting Modality-Invariant Feature for Robust Multimodal Emotion Recognition with Missing Modalities.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Decoding Knowledge Transfer for Neural Text-to-Speech Training.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Emotional voice conversion: Theory, databases and ESD.

[BibT_eX]

[DOI]

Speech Commun., 2022

Multistage Deep Transfer Learning for EmIoT-Enabled Human-Computer Interaction.

[BibT_eX]

[DOI]

IEEE Internet Things J., 2022

FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis.

[BibT_eX]

[DOI]

CoRR, 2022

A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion.

[BibT_eX]

[DOI]

CoRR, 2022

Controllable Accented Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

CoRR, 2022

Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 29th International Conference, 2022

Alignment-Learning Based Single-Step Decoding for Accurate and Fast Non-Autoregressive Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Visualtts: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Asian Language Processing, 2022

2021

Expressive TTS Training With Frame and Style Reconstruction Loss.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Exploiting Morphological and Phonological Features to Improve Prosodic Phrasing for Mongolian Speech Synthesis.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

FastTalker: A neural text-to-speech architecture with shallow and group autoregression.

[BibT_eX]

[DOI]

Neural Networks, 2021

StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis.

[BibT_eX]

[DOI]

Rui Liu

Berrak Sisman

Haizhou Li

CoRR, 2021

Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability.

[BibT_eX]

[DOI]

Rui Liu

Berrak Sisman

Haizhou Li

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Seen and Unseen Emotional Style Transfer for Voice Conversion with A New Emotional Speech Dataset.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Graphspeech: Syntax-Aware Graph Attention Network for Neural Speech Synthesis.

[BibT_eX]

[DOI]

Rui Liu

Berrak Sisman

Haizhou Li

Proceedings of the IEEE International Conference on Acoustics, 2021

Mongolian emotional speech synthesis based on transfer learning and emotional embedding.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Asian Language Processing, 2021

SUTD-NUS System for Blizzard Challenge 2021.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2021, virtual, October 23, 2021, 2021

2020

Modeling Prosodic Phrasing With Multi-Task Learning in Tacotron-Based TTS.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2020

WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Teacher-Student Training For Robust Tacotron-Based TTS.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Building Mongolian TTS Front-End with Encoder-Decoder Model by Using Bridge Method and Multi-view Features.

[BibT_eX]

[DOI]

Rui Liu

Feilong Bao

Guanglai Gao

Proceedings of the Neural Information Processing - 26th International Conference, 2019

The IMU speech synthesis entry for Blizzard Challenge 2019.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2019, Vienna, Austria, September 23, 2019, 2019

2018

Phonologically Aware BiLSTM Model for Mongolian Phrase Break Prediction with Attention Mechanism.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

End-to-End Mongolian Text-to-Speech System.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

A LSTM Approach with Sub-Word Embeddings for Mongolian Phrase Break Prediction.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Computational Linguistics, 2018

2016

Mongolian prosodic phrase prediction using suffix segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Asian Language Processing, 2016

Rui Liu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...