Guanglai Gao
Orcid: 0009-0005-5513-1192
According to our database1,
Guanglai Gao
authored at least 153 papers
between 2006 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
The image and ground truth dataset of Mongolian movable-type newspapers for text recognition.
Int. J. Document Anal. Recognit., June, 2024
Multi-space channel representation learning for mono-to-binaural conversion based audio deepfake detection.
Inf. Fusion, May, 2024
Controllable Accented Text-to-Speech Synthesis With Fine and Coarse-Grained Intensity Rendering.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Text-to-Speech for Low-Resource Agglutinative Language With Morphology-Aware Language Model Pre-Training.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Leveraging Retrieval Augment Approach for Multimodal Emotion Recognition Under Missing Modalities.
CoRR, 2024
Mitigating Heterogeneity among Factor Tensors via Lie Group Manifolds for Tensor Decomposition Based Temporal Knowledge Graph Embedding.
CoRR, 2024
L<sup>2</sup>GC: Lorentzian Linear Graph Convolutional Networks For Node Classification.
CoRR, 2024
Proceedings of the Natural Language Processing and Chinese Computing, 2024
Learning Noise-Robust Joint Representation for Multimodal Emotion Recognition under Incomplete Data Scenarios.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024
Leveraging Contrastive Learning and Self-Training for Multimodal Emotion Recognition with Limited Labeled Samples.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024
Pre-training Language Model for Mongolian with Agglutinative Linguistic Knowledge Injection.
Proceedings of the International Joint Conference on Neural Networks, 2024
Improving End-to-End Speech Recognition Through Conditional Cross-Modal Knowledge Distillation with Language Model.
Proceedings of the International Joint Conference on Neural Networks, 2024
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024
Cross-Attention-Guided Wavenet for Mel Spectrogram Reconstruction in The ICASSP 2024 Auditory EEG Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2024
Multi-Perspective Transfer Learning for Automatic MOS Prediction of Low Resource Language.
Proceedings of the International Conference on Asian Language Processing, 2024
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024
Exploring the Synergy of Dual-path Encoder and Alignment Module for Better Graph-to-Text Generation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
EpLSA: Synergy of Expert-prefix Mixtures and Task-Oriented Latent Space Adaptation for Diverse Generative Reasoning.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
TransERR: Translation-based Knowledge Graph Embedding via Efficient Relation Rotation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Learning Low-dimensional Multi-domain Knowledge Graph Embedding via Dual Archimedean Spirals.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
A Comparative Study on Selecting Acoustic Modeling Units for WFST-based Mongolian Speech Recognition.
ACM Trans. Asian Low Resour. Lang. Inf. Process., October, 2023
IEEE Signal Process. Lett., 2023
Learning Noise-Robust Joint Representation for Multimodal Emotion Recognition under Realistic Incomplete Data Scenarios.
CoRR, 2023
CoRR, 2023
Proceedings of the PRICAI 2023: Trends in Artificial Intelligence, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Exploiting Modality-Invariant Feature for Robust Multimodal Emotion Recognition with Missing Modalities.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2023, 2023
Traditional Mongolian-to-Cyrillic Mongolian Conversion Method Based on the Combination of Rules and Transformer.
Proceedings of the 9th IEEE International Conference on Cloud Computing and Intelligent Systems, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
IEEE ACM Trans. Audio Speech Lang. Process., 2022
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis.
CoRR, 2022
A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion.
CoRR, 2022
MNASR: A Free Speech Corpus For Mongolian Speech Recognition And Accompanied Baselines.
Proceedings of the 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2022
Proceedings of the Natural Language Processing and Chinese Computing, 2022
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion.
Proceedings of the Neural Information Processing - 29th International Conference, 2022
Alignment-Learning Based Single-Step Decoding for Accurate and Fast Non-Autoregressive Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022
End-to-End Large-Scale Image Retrieval Network with Convolution and Vision Transformers.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2022, 2022
MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline.
Proceedings of the International Conference on Asian Language Processing, 2022
2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Exploiting Morphological and Phonological Features to Improve Prosodic Phrasing for Mongolian Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Recurrent Neural Networks and Acoustic Features for Frame-Level Signal-to-Noise Ratio Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Soft-BAC: Soft Bidirectional Alignment Cost for End-to-End Automatic Speech Recognition.
Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021
Panoptic-DLA: Document Layout Analysis of Historical Newspapers Based on Proposal-Free Panoptic Segmentation Model.
Proceedings of the Knowledge Science, Engineering and Management, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Mongolian emotional speech synthesis based on transfer learning and emotional embedding.
Proceedings of the International Conference on Asian Language Processing, 2021
2020
IEEE Signal Process. Lett., 2020
IEEE Access, 2020
Proceedings of the Web Information Systems Engineering - WISE 2020, 2020
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 32nd IEEE International Conference on Tools with Artificial Intelligence, 2020
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Online Handwritten Mongolian Character Recognition using CMA-MOHR and Coordinate Processing.
Proceedings of the International Conference on Asian Language Processing, 2020
Incorporating Inner-word and Out-word Features for Mongolian Morphological Segmentation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020
Proceedings of the Machine Translation - 16th China Conference, 2020
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
2019
Neural Process. Lett., 2019
An End-to-End Preprocessor Based on Adversiarial Learning for Mongolian Historical Document OCR.
Proceedings of the PRICAI 2019: Trends in Artificial Intelligence, 2019
Proceedings of the Natural Language Processing and Chinese Computing, 2019
Research on Khalkha Dialect Mongolian Speech Recognition Acoustic Model Based on Weight Transfer.
Proceedings of the Natural Language Processing and Chinese Computing, 2019
Proceedings of the Natural Language Processing and Chinese Computing, 2019
Proceedings of the Knowledge Science, Engineering and Management, 2019
Proceedings of the International Joint Conference on Neural Networks, 2019
Proceedings of the Neural Information Processing - 26th International Conference, 2019
Learning an Adversarial Network for Speech Enhancement Under Extremely Low Signal-to-Noise Ratio Condition.
Proceedings of the Neural Information Processing - 26th International Conference, 2019
Proceedings of the Neural Information Processing - 26th International Conference, 2019
Building Mongolian TTS Front-End with Encoder-Decoder Model by Using Bridge Method and Multi-view Features.
Proceedings of the Neural Information Processing - 26th International Conference, 2019
A Holistic Recognition Approach for Woodblock-Print Mongolian Words Based on Convolutional Neural Network.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Improving Text Image Resolution using a Deep Generative Adversarial Network for Optical Character Recognition.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019
Proceedings of the International Conference on Asian Language Processing, 2019
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019
Proceedings of the Blizzard Challenge 2019, Vienna, Austria, September 23, 2019, 2019
Proceedings of the Web and Big Data - Third International Joint Conference, 2019
Dynamic-attention based Encoder-decoder model for Speaker Extraction with Anchor speech.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
2018
Phonologically Aware BiLSTM Model for Mongolian Phrase Break Prediction with Attention Mechanism.
Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018
Proceedings of the Natural Language Processing and Chinese Computing, 2018
Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Word Image Representation Based on Visual Embeddings and Spatial Constraints for Keyword Spotting on Historical Documents.
Proceedings of the 24th International Conference on Pattern Recognition, 2018
Convolutional Neural Network for Machine-Printed Traditional Mongolian Font Recognition.
Proceedings of the Neural Information Processing - 25th International Conference, 2018
Proceedings of the Neural Information Processing - 25th International Conference, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Research on Transfer Learning for Khalkha Mongolian Speech Recognition Based on TDNN.
Proceedings of the 2018 International Conference on Asian Language Processing, 2018
Proceedings of the 27th International Conference on Computational Linguistics, 2018
2017
Integrated Speech Enhancement Method Based on Weighted Prediction Error and DNN for Dereverberation and Denoising.
CoRR, 2017
Integrating Visual Word Embeddings into Translation Language Model for Keyword Spotting on Historical Mongolian Document Images.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017
Proceedings of the Natural Language Processing and Chinese Computing, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 29th IEEE International Conference on Tools with Artificial Intelligence, 2017
Using Word Mover's Distance with Spatial Constraints for Measuring Similarity Between Mongolian Word Images.
Proceedings of the Neural Information Processing - 24th International Conference, 2017
Representing word image using visual word embeddings and RNN for keyword spotting on historical document images.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
Segmentation-Free Printed Traditional Mongolian OCR Using Sequence to Sequence with Attention Model.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2017
2016
A Pairwise Algorithm Using the Deep Stacking Network for Speech Separation and Pitch Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Neurocomputing, 2016
Int. J. Document Anal. Recognit., 2016
Intell. Data Anal., 2016
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016
Proceedings of the 28th IEEE International Conference on Tools with Artificial Intelligence, 2016
LDA-Based Word Image Representation for Keyword Spotting on Historical Mongolian Documents.
Proceedings of the Neural Information Processing - 23rd International Conference, 2016
A Connection Reduced Network for Similar Handwritten Chinese Character Discrimination.
Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition, 2016
Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 International Conference on Asian Language Processing, 2016
Proceedings of the 2016 International Conference on Asian Language Processing, 2016
Proceedings of the COLING 2016, 2016
2015
计算机科学, 2015
一种融合语义距离的最近邻图像标注方法 (Combination of Nearest Neighbor with Semantic Distance for Image Annotation).
计算机科学, 2015
Proceedings of the Natural Language Processing and Chinese Computing - 4th CCF Conference, 2015
Proceedings of the Neural Information Processing - 22nd International Conference, 2015
Enhancing the Mongolian Historical Document Recognition System with Multiple Knowledge-Based Strategies.
Proceedings of the Neural Information Processing - 22nd International Conference, 2015
A multiple instances approach to improving keyword spotting on historical Mongolian document images.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015
A pairwise algorithm for pitch estimation and speech separation using deep stacking network.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 International Conference on Asian Language Processing, 2015
Proceedings of the 2015 International Conference on Asian Language Processing, 2015
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2015
2014
J. Comput., 2014
Recognizing Boundaries in Wireless Sensor Networks Based on Local Connectivity Information.
Int. J. Distributed Sens. Networks, 2014
Int. J. Document Anal. Recognit., 2014
Proceedings of the 22nd European Signal Processing Conference, 2014
Proceedings of the Pattern Recognition - 6th Chinese Conference, 2014
2013
Appl. Math. Comput., 2013
Proceedings of the Natural Language Processing and Chinese Computing, 2013
Proceedings of the Intelligent Computing Theories - 9th International Conference, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the 2013 International Conference on Asian Language Processing, 2013
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2013
2012
Proceedings of the CLEF 2012 Evaluation Labs and Workshop, 2012
Proceedings of the Information Access Evaluation. Multilinguality, Multimodality, and Visual Analytics, 2012
Proceedings of the Pattern Recognition - Chinese Conference, 2012
2011
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011
Proceedings of the First Asian Conference on Pattern Recognition, 2011
2010
Proceedings of the 8th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2010
2008
Proceedings of the 2nd International Workshop on Evaluating Information Access, 2008
Proceedings of the International Conference on Computer Science and Software Engineering, 2008
2006
Proceedings of the Computational Intelligence, 2006