Guanglai Gao

Orcid: 0009-0005-5513-1192

According to our database1, Guanglai Gao authored at least 153 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
The image and ground truth dataset of Mongolian movable-type newspapers for text recognition.
Int. J. Document Anal. Recognit., June, 2024

Multi-space channel representation learning for mono-to-binaural conversion based audio deepfake detection.
Inf. Fusion, May, 2024

Controllable Accented Text-to-Speech Synthesis With Fine and Coarse-Grained Intensity Rendering.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Text-to-Speech for Low-Resource Agglutinative Language With Morphology-Aware Language Model Pre-Training.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Leveraging Retrieval Augment Approach for Multimodal Emotion Recognition Under Missing Modalities.
CoRR, 2024

MCDubber: Multimodal Context-Aware Expressive Video Dubbing.
CoRR, 2024

Mitigating Heterogeneity among Factor Tensors via Lie Group Manifolds for Tensor Decomposition Based Temporal Knowledge Graph Embedding.
CoRR, 2024

L<sup>2</sup>GC: Lorentzian Linear Graph Convolutional Networks For Node Classification.
CoRR, 2024

GSEA: Global Structure-Aware Graph Neural Networks for Entity Alignment.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

Learning Noise-Robust Joint Representation for Multimodal Emotion Recognition under Incomplete Data Scenarios.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

Leveraging Contrastive Learning and Self-Training for Multimodal Emotion Recognition with Limited Labeled Samples.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

Pre-training Language Model for Mongolian with Agglutinative Linguistic Knowledge Injection.
Proceedings of the International Joint Conference on Neural Networks, 2024

Improving End-to-End Speech Recognition Through Conditional Cross-Modal Knowledge Distillation with Language Model.
Proceedings of the International Joint Conference on Neural Networks, 2024

MEMix: Improving HMER with Diverse Formula Structure Augmentation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

STAR: Syntax- and Topic-Aware Role Dialogue Summarization.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

Cross-Attention-Guided Wavenet for Mel Spectrogram Reconstruction in The ICASSP 2024 Auditory EEG Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2024

Multi-Perspective Transfer Learning for Automatic MOS Prediction of Low Resource Language.
Proceedings of the International Conference on Asian Language Processing, 2024

Fully Hyperbolic Rotation for Knowledge Graph Embedding.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Exploring the Synergy of Dual-path Encoder and Alignment Module for Better Graph-to-Text Generation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

EpLSA: Synergy of Expert-prefix Mixtures and Task-Oriented Latent Space Adaptation for Diverse Generative Reasoning.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

L\²GC: Lorentzian Linear Graph Convolutional Networks for Node Classification.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

TransERR: Translation-based Knowledge Graph Embedding via Efficient Relation Rotation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Learning Low-dimensional Multi-domain Knowledge Graph Embedding via Dual Archimedean Spirals.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
A Comparative Study on Selecting Acoustic Modeling Units for WFST-based Mongolian Speech Recognition.
ACM Trans. Asian Low Resour. Lang. Inf. Process., October, 2023

Noise-Separated Adaptive Feature Distillation for Robust Speech Recognition.
IEEE Signal Process. Lett., 2023

Learning Noise-Robust Joint Representation for Multimodal Emotion Recognition under Realistic Incomplete Data Scenarios.
CoRR, 2023

MnTTS2: An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis Dataset.
CoRR, 2023

Few-Shot Table-to-Text Generation with Structural Bias Attention.
Proceedings of the PRICAI 2023: Trends in Artificial Intelligence, 2023

Explicit Intensity Control for Accented Text-to-speech.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Exploiting Modality-Invariant Feature for Robust Multimodal Emotion Recognition with Missing Modalities.
Proceedings of the IEEE International Conference on Acoustics, 2023

TableSF: A Structural Bias Framework for Table-To-Text Generation.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2023, 2023

Traditional Mongolian-to-Cyrillic Mongolian Conversion Method Based on the Combination of Rules and Transformer.
Proceedings of the 9th IEEE International Conference on Cloud Computing and Intelligent Systems, 2023

How Well Apply Simple MLP to Incomplete Utterance Rewriting?
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

TeAST: Temporal Knowledge Graph Embedding via Archimedean Spiral Timeline.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Decoding Knowledge Transfer for Neural Text-to-Speech Training.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis.
CoRR, 2022

A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion.
CoRR, 2022

Controllable Accented Text-to-Speech Synthesis.
CoRR, 2022

MNASR: A Free Speech Corpus For Mongolian Speech Recognition And Accompanied Baselines.
Proceedings of the 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2022

QuatSE: Spherical Linear Interpolation of Quaternion for Knowledge Graph Embeddings.
Proceedings of the Natural Language Processing and Chinese Computing, 2022

Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion.
Proceedings of the Neural Information Processing - 29th International Conference, 2022

Alignment-Learning Based Single-Step Decoding for Accurate and Fast Non-Autoregressive Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

End-to-End Large-Scale Image Retrieval Network with Convolution and Vision Transformers.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2022, 2022

MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline.
Proceedings of the International Conference on Asian Language Processing, 2022

2021
Expressive TTS Training With Frame and Style Reconstruction Loss.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Exploiting Morphological and Phonological Features to Improve Prosodic Phrasing for Mongolian Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Recurrent Neural Networks and Acoustic Features for Frame-Level Signal-to-Noise Ratio Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Guided Training: A Simple Method for Single-channel Speaker Separation.
CoRR, 2021

Soft-BAC: Soft Bidirectional Alignment Cost for End-to-End Automatic Speech Recognition.
Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021

Panoptic-DLA: Document Layout Analysis of Historical Newspapers Based on Proposal-Free Panoptic Segmentation Model.
Proceedings of the Knowledge Science, Engineering and Management, 2021

Joint Alignment Learning-Attention Based Model for Grapheme-to-Phoneme Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2021

Mongolian emotional speech synthesis based on transfer learning and emotional embedding.
Proceedings of the International Conference on Asian Language Processing, 2021

2020
Modeling Prosodic Phrasing With Multi-Task Learning in Tacotron-Based TTS.
IEEE Signal Process. Lett., 2020

Context-Driven Image Caption With Global Semantic Relations of the Named Entities.
IEEE Access, 2020

Topic Analysis by Exploring Headline Information.
Proceedings of the Web Information Systems Engineering - WISE 2020, 2020

WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Frame-Level Signal-to-Noise Ratio Estimation Using Deep Learning.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Sub-Band Knowledge Distillation Framework for Speech Enhancement.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Dataless Text Classification with Pseudo Topic Representation.
Proceedings of the 32nd IEEE International Conference on Tools with Artificial Intelligence, 2020

An Edge Information and Mask Shrinking Based Image Inpainting Approach.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Snr-Based Teachers-Student Technique For Speech Enhancement.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Beamformed Feature for Learning-based Dual-channel Speech Separation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Teacher-Student Training For Robust Tacotron-Based TTS.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Online Handwritten Mongolian Character Recognition using CMA-MOHR and Coordinate Processing.
Proceedings of the International Conference on Asian Language Processing, 2020

Incorporating Inner-word and Out-word Features for Mongolian Morphological Segmentation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

MTNER: A Corpus for Mongolian Tourism Named Entity Recognition.
Proceedings of the Machine Translation - 16th China Conference, 2020

Robust Speech Dereverberation Based on WPE and Deep Learning.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Learning Morpheme Representation for Mongolian Named Entity Recognition.
Neural Process. Lett., 2019

An End-to-End Preprocessor Based on Adversiarial Learning for Mongolian Historical Document OCR.
Proceedings of the PRICAI 2019: Trends in Artificial Intelligence, 2019

End-to-End Model for Offline Handwritten Mongolian Word Recognition.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

Research on Khalkha Dialect Mongolian Speech Recognition Acoustic Model Based on Weight Transfer.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

A Context-Free Spelling Correction Method for Classical Mongolian.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

An Automatic Spelling Correction Method for Classical Mongolian.
Proceedings of the Knowledge Science, Engineering and Management, 2019

Neural Morphological Segmentation Model for Mongolian.
Proceedings of the International Joint Conference on Neural Networks, 2019

A Natural Scene Text Extraction Approach Based on Generative Adversarial Learning.
Proceedings of the Neural Information Processing - 26th International Conference, 2019

Learning an Adversarial Network for Speech Enhancement Under Extremely Low Signal-to-Noise Ratio Condition.
Proceedings of the Neural Information Processing - 26th International Conference, 2019

Morphological Knowledge Guided Mongolian Constituent Parsing.
Proceedings of the Neural Information Processing - 26th International Conference, 2019

Building Mongolian TTS Front-End with Encoder-Decoder Model by Using Bridge Method and Multi-view Features.
Proceedings of the Neural Information Processing - 26th International Conference, 2019

A Holistic Recognition Approach for Woodblock-Print Mongolian Words Based on Convolutional Neural Network.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Improving Text Image Resolution using a Deep Generative Adversarial Network for Optical Character Recognition.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Woodblock-Printing Mongolian Words Recognition by Bi-LSTM with Attention Mechanism.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Sub-Word Based Mongolian Offline Handwriting Recognition.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Research on New Event Detection Methods for Mongolian News.
Proceedings of the International Conference on Asian Language Processing, 2019

An Attention-Based Approach for Mongolian News Named Entity Recognition.
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

The IMU speech synthesis entry for Blizzard Challenge 2019.
Proceedings of the Blizzard Challenge 2019, Vienna, Austria, September 23, 2019, 2019

Pseudo Topic Analysis for Boosting Pseudo Relevance Feedback.
Proceedings of the Web and Big Data - Third International Joint Conference, 2019

Dynamic-attention based Encoder-decoder model for Speaker Extraction with Anchor speech.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Phonologically Aware BiLSTM Model for Mongolian Phrase Break Prediction with Attention Mechanism.
Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

Mongolian Grapheme to Phoneme Conversion by Using Hybrid Approach.
Proceedings of the Natural Language Processing and Chinese Computing, 2018

Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Word Image Representation Based on Visual Embeddings and Spatial Constraints for Keyword Spotting on Historical Documents.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Convolutional Neural Network for Machine-Printed Traditional Mongolian Font Recognition.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

Mongolian Word Segmentation Based on Three Character Level Seq2Seq Models.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

Training Supervised Speech Separation System to Improve STOI and PESQ Directly.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Research on Transfer Learning for Khalkha Mongolian Speech Recognition Based on TDNN.
Proceedings of the 2018 International Conference on Asian Language Processing, 2018

A LSTM Approach with Sub-Word Embeddings for Mongolian Phrase Break Prediction.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017
Integrated Speech Enhancement Method Based on Weighted Prediction Error and DNN for Dereverberation and Denoising.
CoRR, 2017

Integrating Visual Word Embeddings into Translation Language Model for Keyword Spotting on Historical Mongolian Document Images.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Research on Mongolian Speech Recognition Based on FSMN.
Proceedings of the Natural Language Processing and Chinese Computing, 2017

Multi-Target Ensemble Learning for Monaural Speech Separation.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Pseudo-Based Relevance Analysis for Information Retrieval.
Proceedings of the 29th IEEE International Conference on Tools with Artificial Intelligence, 2017

Using Word Mover's Distance with Spatial Constraints for Measuring Similarity Between Mongolian Word Images.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

Representing word image using visual word embeddings and RNN for keyword spotting on historical document images.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Segmentation-Free Printed Traditional Mongolian OCR Using Sequence to Sequence with Attention Model.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

Supervised Feature Learning via Within-Class Reconstruction.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

Language Model for Mongolian Polyphone Proofreading.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2017

2016
A Pairwise Algorithm Using the Deep Stacking Network for Speech Separation and Pitch Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Nonlinear discriminant analysis based on vanishing component analysis.
Neurocomputing, 2016

A knowledge-based recognition system for historical Mongolian documents.
Int. J. Document Anal. Recognit., 2016

A spatial-temporal trajectory clustering algorithm for eye fixations identification.
Intell. Data Anal., 2016

Cyrillic Mongolian Named Entity Recognition with Rich Features.
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016

Mongolian Named Entity Recognition with Bidirectional Recurrent Neural Networks.
Proceedings of the 28th IEEE International Conference on Tools with Artificial Intelligence, 2016

LDA-Based Word Image Representation for Keyword Spotting on Historical Mongolian Documents.
Proceedings of the Neural Information Processing - 23rd International Conference, 2016

A Connection Reduced Network for Similar Handwritten Chinese Character Discrimination.
Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition, 2016

DNN-HMM for Large Vocabulary Mongolian Offline Handwriting Recognition.
Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition, 2016

A novel image classifier based on Gaussian mixture language model.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Convolutional neural network for robust pitch determination.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Comparison on Neural Network based acoustic model in Mongolian speech recognition.
Proceedings of the 2016 International Conference on Asian Language Processing, 2016

Mongolian prosodic phrase prediction using suffix segmentation.
Proceedings of the 2016 International Conference on Asian Language Processing, 2016

Mongolian Named Entity Recognition System with Rich Features.
Proceedings of the COLING 2016, 2016

2015
基于最大边缘相关的伪相关反馈方法 (Pseudo Relevance Feedback Based on Maximal Marginal Relevance).
计算机科学, 2015

一种融合语义距离的最近邻图像标注方法 (Combination of Nearest Neighbor with Semantic Distance for Image Annotation).
计算机科学, 2015

Mongolian Inflection Suffix Processing in NLP: A Case Study.
Proceedings of the Natural Language Processing and Chinese Computing - 4th CCF Conference, 2015

Nearest Neighbor with Multi-feature Metric for Image Annotation.
Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Enhancing the Mongolian Historical Document Recognition System with Multiple Knowledge-Based Strategies.
Proceedings of the Neural Information Processing - 22nd International Conference, 2015

A multiple instances approach to improving keyword spotting on historical Mongolian document images.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

A pairwise algorithm for pitch estimation and speech separation using deep stacking network.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Document summarization based on semantic representations.
Proceedings of the 2015 International Conference on Asian Language Processing, 2015

Mongolian Named Entity Recognition using suffixes segmentation.
Proceedings of the 2015 International Conference on Asian Language Processing, 2015

Mongolian Speech Recognition Based on Deep Neural Networks.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2015

2014
A Semantic Distance Based Nearest Neighbor Method for Image Annotation.
J. Comput., 2014

Recognizing Boundaries in Wireless Sensor Networks Based on Local Connectivity Information.
Int. J. Distributed Sens. Networks, 2014

A keyword retrieval system for historical Mongolian document images.
Int. J. Document Anal. Recognit., 2014

Missing feature reconstruction methods for robust speaker identification.
Proceedings of the 22nd European Signal Processing Conference, 2014

Character Segmentation for Classical Mongolian Words in Historical Documents.
Proceedings of the Pattern Recognition - 6th Chinese Conference, 2014

2013
Fractal property of generalized M-set with rational number exponent.
Appl. Math. Comput., 2013

Language Model for Cyrillic Mongolian to Traditional Mongolian Conversion.
Proceedings of the Natural Language Processing and Chinese Computing, 2013

Word Spotting Application in Historical Mongolian Document Images.
Proceedings of the Intelligent Computing Theories - 9th International Conference, 2013

Segmentation-based Mongolian LVCSR approach.
Proceedings of the IEEE International Conference on Acoustics, 2013

Dependency Parsing for Traditional Mongolian.
Proceedings of the 2013 International Conference on Asian Language Processing, 2013

Development of Traditional Mongolian Dependency Treebank.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2013

2012
IMU @ ImageCLEF 2012.
Proceedings of the CLEF 2012 Evaluation Labs and Workshop, 2012

Hidden Markov Model for Term Weighting in Verbose Queries.
Proceedings of the Information Access Evaluation. Multilinguality, Multimodality, and Visual Analytics, 2012

The Research on Mongolian Spoken Term Detection Based on Confusion Network.
Proceedings of the Pattern Recognition - Chinese Conference, 2012

2011
A Method for Removing Inflectional Suffixes in Word Spotting of Mongolian Kanjur.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

Classical Mongolian Words Recognition in Historical Document.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

Acoustic model topology optimization using evolutionary methods.
Proceedings of the First Asian Conference on Pattern Recognition, 2011

2010
IMU Experiment in IR4QA at NTCIR-8.
Proceedings of the 8th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2010

2008
A First Investigation on Mongolian Information Retrieval.
Proceedings of the 2nd International Workshop on Evaluating Information Access, 2008

An Application of Neuro-fuzzy System in Remote Sensing Image Classification.
Proceedings of the International Conference on Computer Science and Software Engineering, 2008

2006
A Mongolian Speech Recognition System Based on HMM.
Proceedings of the Computational Intelligence, 2006


  Loading...