Guanglai Gao

Xinlan Ma

Proceedings of the Natural Language Processing and Chinese Computing, 2022

Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 29th International Conference, 2022

Alignment-Learning Based Single-Step Decoding for Accurate and Fast Non-Autoregressive Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

End-to-End Large-Scale Image Retrieval Network with Convolution and Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2022, 2022

MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Asian Language Processing, 2022

2021

Expressive TTS Training With Frame and Style Reconstruction Loss.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Exploiting Morphological and Phonological Features to Improve Prosodic Phrasing for Mongolian Speech Synthesis.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Recurrent Neural Networks and Acoustic Features for Frame-Level Signal-to-Noise Ratio Estimation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Guided Training: A Simple Method for Single-channel Speaker Separation.

[BibT_eX]

[DOI]

CoRR, 2021

Soft-BAC: Soft Bidirectional Alignment Cost for End-to-End Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021

Panoptic-DLA: Document Layout Analysis of Historical Newspapers Based on Proposal-Free Panoptic Segmentation Model.

[BibT_eX]

[DOI]

Min Lu

Proceedings of the Knowledge Science, Engineering and Management, 2021

Joint Alignment Learning-Attention Based Model for Grapheme-to-Phoneme Conversion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Mongolian emotional speech synthesis based on transfer learning and emotional embedding.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Asian Language Processing, 2021

2020

Modeling Prosodic Phrasing With Multi-Task Learning in Tacotron-Based TTS.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2020

Context-Driven Image Caption With Global Semantic Relations of the Named Entities.

[BibT_eX]

[DOI]

Yun Jing

Zhiwei Xu

IEEE Access, 2020

Topic Analysis by Exploring Headline Information.

[BibT_eX]

[DOI]

Proceedings of the Web Information Systems Engineering - WISE 2020, 2020

WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Frame-Level Signal-to-Noise Ratio Estimation Using Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Sub-Band Knowledge Distillation Framework for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Dataless Text Classification with Pseudo Topic Representation.

[BibT_eX]

[DOI]

Qi Chen

Proceedings of the 32nd IEEE International Conference on Tools with Artificial Intelligence, 2020

An Edge Information and Mask Shrinking Based Image Inpainting Approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Snr-Based Teachers-Student Technique For Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Beamformed Feature for Learning-based Dual-channel Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Teacher-Student Training For Robust Tacotron-Based TTS.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Online Handwritten Mongolian Character Recognition using CMA-MOHR and Coordinate Processing.

[BibT_eX]

[DOI]

Fan Yang

Proceedings of the International Conference on Asian Language Processing, 2020

Incorporating Inner-word and Out-word Features for Mongolian Morphological Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Computational Linguistics, 2020

MTNER: A Corpus for Mongolian Tourism Named Entity Recognition.

[BibT_eX]

[DOI]

Proceedings of the Machine Translation - 16th China Conference, 2020

Robust Speech Dereverberation Based on WPE and Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019

Learning Morpheme Representation for Mongolian Named Entity Recognition.

[BibT_eX]

[DOI]

Neural Process. Lett., 2019

An End-to-End Preprocessor Based on Adversiarial Learning for Mongolian Historical Document OCR.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2019: Trends in Artificial Intelligence, 2019

End-to-End Model for Offline Handwritten Mongolian Word Recognition.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing and Chinese Computing, 2019

Research on Khalkha Dialect Mongolian Speech Recognition Acoustic Model Based on Weight Transfer.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing and Chinese Computing, 2019

A Context-Free Spelling Correction Method for Classical Mongolian.

[BibT_eX]

[DOI]

Min Lu

Proceedings of the Natural Language Processing and Chinese Computing, 2019

An Automatic Spelling Correction Method for Classical Mongolian.

[BibT_eX]

[DOI]

Proceedings of the Knowledge Science, Engineering and Management, 2019

Neural Morphological Segmentation Model for Mongolian.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2019

A Natural Scene Text Extraction Approach Based on Generative Adversarial Learning.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 26th International Conference, 2019

Learning an Adversarial Network for Speech Enhancement Under Extremely Low Signal-to-Noise Ratio Condition.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 26th International Conference, 2019

Morphological Knowledge Guided Mongolian Constituent Parsing.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 26th International Conference, 2019

Building Mongolian TTS Front-End with Encoder-Decoder Model by Using Bridge Method and Multi-view Features.

[BibT_eX]

[DOI]

Rui Liu

Proceedings of the Neural Information Processing - 26th International Conference, 2019

A Holistic Recognition Approach for Woodblock-Print Mongolian Words Based on Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Improving Text Image Resolution using a Deep Generative Adversarial Network for Optical Character Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Woodblock-Printing Mongolian Words Recognition by Bi-LSTM with Attention Mechanism.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Sub-Word Based Mongolian Offline Handwriting Recognition.

[BibT_eX]

[DOI]

Daoerji Fan

Huijuan Wu

Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Research on New Event Detection Methods for Mongolian News.

[BibT_eX]

[DOI]

Shijie Wang

Proceedings of the International Conference on Asian Language Processing, 2019

An Attention-Based Approach for Mongolian News Named Entity Recognition.

[BibT_eX]

[DOI]

Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

The IMU speech synthesis entry for Blizzard Challenge 2019.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2019, Vienna, Austria, September 23, 2019, 2019

Pseudo Topic Analysis for Boosting Pseudo Relevance Feedback.

[BibT_eX]

[DOI]

Proceedings of the Web and Big Data - Third International Joint Conference, 2019

Dynamic-attention based Encoder-decoder model for Speaker Extraction with Anchor speech.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

Phonologically Aware BiLSTM Model for Mongolian Phrase Break Prediction with Attention Mechanism.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

Mongolian Grapheme to Phoneme Conversion by Using Hybrid Approach.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing and Chinese Computing, 2018

Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Word Image Representation Based on Visual Embeddings and Spatial Constraints for Keyword Spotting on Historical Documents.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Pattern Recognition, 2018

Convolutional Neural Network for Machine-Printed Traditional Mongolian Font Recognition.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 25th International Conference, 2018

Mongolian Word Segmentation Based on Three Character Level Seq2Seq Models.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 25th International Conference, 2018

Training Supervised Speech Separation System to Improve STOI and PESQ Directly.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Research on Transfer Learning for Khalkha Mongolian Speech Recognition Based on TDNN.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Conference on Asian Language Processing, 2018

A LSTM Approach with Sub-Word Embeddings for Mongolian Phrase Break Prediction.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017

Integrated Speech Enhancement Method Based on Weighted Prediction Error and DNN for Dereverberation and Denoising.

[BibT_eX]

[DOI]

CoRR, 2017

Integrating Visual Word Embeddings into Translation Language Model for Keyword Spotting on Historical Mongolian Document Images.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Research on Mongolian Speech Recognition Based on FSMN.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing and Chinese Computing, 2017

Multi-Target Ensemble Learning for Monaural Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Pseudo-Based Relevance Analysis for Information Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 29th IEEE International Conference on Tools with Artificial Intelligence, 2017

Using Word Mover's Distance with Spatial Constraints for Measuring Similarity Between Mongolian Word Images.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 24th International Conference, 2017

Representing word image using visual word embeddings and RNN for keyword spotting on historical document images.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Segmentation-Free Printed Traditional Mongolian OCR Using Sequence to Sequence with Attention Model.

[BibT_eX]

[DOI]

Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

Supervised Feature Learning via Within-Class Reconstruction.

[BibT_eX]

[DOI]

Yunxue Shao

Jiantao Zhou

Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

Language Model for Mongolian Polyphone Proofreading.

[BibT_eX]

[DOI]

Min Lu

Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2017

2016

A Pairwise Algorithm Using the Deep Stacking Network for Speech Separation and Pitch Estimation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Nonlinear discriminant analysis based on vanishing component analysis.

[BibT_eX]

[DOI]

Yunxue Shao

Chunheng Wang

Neurocomputing, 2016

A knowledge-based recognition system for historical Mongolian documents.

[BibT_eX]

[DOI]

Int. J. Document Anal. Recognit., 2016

A spatial-temporal trajectory clustering algorithm for eye fixations identification.

[BibT_eX]

[DOI]

Intell. Data Anal., 2016

Cyrillic Mongolian Named Entity Recognition with Rich Features.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Understanding and Intelligent Applications, 2016

Mongolian Named Entity Recognition with Bidirectional Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 28th IEEE International Conference on Tools with Artificial Intelligence, 2016

LDA-Based Word Image Representation for Keyword Spotting on Historical Mongolian Documents.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 23rd International Conference, 2016

A Connection Reduced Network for Similar Handwritten Chinese Character Discrimination.

[BibT_eX]

[DOI]

Yunxue Shao

Chunheng Wang

Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition, 2016

DNN-HMM for Large Vocabulary Mongolian Offline Handwriting Recognition.

[BibT_eX]

[DOI]

Daoerji Fan

Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition, 2016

A novel image classifier based on Gaussian mixture language model.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Convolutional neural network for robust pitch determination.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Comparison on Neural Network based acoustic model in Mongolian speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Asian Language Processing, 2016

Mongolian prosodic phrase prediction using suffix segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Asian Language Processing, 2016

Mongolian Named Entity Recognition System with Rich Features.

[BibT_eX]

[DOI]

Proceedings of the COLING 2016, 2016

2015

基于最大边缘相关的伪相关反馈方法 (Pseudo Relevance Feedback Based on Maximal Marginal Relevance).

[BibT_eX]

[DOI]

计算机科学, 2015

一种融合语义距离的最近邻图像标注方法 (Combination of Nearest Neighbor with Semantic Distance for Image Annotation).

[BibT_eX]

[DOI]

Jianyun Nie

计算机科学, 2015

Mongolian Inflection Suffix Processing in NLP: A Case Study.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing and Chinese Computing - 4th CCF Conference, 2015

Nearest Neighbor with Multi-feature Metric for Image Annotation.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Enhancing the Mongolian Historical Document Recognition System with Multiple Knowledge-Based Strategies.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 22nd International Conference, 2015

A multiple instances approach to improving keyword spotting on historical Mongolian document images.

[BibT_eX]

[DOI]

Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

A pairwise algorithm for pitch estimation and speech separation using deep stacking network.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Document summarization based on semantic representations.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Asian Language Processing, 2015

Mongolian Named Entity Recognition using suffixes segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Asian Language Processing, 2015

Mongolian Speech Recognition Based on Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2015

2014

A Semantic Distance Based Nearest Neighbor Method for Image Annotation.

[BibT_eX]

[DOI]

Jian-Yun Nie

J. Comput., 2014

Recognizing Boundaries in Wireless Sensor Networks Based on Local Connectivity Information.

[BibT_eX]

[DOI]

Int. J. Distributed Sens. Networks, 2014

A keyword retrieval system for historical Mongolian document images.

[BibT_eX]

[DOI]

Int. J. Document Anal. Recognit., 2014

Missing feature reconstruction methods for robust speaker identification.

[BibT_eX]

[DOI]

Proceedings of the 22nd European Signal Processing Conference, 2014

Character Segmentation for Classical Mongolian Words in Historical Documents.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition - 6th Chinese Conference, 2014

2013

Fractal property of generalized M-set with rational number exponent.

[BibT_eX]

[DOI]

Appl. Math. Comput., 2013

Language Model for Cyrillic Mongolian to Traditional Mongolian Conversion.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing and Chinese Computing, 2013

Word Spotting Application in Historical Mongolian Document Images.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Computing Theories - 9th International Conference, 2013

Segmentation-based Mongolian LVCSR approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Dependency Parsing for Traditional Mongolian.

[BibT_eX]

[DOI]

Xueliang Yan

Proceedings of the 2013 International Conference on Asian Language Processing, 2013

Development of Traditional Mongolian Dependency Treebank.

[BibT_eX]

[DOI]

Xueliang Yan

Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2013

2012

IMU @ ImageCLEF 2012.

[BibT_eX]

[DOI]

Proceedings of the CLEF 2012 Evaluation Labs and Workshop, 2012

Hidden Markov Model for Term Weighting in Verbose Queries.

[BibT_eX]

[DOI]

Proceedings of the Information Access Evaluation. Multilinguality, Multimodality, and Visual Analytics, 2012

The Research on Mongolian Spoken Term Detection Based on Confusion Network.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition - Chinese Conference, 2012

2011

A Method for Removing Inflectional Suffixes in Word Spotting of Mongolian Kanjur.

[BibT_eX]

[DOI]

Yulai Bao

Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

Classical Mongolian Words Recognition in Historical Document.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

Acoustic model topology optimization using evolutionary methods.

[BibT_eX]

[DOI]

Xirimo Bao

Proceedings of the First Asian Conference on Pattern Recognition, 2011

2010

IMU Experiment in IR4QA at NTCIR-8.

[BibT_eX]

[DOI]

Proceedings of the 8th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2010

2008

A First Investigation on Mongolian Information Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Workshop on Evaluating Information Access, 2008

An Application of Neuro-fuzzy System in Remote Sensing Image Classification.

[BibT_eX]

[DOI]

Wu Wei