Hui Zhang

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Deep Features Representation of Word Image for Keyword Spotting in Historical Mongolian Document Images.

[BibT_eX]

[DOI]

Jing Zhang

Proceedings of the 32nd IEEE International Conference on Tools with Artificial Intelligence, 2020

Multi-Task Learning Based Traditional Mongolian Words Recognition.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

An Efficient Joint Training Framework for Robust Small-Footprint Keyword Spotting.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 27th International Conference, 2020

2019

A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting.

[BibT_eX]

[DOI]

CoRR, 2019

End-to-End Model for Offline Handwritten Mongolian Word Recognition.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing and Chinese Computing, 2019

An Automatic Spelling Correction Method for Classical Mongolian.

[BibT_eX]

[DOI]

Proceedings of the Knowledge Science, Engineering and Management, 2019

Investigation of Cost Function for Supervised Monaural Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-Noise Ratio Condition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Woodblock-Printing Mongolian Words Recognition by Bi-LSTM with Attention Mechanism.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Supervised Speech Enhancement with Real Spectrum Approximation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Word Image Representation Based on Sequence to Sequence Model with Attention Mechanism for Out-of-Vocabulary Keyword Spotting.

[BibT_eX]

[DOI]

Yanke Kang

Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019

Joint Training ResCNN-based Voice Activity Detection with Speech Enhancement.

[BibT_eX]

[DOI]

Tianjiao Xu

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Improve Data Utilization with Two-stage Learning in CNN-LSTM-based Voice Activity Detection.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

Phonologically Aware BiLSTM Model for Mongolian Phrase Break Prediction with Attention Mechanism.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

End-to-End Mongolian Text-to-Speech System.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Using Shifted Real Spectrum Mask as Training Target for Supervised Speech Separation.

[BibT_eX]

[DOI]

Yun Liu

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Word Image Representation Based on Visual Embeddings and Spatial Constraints for Keyword Spotting on Historical Documents.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Pattern Recognition, 2018

Training Supervised Speech Separation System to Improve STOI and PESQ Directly.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A LSTM Approach with Sub-Word Embeddings for Mongolian Phrase Break Prediction.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017

Integrated Speech Enhancement Method Based on Weighted Prediction Error and DNN for Dereverberation and Denoising.

[BibT_eX]

[DOI]

CoRR, 2017

Integrating Visual Word Embeddings into Translation Language Model for Keyword Spotting on Historical Mongolian Document Images.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Multi-Target Ensemble Learning for Monaural Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Using Word Mover's Distance with Spatial Constraints for Measuring Similarity Between Mongolian Word Images.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 24th International Conference, 2017

Representing word image using visual word embeddings and RNN for keyword spotting on historical document images.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Segmentation-Free Printed Traditional Mongolian OCR Using Sequence to Sequence with Attention Model.

[BibT_eX]

[DOI]

Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

2016

A Pairwise Algorithm Using the Deep Stacking Network for Speech Separation and Pitch Estimation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Jointly Optimizing Activation Coefficients of Convolutive NMF Using DNN for Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Convolutional neural network for robust pitch determination.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Comparison on Neural Network based acoustic model in Mongolian speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Asian Language Processing, 2016

2015

A pairwise algorithm for pitch estimation and speech separation using deep stacking network.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Document summarization based on semantic representations.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Asian Language Processing, 2015

Mongolian Speech Recognition Based on Deep Neural Networks.

[BibT_eX]

[DOI]

Feilong Bao

Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2015

2014

Deep stacking networks with time series for speech separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Missing feature reconstruction methods for robust speaker identification.

[BibT_eX]

[DOI]