Hui Zhang
Affiliations:- Inner Mongolia University, College of Computer Science, Inner Mongolia Key Laboratory of Mongolian Information Processing Technology, Hohhot, China
According to our database1,
Hui Zhang
authored at least 49 papers
between 2014 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2024
The image and ground truth dataset of Mongolian movable-type newspapers for text recognition.
Int. J. Document Anal. Recognit., June, 2024
Innovative Directional Encoding in Speech Processing: Leveraging Spherical Harmonics Injection for Multi-Channel Speech Enhancement.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Efficient Multi-Channel Speech Enhancement with Spherical Harmonics Injection for Directional Encoding.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Hierarchical Modeling of Spatial Cues via Spherical Harmonics for Multi-Channel Speech Enhancement.
CoRR, 2023
PDPCRN: Parallel Dual-Path CRN with Bi-directional Inter-Branch Interactions for Multi-Channel Speech Enhancement.
CoRR, 2023
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
2022
MNASR: A Free Speech Corpus For Mongolian Speech Recognition And Accompanied Baselines.
Proceedings of the 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Alleviating the Loss-Metric Mismatch in Supervised Single-Channel Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2022
Alignment-Learning Based Single-Step Decoding for Accurate and Fast Non-Autoregressive Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Soft-BAC: Soft Bidirectional Alignment Cost for End-to-End Automatic Speech Recognition.
Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Polishing the Classical Likelihood Ratio Test by Supervised Learning for Voice Activity Detection.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Deep Features Representation of Word Image for Keyword Spotting in Historical Mongolian Document Images.
Proceedings of the 32nd IEEE International Conference on Tools with Artificial Intelligence, 2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Proceedings of the Neural Information Processing - 27th International Conference, 2020
2019
CoRR, 2019
Proceedings of the Natural Language Processing and Chinese Computing, 2019
Proceedings of the Knowledge Science, Engineering and Management, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-Noise Ratio Condition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Word Image Representation Based on Sequence to Sequence Model with Attention Mechanism for Out-of-Vocabulary Keyword Spotting.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Improve Data Utilization with Two-stage Learning in CNN-LSTM-based Voice Activity Detection.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
2018
Phonologically Aware BiLSTM Model for Mongolian Phrase Break Prediction with Attention Mechanism.
Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Using Shifted Real Spectrum Mask as Training Target for Supervised Speech Separation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Word Image Representation Based on Visual Embeddings and Spatial Constraints for Keyword Spotting on Historical Documents.
Proceedings of the 24th International Conference on Pattern Recognition, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 27th International Conference on Computational Linguistics, 2018
2017
Integrated Speech Enhancement Method Based on Weighted Prediction Error and DNN for Dereverberation and Denoising.
CoRR, 2017
Integrating Visual Word Embeddings into Translation Language Model for Keyword Spotting on Historical Mongolian Document Images.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Using Word Mover's Distance with Spatial Constraints for Measuring Similarity Between Mongolian Word Images.
Proceedings of the Neural Information Processing - 24th International Conference, 2017
Representing word image using visual word embeddings and RNN for keyword spotting on historical document images.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
Segmentation-Free Printed Traditional Mongolian OCR Using Sequence to Sequence with Attention Model.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017
2016
A Pairwise Algorithm Using the Deep Stacking Network for Speech Separation and Pitch Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Jointly Optimizing Activation Coefficients of Convolutive NMF Using DNN for Speech Separation.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 International Conference on Asian Language Processing, 2016
2015
A pairwise algorithm for pitch estimation and speech separation using deep stacking network.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 International Conference on Asian Language Processing, 2015
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2015
2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the 22nd European Signal Processing Conference, 2014