Maokui He

Orcid: 0000-0002-3772-0939

According to our database1, Maokui He authored at least 19 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Quality-Aware End-to-End Audio-Visual Neural Speaker Diarization.
CoRR, 2024

Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture.
Proceedings of the IEEE International Conference on Acoustics, 2024

A Spatial Long-Term Iterative Mask Estimation Approach for Multi-Channel Speaker Diarization and Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

The USTC System for Cadenza 2024 Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge.
CoRR, 2023

AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in the SUPERB Benchmark.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Unsupervised Adaptation with Quality-Aware Masking to Improve Target-Speaker Voice Activity Detection for Speaker Diarization.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Semi-Supervised Multi-Channel Speaker Diarization With Cross-Channel Attention.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Online Speaker Diarization with Core Samples Selection.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

End-to-End Audio-Visual Neural Speaker Diarization.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

The USTC-Ximalaya System for the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription (M2met) Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2022

Online Neural Speaker Diarization with Core Samples.
Proceedings of the Biometric Recognition - 16th Chinese Conference, 2022

2021
USTC-NELSLIP System Description for DIHARD-III Challenge.
CoRR, 2021

Integration of Speech Separation, Diarization, and Recognition for Multi-Speaker Meetings: System Description, Comparison, and Analysis.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Scenario-Dependent Speaker Diarization for DIHARD-III Challenge.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Target-Speaker Voice Activity Detection with Improved i-Vector Estimation for Unknown Number of Speaker.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2018
A Novel Training Strategy Using Dynamic Data Generation for Deep Neural Network Based Speech Enhancement.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018


  Loading...