AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in the SUPERB Benchmark.

[BibT_eX]

[DOI]

Gaobin Yang

Jun Du

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Unsupervised Adaptation with Quality-Aware Masking to Improve Target-Speaker Voice Activity Detection for Speaker Diarization.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And Recognition.

[BibT_eX]

[DOI]

Sabato Marco Siniscalchi

Proceedings of the IEEE International Conference on Acoustics, 2023

Semi-Supervised Multi-Channel Speaker Diarization With Cross-Channel Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

Online Speaker Diarization with Core Samples Selection.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

End-to-End Audio-Visual Neural Speaker Diarization.

[BibT_eX]

[DOI]

Mao-Kui He

Jun Du

Chin-Hui Lee

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

The USTC-Ximalaya System for the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription (M2met) Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Online Neural Speaker Diarization with Core Samples.

[BibT_eX]

[DOI]

Yanyan Yue

Jun Du

Maokui He

Proceedings of the Biometric Recognition - 16th Chinese Conference, 2022

2021

USTC-NELSLIP System Description for DIHARD-III Challenge.

[BibT_eX]

[DOI]

CoRR, 2021

Integration of Speech Separation, Diarization, and Recognition for Multi-Speaker Meetings: System Description, Comparison, and Analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Scenario-Dependent Speaker Diarization for DIHARD-III Challenge.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Target-Speaker Voice Activity Detection with Improved i-Vector Estimation for Unknown Number of Speaker.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2018

A Novel Training Strategy Using Dynamic Data Generation for Deep Neural Network Based Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Maokui He

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...