Hengshun Zhou

Orcid: 0000-0001-7878-6531

According to our database¹, Hengshun Zhou authored at least 15 papers between 2018 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2018

2019

2020

2021

2022

2023

2024

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Phoneme-Level Contrastive Learning for User-Defined Keyword Spotting with Flexible Enrollment.

[BibT_eX]

[DOI]

CoRR, 2024

Implicit Enhancement of Target Speaker in Speaker-Adaptive ASR through Efficient Joint Optimization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge.

[BibT_eX]

[DOI]

CoRR, 2023

Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

A Multiple-Teacher Pruning Based Self-Distillation (MT-PSD) Approach to Model Compression for Audio-Visual Wake Word Spotting.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis.

[BibT_eX]

[DOI]

Sabato Marco Siniscalchi

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

The First Multimodal Information Based Speech Processing (Misp) Challenge: Data, Tasks, Baselines And Results.

[BibT_eX]

[DOI]

Sabato Marco Siniscalchi

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Information Fusion in Attention Networks Using Adaptive and Multi-Level Factorized Bilinear Pooling for Audio-Visual Emotion Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Speech Emotion Recognition Based on Acoustic Segment Model.

[BibT_eX]

[DOI]

Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Audio-Visual Information Fusion Using Cross-Modal Teacher-Student Learning for Voice Activity Detection in Realistic Environments.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020

Using Speech Enhancement Preprocessing for Speech Emotion Recognition in Realistic Noisy Conditions.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

High-Resolution Attention Network with Acoustic Segment Model for Acoustic Scene Classification.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Multimodal Interaction, 2019

2018

An Investigation of Transfer Learning Mechanism for Acoustic Scene Classification.

[BibT_eX]

[DOI]

Hengshun Zhou

Xue Bai

Jun Du

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Hengshun Zhou

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...