2025
Dynamic prompting class distribution optimization for semi-supervised sound event detection.
Frontiers Inf. Technol. Electron. Eng., April, 2025

Development of a centrosome amplification-associated signature in kidney renal clear cell carcinoma based on multiple machine learning models.
Comput. Biol. Chem., 2025

Global Enhanced Frame Prompt Tuning for Sound Event Detection.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
A novel conversational hierarchical attention network for speech emotion recognition in dyadic conversation.
Multim. Tools Appl., June, 2024

On Local Temporal Embedding for Semi-Supervised Sound Event Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Leveraging Contrastive Language-Image Pre-Training and Bidirectional Cross-attention for Multimodal Keyword Spotting.
Eng. Appl. Artif. Intell., 2024

TF-DiffuSE: Time-Frequency Prior-Conditioned Diffusion Model for Speech Enhancement.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

On Learning Frequency-Instance Correlations by Model-Agnostic Training for Synthetic Speech Detection.
Proceedings of the Asian Conference on Machine Learning, 2024

2023
An efficient speech emotion recognition based on a dual-stream CNN-transformer fusion network.
Int. J. Speech Technol., 2023

TE-KWS: Text-Informed Speech Enhancement for Noise-Robust Keyword Spotting.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Joint-Former: Jointly Regularized and Locally Down-sampled Conformer for Semi-supervised Sound Event Detection.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
Weakly Supervised Sentiment-Specific Region Discovery for VSA.
Comput. J., 2022

Adaptive Hierarchical Pooling for Weakly-supervised Sound Event Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Efficient Monaural Speech Separation with Multiscale Time-Delay Sampling.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Learning to disentangle emotion factors for facial expression recognition in the wild.
Int. J. Intell. Syst., 2021

Reproducibility Companion Paper: On Learning Disentangled Representation for Acoustic Event Detection.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020
环境辅助的多任务混合声音事件检测方法 (Environment-assisted Multi-task Learning for Polyphonic Acoustic Event Detection).
计算机科学, 2020

2019
On Learning Disentangled Representation for Acoustic Event Detection.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

2009
A New Decomposition Algorithm of DCT-IV/DST-IV for Realizing Fast IMDCT Computation.
IEEE Signal Process. Lett., 2009