Dynamic prompting class distribution optimization for semi-supervised sound event detection.
Frontiers Inf. Technol. Electron. Eng., April, 2025
Development of a centrosome amplification-associated signature in kidney renal clear cell carcinoma based on multiple machine learning models.
Comput. Biol. Chem., 2025
Global Enhanced Frame Prompt Tuning for Sound Event Detection.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
A novel conversational hierarchical attention network for speech emotion recognition in dyadic conversation.
Multim. Tools Appl., June, 2024
On Local Temporal Embedding for Semi-Supervised Sound Event Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Leveraging Contrastive Language-Image Pre-Training and Bidirectional Cross-attention for Multimodal Keyword Spotting.
Eng. Appl. Artif. Intell., 2024
TF-DiffuSE: Time-Frequency Prior-Conditioned Diffusion Model for Speech Enhancement.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024
On Learning Frequency-Instance Correlations by Model-Agnostic Training for Synthetic Speech Detection.
Proceedings of the Asian Conference on Machine Learning, 2024
An efficient speech emotion recognition based on a dual-stream CNN-transformer fusion network.
Int. J. Speech Technol., 2023
TE-KWS: Text-Informed Speech Enhancement for Noise-Robust Keyword Spotting.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Joint-Former: Jointly Regularized and Locally Down-sampled Conformer for Semi-supervised Sound Event Detection.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Weakly Supervised Sentiment-Specific Region Discovery for VSA.
Comput. J., 2022
Adaptive Hierarchical Pooling for Weakly-supervised Sound Event Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Efficient Monaural Speech Separation with Multiscale Time-Delay Sampling.
Proceedings of the IEEE International Conference on Acoustics, 2022
Learning to disentangle emotion factors for facial expression recognition in the wild.
Int. J. Intell. Syst., 2021
Reproducibility Companion Paper: On Learning Disentangled Representation for Acoustic Event Detection.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
环境辅助的多任务混合声音事件检测方法 (Environment-assisted Multi-task Learning for Polyphonic Acoustic Event Detection).
计算机科学, 2020
On Learning Disentangled Representation for Acoustic Event Detection.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
A New Decomposition Algorithm of DCT-IV/DST-IV for Realizing Fast IMDCT Computation.
IEEE Signal Process. Lett., 2009