2025

Dynamic prompting class distribution optimization for semi-supervised sound event detection.

[DOI]

Frontiers Inf. Technol. Electron. Eng., April, 2025

Development of a centrosome amplification-associated signature in kidney renal clear cell carcinoma based on multiple machine learning models.

[DOI]

Comput. Biol. Chem., 2025

Global Enhanced Frame Prompt Tuning for Sound Event Detection.

[DOI]

Shiyu Yu

Lijian Gao

Qirong Mao

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

A novel conversational hierarchical attention network for speech emotion recognition in dyadic conversation.

[DOI]

Multim. Tools Appl., June, 2024

On Local Temporal Embedding for Semi-Supervised Sound Event Detection.

[DOI]

Lijian Gao

Qirong Mao

Ming Dong

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Leveraging Contrastive Language-Image Pre-Training and Bidirectional Cross-attention for Multimodal Keyword Spotting.

[DOI]

Eng. Appl. Artif. Intell., 2024

TF-DiffuSE: Time-Frequency Prior-Conditioned Diffusion Model for Speech Enhancement.

[DOI]

Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

On Learning Frequency-Instance Correlations by Model-Agnostic Training for Synthetic Speech Detection.

[DOI]

Proceedings of the Asian Conference on Machine Learning, 2024

2023

An efficient speech emotion recognition based on a dual-stream CNN-transformer fusion network.

[DOI]

Mohammed Tellai

Lijian Gao

Qirong Mao

Int. J. Speech Technol., 2023

TE-KWS: Text-Informed Speech Enhancement for Noise-Robust Keyword Spotting.

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Joint-Former: Jointly Regularized and Locally Down-sampled Conformer for Semi-supervised Sound Event Detection.

[DOI]

Lijian Gao

Qirong Mao

Ming Dong

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

Weakly Supervised Sentiment-Specific Region Discovery for VSA.

[DOI]

Comput. J., 2022

Adaptive Hierarchical Pooling for Weakly-supervised Sound Event Detection.

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Efficient Monaural Speech Separation with Multiscale Time-Delay Sampling.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Learning to disentangle emotion factors for facial expression recognition in the wild.

[DOI]

Int. J. Intell. Syst., 2021

Reproducibility Companion Paper: On Learning Disentangled Representation for Acoustic Event Detection.

[DOI]

Miguel Fabián Romero Rondón

Ujjwal Sharma

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020

环境辅助的多任务混合声音事件检测方法 (Environment-assisted Multi-task Learning for Polyphonic Acoustic Event Detection).

[DOI]

Lijian Gao

Qirong Mao

计算机科学, 2020

2019

On Learning Disentangled Representation for Acoustic Event Detection.

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

2009

A New Decomposition Algorithm of DCT-IV/DST-IV for Realizing Fast IMDCT Computation.

[DOI]

IEEE Signal Process. Lett., 2009