Mengyue Wu

Weidi Xie

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Investigating Passive Filter Pruning for Efficient CNN-Transformer Audio Captioning.

[BibT_eX]

[DOI]

Proceedings of the 34th IEEE International Workshop on Machine Learning for Signal Processing, 2024

Semantic-Enhanced Supervised Contrastive Learning.

[BibT_eX]

[DOI]

Pingyue Zhang

Proceedings of the IEEE International Conference on Acoustics, 2024

A Detailed Audio-Text Data Simulation Pipeline Using Single-Event Sounds.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Enhancing Audio Generation Diversity with Visual Information.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Phonetic and Lexical Discovery of Canine Vocalization.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Automatic Reconstruction of Ancient Chinese Pronunciations.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Multi-Label Supervised Contrastive Learning.

[BibT_eX]

[DOI]

Pingyue Zhang

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

OPAL: Ontology-Aware Pretrained Language Model for End-to-End Task-Oriented Dialogue.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2023

PsyEval: A Comprehensive Large Language Model Evaluation Benchmark for Mental Health.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Lexical Analysis of Dog Vocalizations via Online Videos.

[BibT_eX]

[DOI]

CoRR, 2023

Does My Dog "Speak" Like Me? The Acoustic Correlation between Pet Dogs and Their Human Owners.

[BibT_eX]

[DOI]

CoRR, 2023

A Large-scale Dataset for Audio-Language Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Improving Audio Caption Fluency with Automatic Error Correction.

[BibT_eX]

[DOI]

CoRR, 2023

LLM-empowered Chatbots for Psychiatrist and Patient Simulation: Application and Evaluation.

[BibT_eX]

[DOI]

CoRR, 2023

BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

ReCLR: Reference-Enhanced Contrastive Learning of Audio Representation for Depression Detection.

[BibT_eX]

[DOI]

Pingyue Zhang

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Enhance Temporal Relations in Audio Captioning with Sound Event Detection.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Investigating Pooling Strategies and Loss Functions for Weakly-Supervised Text-to-Audio Grounding via Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Diverse and Vivid Sound Generation from Text Descriptions.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Semantic Space Grounded Weighted Decoding for Multi-Attribute Controllable Dialogue Generation.

[BibT_eX]

[DOI]

Zhiling Zhang

Kenny Q. Zhu

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Detection of Multiple Mental Disorders from Social Media with Two-Stream Psychiatric Experts.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Transcribing Vocal Communications of Domestic Shiba lnu Dogs.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

DialogZoo: Large-Scale Dialog-Oriented Task Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Symptom Identification for Interpretable Detection of Multiple Mental Disorders.

[BibT_eX]

[DOI]

CoRR, 2022

A Comprehensive Survey of Automated Audio Captioning.

[BibT_eX]

[DOI]

CoRR, 2022

Psychiatric Scale Guided Risky Post Screening for Early Detection of Depression.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Can Audio Captions Be Evaluated With Image Caption Metrics?

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Diversity-Controllable and Accurate Audio Captioning Based on Neural Condition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Climate and Weather: Inspecting Depression Detection via Emotion Recognition.

[BibT_eX]

[DOI]

Wen Wu

Proceedings of the IEEE International Conference on Acoustics, 2022

Audio-Text Retrieval in Context.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Navigating Audio-Visual Event Detection Across Mismatched Modalities.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Category-Adapted Sound Event Enhancement with Weakly Labeled Data.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Symptom Identification for Interpretable Detection of Multiple Mental Disorders on Social Media.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

D4: a Chinese Dialogue Dataset for Depression-Diagnosis-Oriented Chat.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021

Voice Activity Detection in the Wild: A Data-Driven Approach Using Teacher-Student Training.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Towards Duration Robust Weakly Supervised Sound Event Detection.

[BibT_eX]

[DOI]

Heinrich Dinkel