2025

Pushing the Frontiers of Self-Distillation Prototypes Network with Dimension Regularization and Score Normalization.

[DOI]

Yafeng Chen

Chong Deng

CoRR, May, 2025

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction.

[DOI]

CoRR, January, 2025

Exploring Text-Queried Sound Event Detection with Audio Source Separation.

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

3D-Speaker-Toolkit: An Open-Source Toolkit for Multimodal Speaker Verification and Diarization.

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Self-Distillation Prototypes Network: Learning Robust Speaker Representations without Supervision.

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Recording for Eyes, Not Echoing to Ears: Contextualized Spoken-to-Written Conversion of ASR Transcripts.

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Strong Consistency of Spectral Clustering for the Sparse Degree-Corrected Hypergraph Stochastic Block Model.

[DOI]

Chong Deng

Xin-Jian Xu

Shihui Ying

IEEE Trans. Inf. Theory, 2024

Lightweight Detection Methods for Insulator Self-Explosion Defects.

[DOI]

Sensors, 2024

CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models.

[DOI]

CoRR, 2024

OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation.

[DOI]

CoRR, 2024

Recording for Eyes, Not Echoing to Ears: Contextualized Spoken-to-Written Conversion of ASR Transcripts.

[DOI]

CoRR, 2024

Multimodal Fusion and Coherence Modeling for Video Topic Segmentation.

[DOI]

CoRR, 2024

FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs.

[DOI]

CoRR, 2024

Skip-Layer Attention: Bridging Abstract and Detailed Dependencies in Transformers.

[DOI]

CoRR, 2024

Loss Masking Is Not Needed In Decoder-Only Transformer For Discrete-Token-Based ASR.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Sliding Mode Control Model of Two-phase Hybrid Stepping Motor Based on Improved Harris Hawks Optimization Algorithm.

[DOI]

Ming Liu

Chong Deng

Proceedings of the International Conference on Advanced Robotics and Mechatronics, 2024

2023

Improving BERT with Hybrid Pooling Network and Drop Mask.

[DOI]

CoRR, 2023

Hyperlink prediction via local random walks and Jensen-Shannon divergence.

[DOI]

Xin-Jian Xu

Chong Deng

Li-Jie Zhang

CoRR, 2023

MUG: A General Meeting Understanding and Generation Benchmark.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG).

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Weighted Sampling for Masked Language Modeling.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Meeting Action Item Detection with Regularized Context Modeling.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Long Document Topic Segmentation Models With Enhanced Coherence Modeling.

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings.

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022

MDERank: A Masked Document Embedding Rank Approach for Unsupervised Keyphrase Extraction.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2018

LCQMC: A Large-scale Chinese Question Matching Corpus.

[DOI]

Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017

Optical flow-based face tracking in <i>The Mummy</i>.

[DOI]

Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2017