Cunhang Fan

Orcid: 0000-0001-6318-8803

According to our database1, Cunhang Fan authored at least 68 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Multimodal Cross-Lingual Summarization for Videos: A Revisit in Knowledge Distillation Induced Triple-Stage Training Method.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis.
Knowl. Based Syst., January, 2024

TaBE: Decoupling spatial and spectral processing with Taylor's unfolding method in the beamspace domain for multi-channel speech enhancement.
Inf. Fusion, January, 2024

ICaps-ResLSTM: Improved capsule network and residual LSTM for EEG emotion recognition.
Biomed. Signal Process. Control., January, 2024

Dual-Branch Knowledge Distillation for Noise-Robust Synthetic Speech Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Multi-Level Information Aggregation Based Graph Attention Networks Towards Fake Speech Detection.
IEEE Signal Process. Lett., 2024

Dynamic Ensemble Teacher-Student Distillation Framework for Light-Weight Fake Audio Detection.
IEEE Signal Process. Lett., 2024

SceneFake: An initial dataset and benchmarks for scene fake audio detection.
Pattern Recognit., 2024

DGSD: Dynamical graph self-distillation for EEG-based auditory spatial attention detection.
Neural Networks, 2024

Spatial reconstructed local attention Res2Net with F0 subband for fake speech detection.
Neural Networks, 2024

DuaPIN: Auxiliary task enhanced dual path interaction network for civil court view generation.
Knowl. Based Syst., 2024

Mitigating Gender Bias in Code Large Language Models via Model Editing.
CoRR, 2024

LiSenNet: Lightweight Sub-band and Dual-Path Modeling for Real-Time Speech Enhancement.
CoRR, 2024

Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging.
CoRR, 2024

Frequency-mix Knowledge Distillation for Fake Speech Detection.
CoRR, 2024

RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection.
CoRR, 2024

Light-weight residual convolution-based capsule network for EEG emotion recognition.
Adv. Eng. Informatics, 2024

Bilateral Masking with prompt for Knowledge Graph Completion.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

MSFNet: Multi-Scale Fusion Network for Brain-Controlled Speaker Extraction.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

A Debiased Domain Adaptation Framework with Minimum Class Confusion for Motor Imagery Decoding.
Proceedings of the International Joint Conference on Neural Networks, 2024

DBPNet: Dual-Branch Parallel Network with Temporal-Frequency Fusion for Auditory Attention Detection.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Dual-View Multimodal Interaction in Multimodal Sentiment Analysis.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Progressive Distillation Based on Masked Generation Feature Method for Knowledge Graph Completion.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Subband fusion of complex spectrogram for fake speech detection.
Speech Commun., November, 2023

CompNet: Complementary network for single-channel speech enhancement.
Neural Networks, November, 2023

Transfer knowledge for punctuation prediction via adversarial training.
Speech Commun., April, 2023

Learning to Behave Like Clean Speech: Dual-Branch Knowledge Distillation for Noise-Robust Fake Audio Detection.
CoRR, 2023

Multi-perspective Information Fusion Res2Net with RandomSpecmix for Fake Speech Detection.
CoRR, 2023

Exploring the Power of Cross-Contextual Large Language Model in Mimic Emotion Prediction.
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023

Multimodal Cross-Lingual Features and Weight Fusion for Cross-Cultural Humor Detection.
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023

Learning From Yourself: A Self-Distillation Method For Fake Speech Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

Multi-perspective Information Fusion Res2Net with Random Specmix for Fake Speech Detection.
Proceedings of the Workshop on Deepfake Audio Detection and Analysis co-located with 32th International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023

Mixed Emotion Recognition Based on EEG Signals.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Dynamic Domain Adaptation for Class-Aware Cross-Subject and Cross-Session EEG Emotion Recognition.
IEEE J. Biomed. Health Informatics, 2022

SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection.
CoRR, 2022

ADD 2022: the First Audio Deep Synthesis Detection Challenge.
CoRR, 2022

AHRNN: Attention-Based Hybrid Robust Neural Network for emotion recognition.
Cogn. Comput. Syst., 2022

Audio Deepfake Detection Based on a Combination of F0 Information and Real Plus Imaginary Spectrogram Features.
Proceedings of the DDAM@MM 2022: Proceedings of the 1st International Workshop on Deepfake Detection for Audio Multimedia, 2022

Fully Automated End-to-End Fake Audio Detection.
Proceedings of the DDAM@MM 2022: Proceedings of the 1st International Workshop on Deepfake Detection for Audio Multimedia, 2022

DDAM '22: 1st International Workshop on Deepfake Detection for Audio Multimedia.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

ADD 2022: the first Audio Deep Synthesis Detection Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2022

Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Two Heads are Better Than One: A Two-Stage Complex Spectral Mapping Approach for Monaural Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

MS-MDA: Multisource Marginal Distribution Adaptation for Cross-subject and Cross-session EEG Emotion Recognition.
CoRR, 2021

Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Spectrograms Fusion-based End-to-end Robust Automatic Speech Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
A Public Chinese Dataset for Language Model Adaptation.
J. Signal Process. Syst., 2020

End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Deep imitator: Handwriting calligraphy imitation via deep attention networks.
Pattern Recognit., 2020

Dynamic Attention Based Generative Adversarial Network with Phase Post-Processing for Speech Enhancement.
CoRR, 2020

Simultaneous Denoising and Dereverberation Using Deep Embedding Features.
CoRR, 2020

Adversarial Transfer Learning for Punctuation Restoration.
CoRR, 2020

Deep Attention Fusion Feature for Speech Separation with End-to-End Post-filter Method.
CoRR, 2020

Spatial and spectral deep attention fusion for multi-channel speech separation using deep embedding features.
CoRR, 2020

Focal Loss for Punctuation Prediction.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

A Recursive Network with Dynamic Attention for Monaural Speech Enhancement.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Gated Recurrent Fusion of Spatial and Spectral Features for Multi-Channel Speech Separation with Deep Embedding Representations.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

AMINN: Attention-Based Multi-Information Neural Network for Emotion Recognition.
Proceedings of the ICCPR 2020: 9th International Conference on Computing and Pattern Recognition, Xiamen, China, October 30, 2020

2019
Automatic Depression Level Detection via ℓ<sub>p</sub>-Norm Pooling.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Noise Prior Knowledge Learning for Speech Enhancement via Gated Convolutional Generative Adversarial Network.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Utterance-level Permutation Invariant Training with Discriminative Learning for Single Channel Speech Separation.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

CLMAD: A Chinese Language Model Adaptation Dataset.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018


  Loading...