Cong Han

Orcid: 0009-0003-6516-1139

According to our database1, Cong Han authored at least 52 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Probability-guaranteed distributed set-membership filtering over sensor networks: A stochastic communication protocol case.
Syst. Control. Lett., 2024

StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion.
CoRR, 2024

Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis.
CoRR, 2024

Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation.
CoRR, 2024

Listen, Chat, and Edit: Text-Guided Soundscape Modification for Enhanced Auditory Experience.
CoRR, 2024

Enhancing Baidu Multimodal Advertisement with Chinese Text-to-Image Generation via Bilingual Alignment and Caption Synthesis.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

BIVL-Net: Bidirectional Vision-Language Guidance for Visual Question Answering.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Exploring Self-supervised Contrastive Learning of Spatial Sound Event Representation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Unsupervised Multi-Channel Separation And Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Scaling Vison-Language Foundation Model to 12 Billion Parameters in Baidu Dynamic Image Advertising.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

2023
Profiling temporal learning interests with time-aware transformers and knowledge graph for online course recommendation.
Electron. Commer. Res., December, 2023

Incorporating heterogeneous information in deep learning with informative meta-paths for community recommendations.
J. Inf. Sci., October, 2023

A Faster and Lighter Detection Method for Foreign Objects in Coal Mine Belt Conveyors.
Sensors, July, 2023

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform.
CoRR, 2023

Zero-Shot Semantic Segmentation with Decoupled One-Pass Network.
CoRR, 2023

SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Open-Vocabulary Semantic Segmentation with Decoupled One-Pass Network.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Research on Named Entity Recognition of Laboratory Safety Knowledge based on Deep Learning.
Proceedings of the 4th International Conference on Big Data & Artificial Intelligence & Software Engineering, 2023

Phoneme-Level Bert for Enhanced Prosody of Text-To-Speech with Grapheme Predictions.
Proceedings of the IEEE International Conference on Acoustics, 2023

Online Binaural Speech Separation Of Moving Speakers With A Wavesplit Network.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improved Decoding of Attentional Selection in Multi-Talker Environments with Self-Supervised Learned Speech Representation.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023

2022
Extensible Proxy for Efficient NAS.
CoRR, 2022

StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech Synthesis.
CoRR, 2022

Styletts-VC: One-Shot Voice Conversion by Knowledge Transfer From Style-Based TTS Models.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Improving Conversational Recommendation Systems' Quality with Context-Aware Item Meta-Information.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Multi-Channel Speech Denoising for Machine Ears.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Group Communication With Context Codec for Lightweight Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

PGM-face: Pose-guided margin loss for cross-pose face recognition.
Neurocomputing, 2021

Identity-and-pose-guided generative adversarial network for face rotation.
Neurocomputing, 2021

Dual-Path RNN for Long Recording Speech Separation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Distortion-Controlled Training for end-to-end Reverberant Speech Separation with Auxiliary Autoencoding Loss.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Empirical Analysis of Generalized Iterative Speech Separation Networks.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Binaural Speech Separation of Moving Speakers With Preserved Spatial Cues.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Continuous Speech Separation Using Speaker Inventory for Long Recording.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Ultra-Lightweight Speech Separation Via Group Communication.
Proceedings of the IEEE International Conference on Acoustics, 2021

Rethinking The Separation Layers In Speech Separation Networks.
Proceedings of the IEEE International Conference on Acoustics, 2021

Dual-Path Modeling for Long Recording Speech Separation in Meetings.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording.
CoRR, 2020

Group Communication with Context Codec for Ultra-Lightweight Source Separation.
CoRR, 2020

Incentive Mechanism Design for ROI-constrained Auto-bidding.
CoRR, 2020

Ultra-Lightweight Speech Separation via Group Communication.
CoRR, 2020

A Reliability-and-Energy-Balanced Service Function Chain Mapping and Migration Method for Internet of Things.
IEEE Access, 2020

Real-Time Binaural Speech Separation with Preserved Spatial Cues.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
On the Validation of a Multiple-Network Poroelastic Model Using Arterial Spin Labeling MRI Data.
Frontiers Comput. Neurosci., 2019

A Multi-objective Service Function Chain Mapping Mechanism for IoT networks.
Proceedings of the 15th International Wireless Communications & Mobile Computing Conference, 2019

Co-simulation of Omnidirectional Mobile Platform Based on Fuzzy Control.
Proceedings of the Intelligent Robotics and Applications - 12th International Conference, 2019

Online Deep Attractor Network for Real-time Single-channel Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Research Based on Log Current Spectrum for Experiment Approaches to Diagnosis of Motor Broken Rotor Bar Fault.
Proceedings of the 12th International Congress on Image and Signal Processing, 2019

FaSNet: Low-Latency Adaptive Beamforming for Multi-Microphone Audio Processing.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2015
The National Entironmental and Geological Information System for Remote Sensing Survey and Monitoring.
Proceedings of the 2015 IEEE International Geoscience and Remote Sensing Symposium, 2015

2007
Experimental design for regression analysis when the responses are subject to censoring.
Comput. Methods Programs Biomed., 2007


  Loading...