Soo-Whan Chung

Orcid: 0000-0001-6529-2196

According to our database1, Soo-Whan Chung authored at least 24 papers between 2017 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

MF-PAM: Accurate Pitch Estimation through Periodicity Analysis and Multi-level Feature Fusion.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Diffusion-Based Generative Speech Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Imaginary Voice: Face-Styled Diffusion Model for Text-to-Speech.
Proceedings of the IEEE International Conference on Acoustics, 2023

MoLE : Mixture Of Language Experts For Multi-Lingual Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

An Empirical Study on Speech Restoration Guided by Self-Supervised Speech Representation.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
SASV Challenge 2022: A Spoofing Aware Speaker Verification Challenge Evaluation Plan.
CoRR, 2022

Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

SASV 2022: The First Spoofing-Aware Speaker Verification Challenge.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Phase Continuity: Learning Derivatives of Phase Spectrum for Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
End-To-End Lip Synchronisation Based on Pattern Classification.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Look Who's Talking: Active Speaker Detection in the Wild.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Looking Into Your Speech: Learning Cross-Modal Affinity for Audio-Visual Speech Separation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Perfect Match: Self-Supervised Embeddings for Cross-Modal Retrieval.
IEEE J. Sel. Top. Signal Process., 2020

End-to-End Lip Synchronisation.
CoRR, 2020

Intra-Class Variation Reduction of Speaker Representation in Disentanglement Framework.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

MIRNet: Learning Multiple Identities Representations in Overlapped Speech.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Seeing Voices and Hearing Voices: Learning Discriminative Embeddings Using Cross-Modal Self-Supervision.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

FaceFilter: Audio-Visual Speech Separation Using Still Images.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019
Orthonormal Embedding-based Deep Clustering for Single-channel Speech Separation.
CoRR, 2019

Gradient-based Active Learning Query Strategy for End-to-end Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Perfect Match: Improved Cross-modal Embeddings for Audio-visual Synchronisation.
Proceedings of the IEEE International Conference on Acoustics, 2019

2017
A study on search grid points for data-driven 3-D beamsteering.
Proceedings of the Hands-free Speech Communications and Microphone Arrays, 2017


  Loading...