Jee-Weon Jung
Orcid: 0000-0003-0505-2988
According to our database1,
Jee-Weon Jung
authored at least 85 papers
between 2017 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech.
CoRR, 2024
Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels.
CoRR, 2024
CoRR, 2024
Beyond Silence: Bias Analysis through Loss and Asymmetric Approach in Audio Anti-Spoofing.
CoRR, 2024
CoRR, 2024
Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement.
CoRR, 2024
TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages.
CoRR, 2024
Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
CoRR, 2024
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models.
CoRR, 2024
OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer.
CoRR, 2024
AugSumm: towards generalizable speech summarization using synthetic labels from large language model.
CoRR, 2024
a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verification.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Improving Audio Captioning Models with Fine-Grained Audio Features, Text Embedding Supervision, and LLM Mix-Up Augmentation.
Proceedings of the IEEE International Conference on Acoustics, 2024
VoxtLM: Unified Decoder-Only Models for Consolidating Speech Recognition, Synthesis and Speech, Text Continuation Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
AugSumm: Towards Generalizable Speech Summarization Using Synthetic Labels from Large Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2024
One Model to Rule Them All ? Towards End-to-End Joint Speaker Diarization and Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network.
CoRR, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Advancing the Dimensionality Reduction of Speaker Embeddings for Speaker Diarisation: Disentangling Noise and Informing Speech Activity.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
CoRR, 2022
CoRR, 2022
SASV Challenge 2022: A Spoofing Aware Speaker Verification Challenge Evaluation Plan.
CoRR, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Automatic Speaker Verification Spoofing and Deepfake Detection Using Wav2vec 2.0 and Data Augmentation.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Multi-Scale Speaker Embedding-Based Graph Attention Networks For Speaker Diarisation.
Proceedings of the IEEE International Conference on Acoustics, 2022
AASIST: Audio Anti-Spoofing Using Integrated Spectro-Temporal Graph Attention Networks.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
CoRR, 2021
End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection.
CoRR, 2021
Attentive Max Feature Map for Acoustic Scene Classification with Joint Learning considering the Abstraction of Classes.
CoRR, 2021
Learning Metrics from Mean Teacher: A Supervised Learning Method for Improving the Generalization of Speaker Verification System.
CoRR, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Three-Class Overlapped Speech Detection Using a Convolutional Recurrent Neural Network.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
DCASENET: An Integrated Pretrained Deep Neural Network for Detecting and Classifying Acoustic Scenes and Events.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Capturing scattered discriminative information using a deep architecture in acoustic scene classification.
CoRR, 2020
Improved RawNet with Filter-wise Rescaling for Text-independent Speaker Verification using Raw Waveforms.
CoRR, 2020
CoRR, 2020
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Self-Supervised Pre-Training with Acoustic Configurations for Replay Spoofing Detection.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Improved RawNet with Feature Map Scaling for Text-Independent Speaker Verification Using Raw Waveforms.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Audio Tag Representation Guided Dual Attention Network for Acoustic Scene Classification.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020
2019
Replay Attack Detection with Complementary High-Resolution Information Using End-to-End DNN for the ASVspoof 2019 Challenge.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
RawNet: Advanced End-to-End Deep Neural Network Using Raw Waveforms for Text-Independent Speaker Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
End-to-End Losses Based on Speaker Basis Vectors and All-Speaker Hard Negative Mining for Speaker Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Distilling the Knowledge of Specialist Deep Neural Networks in Acoustic Scene Classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019
Short Utterance Compensation in Speaker Verification via Cosine-Based Teacher-Student Learning of Speaker Embeddings.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
CoRR, 2018
Replay Spoofing Detection System for Automatic Speaker Verification Using Multi-Task Learning of Noise Classes.
Proceedings of the Conference on Technologies and Applications of Artificial Intelligence, 2018
Avoiding Speaker Overfitting in End-to-End DNNs Using Raw Waveform for Text-Independent Speaker Verification.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
A Complete End-to-End Speaker Verification System Using Deep Neural Networks: From Raw Signals to Verification Result.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018
2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
DNN-Based Audio Scene Classification for DCASE2017: Dual Input Features, Balancing Cost, and Stochastic Data Duplication.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017