Jee-Weon Jung

Orcid: 0000-0003-0505-2988

According to our database1, Jee-Weon Jung authored at least 85 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
The VoxCeleb Speaker Recognition Challenge: A Retrospective.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

SpoofCeleb: Speech Deepfake Detection and SASV In The Wild.
CoRR, 2024

ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech.
CoRR, 2024

Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels.
CoRR, 2024

Text-To-Speech Synthesis In The Wild.
CoRR, 2024

ASVspoof 5: Crowdsourced Speech Data, Deepfakes, and Adversarial Attacks at Scale.
CoRR, 2024

Beyond Silence: Bias Analysis through Loss and Asymmetric Approach in Audio Anti-Spoofing.
CoRR, 2024

Disentangled Representation Learning for Environment-agnostic Speaker Recognition.
CoRR, 2024

To what extent can ASV systems naturally defend against spoofing attacks?
CoRR, 2024

Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement.
CoRR, 2024

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages.
CoRR, 2024

Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
CoRR, 2024

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models.
CoRR, 2024

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer.
CoRR, 2024

Improving Design of Input Condition Invariant Speech Enhancement.
CoRR, 2024

AugSumm: towards generalizable speech summarization using synthetic labels from large language model.
CoRR, 2024

a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verification.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Improving Design of Input Condition Invariant Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2024

Improving Audio Captioning Models with Fine-Grained Audio Features, Text Embedding Supervision, and LLM Mix-Up Augmentation.
Proceedings of the IEEE International Conference on Acoustics, 2024

VoxtLM: Unified Decoder-Only Models for Consolidating Speech Recognition, Synthesis and Speech, Text Continuation Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2024

VoxMM: Rich Transcription of Conversations in the Wild.
Proceedings of the IEEE International Conference on Acoustics, 2024

AugSumm: Towards Generalizable Speech Summarization Using Synthetic Labels from Large Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

One Model to Rule Them All ? Towards End-to-End Joint Speaker Diarization and Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

Understanding Probe Behaviors Through Variational Bounds of Mutual Information.
Proceedings of the IEEE International Conference on Acoustics, 2024

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study.
Proceedings of the IEEE International Conference on Acoustics, 2024

On the Evaluation of Speech Foundation Models for Spoken Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network.
CoRR, 2023

VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge.
CoRR, 2023

Multi-Dataset Co-Training with Sharpness-Aware Optimization for Audio Anti-spoofing.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Disentangled Representation Learning for Multilingual Speaker Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Towards Single Integrated Spoofing-aware Speaker Verification Embeddings.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Encoder-decoder Multimodal Speaker Change Detection.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Curriculum Learning for Self-supervised Speaker Verification.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Absolute Decision Corrupts Absolutely: Conservative Online Speaker Diarisation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Advancing the Dimensionality Reduction of Speaker Embeddings for Speaker Diarisation: Disentangling Noise and Informing Speech Activity.
Proceedings of the IEEE International Conference on Acoustics, 2023

In Search of Strong Embedding Extractors for Speaker Diarisation.
Proceedings of the IEEE International Conference on Acoustics, 2023

High-Resolution Embedding Extractor for Speaker Diarisation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Disentangled representation learning for multilingual speaker recognition.
CoRR, 2022

Large-scale learning of generalised representations for speaker recognition.
CoRR, 2022

Selective Kernel Attention for Robust Speaker Verification.
CoRR, 2022

SASV Challenge 2022: A Spoofing Aware Speaker Verification Challenge Evaluation Plan.
CoRR, 2022

Frequency and Multi-Scale Selective Kernel Attention for Speaker Verification.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Automatic Speaker Verification Spoofing and Deepfake Detection Using Wav2vec 2.0 and Data Augmentation.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

SASV 2022: The First Spoofing-Aware Speaker Verification Challenge.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Pushing the limits of raw waveform speaker recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Attentive Max Feature Map and Joint Training for Acoustic Scene Classification.
Proceedings of the IEEE International Conference on Acoustics, 2022

Multi-Scale Speaker Embedding-Based Graph Attention Networks For Speaker Diarisation.
Proceedings of the IEEE International Conference on Acoustics, 2022

AASIST: Audio Anti-Spoofing Using Integrated Spectro-Temporal Graph Attention Networks.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Disentangled dimensionality reduction for noise-robust speaker diarisation.
CoRR, 2021

End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection.
CoRR, 2021

Attentive Max Feature Map for Acoustic Scene Classification with Joint Learning considering the Abstraction of Classes.
CoRR, 2021

Learning Metrics from Mean Teacher: A Supervised Learning Method for Improving the Generalization of Speaker Verification System.
CoRR, 2021

Graph Attention Networks for Anti-Spoofing.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Adapting Speaker Embeddings for Speaker Diarisation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Three-Class Overlapped Speech Detection Using a Convolutional Recurrent Neural Network.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

DCASENET: An Integrated Pretrained Deep Neural Network for Detecting and Classifying Acoustic Scenes and Events.
Proceedings of the IEEE International Conference on Acoustics, 2021

Graph Attention Networks for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Capturing scattered discriminative information using a deep architecture in acoustic scene classification.
CoRR, 2020

Integrated Replay Spoofing-aware Text-independent Speaker Verification.
CoRR, 2020

Improved RawNet with Filter-wise Rescaling for Text-independent Speaker Verification using Raw Waveforms.
CoRR, 2020

A study on the role of subsidiary information in replay attack spoofing detection.
CoRR, 2020

Knowledge Distillation in Acoustic Scene Classification.
IEEE Access, 2020

Selective Deep Speaker Embedding Enhancement for Speaker Verification.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Self-Supervised Pre-Training with Acoustic Configurations for Replay Spoofing Detection.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Segment Aggregation for Short Utterances Speaker Verification Using Raw Waveforms.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Acoustic Scene Classification Using Audio Tagging.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Improved RawNet with Feature Map Scaling for Text-Independent Speaker Verification Using Raw Waveforms.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Audio Tag Representation Guided Dual Attention Network for Acoustic Scene Classification.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019
Cosine similarity-based adversarial process.
CoRR, 2019

Replay Attack Detection with Complementary High-Resolution Information Using End-to-End DNN for the ASVspoof 2019 Challenge.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

RawNet: Advanced End-to-End Deep Neural Network Using Raw Waveforms for Text-Independent Speaker Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

End-to-End Losses Based on Speaker Basis Vectors and All-Speaker Hard Negative Mining for Speaker Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Acoustic Scene Classification Using Teacher-Student Learning with Soft-Labels.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Distilling the Knowledge of Specialist Deep Neural Networks in Acoustic Scene Classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Short Utterance Compensation in Speaker Verification via Cosine-Based Teacher-Student Learning of Speaker Embeddings.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Replay attack spoofing detection system using replay noise by multi-task learning.
CoRR, 2018

Replay Spoofing Detection System for Automatic Speaker Verification Using Multi-Task Learning of Noise Classes.
Proceedings of the Conference on Technologies and Applications of Artificial Intelligence, 2018

Avoiding Speaker Overfitting in End-to-End DNNs Using Raw Waveform for Text-Independent Speaker Verification.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

A Complete End-to-End Speaker Verification System Using Deep Neural Networks: From Raw Signals to Verification Result.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

DNN based multi-level feature ensemble for acoustic scene classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017
Joint Training of Expanded End-to-End DNN for Text-Dependent Speaker Verification.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

DNN-Based Audio Scene Classification for DCASE2017: Dual Input Features, Balancing Cost, and Stochastic Data Duplication.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017


  Loading...