Yu-An Chung

Orcid: 0000-0001-9451-7956

According to our database1, Yu-An Chung authored at least 40 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DC-Spin: A Speaker-invariant Speech Tokenizer for Spoken Language Models.
CoRR, 2024

COLLD: Contrastive Layer-to-Layer Distillation for Compressing Multilingual Pre-Trained Speech Encoders.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Seamless: Multilingual Expressive and Streaming Speech Translation.
CoRR, 2023

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation.
CoRR, 2023

UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Speech-to-Speech Translation for a Real-world Unwritten Language.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Self-Supervised Learning for Speech Processing
PhD thesis, 2022

Autoregressive Predictive Coding: A Comprehensive Study.
IEEE J. Sel. Top. Signal Process., 2022

SSAST: Self-Supervised Audio Spectrogram Transformer.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training.
CoRR, 2021

PSLA: Improving Audio Event Classification with Pretraining, Sampling, Labeling, and Aggregation.
CoRR, 2021

SPLAT: Speech-Language Joint Pre-Training for Spoken Language Understanding.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

AST: Audio Spectrogram Transformer.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Similarity Analysis of Self-Supervised Speech Representations.
Proceedings of the IEEE International Conference on Acoustics, 2021

w2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Semi-Supervised Speech-Language Joint Pre-Training for Spoken Language Understanding.
CoRR, 2020

Clinical Text Summarization with Syntax-Based Negation and Semantic Concept Identification.
CoRR, 2020

Cost-Sensitive Deep Learning with Layer-Wise Cost Estimation.
Proceedings of the International Conference on Technologies and Applications of Artificial Intelligence, 2020

Vector-Quantized Autoregressive Predictive Coding.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Generative Pre-Training for Speech with Autoregressive Predictive Coding.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Improved Speech Representations with Multi-Target Autoregressive Predictive Coding.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
SummAE: Zero-Shot Abstractive Text Summarization using Length-Agnostic Auto-Encoders.
CoRR, 2019

Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models.
CoRR, 2019

Unsupervised Clinical Language Translation.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

An Unsupervised Autoregressive Model for Speech Representation Learning.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization.
Proceedings of the IEEE International Conference on Acoustics, 2019

Towards Unsupervised Speech-to-text Translation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Semi-supervised Training for Improving Data Efficiency in End-to-end Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Supervised and Unsupervised Transfer Learning for Question Answering.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
Learning Deep Representations of Medical Images using Siamese CNNs with Application to Content-Based Image Retrieval.
CoRR, 2017

Learning Word Embeddings from Speech.
CoRR, 2017

libact: Pool-based Active Learning in Python.
CoRR, 2017

2016
Cost-Sensitive Deep Learning with Layer-Wise Cost Estimation.
CoRR, 2016

Audio Word2Vec: Unsupervised Learning of Audio Segment Representations Using Sequence-to-Sequence Autoencoder.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Cost-Aware Pre-Training for Multiclass Cost-Sensitive Deep Learning.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015
High-level energy consumption model of embedded graphic processors.
Proceedings of the 2015 IEEE International Conference on Digital Signal Processing, 2015


  Loading...