Changhan Wang

According to our database1, Changhan Wang authored at least 59 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DC-Spin: A Speaker-invariant Speech Tokenizer for Spoken Language Models.
CoRR, 2024

Characterizing and Efficiently Accelerating Multimodal Generation Model Inference.
CoRR, 2024

MSLM-S2ST: A Multitask Speech Language Model for Textless Speech-to-Speech Translation with Speaker Style Preservation.
CoRR, 2024

An Empirical Study of Speech Language Models for Prompt-Conditioned Speech Synthesis.
CoRR, 2024

XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Seamless: Multilingual Expressive and Streaming Speech Translation.
CoRR, 2023

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation.
CoRR, 2023

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Pre-training for Speech Translation: CTC Meets Optimal Transport.
Proceedings of the International Conference on Machine Learning, 2023

Enhancing Speech-To-Speech Translation with Multiple TTS Targets.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Speech-to-Speech Translation Through Unlabeled Text.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Holistic Cascade System, Benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Introducing Semantics into Speech Encoders.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Simple and Effective Unsupervised Speech Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Speech-to-Speech Translation for a Real-world Unwritten Language.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Introducing Semantics into Speech Encoders.
CoRR, 2022

Textless Speech-to-Speech Translation on Real Data.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022


Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Contrastive Clustering to Mine Pseudo Parallel Data for Unsupervised Translation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Unified Speech-Text Pre-training for Speech Translation and Recognition.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Direct Speech-to-Speech Translation With Discrete Units.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Textless Speech-to-Speech Translation on Real Data.
CoRR, 2021

Incremental Speech Synthesis For Speech-To-Speech Translation.
CoRR, 2021

Direct speech-to-speech translation with discrete units.
CoRR, 2021

FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021


Large-Scale Self- and Semi-Supervised Learning for Speech Translation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

CoVoST 2 and Massively Multilingual Speech Translation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2021

fairseq S\^2: A Scalable and Integrable Speech Synthesis Toolkit.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2021

VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Multilingual Speech Translation from Efficient Finetuning of Pretrained Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Lightweight Adapter Tuning for Multilingual Speech Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Cross-Modal Transfer Learning for Multilingual Speech-to-Text Translation.
CoRR, 2020

CoVoST 2: A Massively Multilingual Speech-to-Text Translation Corpus.
CoRR, 2020

Facebook AI's WMT20 News Translation Task Submission.
Proceedings of the Fifth Conference on Machine Translation, 2020

CoVoST: A Diverse Multilingual Speech-To-Text Translation Corpus.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020


Self-Supervised Representations Improve End-to-End Speech Translation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Fairseq S2T: Fast Speech-to-Text Modeling with Fairseq.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing: System Demonstrations, 2020

SIMULEVAL: An Evaluation Toolkit for Simultaneous Translation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Neural Machine Translation with Byte-Level Subwords.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Levenshtein Transformer.
CoRR, 2019

Levenshtein Transformer.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

VizSeq: a visual analysis toolkit for text generation tasks.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Does Object Recognition Work for Everyone?
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
Context-Attentive Embeddings for Improved Sentence Representations.
CoRR, 2018

Dynamic Meta-Embeddings for Improved Sentence Representations.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Code-Switched Named Entity Recognition with Embedding Attention.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

2015
A unified framework for automatic wound segmentation and analysis with deep convolutional neural networks.
Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2015

2013
InSide: Interactive Sketching for Image Database Exploration.
Proceedings of the 2013 International Conference on Computer-Aided Design and Computer Graphics, 2013


  Loading...