Yun Tang

Orcid: 0000-0002-3122-5881

Affiliations:
  • Facebook AI, USA


According to our database1, Yun Tang authored at least 41 papers between 2004 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Exploration on HuBERT with Multiple Resolutions.
CoRR, 2023


Exploration on HuBERT with Multiple Resolution.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Enhancing Speech-To-Speech Translation with Multiple TTS Targets.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Speech-to-Speech Translation Through Unlabeled Text.
Proceedings of the IEEE International Conference on Acoustics, 2023

Named Entity Detection and Injection for Direct Speech Translation.
Proceedings of the IEEE International Conference on Acoustics, 2023

ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2023

Simple and Effective Unsupervised Speech Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Contrastive Clustering to Mine Pseudo Parallel Data for Unsupervised Translation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Unified Speech-Text Pre-training for Speech Translation and Recognition.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Direct Speech-to-Speech Translation With Discrete Units.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Direct simultaneous speech to speech translation.
CoRR, 2021

Incremental Speech Synthesis For Speech-To-Speech Translation.
CoRR, 2021

Direct speech-to-speech translation with discrete units.
CoRR, 2021

Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2021

Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Multilingual Speech Translation from Efficient Finetuning of Pretrained Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Cross-Modal Transfer Learning for Multilingual Speech-to-Text Translation.
CoRR, 2020

Self-Training for End-to-End Speech Translation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Fairseq S2T: Fast Speech-to-Text Modeling with Fairseq.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing: System Demonstrations, 2020

Orthogonal Relation Transforms with Graph Context Modeling for Knowledge Graph Embedding.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Zero-Shot Text-to-SQL Learning with Auxiliary Task.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Relation Module for Non-answerable Prediction on Question Answering.
CoRR, 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.
CoRR, 2019

Towards adversarial learning of speaker-invariant representation for speech emotion recognition.
CoRR, 2019

Multi-Stride Self-Attention for Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Deep Speaker Embedding Learning with Multi-level Pooling for Text-independent Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2019

Relation Module for Non-Answerable Predictions on Reading Comprehension.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

End-to-End Structure-Aware Convolutional Networks for Knowledge Base Completion.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2006
A Fast Framework for the Constrained Mean Trajectory Segment Model by Avoidance of Redundant Computation on Segment.
Int. J. Comput. Linguistics Chin. Lang. Process., 2006

All-Path Decoding Algorithm for Segmental Based Speech Recognition.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition.
Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

One-Pass Coarse-to-Fine Segmental Speech Decoding Algorithm.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2004
A framework for fast segment model by avoidance of redundant computation on segment.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Trigram duration modeling in speech recognition.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004


  Loading...