Yuan Cao

Affiliations:
  • Google Research, Mountain View, CA, USA
  • Johns Hopkins University, Baltimore, MD, USA (former)


According to our database1, Yuan Cao authored at least 52 papers between 2011 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024

Retrieval Augmented End-to-End Spoken Dialog Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

IG Captioner: Information Gain Captioners Are Strong Zero-Shot Classifiers.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
SLM: Bridge the thin gap between speech and text foundation models.
CoRR, 2023

Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding.
CoRR, 2023

MUX-PLMs: Pre-training Language Models with Data Multiplexing.
Proceedings of the 8th Workshop on Representation Learning for NLP, 2023

Tree of Thoughts: Deliberate Problem Solving with Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Speech Aware Dialog System Technology Challenge (DSTC11).
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ReAct: Synergizing Reasoning and Acting in Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

AnyTOD: A Programmable Task-Oriented Dialog System.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MUX-PLMs: Data Multiplexing for High-throughput Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

SLM: Bridge the Thin Gap Between Speech and Text Foundation Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners.
CoRR, 2022

Building Machine Translation Systems for the Next Thousand Languages.
CoRR, 2022

The Implicit Length Bias of Label Smoothing on Beam Search Decoding.
CoRR, 2022

Description-Driven Task-Oriented Dialog Modeling.
CoRR, 2022

Towards the Next 1000 Languages in Multilingual Machine Translation: Exploring the Synergy Between Supervised and Self-Supervised Learning.
CoRR, 2022

Unsupervised Slot Schema Induction for Task-oriented Dialog.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

SimVLM: Simple Visual Language Model Pretraining with Weak Supervision.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Knowledge-grounded Dialog State Tracking.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Multilingual Mix: Example Interpolation Improves Multilingual Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Towards Zero-Label Language Learning.
CoRR, 2021

Improving Longer-range Dialogue State Tracking.
CoRR, 2021

Gradient Vaccine: Investigating and Improving Multi-task Optimization in Massively Multilingual Models.
Proceedings of the 9th International Conference on Learning Representations, 2021

Echo State Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Effective Sequence-to-Sequence Dialogue State Tracking.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Rapid Domain Adaptation for Machine Translation with Monolingual Data.
CoRR, 2020

Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior.
CoRR, 2020

Fully-Hierarchical Fine-Grained Prosody Modeling For Interpretable Speech Synthesis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Generating Diverse and Natural Text-to-Speech Samples Using a Quantized Fine-Grained VAE and Autoregressive Prosody Prior.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Sample Selection for Large-scale MT Discriminative Training.
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers, 2020

Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges.
CoRR, 2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.
CoRR, 2019

Gmail Smart Compose: Real-Time Assisted Writing.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Hierarchical Generative Modeling for Controllable Speech Synthesis.
Proceedings of the 7th International Conference on Learning Representations, 2019

Leveraging Weakly Supervised Data to Improve End-to-end Speech-to-text Translation.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Training Deeper Neural Machine Translation Models with Transparent Attention.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2016
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation.
CoRR, 2016

2015
Joshua 6: A phrase-based and hierarchical statistical machine translation system.
Prague Bull. Math. Linguistics, 2015

2014
Translations of the Callhome Egyptian Arabic corpus for conversational speech translation.
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

Online Learning in Tensor Space.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Joshua 5.0: Sparser, Better, Faster, Server.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

2012
Review of Hypothesis Alignment Algorithms for MT System Combination via Confusion Network Decoding.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

Joshua 4.0: Packing, PRO, and Paraphrases.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

Deriving conversation-based features from unlabeled speech for discriminative language modeling.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012




2011
Description of the JHU System Combination Scheme for WMT 2011.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011


  Loading...