Yuan Cao

Affiliations:

Google Research, Mountain View, CA, USA
Johns Hopkins University, Baltimore, MD, USA (former)

According to our database¹, Yuan Cao authored at least 52 papers between 2011 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

RoboVQA: Multimodal Long-Horizon Reasoning for Robotics.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Retrieval Augmented End-to-End Spoken Dialog Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

IG Captioner: Information Gain Captioners Are Strong Zero-Shot Classifiers.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

SLM: Bridge the thin gap between speech and text foundation models.

[BibT_eX]

[DOI]

CoRR, 2023

Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding.

[BibT_eX]

[DOI]

CoRR, 2023

MUX-PLMs: Pre-training Language Models with Data Multiplexing.

[BibT_eX]

[DOI]

Proceedings of the 8th Workshop on Representation Learning for NLP, 2023

Tree of Thoughts: Deliberate Problem Solving with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Speech Aware Dialog System Technology Challenge (DSTC11).

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ReAct: Synergizing Reasoning and Acting in Language Models.

[BibT_eX]

[DOI]

Karthik R. Narasimhan

Yuan Cao

Proceedings of the Eleventh International Conference on Learning Representations, 2023

AnyTOD: A Programmable Task-Oriented Dialog System.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MUX-PLMs: Data Multiplexing for High-throughput Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

SLM: Bridge the Thin Gap Between Speech and Text Foundation Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners.

[BibT_eX]

[DOI]

CoRR, 2022

Building Machine Translation Systems for the Next Thousand Languages.

[BibT_eX]

[DOI]

CoRR, 2022

The Implicit Length Bias of Label Smoothing on Beam Search Decoding.

[BibT_eX]

[DOI]

Bowen Liang

Pidong Wang

Yuan Cao

CoRR, 2022

Description-Driven Task-Oriented Dialog Modeling.

[BibT_eX]

[DOI]

CoRR, 2022

Towards the Next 1000 Languages in Multilingual Machine Translation: Exploring the Synergy Between Supervised and Self-Supervised Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Unsupervised Slot Schema Induction for Task-oriented Dialog.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

SimVLM: Simple Visual Language Model Pretraining with Weak Supervision.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Knowledge-grounded Dialog State Tracking.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Multilingual Mix: Example Interpolation Improves Multilingual Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Towards Zero-Label Language Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Improving Longer-range Dialogue State Tracking.

[BibT_eX]

[DOI]

CoRR, 2021

Gradient Vaccine: Investigating and Improving Multi-task Optimization in Massively Multilingual Models.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Echo State Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Effective Sequence-to-Sequence Dialogue State Tracking.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020

Rapid Domain Adaptation for Machine Translation with Monolingual Data.

[BibT_eX]

[DOI]

CoRR, 2020

Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior.

[BibT_eX]

[DOI]

CoRR, 2020

Fully-Hierarchical Fine-Grained Prosody Modeling For Interpretable Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Generating Diverse and Natural Text-to-Speech Samples Using a Quantized Fine-Grained VAE and Autoregressive Prosody Prior.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Sample Selection for Large-scale MT Discriminative Training.

[BibT_eX]

[DOI]

Yuan Cao

Sanjeev Khudanpur

Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers, 2020

Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation.

[BibT_eX]

[DOI]

Sneha Reddy Kudugunta

Naveen Arivazhagan

Yonghui Wu

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges.

[BibT_eX]

[DOI]

CoRR, 2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.

[BibT_eX]

[DOI]

CoRR, 2019

Gmail Smart Compose: Real-Time Assisted Writing.

[BibT_eX]

[DOI]

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Hierarchical Generative Modeling for Controllable Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Leveraging Weakly Supervised Data to Improve End-to-end Speech-to-text Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Training Deeper Neural Machine Translation Models with Transparent Attention.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2016

Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation.

[BibT_eX]

[DOI]

CoRR, 2016

2015

Joshua 6: A phrase-based and hierarchical statistical machine translation system.

[BibT_eX]

[DOI]

Matt Post

Yuan Cao

Gaurav Kumar

Prague Bull. Math. Linguistics, 2015

2014

Translations of the Callhome Egyptian Arabic corpus for conversational speech translation.

[BibT_eX]

[DOI]

Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

Online Learning in Tensor Space.

[BibT_eX]

[DOI]

Yuan Cao

Sanjeev Khudanpur

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013

Joshua 5.0: Sparser, Better, Faster, Server.

[BibT_eX]

[DOI]

Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

2012

Review of Hypothesis Alignment Algorithms for MT System Combination via Confusion Network Decoding.

[BibT_eX]

[DOI]

Antti-Veikko I. Rosti

Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

Joshua 4.0: Packing, PRO, and Paraphrases.

[BibT_eX]

[DOI]

Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

Deriving conversation-based features from unlabeled speech for discriminative language modeling.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Continuous space discriminative language modeling.

[BibT_eX]

[DOI]

Puyang Xu

Sanjeev Khudanpur

Maider Lehr

Emily Tucker Prud'hommeaux

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Hallucinated n-best lists for discriminative language modeling.

[BibT_eX]

[DOI]

Kenji Sagae

Maider Lehr

Emily Tucker Prud'hommeaux

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Semi-supervised discriminative language modeling for Turkish ASR.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Description of the JHU System Combination Scheme for WMT 2011.

[BibT_eX]

[DOI]

Daguang Xu

Yuan Cao

Damianos G. Karakos

Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Yuan Cao

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...