Tao Yu

Orcid: 0000-0001-9939-2216

Affiliations:
  • University of Hong Kong, Department of Computer Science , Hong Kong
  • University of Washington, Paul G. Allen School of Computer Science & Engineering, Seattle, WA, USA
  • Yale University, Department of Computer Science, New Haven, CT, USA (PhD)


According to our database1, Tao Yu authored at least 57 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval.
CoRR, 2024

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
CoRR, 2024

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments.
CoRR, 2024

ARKS: Active Retrieval in Knowledge Soup for Code Generation.
CoRR, 2024

Generative Representational Instruction Tuning.
CoRR, 2024

OS-Copilot: Towards Generalist Computer Agents with Self-Improvement.
CoRR, 2024

Lemur: Harmonizing Natural Language and Code for Language Agents.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Text2Reward: Reward Shaping with Language Models for Reinforcement Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

EvoR: Evolving Retrieval for Code Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024


2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Trans. Mach. Learn. Res., 2023

OpenAgents: An Open Platform for Language Agents in the Wild.
CoRR, 2023

Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning.
CoRR, 2023

DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations.
CoRR, 2023

Automated Self-Supervised Learning for Recommendation.
Proceedings of the ACM Web Conference 2023, 2023

Coder Reviewer Reranking for Code Generation.
Proceedings of the International Conference on Machine Learning, 2023

Compositional Exemplars for In-context Learning.
Proceedings of the International Conference on Machine Learning, 2023

DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation.
Proceedings of the International Conference on Machine Learning, 2023

Selective Annotation Makes Language Models Better Few-Shot Learners.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Binding Language Models in Symbolic Languages.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Generating Data for Symbolic Language with Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Batch Prompting: Efficient Inference with Large Language Model APIs.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

Complex Reasoning in Natural Languag.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2023

One Embedder, Any Task: Instruction-Finetuned Text Embeddings.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
When Geometric Deep Learning Meets Pretrained Protein Language Models.
CoRR, 2022

NL2INTERFACE: Interactive Visualization Interface Generation from Natural Language Queries.
CoRR, 2022

FOLIO: Natural Language Reasoning with First-Order Logic.
CoRR, 2022

ZeroGen: Efficient Zero-shot Learning via Dataset Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

In-Context Learning for Few-Shot Dialogue State Tracking.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

DYLE: Dynamic Latent Extraction for Abstractive Long-Input Summarization.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Prefix-to-SQL: Text-to-SQL Generation from Incomplete User Questions.
CoRR, 2021

QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

DART: Open-Domain Structured Data Record to Text Generation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

SCoRe: Pre-Training for Context Representation in Conversational Semantic Parsing.
Proceedings of the 9th International Conference on Learning Representations, 2021

GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing.
Proceedings of the 9th International Conference on Learning Representations, 2021

Testing Cross-Database Semantic Parsers With Canonical Utterances.
Proceedings of the 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021

Effective Fine-Tuning Methods for Cross-lingual Adaptation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

SummerTime: Text Summarization Toolkit for Non-experts.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2021

An Exploratory Study on Long Dialogue Summarization: What Works and What's Next.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Logic-Consistency Text Generation from Semantic Parses.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Did You Ask a Good Question? A Cross-Domain Question Intention Classification Benchmark for Text-to-SQL.
CoRR, 2020

DART: Open-Domain Structured Data Record to Text Generation.
CoRR, 2020

Semantic Evaluation for Text-to-SQL with Distilled Test Suites.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Online Conversation Disentanglement with Pointer Networks.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

SParC: Cross-Domain Semantic Parsing in Context.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Cross-lingual sentiment transfer with limited resources.
Mach. Transl., 2018

SyntaxSQLNet: Syntax Tree Networks for Complex and Cross-DomainText-to-SQL Task.
CoRR, 2018

TypeSQL: Knowledge-Based Type-Aware Neural Text-to-SQL Generation.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

SyntaxSQLNet: Syntax Tree Networks for Complex and Cross-Domain Text-to-SQL Task.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
Leveraging Sparse and Dense Feature Combinations for Sentiment Classification.
CoRR, 2017

The Columbia-GWU System at the 2017 TAC KBP BeSt Evaluation.
Proceedings of the 2017 Text Analysis Conference, 2017

2016
The Columbia-GWU System at the 2016 TAC KBP BeSt Evaluation.
Proceedings of the 2016 Text Analysis Conference, 2016


  Loading...