2024
Towards Large Language Models for Everyone: Instruction Following, Knowledge Retrieval and Multilingualism
PhD thesis, 2024
LMFusion: Adapting Pretrained Language Models for Multimodal Generation.
CoRR, 2024
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models.
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Sirius: Contextual Sparsity with Correction for Efficient LLMs.
CoRR, 2024
MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts.
CoRR, 2024
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM.
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
SIRIUS : Contexual Sparisty with Correction for Efficient LLMs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Nearest Neighbor Speculative Decoding for LLM Generation and Attribution.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
In-Context Pretraining: Language Modeling Beyond Document Boundaries.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
RA-DIT: Retrieval-Augmented Dual Instruction Tuning.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Twelfth International Conference on Learning Representations, 2024
FOLIO: Natural Language Reasoning with First-Order Logic.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Instruction-tuned Language Models are Better Knowledge Learners.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Towards A Unified View of Sparse Feed-Forward Network in Pretraining Large Language Model.
CoRR, 2023
LEVER: Learning to Verify Language-to-Code Generation with Execution.
Proceedings of the International Conference on Machine Learning, 2023
Training Trajectories of Language Models Across Scales.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Reimagining Retrieval Augmented Language Models for Answering Queries.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
2022
FeTaQA: Free-form Table Question Answering.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Trans. Assoc. Comput. Linguistics, 2022
OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
FOLIO: Natural Language Reasoning with First-Order Logic.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
OPT: Open Pre-trained Transformer Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
Lifting the Curse of Multilinguality by Pre-training Modular Transformers.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Few-shot Learning with Multilingual Generative Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Efficient Large Scale Language Modeling with Mixtures of Experts.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Pretty Princess vs. Successful Leader: Gender Roles in Greeting Card Messages.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022
On Continual Model Refinement in Out-of-Distribution Data Streams.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
2021
Efficient Large Scale Language Modeling with Mixtures of Experts.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2021
Few-shot Learning with Multilingual Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2021
FeTaQA: Free-form Table Question Answering.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2021
Learning to Synthesize Data for Semantic Parsing.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
DART: Open-Domain Structured Data Record to Text Generation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing.
Proceedings of the 9th International Conference on Learning Representations, 2021
Testing Cross-Database Semantic Parsers With Canonical Utterances.
Proceedings of the 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021
Stage-wise Fine-tuning for Graph-to-Text Generation.
Proceedings of the ACL-IJCNLP 2021 Student Research Workshop, 2021
2020
ColloQL: Robust Cross-Domain Text-to-SQL Over Search Queries.
CoRR, 2020
DART: Open-Domain Structured Data Record to Text Generation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2020
NeurIPS 2020 NLC2CMD Competition: Translating Natural Language to Bash Commands.
Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 2020
Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Photon: A Robust Cross-Domain Text-to-SQL System.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020
Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
2019
Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
SParC: Cross-Domain Semantic Parsing in Context.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
2018
NL2Bash: A Corpus and Semantic Parser for Natural Language Interface to the Linux Operating System.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018
Multi-Hop Knowledge Graph Reasoning with Reward Shaping.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
2016
Compositional Learning of Embeddings for Relation Paths in Knowledge Base and Text.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016