Dzmitry Bahdanau

According to our database1, Dzmitry Bahdanau authored at least 49 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
NNetscape Navigator: Complex Demonstrations for Web Agents Without a Demonstrator.
CoRR, 2024

LLMs can learn self-restraint through iterative self-reflection.
CoRR, 2024

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders.
CoRR, 2024

Evaluating In-Context Learning of Libraries for Code Generation.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

2023
StarCoder: may the source be with you!
Trans. Mach. Learn. Res., 2023

The Stack: 3 TB of permissively licensed source code.
Trans. Mach. Learn. Res., 2023

In-Context Learning for Text Classification with Many Labels.
CoRR, 2023

RepoFusion: Training Code Models to Understand Your Repository.
CoRR, 2023

SantaCoder: don't reach for the stars!
CoRR, 2023

PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Evaluating the Text-to-SQL Capabilities of Large Language Models.
CoRR, 2022

On the Compositional Generalization Gap of In-Context Learning.
Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2022

LAGr: Label Aligned Graphs for Better Systematic Generalization in Semantic Parsing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Compositional Generalization in Dependency Parsing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Data Augmentation for Intent Classification with Off-the-shelf Large Language Models.
Proceedings of the 4th Workshop on NLP for Conversational AI, 2022

2021
LAGr: Labeling Aligned Graphs for Improving Systematic Generalization in Semantic Parsing.
CoRR, 2021

Jointly Learning Truth-Conditional Denotations and Groundings using Parallel Attention.
CoRR, 2021

Systematic Generalization with Edge Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

DuoRAT: Towards Simpler Text-to-SQL Models.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Understanding by Understanding Not: Modeling Negation in Language Models.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Combating False Negatives in Adversarial Imitation Learning.
Proceedings of the International Joint Conference on Neural Networks, 2021

PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Towards Ecologically Valid Research on Language User Interfaces.
CoRR, 2020

BabyAI 1.1.
CoRR, 2020

Combating False Negatives in Adversarial Imitation Learning (Student Abstract).
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
CLOSURE: Assessing Systematic Generalization of CLEVR Models.
CoRR, 2019

Automated curriculum generation for Policy Gradients from Demonstrations.
CoRR, 2019

CLOSURE: Assessing Systematic Generalization of CLEVR Models.
Proceedings of the Visually Grounded Interaction and Language (ViGIL), 2019

BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning.
Proceedings of the 7th International Conference on Learning Representations, 2019

Systematic Generalization: What Is Required and Can It Be Learned?
Proceedings of the 7th International Conference on Learning Representations, 2019

Learning to Understand Goal Specifications by Modelling Reward.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
BabyAI: First Steps Towards Grounded Language Learning With a Human In the Loop.
CoRR, 2018

Learning to Follow Language Instructions with Adversarial Reward Induction.
CoRR, 2018

Commonsense mining as knowledge base completion? A study on the impact of novelty.
CoRR, 2018

Jointly Learning "What" and "How" from Instructions and Goal-States.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Learning to Compute Word Embeddings On the Fly.
CoRR, 2017

Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control.
Proceedings of the 34th International Conference on Machine Learning, 2017

An Actor-Critic Algorithm for Sequence Prediction.
Proceedings of the 5th International Conference on Learning Representations, 2017

2016
Theano: A Python framework for fast computation of mathematical expressions.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2016

End-to-end attention-based large vocabulary speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Blocks and Fuel: Frameworks for deep learning.
CoRR, 2015

Task Loss Estimation for Sequence Prediction.
CoRR, 2015

Neural Machine Translation by Jointly Learning to Align and Translate.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Attention-Based Models for Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

2014
End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results.
CoRR, 2014

Overcoming the Curse of Sentence Length for Neural Machine Translation using Automatic Segmentation.
Proceedings of SSST@EMNLP 2014, 2014

On the Properties of Neural Machine Translation: Encoder-Decoder Approaches.
Proceedings of SSST@EMNLP 2014, 2014

Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014


  Loading...