Caglar Gulcehre

CoRR, 2024

SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning.

[BibT_eX]

[DOI]

CoRR, 2024

The Role of Deep Learning Regularizations on Actors in Offline RL.

[BibT_eX]

[DOI]

Denis Tarasov

Anja Surina

CoRR, 2024

In Search for Architectures and Loss Functions in Multi-Objective Reinforcement Learning.

[BibT_eX]

[DOI]

Mikhail Terekhov

Kilian Konstantin Haefeli

CoRR, 2024

Investigating Low-Rank Training in Transformer Language Models: Efficiency and Scaling Analysis.

[BibT_eX]

[DOI]

CoRR, 2024

HiPPO-Prophecy: State-Space Models can Provably Learn Dynamical Systems in Context.

[BibT_eX]

[DOI]

Federico Arangath Joseph

Noah Liniger

CoRR, 2024

Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers.

[BibT_eX]

[DOI]

CoRR, 2024

Promises, Outlooks and Challenges of Diffusion Language Modeling.

[BibT_eX]

[DOI]

Justin Deschenaux

CoRR, 2024

Fleet of Agents: Coordinated Problem Solving with Large Language Models using Genetic Particle Filtering.

[BibT_eX]

[DOI]

CoRR, 2024

No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO.

[BibT_eX]

[DOI]

CoRR, 2024

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models.

[BibT_eX]

[DOI]

George-Cristian Muraru

CoRR, 2024

Universality of Linear Recurrences Followed by Non-linear Projections: Finite-Width Guarantees and Benefits of Complex Eigenvalues.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Simple Hierarchical Planning with Diffusion.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Aligning Large Language Models with Diverse Political Viewpoints.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Self-Recognition in Language Models.

[BibT_eX]

[DOI]

Tim R. Davidson

Viacheslav Surkov

Veniamin Veselovsky

Giuseppe Russo Latona

Robert West

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023

Reinforced Self-Training (ReST) for Language Modeling.

[BibT_eX]

[DOI]

CoRR, 2023

AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

On the Universality of Linear Recurrences Followed by Nonlinear Projections.

[BibT_eX]

[DOI]

CoRR, 2023

Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Resurrecting Recurrent Neural Networks for Long Sequences.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

2022

An empirical study of implicit regularization in deep offline RL.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2022

On Instrumental Variable Regression for Deep Offline Policy Evaluation.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2022

2021

Regularized Behavior Value Estimation.

[BibT_eX]

[DOI]

Sergio Gómez Colmenarejo

CoRR, 2021

Active Offline Policy Selection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020

Offline Learning from Demonstrations and Unlabeled Experience.

[BibT_eX]

[DOI]

CoRR, 2020

Post-Workshop Report on Science meets Engineering in Deep Learning, NeurIPS 2019, Vancouver.

[BibT_eX]

[DOI]

Stefano Sarao Mannelli

CoRR, 2020

Hyperparameter Selection for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2020

RL Unplugged: Benchmarks for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Sergio Gómez Colmenarejo

CoRR, 2020

Acme: A Research Framework for Distributed Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2020

RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Sergio Gómez Colmenarejo

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Critic Regularized Regression.

[BibT_eX]

[DOI]

Jost Tobias Springenberg

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Stabilizing Transformers for Reinforcement Learning.

[BibT_eX]

[DOI]

Siddhant M. Jayakumar

Max Jaderberg

Raphaël Lopez Kaufman

Proceedings of the 37th International Conference on Machine Learning, 2020

Improving the Gating Mechanism of Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Making Efficient Use of Demonstrations to Solve Hard Exploration Problems.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Gated Orthogonal Recurrent Units: On Learning to Forget.

[BibT_eX]

[DOI]

Neural Comput., 2019

Grandmaster level in StarCraft II using multi-agent reinforcement learning.

[BibT_eX]

[DOI]

Nat., 2019

Improving the Gating Mechanism of Recurrent Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Hyperbolic Attention Networks.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Sample Efficient Adaptive Text-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

2018

Dynamic Neural Turing Machine with Continuous and Discrete Addressing Schemes.

[BibT_eX]

[DOI]

Neural Comput., 2018

Intrinsic Social Motivation via Causal Influence in Multi-Agent RL.

[BibT_eX]

[DOI]

CoRR, 2018

Relational inductive biases, deep learning, and graph networks.

[BibT_eX]

[DOI]

CoRR, 2018

2017

On integrating a language model into neural machine translation.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2017

Plan, Attend, Generate: Character-level Neural Machine Translation with Planning in the Decoder.

[BibT_eX]

[DOI]

CoRR, 2017

Memory Augmented Neural Networks with Wormhole Connections.

[BibT_eX]

[DOI]

Sarath Chandar

Nicolas Boulanger-Lewandowski

CoRR, 2017

Machine Comprehension by Text-to-Text Neural Question Generation.

[BibT_eX]

[DOI]

Proceedings of the 2nd Workshop on Representation Learning for NLP, 2017

Plan, Attend, Generate: Character-Level Neural Machine Translation with Planning.

[BibT_eX]

[DOI]

Proceedings of the 2nd Workshop on Representation Learning for NLP, 2017

Plan, Attend, Generate: Planning for Sequence-to-Sequence Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

A robust adaptive stochastic gradient method for deep learning.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Mollifying Networks.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Recurrent Batch Normalization.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

2016

EmoNets: Multimodal deep learning approaches for emotion recognition in video.

[BibT_eX]

[DOI]

Samira Ebrahimi Kahou

Raul Chandias Ferrari

Christopher Joseph Pal

Nicolas Boulanger-Lewandowski

J. Multimodal User Interfaces, 2016

Policy Distillation.

[BibT_eX]

[DOI]

Andrei A. Rusu

Sergio Gomez Colmenarejo

Proceedings of the 4th International Conference on Learning Representations, 2016

Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes.

[BibT_eX]

[DOI]

CoRR, 2016

Theano: A Python framework for fast computation of mathematical expressions.

[BibT_eX]

[DOI]

Xavier Bouthillier

Alexandre de Brébisson

Samira Ebrahimi Kahou

Pierre-Antoine Manzagol

Christopher Joseph Pal

S. Ramana Subramanyam

CoRR, 2016

Noisy Activation Functions.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond.

[BibT_eX]

[DOI]

Ramesh Nallapati

Bowen Zhou

Cícero Nogueira dos Santos

Bing Xiang

Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, 2016

Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus.

[BibT_eX]

[DOI]

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Pointing the Unknown Words.

[BibT_eX]

[DOI]

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015

On Using Monolingual Corpora in Neural Machine Translation.

[BibT_eX]

[DOI]

CoRR, 2015

Gated Feedback Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 32nd International Conference on Machine Learning, 2015

2014

How to Construct Deep Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Conference on Learning Representations, 2014

ADASECANT: Robust Adaptive Secant Method for Stochastic Gradient.

[BibT_eX]

[DOI]

CoRR, 2014

Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling.

[BibT_eX]

[DOI]

CoRR, 2014

Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation.

[BibT_eX]

[DOI]

CoRR, 2014

Learned-Norm Pooling for Deep Feedforward and Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014

Identifying and attacking the saddle point problem in high-dimensional non-convex optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

2013

Knowledge Matters: Importance of Prior Information for Optimization

[BibT_eX]

[DOI]