Florian Strub

According to our database1, Florian Strub authored at least 41 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Averaging log-likelihoods in direct alignment.
CoRR, 2024

Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion.
CoRR, 2024

Language Evolution with Deep Learning.
CoRR, 2024

Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Countering Reward Over-Optimization in LLM with Demonstration-Guided Reinforcement Learning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Language Model Alignment with Elastic Reset.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

The Edge of Orthogonality: A Simple View of What Makes BYOL Tick.
Proceedings of the International Conference on Machine Learning, 2023

SemPPL: Predicting Pseudo-Labels for Better Contrastive Representations.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Over-communicate no more: Situated RL agents learn concise communication protocols.
CoRR, 2022

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning.
CoRR, 2022

Developing, evaluating and scaling learning agents in multi-agent environments.
AI Commun., 2022

Emergent Communication: Generalization and Overfitting in Lewis Games.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Natural Language Generation with Truncated Reinforcement Learning.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

On the role of population heterogeneity in emergent communication.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Emergent Communication at Scale.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Learning Natural Language Generation from Scratch.
CoRR, 2021

Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Broaden Your Views for Self-Supervised Video Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Multimodal and Interactive Models for Visually Grounded Language Learning. (Développement de modèles multimodaux intéractifs pour l'apprentissage du language dans des environnements visuels).
PhD thesis, 2020

BYOL works even without batch statistics.
CoRR, 2020

The Monte Carlo Transformer: a stochastic self-attention model for sequence prediction.
CoRR, 2020

HIGhER: Improving instruction following with Hindsight Generation for Experience Replay.
Proceedings of the 2020 IEEE Symposium Series on Computational Intelligence, 2020

Bootstrap Your Own Latent - A New Approach to Self-Supervised Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Machine of Few Words: Interactive Speaker Recognition with Reinforcement Learning.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Countering Language Drift with Seeded Iterated Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

Supervised Seeded Iterated Learning for Interactive Language Learning.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
Accurate reconstruction of EBSD datasets by a multimodal data approach using an evolutionary algorithm.
CoRR, 2019

Correction of Electron Back-scattered Diffraction datasets using an evolutionary algorithm.
CoRR, 2019

Self-Educated Language Agent with Hindsight Experience Replay for Instruction Following.
Proceedings of the Visually Grounded Interaction and Language (ViGIL), 2019

2018
Deep Reinforcement Learning and the Deadly Triad.
CoRR, 2018

HoME: a Household Multimodal Environment.
Proceedings of the 6th International Conference on Learning Representations, 2018

Visual Reasoning with Multi-hop Feature Modulation.
Proceedings of the Computer Vision - ECCV 2018, 2018

FiLM: Visual Reasoning with a General Conditioning Layer.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Learning Visual Reasoning Without Strong Priors.
CoRR, 2017

Modulating early visual processing by language.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

End-to-end optimization of goal-driven and visually grounded dialogue systems.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

GuessWhat?! Visual Object Discovery through Multi-modal Dialogue.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Learning Nash Equilibrium for General-Sum Markov Games from Batch Data.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016
Hybrid Collaborative Filtering with Neural Networks.
CoRR, 2016

Hybrid Recommender System based on Autoencoders.
Proceedings of the 1st Workshop on Deep Learning for Recommender Systems, 2016


  Loading...