Florian Strub
According to our database1,
Florian Strub
authored at least 41 papers
between 2016 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion.
CoRR, 2024
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Countering Reward Over-Optimization in LLM with Demonstration-Guided Reinforcement Learning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
2022
Figure Data for the paper "Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning".
Dataset, October, 2022
CoRR, 2022
CoRR, 2022
AI Commun., 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
2021
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
2020
Multimodal and Interactive Models for Visually Grounded Language Learning. (Développement de modèles multimodaux intéractifs pour l'apprentissage du language dans des environnements visuels).
PhD thesis, 2020
The Monte Carlo Transformer: a stochastic self-attention model for sequence prediction.
CoRR, 2020
HIGhER: Improving instruction following with Hindsight Generation for Experience Replay.
Proceedings of the 2020 IEEE Symposium Series on Computational Intelligence, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
2019
Accurate reconstruction of EBSD datasets by a multimodal data approach using an evolutionary algorithm.
CoRR, 2019
Correction of Electron Back-scattered Diffraction datasets using an evolutionary algorithm.
CoRR, 2019
Self-Educated Language Agent with Hindsight Experience Replay for Instruction Following.
Proceedings of the Visually Grounded Interaction and Language (ViGIL), 2019
2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the Computer Vision - ECCV 2018, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017
2016
Proceedings of the 1st Workshop on Deep Learning for Recommender Systems, 2016