2025
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains.
CoRR, January, 2025
2024
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Trans. Mach. Learn. Res., 2024
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Semi-Supervised One Shot Imitation Learning.
RLJ, 2024
Generative Hierarchical Materials Search.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE International Conference on Robotics and Automation, 2024
Improving Factuality and Reasoning in Language Models through Multiagent Debate.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Scalable Diffusion for Materials Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
2023
Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy.
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Open X-Embodiment: Robotic Learning Datasets and RT-X Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
et al.
CoRR, 2023
Bi-Manual Block Assembly via Sim-to-Real Reinforcement Learning.
CoRR, 2023
Grounded Decoding: Guiding Text Generation with Grounded Models for Robot Control.
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
RT-1: Robotics Transformer for Real-World Control at Scale.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023
Grounded Decoding: Guiding Text Generation with Grounded Models for Embodied Agents.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Masked Trajectory Models for Prediction, Representation, and Control.
Proceedings of the International Conference on Machine Learning, 2023
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets.
Proceedings of the International Conference on Machine Learning, 2023
PaLM-E: An Embodied Multimodal Language Model.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the International Conference on Machine Learning, 2023
Composing Ensembles of Pre-trained Models via Iterative Consensus.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Conference on Robot Learning, 2023
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023
2022
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
VeLO: Training Versatile Learned Optimizers by Scaling Up.
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
FCM: Forgetful Causal Masking Makes Causal Language Models Better Zero-Shot Learners.
CoRR, 2022
Implicit Offline Reinforcement Learning via Supervised Learning.
CoRR, 2022
Bi-Manual Manipulation and Attachment via Sim-to-Real Reinforcement Learning.
CoRR, 2022
Pre-Trained Language Models for Interactive Decision-Making.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Multi-Game Decision Transformers.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents.
Proceedings of the International Conference on Machine Learning, 2022
Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022
Learning Iterative Reasoning through Energy Minimization.
Proceedings of the International Conference on Machine Learning, 2022
Inner Monologue: Embodied Reasoning through Planning with Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Conference on Robot Learning, 2022
Energy-Based Models for Continual Learning.
Proceedings of the Conference on Lifelong Learning Agents, 2022
Frozen Pretrained Transformers as Universal Computation Engines.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning.
CoRR, 2021
Pretrained Transformers as Universal Computation Engines.
CoRR, 2021
The Neural MMO Platform for Massively Multiagent Research.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021
Brax - A Differentiable Physics Engine for Large Scale Rigid Body Simulation.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021
Unsupervised Learning of Compositional Energy Concepts.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Decision Transformer: Reinforcement Learning via Sequence Modeling.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Model-Based Reinforcement Learning via Latent-Space Collocation.
Proceedings of the 38th International Conference on Machine Learning, 2021
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot.
Proceedings of the 38th International Conference on Machine Learning, 2021
Improved Contrastive Divergence Training of Energy-Based Models.
Proceedings of the 38th International Conference on Machine Learning, 2021
Reset-Free Lifelong Learning with Skill-Space Planning.
Proceedings of the 9th International Conference on Learning Representations, 2021
Implicit Behavioral Cloning.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021
2020
Energy-Based Models for Continual Learning.
CoRR, 2020
Rearrangement: A Challenge for Embodied AI.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2020
γ-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction.
CoRR, 2020
Compositional Visual Generation and Inference with Energy Based Models.
CoRR, 2020
Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Compositional Visual Generation with Energy Based Models.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
A Game Theoretic Framework for Model Based Reinforcement Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020
One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control.
Proceedings of the 37th International Conference on Machine Learning, 2020
Emergent Tool Use From Multi-Agent Autocurricula.
Proceedings of the 8th International Conference on Learning Representations, 2020
Neural MMO v1.3: A Massively Multiagent Game Environment for Training and Evaluating Neural Networks.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020
2019
Adaptive Online Planning for Continual Lifelong Learning.
CoRR, 2019
Implicit Generation and Generalization in Energy-Based Models.
CoRR, 2019
Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents.
CoRR, 2019
Implicit Generation and Modeling with Energy Based Models.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Plan Online, Learn Offline: Efficient Learning and Exploration via Model-Based Control.
Proceedings of the 7th International Conference on Learning Representations, 2019
Skill emergence and transfer in multi-agent environments.
Proceedings of the Genetic and Evolutionary Computation Conference Companion, 2019
Multi-Agent Reinforcement Learning with Multi-Step Generative Models.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019
Model-Based Planning with Energy-Based Models.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019
2018
Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines.
Proceedings of the 6th International Conference on Learning Representations, 2018
Concept Learning with Energy-Based Models.
Proceedings of the 6th International Conference on Learning Representations, 2018
Emergent Complexity via Multi-Agent Competition.
Proceedings of the 6th International Conference on Learning Representations, 2018
Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments.
Proceedings of the 6th International Conference on Learning Representations, 2018
Learning with Opponent-Learning Awareness.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018
Emergence of Grounded Compositional Language in Multi-Agent Populations.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Interpretable and Pedagogical Examples.
CoRR, 2017
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Prediction and Control with Temporal Segment Models.
Proceedings of the 34th International Conference on Machine Learning, 2017
2016
A Paradigm for Situated and Goal-Driven Language Learning.
CoRR, 2016
Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model.
CoRR, 2016
Combining model-based policy search with online model learning for control of physical humanoids.
Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016
2015
Automated Discovery and Learning of Complex Movement Behaviors.
PhD thesis, 2015
Interactive Control of Diverse Complex Characters with Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Ensemble-CIO: Full-body dynamic motion planning that transfers to physical humanoids.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015
Physics-based trajectory optimization for grasping in cluttered environments.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015
2014
Combining the benefits of function approximation and trajectory optimization.
Proceedings of the Robotics: Science and Systems X, 2014
2013
Animating human lower limbs using contact-invariant optimization.
ACM Trans. Graph., 2013
Stylizing animation by example.
ACM Trans. Graph., 2013
2012
Discovery of complex behaviors through contact-invariant optimization.
ACM Trans. Graph., 2012
Contact-Invariant Optimization for Hand Manipulation.
Proceedings of the 2012 Eurographics/ACM SIGGRAPH Symposium on Computer Animation, 2012
2010
Robust physics-based locomotion using low-dimensional planning.
ACM Trans. Graph., 2010
Feature-based locomotion controllers.
ACM Trans. Graph., 2010
2009
Multiscale 3D navigation.
Proceedings of the 2009 Symposium on Interactive 3D Graphics, 2009
2008
ViewCube: a 3D orientation indicator and controller.
Proceedings of the 2008 Symposium on Interactive 3D Graphics, 2008
Proceedings of the 2008 Symposium on Interactive 3D Graphics, 2008
2007
Interface techniques for 3D control of spatial keyframing.
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2007