Archit Sharma

According to our database¹, Archit Sharma authored at least 34 papers between 2018 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Data-Centric RLHF: Simple Metrics for Preference Dataset Comparison.

[BibT_eX]

[DOI]

Judy Hanwen Shen

Archit Sharma

Jun Qin

CoRR, 2024

Stream of Search (SoS): Learning to Search in Language.

[BibT_eX]

[DOI]

CoRR, 2024

DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset.

[BibT_eX]

[DOI]

CoRR, 2024

Yell At Your Robot: Improving On-the-Fly from Language Corrections.

[BibT_eX]

[DOI]

CoRR, 2024

A Critical Evaluation of AI Feedback for Aligning Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration.

[BibT_eX]

[DOI]

Henrik I. Christensen

Keerthana Gopalakrishnan

Lawrence Yunliang Chen

Nur Muhammad (Mahi) Shafiullah

Roberto Martín-Martín

Subramanian Ramamoorthy

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

RLVF: Learning from Verbal Feedback without Overgeneralization.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Language Model Detectors Are Easily Optimized Against.

[BibT_eX]

[DOI]

Christopher D. Manning

Chelsea Finn

Stefano Ermon

Proceedings of the Twelfth International Conference on Learning Representations, 2024

An Emulator for Fine-tuning Large Language Models using Small Language Models.

[BibT_eX]

[DOI]

Christopher D. Manning

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Evolutionary Search of Optimal Hyperparameters for Learning Various Robot Manipulation Tasks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Congress on Evolutionary Computation, 2024

2023

Adapt On-the-Go: Behavior Modulation for Single-Life Robot Deployment.

[BibT_eX]

[DOI]

CoRR, 2023

Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias.

[BibT_eX]

[DOI]

CoRR, 2023

Direct Preference Optimization: Your Language Model is Secretly a Reward Model.

[BibT_eX]

[DOI]

Rafael Rafailov

Archit Sharma

Eric Mitchell

Christopher D. Manning

Stefano Ermon

Chelsea Finn

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback.

[BibT_eX]

[DOI]

Christopher D. Manning

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Waypoint-Based Imitation Learning for Robotic Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2023

Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2023

APTSumm at BioLaySumm Task 1: Biomedical Breakdown, Improving Readability by Relevancy Based Selection.

[BibT_eX]

[DOI]

Proceedings of the 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, 2023

2022

When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

You Only Live Once: Single-Life Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning.

[BibT_eX]

[DOI]

Archit Sharma

Rehaan Ahmad

Chelsea Finn

Proceedings of the International Conference on Machine Learning, 2022

Autonomous Reinforcement Learning: Formalism and Benchmarking.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Persistent Reinforcement Learning via Subgoal Curricula.

[BibT_eX]

[DOI]

CoRR, 2021

Variational Empowerment as Representation Learning for Goal-Based Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Discriminator Augmented Model-Based Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Autonomous Reinforcement Learning via Subgoal Curricula.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Robotics: Science and Systems XVI, 2020

Dynamics-Aware Unsupervised Discovery of Skills.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

A flexible probabilistic framework for large-margin mixture of experts.

[BibT_eX]

[DOI]

Archit Sharma

Siddhartha Saxena

Piyush Rai

Mach. Learn., 2019

2018

TrueChain: Highly Performant Decentralized Public Ledger.

[BibT_eX]

[DOI]

Archit Sharma

Jasper L

Eric Zhang

CoRR, 2018

Archit Sharma

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...