Archit Sharma

According to our database1, Archit Sharma authored at least 34 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval.
CoRR, 2024

Towards Data-Centric RLHF: Simple Metrics for Preference Dataset Comparison.
CoRR, 2024

Stream of Search (SoS): Learning to Search in Language.
CoRR, 2024

DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset.
CoRR, 2024

Yell At Your Robot: Improving On-the-Fly from Language Corrections.
CoRR, 2024

A Critical Evaluation of AI Feedback for Aligning Large Language Models.
CoRR, 2024

Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

RLVF: Learning from Verbal Feedback without Overgeneralization.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Language Model Detectors Are Easily Optimized Against.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

An Emulator for Fine-tuning Large Language Models using Small Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Evolutionary Search of Optimal Hyperparameters for Learning Various Robot Manipulation Tasks.
Proceedings of the IEEE Congress on Evolutionary Computation, 2024

2023
Adapt On-the-Go: Behavior Modulation for Single-Life Robot Deployment.
CoRR, 2023

Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias.
CoRR, 2023

Direct Preference Optimization: Your Language Model is Secretly a Reward Model.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Waypoint-Based Imitation Learning for Robotic Manipulation.
Proceedings of the Conference on Robot Learning, 2023

Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement Learning.
Proceedings of the Conference on Robot Learning, 2023

APTSumm at BioLaySumm Task 1: Biomedical Breakdown, Improving Readability by Relevancy Based Selection.
Proceedings of the 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, 2023

2022
When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

You Only Live Once: Single-Life Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

Autonomous Reinforcement Learning: Formalism and Benchmarking.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Persistent Reinforcement Learning via Subgoal Curricula.
CoRR, 2021

Variational Empowerment as Representation Learning for Goal-Based Reinforcement Learning.
CoRR, 2021

Discriminator Augmented Model-Based Reinforcement Learning.
CoRR, 2021

Autonomous Reinforcement Learning via Subgoal Curricula.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning.
Proceedings of the Robotics: Science and Systems XVI, 2020

Dynamics-Aware Unsupervised Discovery of Skills.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
A flexible probabilistic framework for large-margin mixture of experts.
Mach. Learn., 2019

2018
TrueChain: Highly Performant Decentralized Public Ledger.
CoRR, 2018


  Loading...