Rishabh Agarwal

According to our database1, Rishabh Agarwal authored at least 56 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models.
Trans. Mach. Learn. Res., 2024

A comprehensive survey on answer generation methods using NLP.
Nat. Lang. Process. J., 2024

Evolving Alignment via Asymmetric Self-Play.
CoRR, 2024

Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models.
CoRR, 2024

Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling.
CoRR, 2024

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning.
CoRR, 2024

Not All LLM Reasoners Are Created Equal.
CoRR, 2024

Training Language Models to Self-Correct via Reinforcement Learning.
CoRR, 2024

Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling.
CoRR, 2024

Generative Verifiers: Reward Modeling as Next-Token Prediction.
CoRR, 2024

Don't Throw Away Data: Better Sequence Knowledge Distillation.
CoRR, 2024

On scalable oversight with weak LLMs judging strong LLMs.
CoRR, 2024

Many-Shot In-Context Learning.
CoRR, 2024

Transformers Can Achieve Length Generalization But Not Robustly.
CoRR, 2024

V-STaR: Training Verifiers for Self-Taught Reasoners.
CoRR, 2024

SiT: Symmetry-invariant Transformers for Generalisation in Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Stop Regressing: Training Value Functions via Classification for Scalable Deep RL.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

DistillSpec: Improving Speculative Decoding via Knowledge Distillation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy.
CoRR, 2023

GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models.
CoRR, 2023

Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Revisiting Bellman Errors for Offline Model Selection.
Proceedings of the International Conference on Machine Learning, 2023

The Dormant Neuron Phenomenon in Deep Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

Bigger, Better, Faster: Human-level Atari with human-level efficiency.
Proceedings of the International Conference on Machine Learning, 2023

Bootstrapped Representations in Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

Investigating Multi-task Pretraining and Generalization in Reinforcement Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Revisiting Bisimulation: A Sampling-Based State Similarity Pseudo-metric.
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Beyond Tabula Rasa: Reincarnating Reinforcement Learning.
CoRR, 2022

Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Detection Of Crop Water Stress In Maize Using Drone Based Hyperspectral Imaging.
Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2022

DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization.
Proceedings of the Tenth International Conference on Learning Representations, 2022

On the Generalization of Representations in Reinforcement Learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Deep Reinforcement Learning at the Edge of the Statistical Precipice.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Neural Additive Models: Interpretable Machine Learning with Neural Nets.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

Field Boundary Identification using Convolutional Neural Network and GIS on High Resolution Satellite Observations.
Proceedings of the 9th International Conference on Agro-Geoinformatics, 2021

2020
RL Unplugged: Benchmarks for Offline Reinforcement Learning.
CoRR, 2020

Neural Additive Models: Interpretable Machine Learning with Neural Nets.
CoRR, 2020

IITK at SemEval-2020 Task 10: Transformers for Emphasis Selection.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Revisiting Fundamentals of Experience Replay.
Proceedings of the 37th International Conference on Machine Learning, 2020

An Optimistic Perspective on Offline Reinforcement Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Striving for Simplicity in Off-policy Deep Reinforcement Learning.
CoRR, 2019

Evaluation Function Approximation for Scrabble.
CoRR, 2019

Measurement of shear forces during gripping tasks with a low-cost tactile sensing system.
Proceedings of the IEEE International Conference on Soft Robotics, 2019

Learning to Generalize from Sparse and Underspecified Rewards.
Proceedings of the 36th International Conference on Machine Learning, 2019

2017
Computing Theory Prime Implicates in Modal Logic.
Proceedings of the Intelligent Systems Design and Applications, 2017

S-Pencil: A Smart Pencil Grip Monitoring System for Kids Using Sensors.
Proceedings of the 2017 IEEE Global Communications Conference, 2017

Development of a Low-Cost Education Platform: RoboMuse 4.0.
Proceedings of the Advances in Robotics, 2017

2016
Touchless human-mobile robot interaction using a projectable interactive surface.
Proceedings of the 2016 IEEE/SICE International Symposium on System Integration, 2016


  Loading...