Rishabh Agarwal

According to our database¹, Rishabh Agarwal authored at least 56 papers between 2016 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

A comprehensive survey on answer generation methods using NLP.

[BibT_eX]

[DOI]

Nat. Lang. Process. J., 2024

Evolving Alignment via Asymmetric Self-Play.

[BibT_eX]

[DOI]

CoRR, 2024

Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling.

[BibT_eX]

[DOI]

CoRR, 2024

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning.

[BibT_eX]

[DOI]

CoRR, 2024

Not All LLM Reasoners Are Created Equal.

[BibT_eX]

[DOI]

CoRR, 2024

Training Language Models to Self-Correct via Reinforcement Learning.

[BibT_eX]

[DOI]

Feryal M. P. Behbahani

Aleksandra Faust

CoRR, 2024

Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling.

[BibT_eX]

[DOI]

CoRR, 2024

Generative Verifiers: Reward Modeling as Next-Token Prediction.

[BibT_eX]

[DOI]

CoRR, 2024

Don't Throw Away Data: Better Sequence Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, 2024

On scalable oversight with weak LLMs judging strong LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

Many-Shot In-Context Learning.

[BibT_eX]

[DOI]

Feryal M. P. Behbahani

Aleksandra Faust

Hugo Larochelle

CoRR, 2024

Transformers Can Achieve Length Generalization But Not Robustly.

[BibT_eX]

[DOI]

CoRR, 2024

V-STaR: Training Verifiers for Self-Taught Reasoners.

[BibT_eX]

[DOI]

CoRR, 2024

SiT: Symmetry-invariant Transformers for Generalisation in Reinforcement Learning.

[BibT_eX]

[DOI]

Matthias Weissenbacher

Rishabh Agarwal

Yoshinobu Kawahara

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Stop Regressing: Training Value Functions via Classification for Scalable Deep RL.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

DistillSpec: Improving Speculative Decoding via Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy.

[BibT_eX]

[DOI]

CoRR, 2023

GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models.

[BibT_eX]

[DOI]

CoRR, 2023

Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Revisiting Bellman Errors for Offline Model Selection.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

The Dormant Neuron Phenomenon in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Bigger, Better, Faster: Human-level Atari with human-level efficiency.

[BibT_eX]

[DOI]

Max Schwarzer

Johan Samir Obando-Ceron

Proceedings of the International Conference on Machine Learning, 2023

Bootstrapped Representations in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Investigating Multi-task Pretraining and Generalization in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Revisiting Bisimulation: A Sampling-Based State Similarity Pseudo-metric.

[BibT_eX]

[DOI]

Charline Le Lan

Rishabh Agarwal

Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022

Beyond Tabula Rasa: Reincarnating Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Detection Of Crop Water Stress In Maize Using Drone Based Hyperspectral Imaging.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2022

DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

On the Generalization of Representations in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Deep Reinforcement Learning at the Edge of the Statistical Precipice.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Neural Additive Models: Interpretable Machine Learning with Neural Nets.

[BibT_eX]

[DOI]

Benjamin J. Lengerich

Rich Caruana

Geoffrey E. Hinton

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Field Boundary Identification using Convolutional Neural Network and GIS on High Resolution Satellite Observations.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Agro-Geoinformatics, 2021

2020

RL Unplugged: Benchmarks for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Sergio Gómez Colmenarejo

CoRR, 2020

Neural Additive Models: Interpretable Machine Learning with Neural Nets.

[BibT_eX]

[DOI]

CoRR, 2020

IITK at SemEval-2020 Task 10: Transformers for Emphasis Selection.

[BibT_eX]

[DOI]

Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Sergio Gómez Colmenarejo

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Revisiting Fundamentals of Experience Replay.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

An Optimistic Perspective on Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Rishabh Agarwal

Dale Schuurmans

Mohammad Norouzi

Proceedings of the 37th International Conference on Machine Learning, 2020

2019

Striving for Simplicity in Off-policy Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Rishabh Agarwal

Dale Schuurmans

Mohammad Norouzi

CoRR, 2019

Evaluation Function Approximation for Scrabble.

[BibT_eX]

[DOI]

Rishabh Agarwal

CoRR, 2019

Measurement of shear forces during gripping tasks with a low-cost tactile sensing system.

[BibT_eX]

[DOI]

Rishabh Agarwal

Sarah Bergbreiter

Proceedings of the IEEE International Conference on Soft Robotics, 2019

Learning to Generalize from Sparse and Underspecified Rewards.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

2017

Computing Theory Prime Implicates in Modal Logic.

[BibT_eX]

[DOI]

Manoj K. Raut

Tushar V. Kokane

Rishabh Agarwal

Proceedings of the Intelligent Systems Design and Applications, 2017

S-Pencil: A Smart Pencil Grip Monitoring System for Kids Using Sensors.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Global Communications Conference, 2017

Development of a Low-Cost Education Platform: RoboMuse 4.0.

[BibT_eX]

[DOI]

Proceedings of the Advances in Robotics, 2017

2016

Touchless human-mobile robot interaction using a projectable interactive surface.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE/SICE International Symposium on System Integration, 2016

Rishabh Agarwal

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...