Mridul Agarwal

According to our database1, Mridul Agarwal authored at least 36 papers between 2005 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Reinforcement Learning for Joint Optimization of Multiple Rewards.
J. Mach. Learn. Res., 2023

On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization.
Proceedings of the International Conference on Machine Learning, 2023

Poze with Vogue.
Proceedings of the 14th International Conference on Computing Communication and Networking Technologies, 2023

2022
Concave Utility Reinforcement Learning with Zero-Constraint Violations.
Trans. Mach. Learn. Res., 2022

On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC).
J. Mach. Learn. Res., 2022

Multi-Agent Multi-Armed Bandits with Limited Communication.
J. Mach. Learn. Res., 2022

Joint Optimization of Concave Scalarized Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm.
J. Artif. Intell. Res., 2022

Learning-Based Online QoE Optimization in Multi-Agent Video Streaming.
Algorithms, 2022

Reinforcement Learning for Mean-Field Game.
Algorithms, 2022

An explore-then-commit algorithm for submodular maximization under full-bandit feedback.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Regret guarantees for model-based reinforcement learning with long-term average constraints.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Multi-Objective Reinforcement Learning with Non-Linear Scalarization.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Stochastic Top <i>K</i>-Subset Bandits with Linear Space and Non-Linear Feedback with Applications to Social Influence Maximization.
Trans. Data Sci., 2021

Blind decision making: Reinforcement learning with delayed observations.
Pattern Recognit. Lett., 2021

Markov Decision Processes with Long-Term Average Constraints.
CoRR, 2021

Joint Optimization of Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm.
CoRR, 2021

SARTRES: a semi-autonomous robot teleoperation environment for surgery.
Comput. methods Biomech. Biomed. Eng. Imaging Vis., 2021

Communication efficient parallel reinforcement learning.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Dexterous Skill Transfer between Surgical Procedures for Teleoperated Robotic Surgery.
Proceedings of the 30th IEEE International Conference on Robot & Human Interactive Communication, 2021

DESERTS: DElay-tolerant SEmi-autonomous Robot Teleoperation for Surgery.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Stochastic Top-K Subset Bandits with Linear Space and Non-Linear Feedback.
Proceedings of the Algorithmic Learning Theory, 2021

DART: Adaptive Accept Reject Algorithm for Non-Linear Combinatorial Bandits.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
DART: aDaptive Accept RejecT for non-linear top-K subset identification.
CoRR, 2020

Escaping Saddle Points for Zeroth-order Non-convex Optimization using Estimated Gradient Descent.
Proceedings of the 54th Annual Conference on Information Sciences and Systems, 2020

2019
Escaping Saddle Points for Zeroth-order Nonconvex Optimization using Estimated Gradient Descent.
CoRR, 2019

Encoders and Decoders for Quantum Expander Codes Using Machine Learning.
CoRR, 2019

A Reinforcement Learning Based Approach for Joint Multi-Agent Decision Making.
CoRR, 2019

Transferring Dexterous Surgical Skill Knowledge between Robots for Semi-autonomous Teleoperation.
Proceedings of the 28th IEEE International Conference on Robot and Human Interactive Communication, 2019

2018
Regret Bounds for Stochastic Combinatorial Multi-Armed Bandits with Linear Space Complexity.
CoRR, 2018

2012
Grasping Region Identification in Novel Objects Using Microsoft Kinect.
Proceedings of the Neural Information Processing - 19th International Conference, 2012

2008
Optimized Circuit Failure Prediction for Aging: Practicality and Promise.
Proceedings of the 2008 IEEE International Test Conference, 2008

2007
Circuit Failure Prediction and Its Application to Transistor Aging.
Proceedings of the 25th IEEE VLSI Test Symposium (VTS 2007), 2007

Circuit failure prediction to overcome scaled CMOS reliability challenges.
Proceedings of the 2007 IEEE International Test Conference, 2007

2006
Statistical interconnect metrics for physical-design optimization.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2006

2005
Statistical modeling of cross-coupling effects in VLSI interconnects.
Proceedings of the 2005 Conference on Asia South Pacific Design Automation, 2005


  Loading...