Abhijit Gosavi

Orcid: 0000-0002-9703-4076

According to our database1, Abhijit Gosavi authored at least 30 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Deep reinforcement learning for approximate policy iteration: convergence analysis and a post-earthquake disaster response case study.
Optim. Lett., December, 2024

Maintenance optimization in a digital twin for Industry 4.0.
Ann. Oper. Res., September, 2024

2022
On general multi-server queues with non-poisson arrivals and medium traffic: a new approximation and a COVID-19 ventilator case study.
Oper. Res., 2022

2020
Discrete-Event-Based Simulation Model for Performance Evaluation of Post-Earthquake Restoration in a Smart City.
IEEE Trans. Engineering Management, 2020

The Actor-Critic Algorithm for Infinite Horizon Discounted Cost Revisited.
Proceedings of the Winter Simulation Conference, 2020

2019
A bounded actor-critic reinforcement learning algorithm applied to airline revenue management.
Eng. Appl. Artif. Intell., 2019

2017
Predicting Response of Risk-Seeking Systems During Project Negotiations in a System of Systems.
IEEE Syst. J., 2017

2016
Solving Markov decision processes with downside risk adjustment.
Int. J. Autom. Comput., 2016

2015
Analyzing Responses from Likert Surveys and Risk-adjusted Ranking: A Data Analytics Perspective.
Proceedings of the Complex Adaptive Systems 2015 Conference, San Jose, 2015

2014
Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques.
Int. J. Gen. Syst., 2014

How to Rein in the Volatile Actor: A New Bounded Perspective.
Proceedings of the Complex Adaptive Systems 2014 Conference, 2014

Beyond exponential utility functions: A variance-adjusted approach for risk-averse reinforcement learning.
Proceedings of the 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2014

2013
Relative value iteration for average reward semi-Markov control via simulation.
Proceedings of the Winter Simulations Conference: Simulation Making Decisions in a Complex World, 2013

2012
Approximate Policy Iteration for Markov Control Revisited.
Proceedings of the Complex Adaptive Systems 2012 Conference, 2012

2011
Approximate Policy Iteration for Semi-Markov Control Revisited.
Proceedings of the Complex Adaptive Systems 2011 Conference, 2011

On the distribution of the number stranded in bulk-arrival, bulk-service queues of the M/G/1 form.
Eur. J. Oper. Res., 2011

Stochastic policy search for variance-penalized semi-Markov control.
Proceedings of the Winter Simulation Conference 2011, 2011

Model-building semi-Markov adaptive critics.
Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

2010
Finite horizon Markov control with one-step variance penalties.
Proceedings of the 48th Annual Allerton Conference on Communication, 2010

2009
Reinforcement Learning: A Tutorial Survey and Recent Advances.
INFORMS J. Comput., 2009

Reinforcement Learning for Model Building and Variance-penalized Control.
Proceedings of the 2009 Winter Simulation Conference, 2009

2008
Stochastic Modeling of an Automated Guided Vehicle System With One Vehicle and a Closed-Loop Path.
IEEE Trans Autom. Sci. Eng., 2008

On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning.
Proceedings of the 2008 Winter Simulation Conference, Global Gateway to Discovery, 2008

2007
Simulation optimization for revenue management of airlines with cancellations and overbooking.
OR Spectr., 2007

Simulation-based optimisation for material dispatching in Vendor-Managed Inventory systems.
Int. J. Simul. Process. Model., 2007

2006
Boundedness of iterates in Q-Learning.
Syst. Control. Lett., 2006

A risk-sensitive approach to total productive maintenance.
Autom., 2006

2004
A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward: Empirical Results with Yield Management and Convergence Analysis.
Mach. Learn., 2004

Reinforcement learning for long-run average cost.
Eur. J. Oper. Res., 2004

Simulation-Based Optimization for Material Dispatching in a Retailer Network.
Proceedings of the 36th conference on Winter simulation, 2004


  Loading...