Abhijit Gosavi

Orcid: 0000-0002-9703-4076

According to our database¹, Abhijit Gosavi authored at least 30 papers between 2004 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2005

2010

2015

2020

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Deep reinforcement learning for approximate policy iteration: convergence analysis and a post-earthquake disaster response case study.

[BibT_eX]

[DOI]

Abhijit Gosavi

Lesley H. Sneed

L. A. Spearing

Optim. Lett., December, 2024

Maintenance optimization in a digital twin for Industry 4.0.

[BibT_eX]

[DOI]

Abhijit Gosavi

Vy K. Le

Ann. Oper. Res., September, 2024

2022

On general multi-server queues with non-poisson arrivals and medium traffic: a new approximation and a COVID-19 ventilator case study.

[BibT_eX]

[DOI]

Carlos Chaves

Abhijit Gosavi

Oper. Res., 2022

2020

Discrete-Event-Based Simulation Model for Performance Evaluation of Post-Earthquake Restoration in a Smart City.

[BibT_eX]

[DOI]

IEEE Trans. Engineering Management, 2020

The Actor-Critic Algorithm for Infinite Horizon Discounted Cost Revisited.

[BibT_eX]

[DOI]

Abhijit Gosavi

Proceedings of the Winter Simulation Conference, 2020

2019

A bounded actor-critic reinforcement learning algorithm applied to airline revenue management.

[BibT_eX]

[DOI]

Ryan J. Lawhead

Abhijit Gosavi

Eng. Appl. Artif. Intell., 2019

2017

Predicting Response of Risk-Seeking Systems During Project Negotiations in a System of Systems.

[BibT_eX]

[DOI]

Abhijit Gosavi

Siddhartha Agarwal

Cihan H. Dagli

IEEE Syst. J., 2017

2016

Solving Markov decision processes with downside risk adjustment.

[BibT_eX]

[DOI]

Abhijit Gosavi

Anish Parulekar

Int. J. Autom. Comput., 2016

2015

Analyzing Responses from Likert Surveys and Risk-adjusted Ranking: A Data Analytics Perspective.

[BibT_eX]

[DOI]

Abhijit Gosavi

Proceedings of the Complex Adaptive Systems 2015 Conference, San Jose, 2015

2014

Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques.

[BibT_eX]

[DOI]

Abhijit Gosavi

Int. J. Gen. Syst., 2014

How to Rein in the Volatile Actor: A New Bounded Perspective.

[BibT_eX]

[DOI]

Abhijit Gosavi

Proceedings of the Complex Adaptive Systems 2014 Conference, 2014

Beyond exponential utility functions: A variance-adjusted approach for risk-averse reinforcement learning.

[BibT_eX]

[DOI]

Abhijit Gosavi

Sajal K. Das

Susan L. Murray

Proceedings of the 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2014

2013

Relative value iteration for average reward semi-Markov control via simulation.

[BibT_eX]

[DOI]

Abhijit Gosavi

Proceedings of the Winter Simulations Conference: Simulation Making Decisions in a Complex World, 2013

2012

Approximate Policy Iteration for Markov Control Revisited.

[BibT_eX]

[DOI]

Abhijit Gosavi

Proceedings of the Complex Adaptive Systems 2012 Conference, 2012

2011

Approximate Policy Iteration for Semi-Markov Control Revisited.

[BibT_eX]

[DOI]

Abhijit Gosavi

Proceedings of the Complex Adaptive Systems 2011 Conference, 2011

On the distribution of the number stranded in bulk-arrival, bulk-service queues of the M/G/1 form.

[BibT_eX]

[DOI]

Aykut F. Kahraman

Abhijit Gosavi

Eur. J. Oper. Res., 2011

Stochastic policy search for variance-penalized semi-Markov control.

[BibT_eX]

[DOI]

Abhijit Gosavi

Mandar Purohit

Proceedings of the Winter Simulation Conference 2011, 2011

Model-building semi-Markov adaptive critics.

[BibT_eX]

[DOI]

Abhijit Gosavi

Susan L. Murray

Jiaqiao Hu

Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

2010

Finite horizon Markov control with one-step variance penalties.

[BibT_eX]

[DOI]

Abhijit Gosavi

Proceedings of the 48th Annual Allerton Conference on Communication, 2010

2009

Reinforcement Learning: A Tutorial Survey and Recent Advances.

[BibT_eX]

[DOI]

Abhijit Gosavi

INFORMS J. Comput., 2009

Reinforcement Learning for Model Building and Variance-penalized Control.

[BibT_eX]

[DOI]

Abhijit Gosavi

Proceedings of the 2009 Winter Simulation Conference, 2009

2008

Stochastic Modeling of an Automated Guided Vehicle System With One Vehicle and a Closed-Loop Path.

[BibT_eX]

[DOI]

Aykut F. Kahraman

Abhijit Gosavi

Karla J. Oty

IEEE Trans Autom. Sci. Eng., 2008

On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning.

[BibT_eX]

[DOI]

Abhijit Gosavi

Proceedings of the 2008 Winter Simulation Conference, Global Gateway to Discovery, 2008

2007

Simulation optimization for revenue management of airlines with cancellations and overbooking.

[BibT_eX]

[DOI]

Abhijit Gosavi

Emrah Ozkaya

Aykut F. Kahraman

OR Spectr., 2007

Simulation-based optimisation for material dispatching in Vendor-Managed Inventory systems.

[BibT_eX]

[DOI]

Ganesh Subramaniam

Abhijit Gosavi

Int. J. Simul. Process. Model., 2007

2006

Boundedness of iterates in Q-Learning.

[BibT_eX]

[DOI]

Abhijit Gosavi

Syst. Control. Lett., 2006

A risk-sensitive approach to total productive maintenance.

[BibT_eX]

[DOI]

Abhijit Gosavi

Autom., 2006

2004

A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward: Empirical Results with Yield Management and Convergence Analysis.

[BibT_eX]

[DOI]

Abhijit Gosavi

Mach. Learn., 2004

Reinforcement learning for long-run average cost.

[BibT_eX]

[DOI]

Abhijit Gosavi

Eur. J. Oper. Res., 2004

Simulation-Based Optimization for Material Dispatching in a Retailer Network.

[BibT_eX]

[DOI]

Ganesh Subramaniam

Abhijit Gosavi

Proceedings of the 36th conference on Winter simulation, 2004

Abhijit Gosavi

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...