Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Resource Constrained Dialog Policy Learning Via Differentiable Inductive Logic Programming.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Computational Linguistics, 2020

Situated and Interactive Multimodal Conversations.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019

SIMMC: Situated Interactive Multi-Modal Conversational Data Collection And Evaluation Platform.

[BibT_eX]

[DOI]

CoRR, 2019

Domain-Independent turn-level Dialogue Quality Evaluation via User Satisfaction Estimation.

[BibT_eX]

[DOI]

Praveen Kumar Bodigutla

CoRR, 2019

2017

Learning Robust Dialog Policies in Noisy Environments.

[BibT_eX]

[DOI]

CoRR, 2017

The Future of Artificially Intelligent Assistants.

[BibT_eX]

[DOI]

Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

2015

RLPy: a value-function-based reinforcement learning framework for education and research.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2015

2013

Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning.

[BibT_eX]

[DOI]

Alborz Geramifard

Josh Redding

Jonathan P. How

J. Intell. Robotic Syst., 2013

A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning.

[BibT_eX]

[DOI]

Found. Trends Mach. Learn., 2013

Batch-iFDD for Representation Expansion in Large MDPs.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence, 2013

Reinforcement learning with misspecified model classes.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

Decentralized control of partially observable Markov decision processes.

[BibT_eX]

[DOI]

Mykel J. Kochenderfer

Proceedings of the 52nd IEEE Conference on Decision and Control, 2013

2012

Adaptive Planning for Markov Decision Processes with Uncertain Transition Models via Incremental Feature Dependency Discovery.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2012

Model estimation within planning and learning.

[BibT_eX]

[DOI]

Proceedings of the American Control Conference, 2012

2011

Online Discovery of Feature Dependencies.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Machine Learning, 2011

UAV cooperative control with stochastic risk models.

[BibT_eX]

[DOI]

Proceedings of the American Control Conference, 2011

2010

On the Design and Use of a Micro Air Vehicle to Track and Avoid Adversaries.

[BibT_eX]

[DOI]

Int. J. Robotics Res., 2010

An intelligent Cooperative Control Architecture.

[BibT_eX]

[DOI]

Proceedings of the American Control Conference, 2010

Actor-Critic Policy Learning in Cooperative Planning.

[BibT_eX]

[DOI]

Josh Redding

Alborz Geramifard

Jonathan P. How

Proceedings of the Embedded Reasoning, 2010

2008

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping.

[BibT_eX]

[DOI]

Proceedings of the UAI 2008, 2008

Co-ordinated Tracking and Planning Using Air and Ground Vehicles.

[BibT_eX]

[DOI]

Proceedings of the Experimental Robotics, The Eleventh International Symposium, 2008

Sigma point policy iteration.

[BibT_eX]

[DOI]

Michael H. Bowling

Alborz Geramifard

David Wingate

Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

2006

A Hybrid Three Layer Architecture for Fire Agent Management in Rescue Simulation Environment

[BibT_eX]

[DOI]

CoRR, 2006

iLSTD: Eligibility Traces and Convergence Analysis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Biased Cost Pathfinding.

[BibT_eX]

[DOI]

Alborz Geramifard

Pirooz Chubak

Vadim Bulitko

Proceedings of the Second Artificial Intelligence and Interactive Digital Entertainment Conference, 2006

Incremental Least-Squares Temporal Difference Learning.

[BibT_eX]

[DOI]

Alborz Geramifard

Michael H. Bowling

Richard S. Sutton

Proceedings of the Proceedings, 2006

Alborz Geramifard

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...