Andrey Kolobov

Orcid: 0000-0003-4966-7466

According to our database1, Andrey Kolobov authored at least 50 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
The Importance of Directional Feedback for LLM-based Optimizers.
CoRR, 2024

PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem.
CoRR, 2024

WindSeer: Real-time volumetric wind prediction over complex terrain aboard a small UAV.
CoRR, 2024

Watching the Air Rise: Learning-Based Single-Frame Schlieren Detection.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Improving Offline RL by Blending Heuristics.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Influence of Heat Loss on the Chaotic Dynamics of Reaction Waves in the Model with Chain-Branching Reaction.
Int. J. Bifurc. Chaos, September, 2023

LLF-Bench: Benchmark for Interactive Learning from Language Feedback.
CoRR, 2023

Interactive Robot Learning from Verbal Correction.
CoRR, 2023

PLEX: Making the Most of the Available Data for Robotic Manipulation Pretraining.
CoRR, 2023

Survival Instinct in Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Exploring Levels of Control for a Navigation Assistant for Blind Travelers.
Proceedings of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, 2023

PLEX: Making the Most of the Available Data for Robotic Manipulation Pretraining.
Proceedings of the Conference on Robot Learning, 2023

Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control.
Proceedings of the Conference on Robot Learning, 2023

2022
The Sandbox Environment for Generalizable Agent Research (SEGAR).
CoRR, 2022

MoCapAct: A Multi-Task Dataset for Simulated Humanoid Control.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Heuristic-Guided Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Policy Improvement from Multiple Experts.
CoRR, 2020

Safe Reinforcement Learning via Curriculum Induction.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark.
Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 2020

Policy Improvement via Imitation of Multiple Oracles.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Online Learning for Active Cache Synchronization.
Proceedings of the 37th International Conference on Machine Learning, 2020

MultiPoint: Cross-spectral registration of thermal and optical aerial imagery.
Proceedings of the 4th Conference on Robot Learning, 2020

2019
Optimal Freshness Crawl Under Politeness Constraints.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Staying up to Date with Online Content Changes Using Reinforcement Learning for Scheduling.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
Autonomous Thermalling as a Partially Observable Markov Decision Process (Extended Version).
CoRR, 2018

Autonomous Thermalling as a Partially Observable Markov Decision Process.
Proceedings of the Robotics: Science and Systems XIV, 2018

ArduSoar: An Open-Source Thermalling Controller for Resource-Constrained Autopilots.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

2016
Interactive Teaching Strategies for Agent Training.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015
Metareasoning for Planning Under Uncertainty.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Selecting Robust Strategies in RTS Games via Concurrent Plan Augmentation.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

TODTLER: Two-Order-Deep Transfer Learning.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Parallel Task Routing for Crowdsourcing.
Proceedings of the Seconf AAAI Conference on Human Computation and Crowdsourcing, 2014

Gauss meets Canadian traveler: shortest-path problems with correlated natural dynamics.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Saturated Path-Constrained MDP: Planning under Uncertainty and Deterministic Model-Checking Constraints.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Scalable Methods and Expressive Models for Planning Under Uncertainty.
PhD thesis, 2013

Joint Crowdsourcing of Multiple Tasks.
Proceedings of the Human Computation and Crowdsourcing: Works in Progress and Demonstration Abstracts, 2013

2012
Planning with Markov Decision Processes: An AI Perspective
Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers, ISBN: 978-3-031-01559-5, 2012

Discovering hidden structure in factored MDPs.
Artif. Intell., 2012

A Theory of Goal-Oriented MDPs with Dead Ends.
Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Reverse Iterative Deepening for Finite-Horizon MDPs with Large Branching Factors.
Proceedings of the Twenty-Second International Conference on Automated Planning and Scheduling, 2012

LRTDP Versus UCT for Online Probabilistic Planning.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
Towards Scalable MDP Algorithms.
Proceedings of the IJCAI 2011, 2011

Heuristic Search for Generalized Stochastic Shortest Path MDPs.
Proceedings of the 21st International Conference on Automated Planning and Scheduling, 2011

2010
Classical Planning in MDP Heuristics: with a Little Help from Generalization.
Proceedings of the 20th International Conference on Automated Planning and Scheduling, 2010

SixthSense: Fast and Reliable Recognition of Dead Ends in MDPs.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009
ReTrASE: Integrating Paradigms for Approximate Probabilistic Planning.
Proceedings of the IJCAI 2009, 2009

2005
BLOG: Probabilistic Models with Unknown Objects.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Approximate Inference for Infinite Contingent Bayesian Networks.
Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics, 2005


  Loading...