Shie Mannor
Orcid: 0000-0003-4439-7647Affiliations:
- Technion - Israel Institute of Technology, Department of Electrical Engineering, Haifa, Israel (PhD 2002)
- Nvidia Research, Tel Aviv-Yafo, Israel
According to our database1,
Shie Mannor
authored at least 437 papers
between 2000 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on zbmath.org
-
on linkedin.com
-
on twitter.com
-
on orcid.org
-
on d-nb.info
On csauthors.net:
Bibliography
2024
Dual Pricing to Prioritize Renewable Energy and Consumer Preferences in Electricity Markets.
CoRR, 2024
From Glucose Patterns to Health Outcomes: A Generalizable Foundation Model for Continuous Glucose Monitor Data Analysis.
CoRR, 2024
CoRR, 2024
On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes.
CoRR, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Second Tiny Papers Track at ICLR 2024, 2024
Proceedings of the Second Tiny Papers Track at ICLR 2024, 2024
Proceedings of the Second Tiny Papers Track at ICLR 2024, 2024
Policy Gradient with Tree Search (PGTS) in Reinforcement Learning Evades Local Maxima.
Proceedings of the Second Tiny Papers Track at ICLR 2024, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023
Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization.
CoRR, 2023
CoRR, 2023
CoRR, 2023
Proceedings of the ACM SIGGRAPH 2023 Conference Proceedings, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023
Proceedings of the International Conference on Machine Learning, 2023
Learning Hidden Markov Models When the Locations of Missing Observations are Unknown.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the 23rd IEEE/ACM International Symposium on Cluster, 2023
Never Worse, Mostly Better: Stable Policy Improvement in Deep Reinforcement Learning.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
SIGMETRICS Perform. Evaluation Rev., 2022
CoRR, 2022
Whats Missing? Learning Hidden Markov Models When the Locations of Missing Observations are Unknown.
CoRR, 2022
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Uncertainty Estimation Using Riemannian Model Dynamics for Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms.
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Reinforcement Learning for Extended Intelligence.
Proceedings of the 19th International Conference on Informatics in Control, 2022
Proceedings of the Conference on Robot Learning, 2022
Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
Better than the Best: Gradient-based Improper Reinforcement Learning for Network Scheduling.
CoRR, 2021
CoRR, 2021
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021
Proceedings of the Robotics: Science and Systems XVII, Virtual Event, July 12-16, 2021., 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
On the Volatility of Optimal Control Policies of a Class of Linear Quadratic Regulators.
Proceedings of the 2021 American Control Conference, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems.
CoRR, 2020
CoRR, 2020
How to Stop Epidemics: Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks.
CoRR, 2020
CoRR, 2020
CoRR, 2020
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the Conference on Learning Theory, 2020
Proceedings of the Algorithmic Learning Theory, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
IEEE/ACM Trans. Netw., 2019
CoRR, 2019
Action Assembly: Sparse Imitation Learning for Text Based Games with Combinatorial Action Spaces.
CoRR, 2019
Image Matters: Detecting Offensive and Non-Compliant Content / Logo in Product Images.
CoRR, 2019
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching.
CoRR, 2019
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 7th International Conference on Learning Representations, 2019
Batch-Size Independent Regret Bounds for the Combinatorial Multi-Armed Bandit Problem.
Proceedings of the Conference on Learning Theory, 2019
On-Line Learning of Linear Dynamical Systems: Exponential Forgetting in Kalman Filters.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
IEEE Trans. Inf. Theory, 2018
CoRR, 2018
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Finite Sample Analysis of Two-Timescale Stochastic Approximation with Applications to Reinforcement Learning.
Proceedings of the Conference On Learning Theory, 2018
Proceedings of the Conference On Learning Theory, 2018
Is a Picture Worth a Thousand Words? A Deep Multi-Modal Architecture for Product Classification in E-Commerce.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017
SIAM J. Comput., 2017
Learn on Source, Refine on Target: A Model Transfer Learning Framework with Random Forests.
IEEE Trans. Pattern Anal. Mach. Intell., 2017
Concentration Bounds for Two Timescale Stochastic Approximation with Applications to Reinforcement Learning.
CoRR, 2017
Proceedings of the 26th International Conference on World Wide Web Companion, 2017
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the IEEE Power & Energy Society Innovative Smart Grid Technologies Conference, 2017
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 30th Conference on Learning Theory, 2017
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
2016
J. Mach. Learn. Res., 2016
Is a picture worth a thousand words? A Deep Multi-Modal Fusion Architecture for Product Classification in e-commerce.
CoRR, 2016
CoRR, 2016
Distributed scenario-based optimization for asset management in a hierarchical decision making environment.
Proceedings of the Power Systems Computation Conference, 2016
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Proceedings of the 35th Annual IEEE International Conference on Computer Communications, 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
2015
IEEE Trans. Inf. Theory, 2015
J. Artif. Intell. Res., 2015
Learning to coordinate without communication in multi-user multi-armed bandit problems.
CoRR, 2015
Proceedings of the 2015 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015
Proceedings of the 2015 IEEE Conference on Computer Communications, 2015
Proceedings of the 2015 IEEE Conference on Computer Communications, 2015
Proceedings of the 32nd International Conference on Machine Learning, 2015
Proceedings of the 32nd International Conference on Machine Learning, 2015
Proceedings of The 28th Conference on Learning Theory, 2015
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
Proceedings of the Learning for General Competency in Video Games, 2015
2014
High-Throughput Energy-Efficient LDPC Decoders Using Differential Binary Message Passing.
IEEE Trans. Signal Process., 2014
Math. Oper. Res., 2014
J. Mach. Learn. Res., 2014
Proceedings of the ACM Conference on Economics and Computation, 2014
Heterogeneous Stream Processing and Crowdsourcing for Traffic Monitoring: Highlights.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Scaling Up Approximate Value Iteration with Options: Better Policies with Fewer Iterations.
Proceedings of the 31th International Conference on Machine Learning, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Combining a Gauss-Markov model and Gaussian process for traffic prediction in Dublin city center.
Proceedings of the Workshops of the EDBT/ICDT 2014 Joint Conference (EDBT/ICDT 2014), 2014
Proceedings of the 17th International Conference on Extending Database Technology, 2014
Approachability in unknown games: Online learning meets multi-objective optimization.
Proceedings of The 27th Conference on Learning Theory, 2014
Proceedings of the IEEE 25th International Conference on Application-Specific Systems, 2014
2013
IEEE Trans. Pattern Anal. Mach. Intell., 2013
A State Action Frequency Approach to Throughput Maximization over Uncertain Wireless Channels.
Internet Math., 2013
Eur. J. Oper. Res., 2013
Online Learning for Loss Functions with Memory and Applications to Statistical Arbitrage
CoRR, 2013
CoRR, 2013
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013
Proceedings of the Fourteenth ACM International Symposium on Mobile Ad Hoc Networking and Computing, 2013
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013
Proceedings of the 30th International Conference on Machine Learning, 2013
Proceedings of the 30th International Conference on Machine Learning, 2013
Proceedings of the COLT 2013, 2013
2012
IEEE Trans. Pattern Anal. Mach. Intell., 2012
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012
Proceedings of the 4th Asian Conference on Machine Learning, 2012
CoRR, 2012
Ann. Oper. Res., 2012
Proceedings of the 2012 IEEE Workshop on Signal Processing Systems, 2012
Proceedings of the ACM SIGMETRICS/PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems, 2012
Proceedings of the 29th International Conference on Machine Learning, 2012
Proceedings of the 29th International Conference on Machine Learning, 2012
Proceedings of the 29th International Conference on Machine Learning, 2012
Proceedings of the 51th IEEE Conference on Decision and Control, 2012
Proceedings of the 51th IEEE Conference on Decision and Control, 2012
Proceedings of the 50th Annual Allerton Conference on Communication, 2012
2011
IEEE Trans. Parallel Distributed Syst., 2011
IEEE Trans. Comput. Intell. AI Games, 2011
Proceedings of the COLT 2011, 2011
Proceedings of the COLT 2011, 2011
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2011
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011
Proceedings of the 28th International Conference on Machine Learning, 2011
Proceedings of the 28th International Conference on Machine Learning, 2011
Proceedings of the 28th International Conference on Machine Learning, 2011
Proceedings of the 28th International Conference on Machine Learning, 2011
Proceedings of the 50th IEEE Conference on Decision and Control and European Control Conference, 2011
Proceedings of the 50th IEEE Conference on Decision and Control and European Control Conference, 2011
Proceedings of the Computational Physiology, 2011
2010
IEEE Trans. Signal Process., 2010
IEEE Trans. Signal Process., 2010
IEEE Trans. Circuits Syst. II Express Briefs, 2010
Oper. Res., 2010
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2010
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010
Proceedings of the First Workshop on Social Media Analytics, 2010
Proceedings of the 2010 International Conference on Distributed Computing Systems, 2010
A novel similarity measure for time series data with applications to gait and activity recognition.
Proceedings of the UbiComp 2010: Ubiquitous Computing, 12th International Conference, 2010
Proceedings of the Global Communications Conference, 2010
Proceedings of the COLT 2010, 2010
Proceedings of the COLT 2010, 2010
Proceedings of the 49th IEEE Conference on Decision and Control, 2010
Proceedings of the 49th IEEE Conference on Decision and Control, 2010
Proceedings of the 48th Annual Allerton Conference on Communication, 2010
Proceedings of the 48th Annual Allerton Conference on Communication, 2010
Proceedings of the 48th Annual Allerton Conference on Communication, 2010
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010
2009
IEEE Trans. Autom. Control., 2009
IEEE Trans. Autom. Control., 2009
Games Econ. Behav., 2009
Proceedings of the IEEE Workshop on Signal Processing Systems, 2009
Proceedings of the 2009 IEEE Information Theory Workshop, 2009
Proceedings of the 26th Annual International Conference on Machine Learning, 2009
Proceedings of IEEE International Conference on Communications, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the Global Communications Conference, 2009. GLOBECOM 2009, Honolulu, Hawaii, USA, 30 November, 2009
Online learning in Markov decision processes with arbitrarily changing rewards and transitions.
Proceedings of the 1st International Conference on Game Theory for Networks, 2009
Proceedings of the 1st International Conference on Game Theory for Networks, 2009
Proceedings of the 48th IEEE Conference on Decision and Control, 2009
Proceedings of the 48th IEEE Conference on Decision and Control, 2009
Proceedings of the 48th IEEE Conference on Decision and Control, 2009
Regularized Fitted Q-Iteration for planning in continuous-space Markovian decision problems.
Proceedings of the American Control Conference, 2009
2008
Games Econ. Behav., 2008
Proceedings of the Internet and Network Economics, 4th International Workshop, 2008
Proceedings of the 3rd International ICST Conference on Performance Evaluation Methodologies and Tools, 2008
Proceedings of the Advances in Neural Information Processing Systems 21, 2008
Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2008
Proceedings of the Machine Learning, 2008
Proceedings of the Recent Advances in Reinforcement Learning, 8th European Workshop, 2008
Proceedings of the Recent Advances in Reinforcement Learning, 8th European Workshop, 2008
Proceedings of the 21st Annual Conference on Learning Theory, 2008
Proceedings of the 46th Annual Allerton Conference on Communication, 2008
Proceedings of the 46th Annual Allerton Conference on Communication, 2008
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008
2007
IEEE Trans. Inf. Theory, 2007
Online calibrated forecasts: Memory efficiency versus universality for learning in games.
Mach. Learn., 2007
IEEE J. Sel. Areas Commun., 2007
An Area-Efficient FPGA-Based Architecture for Fully-Parallel Stochastic LDPC Decoding.
Proceedings of the IEEE Workshop on Signal Processing Systems, 2007
Proceedings of the NETWORKING 2007. Ad Hoc and Sensor Networks, 2007
Proceedings of the 37th International Symposium on Multiple-Valued Logic, 2007
Percentile optimization in uncertain Markov decision processes with application to efficient exploration.
Proceedings of the Machine Learning, 2007
Proceedings of the Global Communications Conference, 2007
Proceedings of the 46th IEEE Conference on Decision and Control, 2007
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007
2006
Design of ℓ<sub>1</sub>-optimal controllers with flexible disturbance rejection level.
IEEE Trans. Autom. Control., 2006
Action Elimination and Stopping Conditions for the Multi-Armed Bandit and Reinforcement Learning Problems.
J. Mach. Learn. Res., 2006
Proceedings of the Advances in Neural Information Processing Systems 19, 2006
Proceedings of the INFOCOM 2006. 25th IEEE International Conference on Computer Communications, 2006
Automatic basis function construction for approximate dynamic programming and reinforcement learning.
Proceedings of the Machine Learning, 2006
Proceedings of the Learning Theory, 19th Annual Conference on Learning Theory, 2006
Proceedings of the Learning Theory, 19th Annual Conference on Learning Theory, 2006
Proceedings of the American Control Conference, 2006
2005
IEEE Trans. Autom. Control., 2005
On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies.
Math. Oper. Res., 2005
Ann. Oper. Res., 2005
The Workshop Program at the Nineteenth National Conference on Artificial Intelligence.
AI Mag., 2005
Proceedings of the Machine Learning, 2005
Proceedings of the Machine Learning, 2005
2004
J. Mach. Learn. Res., 2004
J. Mach. Learn. Res., 2004
Proceedings of the Machine Learning, 2004
Proceedings of the Machine Learning, 2004
Proceedings of the Learning Theory, 17th Annual Conference on Learning Theory, 2004
Proceedings of the 43rd IEEE Conference on Decision and Control, 2004
2003
The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes.
Math. Oper. Res., 2003
Greedy Algorithms for Classification -- Consistency, Convergence Rates, and Adaptivity.
J. Mach. Learn. Res., 2003
Proceedings of the Machine Learning, 2003
Proceedings of the Machine Learning, 2003
Proceedings of the Machine Learning, 2003
Lower Bounds on the Sample Complexity of Exploration in the Multi-armed Bandit Problem.
Proceedings of the Computational Learning Theory and Kernel Machines, 2003
Proceedings of the Computational Learning Theory and Kernel Machines, 2003
2002
Mach. Learn., 2002
Proceedings of the Machine Learning: ECML 2002, 2002
Proceedings of the Machine Learning: ECML 2002, 2002
Proceedings of the Computational Learning Theory, 2002
Proceedings of the Computational Learning Theory, 2002
2001
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001
Learning Embedded Maps of Markov Processes.
Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001
Adaptive Strategies and Regret Minimization in Arbitrarily Varying Markov Environments.
Proceedings of the Computational Learning Theory, 2001
Proceedings of the Computational Learning Theory, 2001
2000
Proceedings of the Advances in Neural Information Processing Systems 13, 2000