W. Bradley Knox

Orcid: 0000-0002-6006-9523

Affiliations:

University of Texas at Austin, USA

According to our database¹, W. Bradley Knox authored at least 37 papers between 2006 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Models of human preference for learning reward functions.

[BibT_eX]

[DOI]

W. Bradley Knox

Stephane Hatgis-Kessell

Serena Booth

Scott Niekum

Peter Stone

Alessandro Gabriele Allievi

Trans. Mach. Learn. Res., 2024

MobileSafetyBench: Evaluating Safety of Autonomous Agents in Mobile Device Control.

[BibT_eX]

[DOI]

CoRR, 2024

Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions.

[BibT_eX]

[DOI]

Michael J. Q. Zhang

W. Bradley Knox

Eunsol Choi

CoRR, 2024

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms.

[BibT_eX]

[DOI]

CoRR, 2024

Contrastive Preference Learning: Learning from Human Feedback without Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning Optimal Advantage from Preferences and Mistaking It for Reward.

[BibT_eX]

[DOI]

W. Bradley Knox

Stephane Hatgis-Kessell

Sigurdur O. Adalgeirsson

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Reward (Mis)design for Autonomous Driving (Abstract Reprint).

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Reward (Mis)design for autonomous driving.

[BibT_eX]

[DOI]

Artif. Intell., March, 2023

Contrastive Preference Learning: Learning from Human Feedback without RL.

[BibT_eX]

[DOI]

CoRR, 2023

The Perils of Trial-and-Error Reward Design: Misdesign through Overfitting and Invalid Task Specifications.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Toward Believable Acting for Autonomous Animated Characters.

[BibT_eX]

[DOI]

Cassidy J. Curtis

Sigurdur O. Adalgeirsson

Norberto Adrián Goussies

Tianyu Liu

Palash Nandy

Proceedings of the MIG '22: ACM SIGGRAPH Conference on Motion, Interaction and Games, Guanajuato, Mexico, November 3, 2022

2021

Demonstration of the EMPATHIC Framework for Task Learning from Implicit Human Feedback.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

The EMPATHIC Framework for Task Learning from Implicit Human Feedback.

[BibT_eX]

[DOI]

Proceedings of the 4th Conference on Robot Learning, 2020

2018

Social interaction for efficient agent learning from human reward.

[BibT_eX]

[DOI]

Auton. Agents Multi Agent Syst., 2018

2016

Using informative behavior to increase engagement while learning from human reward.

[BibT_eX]

[DOI]

Auton. Agents Multi Agent Syst., 2016

Learning from the Wizard: Programming Social Interaction through Teleoperated Demonstrations (Extended Abstract).

[BibT_eX]

[DOI]

W. Bradley Knox

Samuel Spaulding

Cynthia Breazeal

Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

2015

Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance.

[BibT_eX]

[DOI]

W. Bradley Knox

Peter Stone

Artif. Intell., 2015

2014

Power to the People: The Role of Humans in Interactive Machine Learning.

[BibT_eX]

[DOI]

AI Mag., 2014

Learning from human reward benefits from socio-competitive feedback.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Development and Learning and on Epigenetic Robotics, 2014

Leveraging social networks to motivate humans to train agents.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

2013

Training a Robot via Human Feedback: A Case Study.

[BibT_eX]

[DOI]

W. Bradley Knox

Peter Stone

Cynthia Breazeal

Proceedings of the Social Robotics - 5th International Conference, 2013

Teaching agents with human feedback: a demonstration of the TAMER framework.

[BibT_eX]

[DOI]

W. Bradley Knox

Peter Stone

Cynthia Breazeal

Proceedings of the 18th International Conference on Intelligent User Interfaces, 2013

Learning non-myopically from human-generated reward.

[BibT_eX]

[DOI]

W. Bradley Knox

Peter Stone

Proceedings of the 18th International Conference on Intelligent User Interfaces, 2013

IUI workshop on interactive machine learning.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Intelligent User Interfaces, 2013

Using informative behavior to increase engagement in the tamer framework.

[BibT_eX]

[DOI]

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

2012

How Humans Teach Agents - A New Experimental Perspective.

[BibT_eX]

[DOI]

Int. J. Soc. Robotics, 2012

Reinforcement learning from human reward: Discounting in episodic tasks.

[BibT_eX]

[DOI]

W. Bradley Knox

Peter Stone

Proceedings of the 21st IEEE International Symposium on Robot and Human Interactive Communication, 2012

Reinforcement learning from simultaneous human and MDP reward.

[BibT_eX]

[DOI]

W. Bradley Knox

Peter Stone

Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

2011

Computational, Neuroscientific, and Lifespan Perspectives on the Exploration-Exploitation Dilemma.

[BibT_eX]

[DOI]

Proceedings of the 33th Annual Meeting of the Cognitive Science Society, 2011

Reinforcement Learning with Human Feedback in Mountain Car.

[BibT_eX]

[DOI]

W. Bradley Knox

Adam Bradley Setapen

Peter Stone

Proceedings of the Help Me Help You: Bridging the Gaps in Human-Agent Collaboration, 2011

2010

Training a Tetris agent via interactive shaping: a demonstration of the TAMER framework.

[BibT_eX]

[DOI]

W. Bradley Knox

Peter Stone

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Combining manual feedback with subsequent MDP reward signals for reinforcement learning.

[BibT_eX]

[DOI]

W. Bradley Knox

Peter Stone

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

2009

Interactively shaping agents via human reinforcement: the TAMER framework.

[BibT_eX]

[DOI]

W. Bradley Knox

Peter Stone

Proceedings of the 5th International Conference on Knowledge Capture (K-CAP 2009), 2009

Design Principles for Creating Human-Shapable Agents.

[BibT_eX]

[DOI]

W. Bradley Knox

Ian R. Fasel

Peter Stone

Proceedings of the Agents that Learn from Human Teachers, 2009

2008

Domestic Interaction on a Segway Base.

[BibT_eX]

[DOI]

W. Bradley Knox

Juhyun Lee

Peter Stone

Proceedings of the RoboCup 2008: Robot Soccer World Cup XII [papers from the 12th annual RoboCup International Symposium, 2008

Person recognition on a Segway Robot: A video of UT Austin Villa Robocup@Home 2007 finals demonstration.

[BibT_eX]

[DOI]

W. Bradley Knox

Juhyun Lee

Peter Stone

Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

2006

Know Thine Enemy: A Champion RoboCup Coach Agent.

[BibT_eX]

[DOI]

Gregory Kuhlmann

William B. Knox

Peter Stone

Proceedings of the Proceedings, 2006

W. Bradley Knox

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...