W. Bradley Knox

Orcid: 0000-0002-6006-9523

Affiliations:
  • University of Texas at Austin, USA


According to our database1, W. Bradley Knox authored at least 37 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Models of human preference for learning reward functions.
Trans. Mach. Learn. Res., 2024

MobileSafetyBench: Evaluating Safety of Autonomous Agents in Mobile Device Control.
CoRR, 2024

Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions.
CoRR, 2024

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms.
CoRR, 2024

Contrastive Preference Learning: Learning from Human Feedback without Reinforcement Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning Optimal Advantage from Preferences and Mistaking It for Reward.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Reward (Mis)design for Autonomous Driving (Abstract Reprint).
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Reward (Mis)design for autonomous driving.
Artif. Intell., March, 2023

Contrastive Preference Learning: Learning from Human Feedback without RL.
CoRR, 2023

The Perils of Trial-and-Error Reward Design: Misdesign through Overfitting and Invalid Task Specifications.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Toward Believable Acting for Autonomous Animated Characters.
Proceedings of the MIG '22: ACM SIGGRAPH Conference on Motion, Interaction and Games, Guanajuato, Mexico, November 3, 2022

2021
Demonstration of the EMPATHIC Framework for Task Learning from Implicit Human Feedback.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
The EMPATHIC Framework for Task Learning from Implicit Human Feedback.
Proceedings of the 4th Conference on Robot Learning, 2020

2018
Social interaction for efficient agent learning from human reward.
Auton. Agents Multi Agent Syst., 2018

2016
Using informative behavior to increase engagement while learning from human reward.
Auton. Agents Multi Agent Syst., 2016

Learning from the Wizard: Programming Social Interaction through Teleoperated Demonstrations (Extended Abstract).
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

2015
Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance.
Artif. Intell., 2015

2014
Power to the People: The Role of Humans in Interactive Machine Learning.
AI Mag., 2014

Learning from human reward benefits from socio-competitive feedback.
Proceedings of the 4th International Conference on Development and Learning and on Epigenetic Robotics, 2014

Leveraging social networks to motivate humans to train agents.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

2013
Training a Robot via Human Feedback: A Case Study.
Proceedings of the Social Robotics - 5th International Conference, 2013

Teaching agents with human feedback: a demonstration of the TAMER framework.
Proceedings of the 18th International Conference on Intelligent User Interfaces, 2013

Learning non-myopically from human-generated reward.
Proceedings of the 18th International Conference on Intelligent User Interfaces, 2013

IUI workshop on interactive machine learning.
Proceedings of the 18th International Conference on Intelligent User Interfaces, 2013

Using informative behavior to increase engagement in the tamer framework.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

2012
How Humans Teach Agents - A New Experimental Perspective.
Int. J. Soc. Robotics, 2012

Reinforcement learning from human reward: Discounting in episodic tasks.
Proceedings of the 21st IEEE International Symposium on Robot and Human Interactive Communication, 2012

Reinforcement learning from simultaneous human and MDP reward.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

2011
Computational, Neuroscientific, and Lifespan Perspectives on the Exploration-Exploitation Dilemma.
Proceedings of the 33th Annual Meeting of the Cognitive Science Society, 2011

Reinforcement Learning with Human Feedback in Mountain Car.
Proceedings of the Help Me Help You: Bridging the Gaps in Human-Agent Collaboration, 2011

2010
Training a Tetris agent via interactive shaping: a demonstration of the TAMER framework.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Combining manual feedback with subsequent MDP reward signals for reinforcement learning.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

2009
Interactively shaping agents via human reinforcement: the TAMER framework.
Proceedings of the 5th International Conference on Knowledge Capture (K-CAP 2009), 2009

Design Principles for Creating Human-Shapable Agents.
Proceedings of the Agents that Learn from Human Teachers, 2009

2008
Domestic Interaction on a Segway Base.
Proceedings of the RoboCup 2008: Robot Soccer World Cup XII [papers from the 12th annual RoboCup International Symposium, 2008

Person recognition on a Segway Robot: A video of UT Austin Villa Robocup@Home 2007 finals demonstration.
Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

2006
Know Thine Enemy: A Champion RoboCup Coach Agent.
Proceedings of the Proceedings, 2006


  Loading...