Martha White

Orcid: 0000-0002-5356-2950

Affiliations:
  • University of Alberta, Edmonton, Canada


According to our database1, Martha White authored at least 125 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
GVFs in the real world: making predictions online for water treatment.
Mach. Learn., July, 2024

Offline Reinforcement Learning via Tsallis Regularization.
Trans. Mach. Learn. Res., 2024

Mitigating Value Hallucination in Dyna-Style Planning via Multistep Predecessor Models.
J. Artif. Intell. Res., 2024

Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers.
CoRR, 2024

PC-Gym: Benchmark Environments For Process Control Problems.
CoRR, 2024

Real-Time Recurrent Learning using Trace Units in Reinforcement Learning.
CoRR, 2024

q-exponential family for policy optimization.
CoRR, 2024

The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning.
CoRR, 2024

A New View on Planning in Online Reinforcement Learning.
CoRR, 2024

Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL.
CoRR, 2024

Investigating the Histogram Loss in Regression.
CoRR, 2024

What to Do When Your Discrete Optimization Is the Size of a Neural Network?
CoRR, 2024

Compound Returns Reduce Variance in Reinforcement Learning.
CoRR, 2024

Investigating the properties of neural network representations in reinforcement learning.
Artif. Intell., 2024

Cross-environment Hyperparameter Tuning for Reinforcement Learning.
RLJ, 2024

Investigating the Interplay of Prioritized Replay and Generalization.
RLJ, 2024

Demystifying the Recency Heuristic in Temporal-Difference Learning.
RLJ, 2024

Position: Benchmarking is Limited in Reinforcement Learning Research.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Averaging n-step Returns Reduces Variance in Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Exploiting Action Impact Regularity and Exogenous State Variables for Offline Reinforcement Learning (Abstract Reprint).
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Robust Losses for Learning Value Functions.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Investigating Action Encodings in Recurrent Neural Networks in Reinforcement Learning.
Trans. Mach. Learn. Res., 2023

Resmax: An Alternative Soft-Greedy Operator for Reinforcement Learning.
Trans. Mach. Learn. Res., 2023

Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks.
J. Mach. Learn. Res., 2023

Off-Policy Actor-Critic with Emphatic Weightings.
J. Mach. Learn. Res., 2023

Exploiting Action Impact Regularity and Exogenous State Variables for Offline Reinforcement Learning.
J. Artif. Intell. Res., 2023

When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
CoRR, 2023

Coagent Networks: Generalized and Scaled.
CoRR, 2023

Empirical Design in Reinforcement Learning.
CoRR, 2023

Online Real-Time Recurrent Learning Using Sparse Connections and Selective Learning.
CoRR, 2023

Generalized Munchausen Reinforcement Learning using Tsallis KL Divergence.
CoRR, 2023

General Munchausen Reinforcement Learning with Tsallis Kullback-Leibler Divergence.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

The In-Sample Softmax for Offline Reinforcement Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Measuring and Mitigating Interference in Reinforcement Learning.
Proceedings of the Conference on Lifelong Learning Agents, 2023

Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL.
Trans. Mach. Learn. Res., 2022

Representation Alignment in Neural Networks.
Trans. Mach. Learn. Res., 2022

A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning.
J. Mach. Learn. Res., 2022

Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences.
J. Mach. Learn. Res., 2022

Goal-Space Planning with Subgoal Models.
CoRR, 2022

Understanding and mitigating the limitations of prioritized experience replay.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

A Temporal-Difference Approach to Policy Gradient Estimation.
Proceedings of the International Conference on Machine Learning, 2022

Resonance in Weight Space: Covariate Shift Can Drive Divergence of SGD with Momentum.
Proceedings of the Tenth International Conference on Learning Representations, 2022

An Alternate Policy Gradient Estimator for Softmax Policies.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
Sim2Real in Robotics and Automation: Applications and Challenges.
IEEE Trans Autom. Sci. Eng., 2021

General Value Function Networks.
J. Artif. Intell. Res., 2021

Understanding Feature Transfer Through Representation Alignment.
CoRR, 2021

Exploiting Action Impact Regularity and Partially Known Models for Offline Reinforcement Learning.
CoRR, 2021

Predictive Representation Learning for Language Modeling.
CoRR, 2021

A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning.
CoRR, 2021

Scalable Online Recurrent Learning Using Columnar Neural Networks.
CoRR, 2021

Continual Auxiliary Task Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Structural Credit Assignment in Neural Networks using Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Adapting Behavior via Intrinsic Reward: A Survey and Empirical Study.
J. Artif. Intell. Res., 2020

Perspectives on Sim2Real Transfer for Robotics: A Summary of the R: SS 2020 Workshop.
CoRR, 2020

From Language to Language-ish: How Brain-Like is an LSTM's Representation of Nonsensical Language Stimuli?
CoRR, 2020

Beyond Prioritized Replay: Sampling States in Model-Based RL via Simulated Priorities.
CoRR, 2020

Towards a practical measure of interference for reinforcement learning.
CoRR, 2020

Learning Causal Models Online.
CoRR, 2020

Hallucinating Value: A Pitfall of Dyna-style Planning with Imperfect Environment Models.
CoRR, 2020

Maximizing Information Gain in Partially Observable Environments via Prediction Reward.
CoRR, 2020

An implicit function learning approach for parametric modal regression.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Towards Safe Policy Improvement for Non-Stationary MDPs.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Gradient Temporal-Difference Learning with Regularized Corrections.
Proceedings of the 37th International Conference on Machine Learning, 2020

Optimizing for the Future in Non-Stationary MDPs.
Proceedings of the 37th International Conference on Machine Learning, 2020

Selective Dyna-Style Planning Under Limited Model Capacity.
Proceedings of the 37th International Conference on Machine Learning, 2020

Training Recurrent Neural Networks Online by Learning Explicit State Variables.
Proceedings of the 8th International Conference on Learning Representations, 2020

Maxmin Q-learning: Controlling the Estimation Bias of Q-learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

From Language to Language-ish: How Brain-Like is an LSTM's Representation of Atypical Language Stimuli?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Maximizing Information Gain in Partially Observable Environments via Prediction Rewards.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019
Is Fast Adaptation All You Need?
CoRR, 2019

Adapting Behaviour via Intrinsic Reward: A Survey and Empirical Study.
CoRR, 2019

Importance Resampling for Off-policy Prediction.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Meta-Learning Representations for Continual Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Learning Macroscopic Brain Connectomes via Group-Sparse Factorization.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Planning with Expectation Models.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Hill Climbing on Value Estimates for Search-control in Dyna.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Two-Timescale Networks for Nonlinear Value Function Approximation.
Proceedings of the 7th International Conference on Learning Representations, 2019

The Utility of Sparse Representations for Control in Reinforcement Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Meta-Descent for Online, Continual Prediction.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Accelerating Large Scale Knowledge Distillation via Dynamic Importance Sampling.
CoRR, 2018

The Barbados 2018 List of Open Issues in Continual Learning.
CoRR, 2018

Online Off-policy Prediction.
CoRR, 2018

Actor-Expert: A Framework for using Action-Value Methods in Continuous Action Spaces.
CoRR, 2018

General Value Function Networks.
CoRR, 2018

Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods.
CoRR, 2018

Comparing Direct and Indirect Temporal-Difference Methods for Estimating the Variance of the Return.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

High-confidence error estimates for learned value functions.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

Supervised autoencoders: Improving generalization performance with unsupervised regularizers.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Context-dependent upper-confidence bounds for directed exploration.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

An Off-policy Policy Gradient Theorem Using Emphatic Weightings.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Organizing Experience: a Deeper Look at Replay Mechanisms for Sample-Based Planning in Continuous State Domains.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control.
Proceedings of the 35th International Conference on Machine Learning, 2018

Improving Regression Performance with Distributional Losses.
Proceedings of the 35th International Conference on Machine Learning, 2018

2017
Effective sketching methods for value function approximation.
Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence, 2017

Multi-view Matrix Factorization for Linear Dynamical System Estimation.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Learning Sparse Representations in Reinforcement Learning with Sparse Coding.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Unifying Task Specification in Reinforcement Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017

Adapting Kernel Representations Online Using Submodular Maximization.
Proceedings of the 34th International Conference on Machine Learning, 2017

Accelerated Gradient Temporal Difference Learning.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Recovering True Classifier Performance in Positive-Unlabeled Learning.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning.
J. Mach. Learn. Res., 2016

Global optimization of factor models using alternating minimization.
CoRR, 2016

Nonparametric semi-supervised learning of class proportions.
CoRR, 2016

Estimating the class prior and posterior from noisy positives and unlabeled data.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Incremental Truncated LSTD.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

A Greedy Approach to Adapting the Trace Parameter for Temporal Difference Learning.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Investigating Practical Linear Temporal Difference Learning.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

2015
Emphatic Temporal-Difference Learning.
CoRR, 2015

Incremental Truncated LSTD.
CoRR, 2015

Scalable Metric Learning for Co-Embedding.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2015

Optimal Estimation of Multivariate ARMA Models.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2013
Partition Tree Weighting.
Proceedings of the 2013 Data Compression Conference, 2013

2012
Generalized Optimal Reverse Prediction.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Off-Policy Actor-Critic
CoRR, 2012

Convex Multi-view Subspace Learning.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Linear Off-Policy Actor-Critic.
Proceedings of the 29th International Conference on Machine Learning, 2012

2011
Convex Sparse Coding, Subspace Learning, and Semi-Supervised Extensions.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
Relaxed Clipping: A Global Training Method for Robust Regression and Classification.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Interval Estimation for Reinforcement-Learning Algorithms in Continuous-State Domains.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

2009
Learning a Value Analysis Tool for Agent Evaluation.
Proceedings of the IJCAI 2009, 2009

Optimal reverse prediction: a unified perspective on supervised, unsupervised and semi-supervised learning.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009


  Loading...