We stand with Ukraine

We stand with Ukraine

Joar Skalse

According to our database¹, Joar Skalse authored at least 20 papers between 2019 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

2019

2020

2021

2022

2023

2024

0

1

2

3

4

5

6

7

8

9

4

1

2

2

4

3

2

2

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

On csauthors.net:

Bibliography

2024

Partial Identifiability in Inverse Reinforcement Learning For Agents With Non-Exponential Discounting.

[BibT_eX]

[DOI]

,

Alessandro Abate

CoRR, 2024

Partial Identifiability and Misspecification in Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

,

Alessandro Abate

CoRR, 2024

The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret.

[BibT_eX]

[DOI]

,

,

Alessandro Abate

,

,

,

CoRR, 2024

Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems.

[BibT_eX]

[DOI]

David Dalrymple

,

,

,

,

,

,

Steve Omohundro

,

Christian Szegedy

,

,

,

Alessandro Abate

,

,

Clark W. Barrett

,

,

,

,

Joshua B. Tenenbaum

CoRR, 2024

On the Expressivity of Objective-Specification Formalisms in Reinforcement Learning.

[BibT_eX]

[DOI]

Rohan Subramani

,

Marcus Williams

,

,

,

Charlie Griffin

,

Joar Max Viktor Skalse

Proceedings of the Twelfth International Conference on Learning Representations, 2024

STARC: A General Framework For Quantifying Differences Between Reward Functions.

[BibT_eX]

[DOI]

Joar Max Viktor Skalse

,

,

Sumeet Ramesh Motwani

,

,

,

Alessandro Abate

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification.

[BibT_eX]

[DOI]

Joar Max Viktor Skalse

,

Alessandro Abate

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Goodhart's Law in Reinforcement Learning.

[BibT_eX]

[DOI]

Jacek Karwowski

,

,

,

Klaus Kiendlhofer

,

Charlie Griffin

,

Joar Max Viktor Skalse

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

On the limitations of Markovian rewards to express multi-objective, risk-sensitive, and modal tasks.

[BibT_eX]

[DOI]

,

Alessandro Abate

Proceedings of the Uncertainty in Artificial Intelligence, 2023

Invariance in Policy Optimisation and Partial Identifiability in Reward Learning.

[BibT_eX]

[DOI]

Joar Max Viktor Skalse

,

Matthew Farrugia-Roberts

,

,

Alessandro Abate

,

Proceedings of the International Conference on Machine Learning, 2023

Misspecification in Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

,

Alessandro Abate

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Defining and Characterizing Reward Hacking.

[BibT_eX]

[DOI]

,

Nikolaus H. R. Howe

,

Dmitrii Krasheninnikov

,

CoRR, 2022

Defining and Characterizing Reward Gaming.

[BibT_eX]

[DOI]

,

Nikolaus H. R. Howe

,

Dmitrii Krasheninnikov

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Lexicographic Multi-Objective Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Charlie Griffin

,

Alessandro Abate

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

2021

Is SGD a Bayesian sampler? Well, almost.

[BibT_eX]

[DOI]

,

Guillermo Valle Pérez

,

,

J. Mach. Learn. Res., 2021

A General Counterexample to Any Decision Theory and Some Responses.

[BibT_eX]

[DOI]

CoRR, 2021

Reinforcement Learning in Newcomblike Environments.

[BibT_eX]

[DOI]

,

Linda Linsefors

,

Caspar Oesterheld

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Safety Properties of Inductive Logic Programming.

[BibT_eX]

[DOI]

,

,

Proceedings of the Workshop on Artificial Intelligence Safety 2021 (SafeAI 2021) co-located with the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021), 2021

2019

Neural networks are a priori biased towards Boolean functions with low entropy.

[BibT_eX]

[DOI]

,

,

Guillermo Valle Pérez

,

David Martínez-Rubio

,

Vladimir Mikulik

,

CoRR, 2019

Risks from Learned Optimization in Advanced Machine Learning Systems.

[BibT_eX]

[DOI]

,

Chris van Merwijk

,

Vladimir Mikulik

,

,

Scott Garrabrant

CoRR, 2019

Loading...