Zachary Kenton

According to our database1, Zachary Kenton authored at least 20 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
On scalable oversight with weak LLMs judging strong LLMs.
CoRR, 2024

The Ethics of Advanced AI Assistants.
CoRR, 2024

A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI.
CoRR, 2024

Discovering Agents (Abstract Reprint).
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Discovering agents.
Artif. Intell., September, 2023

Challenges with unsupervised LLM knowledge discovery.
CoRR, 2023

Explaining grokking through circuit efficiency.
CoRR, 2023

2022
Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct Goals.
CoRR, 2022

Safe Deep RL in 3D Environments using Human Feedback.
CoRR, 2022

Taxonomy of Risks posed by Language Models.
Proceedings of the FAccT '22: 2022 ACM Conference on Fairness, Accountability, and Transparency, Seoul, Republic of Korea, June 21, 2022

2021
Ethical and social risks of harm from Language Models.
CoRR, 2021

Alignment of Language Agents.
CoRR, 2021

2020
Imitating Interactive Intelligence.
CoRR, 2020

2019
A Systematic Comparison of Bayesian Deep Learning Robustness in Diabetic Retinopathy Tasks.
CoRR, 2019

Generalizing from a few environments in safety-critical reinforcement learning.
CoRR, 2019

On the Relation Between the Sharpest Directions of DNN Loss and the SGD Step Length.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
DNN's Sharpest Directions Along the SGD Trajectory.
CoRR, 2018

Finding Flatter Minima with SGD.
Proceedings of the 6th International Conference on Learning Representations, 2018

Width of Minima Reached by Stochastic Gradient Descent is Influenced by Learning Rate to Batch Size Ratio.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2018, 2018

2017
Three Factors Influencing Minima in SGD.
CoRR, 2017


  Loading...