Jacob Hilton

According to our database1, Jacob Hilton authored at least 17 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Obfuscated Activations Bypass LLM Latent-Space Defenses.
CoRR, 2024

Estimating the Probabilities of Rare Outputs in Language Models.
CoRR, 2024

Towards a Law of Iterated Expectations for Heuristic Estimators.
CoRR, 2024

Backdoor defense, learnability and obfuscation.
CoRR, 2024

2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Trans. Mach. Learn. Res., 2023

Scaling laws for single-agent reinforcement learning.
CoRR, 2023

Scaling Laws for Reward Model Overoptimization.
Proceedings of the International Conference on Machine Learning, 2023

2022
Teaching Models to Express Their Uncertainty in Words.
Trans. Mach. Learn. Res., 2022

Training language models to follow instructions with human feedback.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Batch size-invariance for policy optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

TruthfulQA: Measuring How Models Mimic Human Falsehoods.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
WebGPT: Browser-assisted question-answering with human feedback.
CoRR, 2021

Training Verifiers to Solve Math Word Problems.
CoRR, 2021

Phasic Policy Gradient.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark.
Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 2020

Leveraging Procedural Generation to Benchmark Reinforcement Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

2016
The Topological Pigeonhole Principle for Ordinals.
J. Symb. Log., 2016


  Loading...