2022

Self-critiquing models for assisting human evaluators.

[DOI]

William Saunders

Catherine Yeh

CoRR, 2022

Training language models to follow instructions with human feedback.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

WebGPT: Browser-assisted question-answering with human feedback.

[DOI]

CoRR, 2021

Recursively Summarizing Books with Human Feedback.

[DOI]

CoRR, 2021

2020

Learning to summarize from human feedback.

[DOI]

CoRR, 2020

Learning to summarize with human feedback.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2018

Bayesian Inference of Regular Expressions from Human-Generated Example Strings.

[DOI]

Long Ouyang

CoRR, 2018

webppl-oed: A practical optimal experiment design system.

[DOI]

Long Ouyang

Michael Henry Tessler

Daniel Ly

Noah D. Goodman

Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

2017

Pedagogical Learning.

[DOI]

Long Ouyang

Michael C. Frank

CoRR, 2017

2016

Practical optimal experiment design with probabilistic programs.

[DOI]

Long Ouyang

Michael Henry Tessler

Daniel Ly

Noah D. Goodman

CoRR, 2016

Fabular: regression formulas as probabilistic programming.

[DOI]

Proceedings of the 43rd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, 2016

2012

Semantic Coherence Facilitates Distributional Learning of Word Meanings.

[DOI]

Long Ouyang

Lera Boroditsky

Michael C. Frank

Proceedings of the 34th Annual Meeting of the Cognitive Science Society, 2012