Self-critiquing models for assisting human evaluators.
CoRR, 2022
Training language models to follow instructions with human feedback.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Recursively Summarizing Books with Human Feedback.
CoRR, 2021
Learning to summarize from human feedback.
CoRR, 2020
Learning to summarize with human feedback.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Bayesian Inference of Regular Expressions from Human-Generated Example Strings.
CoRR, 2018
webppl-oed: A practical optimal experiment design system.
Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018
Practical optimal experiment design with probabilistic programs.
CoRR, 2016
Fabular: regression formulas as probabilistic programming.
Proceedings of the 43rd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, 2016
Semantic Coherence Facilitates Distributional Learning of Word Meanings.
Proceedings of the 34th Annual Meeting of the Cognitive Science Society, 2012