Nicholas Joseph
According to our database1,
Nicholas Joseph
authored at least 20 papers
between 2021 and 2023.
Collaborative distances:
Collaborative distances:
Timeline
2021
2022
2023
0
5
10
7
9
2
1
1
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2023
CoRR, 2023
Towards Measuring the Representation of Subjective Global Opinions in Language Models.
CoRR, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
2022
Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned.
CoRR, 2022
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback.
CoRR, 2022
Proceedings of the FAccT '22: 2022 ACM Conference on Fairness, Accountability, and Transparency, Seoul, Republic of Korea, June 21, 2022
2021