Generative AI Misuse: A Taxonomy of Tactics and Insights from Real-World Data.
CoRR, 2024
Should Users Trust Advanced AI Assistants? Justified Trust As a Function of Competence and Alignment.
Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, 2024
Gaps in the Safety Evaluation of Generative AI.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Seventh AAAI/ACM Conference on AI, Ethics, and Society (AIES-24) - Full Archival Papers, October 21-23, 2024, San Jose, California, USA, 2024
The Code That Binds Us: Navigating the Appropriateness of Human-AI Assistant Relationships.
Proceedings of the Seventh AAAI/ACM Conference on AI, Ethics, and Society (AIES-24) - Full Archival Papers, October 21-23, 2024, San Jose, California, USA, 2024
All Too Human? Mapping and Mitigating the Risk from Anthropomorphic AI.
Proceedings of the Seventh AAAI/ACM Conference on AI, Ethics, and Society (AIES-24) - Full Archival Papers, October 21-23, 2024, San Jose, California, USA, 2024
Sociotechnical Safety Evaluation of Generative AI Systems.
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Representation in AI Evaluations.
Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023
Manifestations of Xenophobia in AI Systems.
CoRR, 2022
A Human Rights-Based Approach to Responsible AI.
CoRR, 2022
In conversation with Artificial Intelligence: aligning language models with human values.
CoRR, 2022
Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Taxonomy of Risks posed by Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the FAccT '22: 2022 ACM Conference on Fairness, Accountability, and Transparency, Seoul, Republic of Korea, June 21, 2022
Power to the People? Opportunities and Challenges for Participatory AI.
Proceedings of the Equity and Access in Algorithms, Mechanisms, and Optimization, 2022
Towards a Theory of Justice for Artificial Intelligence.
CoRR, 2021
Alignment of Language Agents.
CoRR, 2021
The Challenge of Value Alignment: from Fairer Algorithms to AI Safety.
CoRR, 2021
Modelling Cooperation in Network Games with Spatio-Temporal Complexity.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021
Artificial Intelligence, Values, and Alignment.
Minds Mach., 2020