2024

The PRISM Alignment Project: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models.

[DOI]

Hannah Rose Kirk

,

Alexander Whitefield

,

,

,

Katerina Margatina

,

,

Rafael Mosquera

,

,

,

,

,

CoRR, 2024

The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models.

[DOI]

Hannah Rose Kirk

,

Alexander Whitefield

,

,

,

Katerina Margatina

,

Rafael Mosquera Gómez

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Adversarial Nibbler: An Open Red-Teaming Method for Identifying Diverse Harms in Text-to-Image Generation.

[DOI]

,

,

,

,

Hannah Rose Kirk

,

,

,

,

,

,

,

Rafael Mosquera

,

,

Vijay Janapa Reddi

,

Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, 2024

2023

Speech Wikimedia: A 77 Language Multilingual Speech Dataset.

[DOI]

Rafael Mosquera Gómez

,

,

,

,

,

Kurt D. Bollacker

,

CoRR, 2023

Adversarial Nibbler: A Data-Centric Challenge for Improving the Safety of Text-to-Image Models.

[DOI]

,

Hannah Rose Kirk

,

,

,

,

,

,

Rafael Mosquera

,

,

,

,

Vijay Janapa Reddi

,

CoRR, 2023

DataPerf: Benchmarks for Data-Centric AI Development.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022

DataPerf: Benchmarks for Data-Centric AI Development.

[DOI]

CoRR, 2022

2021

LSH methods for data deduplication in a Wikipedia artificial dataset.

[DOI]

,

,

,

CoRR, 2021

The People's Speech: A Large-Scale Diverse English Speech Recognition Dataset for Commercial Usage.

[DOI]

,

,

,

Juan Felipe Cerón

,

,

,

,

,

,

Vijay Janapa Reddi

CoRR, 2021

Multilingual Spoken Words Corpus.

[DOI]

,

Sharad Chitlangia

,

Colby R. Banbury

,

,

,

,

,

,

,

,

,

,

,

Vijay Janapa Reddi

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021