We stand with Ukraine

We stand with Ukraine

James Zou

Orcid: 0000-0001-8880-4764

Affiliations:

Stanford University, Department of Electrical Engineering, CA, USA
Harvard University, School of Engineering and Applied Sciences, Cambridge, MA, USA

According to our database¹, James Zou authored at least 238 papers between 2010 and 2024.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on orcid.org
on people.fas.harvard.edu

On csauthors.net:

Bibliography

2024

Provable Membership Inference Privacy.

[BibT_eX]

[DOI]

,

,

Sercan Ö. Arik

,

Trans. Mach. Learn. Res., 2024

Author Correction: Bridging the literacy gap for surgical consents: an AI-human expert collaborative approach.

[BibT_eX]

[DOI]

,

Ian D. Connolly

,

,

Fatima N. Mirza

,

Benjamin Johnston

,

Hael F. Abdulrazeq

,

,

Paul F. Galamaga

,

Tiffany J. Libby

,

,

Michael W. Groff

,

Ziya L. Gokaslan

,

Albert E. Telfeian

,

,

,

,

Curtis E. Doberstein

npj Digit. Medicine, 2024

Bridging the literacy gap for surgical consents: an AI-human expert collaborative approach.

[BibT_eX]

[DOI]

,

Ian D. Connolly

,

,

Fatima N. Mirza

,

Benjamin Johnston

,

Hael F. Abdulrazeq

,

Paul F. Galamaga

,

Tiffany J. Libby

,

,

Michael W. Groff

,

Ziya L. Gokaslan

,

Albert E. Telfeian

,

,

,

,

Curtis E. Doberstein

npj Digit. Medicine, 2024

Generative AI for designing and validating easily synthesizable and structurally novel antibiotics.

[BibT_eX]

[DOI]

,

,

Denise B. Catacutan

,

,

,

Jonathan M. Stokes

Nat. Mac. Intell., 2024

Systematic analysis of 32,111 AI model cards characterizes documentation practice in AI.

[BibT_eX]

[DOI]

,

,

,

Ezinwanne Ozoani

,

,

,

Daniel Scott Smith

,

Nat. Mac. Intell., 2024

Belief in the Machine: Investigating Epistemological Blind Spots of Language Models.

[BibT_eX]

[DOI]

,

,

Federico Bianchi

,

,

,

,

CoRR, 2024

Reducing Hallucinations in Vision-Language Models via Latent Space Steering.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Locality Alignment Improves Vision-Language Models.

[BibT_eX]

[DOI]

,

,

,

Tatsunori Hashimoto

CoRR, 2024

Self-rationalization improves LLM as a fine-grained judge.

[BibT_eX]

[DOI]

,

,

Oliver Molenschot

,

Meghana Arakkal Rajeev

,

Rajkumar Ramamurthy

,

,

Tanveesh Singh Chaudhery

,

Jahnavi Jambholkar

,

,

CoRR, 2024

TFG: Unified Training-Free Guidance for Diffusion Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Regulating AI Adaptation: An Analysis of AI Medical Device Updates.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Quantifying AI Psychology: A Psychometrics Benchmark for Large Language Models.

[BibT_eX]

[DOI]

,

,

,

Xiangliang Zhang

,

,

CoRR, 2024

Automated radiotherapy treatment planning guided by GPT-4Vision.

[BibT_eX]

[DOI]

,

Oscar Pastor-Serrano

,

,

Matthew Gopaulchan

,

,

Mark Buyyounouski

,

,

,

Michael Gensheimer

,

,

,

,

CoRR, 2024

AvaTaR: Optimizing LLM Agents for Tool-Assisted Knowledge Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

Michihiro Yasunaga

,

,

Vassilis N. Ioannidis

,

Karthik Subbian

,

,

CoRR, 2024

TextGrad: Automatic "Differentiation" via Text.

[BibT_eX]

[DOI]

Mert Yüksekgönül

,

Federico Bianchi

,

,

,

,

Carlos Guestrin

,

CoRR, 2024

CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Marc Niethammer

,

,

,

,

,

,

,

,

CoRR, 2024

Truthful Dataset Valuation by Pointwise Mutual Information.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

Accelerating Transformers with Spectrum-Preserving Token Merging.

[BibT_eX]

[DOI]

,

Duy M. H. Nguyen

,

,

TrungTin Nguyen

,

,

,

,

,

,

Mathias Niepert

CoRR, 2024

STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases.

[BibT_eX]

[DOI]

,

,

Michihiro Yasunaga

,

,

,

,

Vassilis N. Ioannidis

,

Karthik Subbian

,

,

CoRR, 2024

Optimizing Calibration by Gaining Aware of Prediction Correctness.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

How faithful are RAG models? Quantifying the tug-of-war between RAG and LLMs' internal prior.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

Mapping the Increasing Use of LLMs in Scientific Papers.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Christopher Potts

,

Christopher D. Manning

,

CoRR, 2024

Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems.

[BibT_eX]

[DOI]

,

Jared Quincy Davis

,

,

,

,

,

CoRR, 2024

Simple linear attention language models balance the recall-throughput tradeoff.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Christopher Ré

CoRR, 2024

Large Language Models are Vulnerable to Bait-and-Switch Attacks for Generating Harmful Content.

[BibT_eX]

[DOI]

Federico Bianchi

,

CoRR, 2024

What's documented in AI? Systematic Analysis of 32K AI Model Cards.

[BibT_eX]

[DOI]

,

,

,

Ezinwanne Ozoani

,

,

,

Daniel Scott Smith

,

CoRR, 2024

How well do LLMs cite relevant medical references? An evaluation framework and analyses.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Patricia Shi Riantawan

,

,

CoRR, 2024

Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution.

[BibT_eX]

[DOI]

,

,

,

,

Tatsunori Hashimoto

CoRR, 2024

Navigating Dataset Documentations in AI: A Large-Scale Analysis of Dataset Cards on Hugging Face.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

TrustLLM: Trustworthiness in Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Bhavya Kailkhura

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

John C. Mitchell

,

,

,

,

,

,

,

Neil Zhenqiang Gong

,

,

,

,

,

,

,

,

,

,

,

,

,

Xiangliang Zhang

,

,

,

,

,

,

,

,

CoRR, 2024

Can AI Be as Creative as Humans?

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Michael Qizhe Xie

,

,

Kenji Kawaguchi

CoRR, 2024

ADMET-AI: a machine learning ADMET platform for evaluation of large-scale chemical libraries.

[BibT_eX]

[DOI]

,

,

,

Souhrid Mukherjee

,

,

Rabindra V. Shivnaraine

,

Bioinform., 2024

Learning and Forgetting Unsafe Examples in Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Rethinking Data Shapley for Data Selection Tasks: Misleads and Merits.

[BibT_eX]

[DOI]

Jiachen T. Wang

,

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

ArtWhisperer: A Dataset for Characterizing Human-AI Interactions in Artistic Creations.

[BibT_eX]

[DOI]

Kailas Vodrahalli

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory Signals.

[BibT_eX]

[DOI]

,

,

Magnus Ruud Kjær

,

Hyatt E. Moore IV

,

,

Emmanuel Mignot

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Prospector Heads: Generalized Feature Attribution for Large Models & Data.

[BibT_eX]

[DOI]

Gautam Machiraju

,

Alexander Derry

,

,

,

Amir-Hossein Karimi

,

,

,

Christopher Ré

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Selecting Large Language Model to Fine-tune via Rectified Scaling Law.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Daniel A. McFarland

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Position: TrustLLM: Trustworthiness in Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Bhavya Kailkhura

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Joaquin Vanschoren

,

John C. Mitchell

,

,

,

,

,

,

,

Neil Zhenqiang Gong

,

,

,

,

,

,

,

,

,

,

,

,

,

Xiangliang Zhang

,

,

,

,

,

,

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Scaling Laws for the Value of Individual Data Points in Machine Learning.

[BibT_eX]

[DOI]

Ian Connick Covert

,

,

Tatsunori Hashimoto

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Simple linear attention language models balance the recall-throughput tradeoff.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Christopher Ré

Proceedings of the Forty-first International Conference on Machine Learning, 2024

How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis.

[BibT_eX]

[DOI]

Federico Bianchi

,

Patrick John Chia

,

Mert Yüksekgönül

,

Jacopo Tagliabue

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Navigating Dataset Documentations in AI: A Large-Scale Analysis of Dataset Cards on HuggingFace.

[BibT_eX]

[DOI]

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Zoology: Measuring and Improving Recall in Efficient Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Christopher Ré

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions.

[BibT_eX]

[DOI]

Federico Bianchi

,

,

Giuseppe Attanasio

,

,

,

Tatsunori Hashimoto

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Model ChangeLists: Characterizing Updates to ML Models.

[BibT_eX]

[DOI]

,

,

,

,

,

Christopher Ré

,

Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, 2024

2023

A clinically applicable AI system for diagnosis of congenital heart diseases based on computed tomography images.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Medical Image Anal., December, 2023

GPT detectors are biased against non-native English writers.

[BibT_eX]

[DOI]

,

Mert Yüksekgönül

,

,

,

Patterns, July, 2023

Machine learning modeling of RNA structures: methods, challenges and future perspectives.

[BibT_eX]

[DOI]

,

,

Briefings Bioinform., July, 2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.

[BibT_eX]

[DOI]

Aarohi Srivastava

,

Abhinav Rastogi

,

,

Abu Awal Md Shoeb

,

,

,

,

,

,

Adrià Garriga-Alonso

,

Agnieszka Kluska

,

Aitor Lewkowycz

,

,

,

,

,

Alexander W. Kocurek

,

,

,

,

,

,

,

,

,

,

,

Anantharaman S. Iyer

,

Anders Andreassen

,

,

Andrea Santilli

,

Andreas Stuhlmüller

,

,

,

Andrew K. Lampinen

,

,

,

,

,

,

,

Antonio Norelli

,

,

Arash Gholamidavoodi

,

,

,

Arun Kirubarajan

,

Asher Mullokandov

,

Ashish Sabharwal

,

,

,

,

,

B. Ryan Roberts

,

,

,

Bartlomiej Bojanowski

,

Batuhan Özyurt

,

Behnam Hedayatnia

,

Behnam Neyshabur

,

,

,

,

Bill Yuchen Lin

,

,

,

,

,

Catherine Stinson

,

Cedrick Argueta

,

Cèsar Ferri Ramírez

,

,

Charles Rathkopf

,

,

,

,

Chris Callison-Burch

,

,

Christian Voigt

,

Christopher D. Manning

,

Christopher Potts

,

,

Clara E. Rivera

,

,

,

Courtney Ashcraft

,

Cristina Garbacea

,

,

,

,

,

,

,

Daniel Khashabi

,

,

Daniel Moseguí González

,

Danielle Perszyk

,

Danny Hernandez

,

,

Daphne Ippolito

,

,

,

,

,

Debajyoti Datta

,

,

,

,

,

,

,

,

,

,

Dimitri Coelho Mollo

,

,

,

,

Ekaterina Shutova

,

Ekin Dogus Cubuk

,

,

Eleanor Hagerman

,

Elizabeth Barnes

,

Elizabeth Donoway

,

,

Emanuele Rodolà

,

,

,

,

,

,

,

,

Ethan J. Jerzak

,

,

Eunice Engefu Manyasi

,

Evgenii Zheltonozhskii

,

,

,

Fernando Martínez-Plumed

,

Francesca Happé

,

François Chollet

,

,

,

Genta Indra Winata

,

,

Germán Kruszewski

,

Giambattista Parascandolo

,

Giorgio Mariani

,

,

Gonzalo Jaimovitch-López

,

,

,

Hana Galijasevic

,

,

,

Hannaneh Hajishirzi

,

,

,

,

Hinrich Schütze

,

,

,

,

,

,

,

Jack Geissinger

,

Jackson Kernion

,

,

,

Jaime Fernández Fisac

,

,

,

,

,

,

,

Janelle Wingfield

,

,

,

Jascha Sohl-Dickstein

,

,

,

,

Jekaterina Novikova

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Jonathan Batchelder

,

Jonathan Berant

,

,

,

José Hernández-Orallo

,

Joseph Boudeman

,

,

,

Joshua B. Tenenbaum

,

,

,

,

,

,

Karthik Gopalakrishnan

,

Katerina Ignatyeva

,

,

Kaustubh D. Dhole

,

,

,

,

Kristen Chiafullo

,

Ksenia Shkaruta

,

,

,

Kyle Richardson

,

,

,

,

,

,

Lidia Contreras Ochando

,

Louis-Philippe Morency

,

,

,

,

,

,

Luis Oliveros Colón

,

,

Lütfi Kerem Senel

,

,

,

Maartje ter Hoeve

,

,

,

,

,

,

,

María José Ramírez-Quintana

,

,

Mario Giulianelli

,

,

Martin Potthast

,

Matthew L. Leavitt

,

,

Mátyás Schubert

,

Medina Baitemirova

,

,

Melvin McElrath

,

,

,

,

Michael I. Ivanitskiy

,

Michael Starritt

,

,

Michal Swedrowski

,

Michele Bevilacqua

,

Michihiro Yasunaga

,

,

,

,

,

,

,

,

Moin Aminnaseri

,

,

,

Mukund Varma T.

,

,

,

,

Neta Gur-Ari Krakover

,

Nicholas Cameron

,

Nicholas Roberts

,

,

Nicole Martinez

,

,

,

Niklas Muennighoff

,

Nitish Shirish Keskar

,

,

,

,

,

,

,

Omar Elbaghdadi

,

,

,

Pablo Antonio Moreno Casares

,

,

,

,

,

Pegah Alipoormolabashi

,

,

,

,

Peter Eckersley

,

,

,

Piotr Milkowski

,

,

Pouya Pezeshkpour

,

,

,

,

,

,

Rachel Etta Rudolph

,

,

,

,

Raphaël Millière

,

,

,

,

,

Robbe Raymaekers

,

,

,

,

,

,

,

,

,

Ruslan Salakhutdinov

,

,

,

,

,

,

,

Saif M. Mohammad

,

,

,

,

,

Samuel Gruetter

,

Samuel R. Bowman

,

Samuel S. Schoenholz

,

,

,

,

Sarik Ghazarian

,

,

,

Sebastian Bischoff

,

Sebastian Gehrmann

,

Sebastian Schuster

,

Sepideh Sadeghi

,

,

,

Shashank Srivastava

,

,

,

,

Shixiang Shane Gu

,

Shubh Pachchigar

,

Shubham Toshniwal

,

,

Shyamolima (Shammie) Debnath

,

,

Simon Thormeyer

,

,

,

Sneha Priscilla Makini

,

,

,

Sriharsha Hatwar

,

Stanislas Dehaene

,

,

,

Stella Biderman

,

,

,

Steven T. Piantadosi

,

Stuart M. Shieber

,

Summer Misherghi

,

Svetlana Kiritchenko

,

,

,

,

,

,

,

Tatsu Hashimoto

,

,

Théo Desbordes

,

Theodore Rothschild

,

,

,

Tiberius Nkinyili

,

,

,

,

Tobias Gerstenberg

,

,

Trishala Neeraj

,

,

,

,

,

,

Victoria Nyamai

,

,

Vinay V. Ramasesh

,

Vinay Uday Prabhu

,

Vishakh Padmakumar

,

,

,

William Saunders

,

,

,

,

,

,

,

,

Yadollah Yaghoobzadeh

,

,

,

,

,

,

,

,

Yonatan Belinkov

,

,

,

,

,

,

,

,

,

Trans. Mach. Learn. Res., 2023

Skin Tone Analysis for Representation in Educational Materials (STAR-ED) using machine learning.

[BibT_eX]

[DOI]

Girmaw Abebe Tadesse

,

,

Kush R. Varshney

,

Peter W. J. Staar

,

Chinyere Agunwa

,

Skyler Speakman

,

,

Elizabeth E. Bailey

,

Ademide Adelekun

,

,

Ginikanwa Onyekaba

,

Jenna C. Lester

,

Veronica Rotemberg

,

,

Roxana Daneshjou

npj Digit. Medicine, 2023

A deep learning-based electrocardiogram risk score for long term cardiovascular death and disease.

[BibT_eX]

[DOI]

J. Weston Hughes

,

James E. Tooley

,

Jessica Torres Soto

,

Anna Ostropolets

,

,

Matthew Christensen

,

,

,

Dhamanpreet Kaur

,

,

Albert J. Rogers

,

Sanjiv M. Narayan

,

,

,

,

,

npj Digit. Medicine, 2023

Author Correction: Prostate cancer therapy personalization via multi-modal deep learning on randomized phase III clinical trials.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2023

Dynamic visualization of high-dimensional data.

[BibT_eX]

[DOI]

,

,

Nat. Comput. Sci., 2023

The Power of Contrast for Feature Learning: A Theoretical Analysis.

[BibT_eX]

[DOI]

,

,

,

,

J. Mach. Learn. Res., 2023

GraphMETRO: Mitigating Complex Distribution Shifts in GNNs via Mixture of Aligned Experts.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

ChatGPT Exhibits Gender and Racial Biases in Acute Coronary Syndrome Management.

[BibT_eX]

[DOI]

,

Mert Yüksekgönül

,

,

,

CoRR, 2023

Data Acquisition: A New Frontier in Data-centric AI.

[BibT_eX]

[DOI]

,

,

Newsha Ardalani

,

,

,

,

,

,

,

,

CoRR, 2023

DMLR: Data-centric Machine Learning Research - Past, Present and Future.

[BibT_eX]

[DOI]

CoRR, 2023

In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering.

[BibT_eX]

[DOI]

,

,

CoRR, 2023

Holistic Analysis of Hallucination in GPT-4V(ision): Bias and Interference Challenges.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

Can large language models provide useful feedback on research papers? A large-scale empirical analysis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Kailas Vodrahalli

,

,

Daniel Scott Smith

,

,

Daniel A. McFarland

,

CoRR, 2023

Large language models in medicine: the potentials and pitfalls.

[BibT_eX]

[DOI]

Jesutofunmi A. Omiye

,

,

Shawheen J. Rezaei

,

,

Roxana Daneshjou

CoRR, 2023

Is your data alignable? Principled and interpretable alignability testing and integration of single-cell data.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

How is ChatGPT's behavior changing over time?

[BibT_eX]

[DOI]

,

,

CoRR, 2023

What Should Data Science Education Do with Large Language Models?

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

Factorized Contrastive Learning: Going Beyond Multi-view Redundancy.

[BibT_eX]

[DOI]

,

,

,

,

Louis-Philippe Morency

,

Ruslan Salakhutdinov

CoRR, 2023

FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance.

[BibT_eX]

[DOI]

,

,

CoRR, 2023

Last-Layer Fairness Fine-tuning is Simple and Effective for Neural Networks.

[BibT_eX]

[DOI]

,

,

,

,

Kenji Kawaguchi

,

CoRR, 2023

SkinCon: A skin disease dataset densely annotated by domain experts for fine-grained model debugging and analysis.

[BibT_eX]

[DOI]

Roxana Daneshjou

,

Mert Yüksekgönül

,

,

Roberto A. Novoa

,

CoRR, 2023

Beyond Confidence: Reliable Models Should Also Consider Atypicality.

[BibT_eX]

[DOI]

Mert Yüksekgönül

,

,

,

Carlos Guestrin

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DataPerf: Benchmarks for Data-Centric AI Development.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Factorized Contrastive Learning: Going Beyond Multi-view Redundancy.

[BibT_eX]

[DOI]

,

,

,

,

Louis-Philippe Morency

,

Ruslan Salakhutdinov

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

OpenDataVal: a Unified Benchmark for Data Valuation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

TWIGMA: A dataset of AI-Generated Images with Metadata From Twitter.

[BibT_eX]

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Multi-Granularity Approach to Similarity Search in Multiplexed Immunofluorescence Images.

[BibT_eX]

[DOI]

,

,

,

Alexandro Trevino

,

Proceedings of the Machine Learning in Computational Biology, November 30, 2023

TCR-BERT: learning the grammar of T-cell receptors for flexible antigen-binding analyses.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Ansuman Satpathy

,

,

Proceedings of the Machine Learning in Computational Biology, November 30, 2023

Discover and Cure: Concept-aware Mitigation of Spurious Correlation.

[BibT_eX]

[DOI]

,

Mert Yüksekgönül

,

,

Proceedings of the International Conference on Machine Learning, 2023

Accuracy on the Curve: On the Nonlinear Correlation of ML Performance Between Data Subpopulations.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2023

Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value.

[BibT_eX]

[DOI]

,

Proceedings of the International Conference on Machine Learning, 2023

Data-Driven Subgroup Identification for Linear Regression.

[BibT_eX]

[DOI]

,

,

Proceedings of the International Conference on Machine Learning, 2023

Diagnosing and Rectifying Vision Models using Language.

[BibT_eX]

[DOI]

,

Jeff Z. HaoChen

,

Shih-Cheng Huang

,

Kuan-Chieh Wang

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Post-hoc Concept Bottleneck Models.

[BibT_eX]

[DOI]

Mert Yüksekgönül

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

When and Why Vision-Language Models Behave like Bags-Of-Words, and What to Do About It?

[BibT_eX]

[DOI]

Mert Yüksekgönül

,

Federico Bianchi

,

Pratyusha Kalluri

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

FaiREE: fair classification with finite-sample and distribution-free guarantee.

[BibT_eX]

[DOI]

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

FIFA: Making Fairness More Generalizable in Classifiers Trained on Imbalanced Data.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale.

[BibT_eX]

[DOI]

Federico Bianchi

,

Pratyusha Kalluri

,

,

,

,

,

Tatsunori Hashimoto

,

,

,

Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023

Collecting data when missingness is unknown: a method for improving model performance given under-reporting in patient populations.

[BibT_eX]

[DOI]

,

,

Christopher Hane

,

,

Proceedings of the Conference on Health, Inference, and Learning, 2023

Understanding and Predicting the Effect of Environmental Factors on People with Type 2 Diabetes.

[BibT_eX]

[DOI]

Kailas Vodrahalli

,

Gregory D. Lyng

,

,

Kimmo Kärkkäinen

,

Jeffrey Hertzberg

,

,

Proceedings of the Conference on Health, Inference, and Learning, 2023

Freeze then Train: Towards Provable Representation Learning under Spurious Correlations and Feature Noise.

[BibT_eX]

[DOI]

,

,

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Understanding Multimodal Contrastive Learning and Incorporating Unpaired Data.

[BibT_eX]

[DOI]

,

Halil Ibrahim Gulluk

,

,

,

,

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language Models.

[BibT_eX]

[DOI]

,

Michihiro Yasunaga

,

,

Jeff Z. HaoChen

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

HAPI Explorer: Comprehension, Discovery, and Explanation on History of ML APIs.

[BibT_eX]

[DOI]

,

,

,

,

Christopher Ré

,

,

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Author Correction: Advances, challenges and opportunities in creating data for trustworthy AI.

[BibT_eX]

[DOI]

,

Girmaw Abebe Tadesse

,

,

,

,

,

Nat. Mac. Intell., October, 2022

Competition over data: how does data purchase affect users?

[BibT_eX]

[DOI]

,

,

Trans. Mach. Learn. Res., 2022

Systematic analysis of 50 years of Stanford University technology transfer and commercialization.

[BibT_eX]

[DOI]

,

,

Daniel A. McFarland

,

Patterns, 2022

Prostate cancer therapy personalization via multi-modal deep learning on randomized phase III clinical trials.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2022

Advances, challenges and opportunities in creating data for trustworthy AI.

[BibT_eX]

[DOI]

,

Girmaw Abebe Tadesse

,

,

,

,

,

Nat. Mach. Intell., 2022

AI reflections in 2021.

[BibT_eX]

[DOI]

Cameron Buckner

,

Risto Miikkulainen

,

Stephanie Forrest

,

,

,

,

Christopher Irrgang

,

,

,

Robin R. Murphy

,

Russell H. Taylor

,

,

,

Jathan Sadowski

,

Nat. Mach. Intell., 2022

Beyond Importance Scores: Interpreting Tabular ML by Visualizing Feature Semantics.

[BibT_eX]

[DOI]

Amirata Ghorbani

,

,

,

,

Inf., 2022

A Spectral Method for Assessing and Combining Multiple Data Visualizations.

[BibT_eX]

[DOI]

,

,

CoRR, 2022

SEAL : Interactive Tool for Systematic Error Analysis and Labeling.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2022

Knowledge-Driven New Drug Recommendation.

[BibT_eX]

[DOI]

,

,

,

David M. Liebovitz

,

,

,

,

CoRR, 2022

Data Budgeting for Machine Learning.

[BibT_eX]

[DOI]

,

,

CoRR, 2022

Protein structure generation via folding diffusion.

[BibT_eX]

[DOI]

,

,

Rianne van den Berg

,

,

,

CoRR, 2022

Development and Clinical Evaluation of an AI Support Tool for Improving Telemedicine Photo Quality.

[BibT_eX]

[DOI]

Kailas Vodrahalli

,

,

Albert S. Chiou

,

Roberto A. Novoa

,

,

,

,

,

,

Roxana Daneshjou

CoRR, 2022

DataPerf: Benchmarks for Data-Centric AI Development.

[BibT_eX]

[DOI]

CoRR, 2022

GSCLIP : A Framework for Explaining Distribution Shifts in Natural Language.

[BibT_eX]

[DOI]

,

,

CoRR, 2022

A Unified f-divergence Framework Generalizing VAE and GAN.

[BibT_eX]

[DOI]

Jaime Roquero Gimenez

,

CoRR, 2022

Improving genetic risk prediction across diverse population by disentangling ancestry representations.

[BibT_eX]

[DOI]

Prashnna K. Gyawali

,

,

,

,

,

CoRR, 2022

Electrocardiographic Deep Learning for Predicting Post-Procedural Mortality.

[BibT_eX]

[DOI]

CoRR, 2022

Disparities in Dermatology AI Performance on a Diverse, Curated Clinical Image Set.

[BibT_eX]

[DOI]

Roxana Daneshjou

,

Kailas Vodrahalli

,

Roberto A. Novoa

,

Melissa Jenkins

,

,

Veronica Rotemberg

,

,

Susan M. Swetter

,

Elizabeth E. Bailey

,

Olivier Gevaert

,

Pritam Mukherjee

,

,

,

,

Rachna Sahasrabudhe

,

Johan A. C. Allerup

,

Utako Okata-Karigane

,

,

CoRR, 2022

Submix: Practical Private Prediction for Large-Scale Language Models.

[BibT_eX]

[DOI]

,

Laurens van der Maaten

,

,

CoRR, 2022

C-Mixup: Improving Generalization in Regression.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Uncalibrated Models Can Improve Human-AI Collaboration.

[BibT_eX]

[DOI]

Kailas Vodrahalli

,

Tobias Gerstenberg

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

WeightedSHAP: analyzing and improving Shapley based feature attributions.

[BibT_eX]

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SkinCon: A skin disease dataset densely annotated by domain experts for fine-grained debugging and analysis.

[BibT_eX]

[DOI]

Roxana Daneshjou

,

Mert Yüksekgönül

,

,

Roberto A. Novoa

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Estimating and Explaining Model Performance When Both Covariates and Labels Shift.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions.

[BibT_eX]

[DOI]

,

,

,

Christopher Ré

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Predicting Immune Escape with Pretrained Protein Language Model Embeddings.

[BibT_eX]

[DOI]

,

,

Proceedings of the Machine Learning in Computational Biology, 21-22 November 2022, Online, 2022

Ensembling improves stability and power of feature selection for deep learning models.

[BibT_eX]

[DOI]

Prashnna K. Gyawali

,

,

,

Proceedings of the Machine Learning in Computational Biology, 21-22 November 2022, Online, 2022

When and How Mixup Improves Calibration.

[BibT_eX]

[DOI]

,

,

Kenji Kawaguchi

,

Proceedings of the International Conference on Machine Learning, 2022

Improving Out-of-Distribution Robustness via Selective Augmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

Efficient Online ML API Selection for Multi-Label Classification Tasks.

[BibT_eX]

[DOI]

,

,

Proceedings of the International Conference on Machine Learning, 2022

Meaningfully debugging model mistakes using conceptual counterfactual explanations.

[BibT_eX]

[DOI]

,

Mert Yüksekgönül

,

Proceedings of the International Conference on Machine Learning, 2022

MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts.

[BibT_eX]

[DOI]

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Domino: Discovering Systematic Errors with Cross-Modal Embeddings.

[BibT_eX]

[DOI]

,

,

Khaled Kamal Saab

,

Jean-Benoit Delbrouck

,

Christopher Lee-Messer

,

,

,

Christopher Ré

Proceedings of the Tenth International Conference on Learning Representations, 2022

How Did the Model Change? Efficiently Assessing Machine Learning API Shifts.

[BibT_eX]

[DOI]

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

SEAL: Interactive Tool for Systematic Error Analysis and Labeling.

[BibT_eX]

[DOI]

,

,

,

Margaret Mitchell

,

Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

dcbench: a benchmark for data-centric AI systems.

[BibT_eX]

[DOI]

,

,

Christopher Ré

,

,

Proceedings of the DEEM '22: Proceedings of the Sixth Workshop on Data Management for End-To-End Machine Learning Philadelphia, 2022

Clustering Plotted Data by Image Segmentation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning.

[BibT_eX]

[DOI]

,

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

How to Learn when Data Gradually Reacts to Your Model.

[BibT_eX]

[DOI]

,

,

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

MLDemon: Deployment Monitoring for Machine Learning Systems.

[BibT_eX]

[DOI]

,

Martin Jinye Zhang

,

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Do Humans Trust Advice More if it Comes from AI?: An Analysis of Human-AI Interactions.

[BibT_eX]

[DOI]

Kailas Vodrahalli

,

Roxana Daneshjou

,

Tobias Gerstenberg

,

Proceedings of the AIES '22: AAAI/ACM Conference on AI, Ethics, and Society, Oxford, United Kingdom, May 19, 2022

Data Sculpting: Interpretable Algorithm for End-to-End Cohort Selection.

[BibT_eX]

[DOI]

,

Proceedings of the 56th Asilomar Conference on Signals, Systems, and Computers, ACSSC 2022, Pacific Grove, CA, USA, October 31, 2022

Data Shapley Valuation for Efficient Batch Active Learning.

[BibT_eX]

[DOI]

Amirata Ghorbani

,

,

Proceedings of the 56th Asilomar Conference on Signals, Systems, and Computers, ACSSC 2022, Pacific Grove, CA, USA, October 31, 2022

Grading of Prostate Whole-slide Images Using Weak Self-supervised Learning.

[BibT_eX]

[DOI]

Amirata Ghorbani

,

,

Proceedings of the 56th Asilomar Conference on Signals, Systems, and Computers, ACSSC 2022, Pacific Grove, CA, USA, October 31, 2022

2021

Large language models associate Muslims with violence.

[BibT_eX]

[DOI]

,

,

Nat. Mach. Intell., 2021

Patient Experience Surveys Reveal Gender-Biased Descriptions of Their Care Providers.

[BibT_eX]

[DOI]

,

,

Christina Topham

,

Kathryn Schwarzenberger

,

,

,

Teri M. Greiling

J. Medical Syst., 2021

Explaining medical AI performance disparities across sites with confounder Shapley value analysis.

[BibT_eX]

[DOI]

,

,

CoRR, 2021

Disparities in Dermatology AI: Assessments Using Diverse Clinical Images.

[BibT_eX]

[DOI]

Roxana Daneshjou

,

Kailas Vodrahalli

,

,

Roberto A. Novoa

,

Melissa Jenkins

,

Veronica Rotemberg

,

,

Susan M. Swetter

,

Elizabeth E. Bailey

,

Olivier Gevaert

,

Pritam Mukherjee

,

,

,

,

Rachna Sahasrabudhe

,

,

CoRR, 2021

Did the Model Change? Efficiently Assessing Machine Learning API Shifts.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2021

Do Humans Trust Advice More if it Comes from AI? An Analysis of Human-AI Interactions.

[BibT_eX]

[DOI]

Kailas Vodrahalli

,

Tobias Gerstenberg

,

CoRR, 2021

Meaningfully Explaining a Model's Mistakes.

[BibT_eX]

[DOI]

,

CoRR, 2021

High-Throughput Precision Phenotyping of Left Ventricular Hypertrophy with Cardiovascular Deep Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Matthew J. Shun-Shin

,

Kevin M. Alexander

,

,

Matthew P. Lungren

,

,

,

Ingela Schnittger

,

,

,

,

Ronald Witteles

,

,

CoRR, 2021

Group-Structured Adversarial Training.

[BibT_eX]

[DOI]

,

Amirali Aghazadeh

,

,

CoRR, 2021

FrugalMCT: Efficient Online ML API Selection for Multi-Label Classification Tasks.

[BibT_eX]

[DOI]

,

,

CoRR, 2021

TrueImage: A Machine Learning Algorithm to Improve the Quality of Telehealth Photos.

[BibT_eX]

[DOI]

Kailas Vodrahalli

,

Roxana Daneshjou

,

Roberto A. Novoa

,

,

,

Proceedings of the Biocomputing 2021: Proceedings of the Pacific Symposium, 2021

Adversarial Training Helps Transfer Learning via Better Representations.

[BibT_eX]

[DOI]

,

,

Kailas Vodrahalli

,

Kenji Kawaguchi

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Neural Group Testing to Accelerate Deep Learning.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE International Symposium on Information Theory, 2021

Mixed Dimension Embeddings with Application to Memory-Efficient Recommendation Systems.

[BibT_eX]

[DOI]

Antonio A. Ginart

,

,

Dheevatsa Mudigere

,

,

Proceedings of the IEEE International Symposium on Information Theory, 2021

Improving Generalization in Meta-learning via Task Augmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

How to Learn when Data Reacts to Your Model: Performative Gradient Descent.

[BibT_eX]

[DOI]

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

How Does Mixup Help With Robustness and Generalization?

[BibT_eX]

[DOI]

,

,

Kenji Kawaguchi

,

Amirata Ghorbani

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Racial Representation Analysis in Dermatology Academic Materials.

[BibT_eX]

[DOI]

Girmaw Abebe Tadesse

,

,

Roxana Daneshjou

,

Kush R. Varshney

,

Peter W. J. Staar

,

Skyler Speakman

,

,

Chinyere Agunwa

,

,

Elizabeth E. Bailey

,

,

Ginikanwa Onyekaba

,

Veronica Rotemberg

,

Ademide Adelekun

,

Proceedings of the AMIA 2021, American Medical Informatics Association Annual Symposium, San Diego, CA, USA, October 30, 2021, 2021

Efficient Computation and Analysis of Distributional Shapley Values.

[BibT_eX]

[DOI]

,

Manuel A. Rivas

,

Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Approximate Data Deletion from Machine Learning Models.

[BibT_eX]

[DOI]

,

Mary Anne Smart

,

Kamalika Chaudhuri

,

Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Competing AI: How does competition feedback affect machine learning?

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Improving Adversarial Robustness via Unlabeled Out-of-Domain Data.

[BibT_eX]

[DOI]

,

,

Amirata Ghorbani

,

Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Who's Responsible? Jointly Quantifying the Contribution of the Learning Algorithm and Data.

[BibT_eX]

[DOI]

,

Amirata Ghorbani

,

Proceedings of the AIES '21: AAAI/ACM Conference on AI, 2021

Persistent Anti-Muslim Bias in Large Language Models.

[BibT_eX]

[DOI]

,

,

Proceedings of the AIES '21: AAAI/ACM Conference on AI, 2021

2020

How Much Does Your Data Exploration Overfit? Controlling Bias via Information Usage.

[BibT_eX]

[DOI]

,

IEEE Trans. Inf. Theory, 2020

Deep learning interpretation of echocardiograms.

[BibT_eX]

[DOI]

Amirata Ghorbani

,

,

,

,

Jonathan H. Chen

,

Robert A. Harrington

,

,

,

npj Digit. Medicine, 2020

Video-based AI for beat-to-beat assessment of cardiac function.

[BibT_eX]

[DOI]

,

,

Amirata Ghorbani

,

,

,

Curtis P. Langlotz

,

Paul A. Heidenreich

,

Robert A. Harrington

,

,

,

Nat., 2020

An online platform for interactive feedback in biomedical machine learning.

[BibT_eX]

[DOI]

,

,

,

,

Abdulrahman Alfozan

,

Nat. Mach. Intell., 2020

Data Valuation for Medical Imaging Using Shapley Value: Application on A Large-scale Chest X-ray Dataset.

[BibT_eX]

[DOI]

,

Amirata Ghorbani

,

Rikiya Yamashita

,

,

Jared A. Dunnmon

,

,

Daniel L. Rubin

CoRR, 2020

Competing AI: How competition feedback affects machine learning.

[BibT_eX]

[DOI]

,

,

CoRR, 2020

Improving Training on Noisy Stuctured Labels.

[BibT_eX]

[DOI]

,

CoRR, 2020

Approximate Data Deletion from Machine Learning Models: Algorithms and Evaluations.

[BibT_eX]

[DOI]

,

Mary Anne Smart

,

Kamalika Chaudhuri

,

CoRR, 2020

Predicting target genes of non-coding regulatory variants with IRT.

[BibT_eX]

[DOI]

,

Nilah M. Ioannidis

,

,

Russell Schwartz

Bioinform., 2020

LitGen: Genetic Literature Recommendation Guided by Human Explanations.

[BibT_eX]

[DOI]

,

Arturo L. Pineda

,

,

,

,

,

,

Carlos D. Bustamante

,

Proceedings of the Pacific Symposium on Biocomputing 2020, 2020

MOPO: Model-based Offline Policy Optimization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Neuron Shapley: Discovering the Responsible Neurons.

[BibT_eX]

[DOI]

Amirata Ghorbani

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

FrugalML: How to use ML Prediction APIs more accurately and cheaply.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Distributional Framework For Data Valuation.

[BibT_eX]

[DOI]

Amirata Ghorbani

,

,

Proceedings of the 37th International Conference on Machine Learning, 2020

Learning transport cost from subset correspondence.

[BibT_eX]

[DOI]

,

Akshay Balsubramani

,

Proceedings of the 8th International Conference on Learning Representations, 2020

ALICE: Active Learning with Contrastive Natural Language Explanations.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Beyond User Self-Reported Likert Scale Ratings: A Comparison Model for Automatic Dialog Evaluation.

[BibT_eX]

[DOI]

,

,

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

VetTag: improving automated veterinary diagnosis coding via large-scale language modeling.

[BibT_eX]

[DOI]

,

,

,

Rodney López Page

,

npj Digit. Medicine, 2019

Sex and gender analysis improves science and engineering.

[BibT_eX]

[DOI]

Cara Tannenbaum

,

Robert P. Ellis

,

Friederike Eyssel

,

,

Londa Schiebinger

Nat., 2019

Feedback GAN for DNA optimizes protein functions.

[BibT_eX]

[DOI]

,

Nat. Mach. Intell., 2019

Who's responsible? Jointly quantifying the contribution of the learning algorithm and training data.

[BibT_eX]

[DOI]

,

Amirata Ghorbani

,

CoRR, 2019

Gradio: Hassle-Free Sharing and Testing of ML Models in the Wild.

[BibT_eX]

[DOI]

,

,

,

,

Abdulrahman Alfozan

,

CoRR, 2019

Contrastive Variational Autoencoder Enhances Salient Features.

[BibT_eX]

[DOI]

,

CoRR, 2019

AdaFDR: A Fast, Powerful and Covariate-Adaptive Approach to Multiple Hypothesis Testing.

[BibT_eX]

[DOI]

Martin J. Zhang

,

,

Proceedings of the Research in Computational Molecular Biology, 2019

Making AI Forget You: Data Deletion in Machine Learning.

[BibT_eX]

[DOI]

,

,

Gregory Valiant

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Towards Automatic Concept-based Explanations.

[BibT_eX]

[DOI]

Amirata Ghorbani

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings.

[BibT_eX]

[DOI]

Dorottya Demszky

,

,

,

,

,

Matthew Gentzkow

,

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Adaptive Monte Carlo Multiple Testing via Multi-Armed Bandits.

[BibT_eX]

[DOI]

Martin J. Zhang

,

,

Proceedings of the 36th International Conference on Machine Learning, 2019

Discovering Conditionally Salient Features with Statistical Guarantees.

[BibT_eX]

[DOI]

Jaime Roquero Gimenez

,

Proceedings of the 36th International Conference on Machine Learning, 2019

Data Shapley: Equitable Valuation of Data for Machine Learning.

[BibT_eX]

[DOI]

Amirata Ghorbani

,

Proceedings of the 36th International Conference on Machine Learning, 2019

Concrete Autoencoders: Differentiable Feature Selection and Reconstruction.

[BibT_eX]

[DOI]

Muhammed Fatih Balin

,

,

Proceedings of the 36th International Conference on Machine Learning, 2019

Contingent Payment Mechanisms for Resource Utilization.

[BibT_eX]

[DOI]

,

,

David C. Parkes

,

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Contrastive Multivariate Singular Spectrum Analysis.

[BibT_eX]

[DOI]

Abdi-Hakin Dirie

,

,

Proceedings of the 57th Annual Allerton Conference on Communication, 2019

Improving the Stability of the Knockoff Procedure: Multiple Simultaneous Knockoffs and Entropy Maximization.

[BibT_eX]

[DOI]

Jaime Roquero Gimenez

,

Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

Knockoffs for the Mass: New Feature Importance Statistics with False Discovery Guarantees.

[BibT_eX]

[DOI]

Jaime Roquero Gimenez

,

Amirata Ghorbani

,

Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

Multiaccuracy: Black-Box Post-Processing for Fairness in Classification.

[BibT_eX]

[DOI]

,

Amirata Ghorbani

,

Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 2019

Interpretation of Neural Networks Is Fragile.

[BibT_eX]

[DOI]

Amirata Ghorbani

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Word embeddings quantify 100 years of gender and ethnic stereotypes.

[BibT_eX]

[DOI]

,

Londa Schiebinger

,

,

Proc. Natl. Acad. Sci. USA, 2018

DeepTag: inferring diagnoses from veterinary clinical notes.

[BibT_eX]

[DOI]

,

,

Rodney López Page

,

,

Arturo López Pineda

,

Manuel A. Rivas

,

Carlos D. Bustamante

,

npj Digit. Medicine, 2018

Large-scale Generative Modeling to Improve Automated Veterinary Disease Coding.

[BibT_eX]

[DOI]

,

,

CoRR, 2018

Autowarp: Learning a Warping Distance from Unlabeled Time Series Using Sequence Autoencoders.

[BibT_eX]

[DOI]

,

CoRR, 2018

DeepTag: inferring all-cause diagnoses from clinical notes in under-resourced medical domain.

[BibT_eX]

[DOI]

,

,

Rodney López Page

,

Arturo L. Pineda

,

Manuel A. Rivas

,

Carlos D. Bustamante

,

CoRR, 2018

Feedback GAN (FBGAN) for DNA: a Novel Feedback-Loop Architecture for Optimizing Protein Functions.

[BibT_eX]

[DOI]

,

CoRR, 2018

Stochastic EM for Shuffled Linear Regression.

[BibT_eX]

[DOI]

,

CoRR, 2018

Learning a Warping Distance from Unlabeled Time Series Using Sequence Autoencoders.

[BibT_eX]

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

CoVeR: Learning Covariate-Specific Vector Representations with Tensor Decompositions.

[BibT_eX]

[DOI]

,

,

Proceedings of the 35th International Conference on Machine Learning, 2018

The Effects of Memory Replay in Reinforcement Learning.

[BibT_eX]

[DOI]

,

Proceedings of the 56th Annual Allerton Conference on Communication, 2018

Embedding for Informative Missingness: Deep Learning With Incomplete Data.

[BibT_eX]

[DOI]

Amirata Ghorbani

,

Proceedings of the 56th Annual Allerton Conference on Communication, 2018

A Stochastic Expectation-Maximization Approach to Shuffled Linear Regression.

[BibT_eX]

[DOI]

,

Proceedings of the 56th Annual Allerton Conference on Communication, 2018

Why Adaptively Collected Data Have Negative Bias and How to Correct for It.

[BibT_eX]

[DOI]

,

,

Jonathan Taylor

,

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017

Contrastive Principal Component Analysis.

[BibT_eX]

[DOI]

,

Vivek Kumar Bagaria

,

Martin J. Zhang

,

CoRR, 2017

Beyond Bilingual: Multi-sense Word Embeddings using Multilingual Context.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2nd Workshop on Representation Learning for NLP, 2017

NeuralFDR: Learning Discovery Thresholds from Hypothesis Features.

[BibT_eX]

[DOI]

,

Martin J. Zhang

,

,

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Learning Latent Space Models with Angular Constraints.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 34th International Conference on Machine Learning, 2017

Estimating the unseen from multiple populations.

[BibT_eX]

[DOI]

Aditi Raghunathan

,

Gregory Valiant

,

Proceedings of the 34th International Conference on Machine Learning, 2017

2016

Clustering with a Reject Option: Interactive Clustering as Bayesian Prior Elicitation.

[BibT_eX]

[DOI]

Akash Srivastava

,

,

,

CoRR, 2016

Contingent Payment Mechanisms to Maximize Resource Utilization.

[BibT_eX]

[DOI]

,

,

David C. Parkes

,

CoRR, 2016

Quantifying and Reducing Stereotypes in Word Embeddings.

[BibT_eX]

[DOI]

Tolga Bolukbasi

,

,

,

Venkatesh Saligrama

,

Adam Tauman Kalai

CoRR, 2016

Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings.

[BibT_eX]

[DOI]

Tolga Bolukbasi

,

,

,

Venkatesh Saligrama

,

Adam Tauman Kalai

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

2015

Inferring parental genomic ancestries using pooled semi-Markov processes.

[BibT_eX]

[DOI]

,

,

Esteban Gonzàlez Burchard

,

Sriram Sankararaman

Bioinform., 2015

Incentive-Compatible Experimental Design.

[BibT_eX]

[DOI]

,

David C. Parkes

,

,

Proceedings of the Sixteenth ACM Conference on Economics and Computation, 2015

Crowdsourcing Feature Discovery via Adaptively Chosen Comparisons.

[BibT_eX]

[DOI]

,

Kamalika Chaudhuri

,

Adam Tauman Kalai

Proceedings of the Third AAAI Conference on Human Computation and Crowdsourcing, 2015

Strategic Voting Behavior in Doodle Polls.

[BibT_eX]

[DOI]

,

,

David C. Parkes

Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, 2015

2013

Contrastive Learning Using Spectral Methods.

[BibT_eX]

[DOI]

,

,

David C. Parkes

,

Ryan Prescott Adams

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

2012

Mechanism Design for Time Critical and Cost Critical Task Execution via Crowdsourcing.

[BibT_eX]

[DOI]

,

,

,

Yadati Narahari

,

Proceedings of the Internet and Network Economics - 8th International Workshop, 2012

Priors for Diversity in Generative Latent Variable Models.

[BibT_eX]

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

A Slime Mold Solver for Linear Programming Problems.

[BibT_eX]

[DOI]

Anders Johannson

,

Proceedings of the How the World Computes, 2012

Threats and Trade-Offs in Resource Critical Crowdsourcing Tasks Over Networks.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2010

Tolerable Manipulability in Dynamic Assignment without Money.

[BibT_eX]

[DOI]

,

,

David C. Parkes

Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

Loading...