Pang Wei Koh

Orcid: 0000-0003-4330-6969

According to our database1, Pang Wei Koh authored at least 60 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
OLMoE: Open Mixture-of-Experts Language Models.
CoRR, 2024

JPEG-LM: LLMs as Image Generators with Canonical Codec Representations.
CoRR, 2024

Scaling Retrieval-Based Language Models with a Trillion-Token Datastore.
CoRR, 2024

PLeaS - Merging Models with Permutations and Least Squares.
CoRR, 2024

Data-Centric AI in the Age of Large Language Models.
CoRR, 2024

DataComp-LM: In search of the next generation of training sets for language models.
CoRR, 2024

The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better.
CoRR, 2024

MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning.
CoRR, 2024

Multilingual Diversity Improves Vision-Language Representations.
CoRR, 2024

Information-Theoretic Distillation for Reference-less Summarization.
CoRR, 2024

Reliable, Adaptable, and Attributable Language Models with Retrieval.
CoRR, 2024

Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models.
CoRR, 2024

Instructional Fingerprinting of Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Improving Domain Generalization with Domain Relations.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

The Generative AI Paradox: "What It Can Create, It May Not Understand".
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Beyond the Stethoscope: Operationalizing Interactive Clinical Reasoning in Large Language Models via Proactive Information Seeking.
Proceedings of the 12th IEEE International Conference on Healthcare Informatics, 2024

Position Paper: Data-Centric AI in the Age of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Annotation alignment: Comparing LLM and human annotations of conversational safety.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Use large language models to promote equity.
CoRR, 2023

OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models.
CoRR, 2023

Are aligned neural networks adversarially aligned?
CoRR, 2023

Proximity-Informed Calibration for Deep Neural Networks.
CoRR, 2023

Leveraging Domain Relations for Domain Generalization.
CoRR, 2023

On the Trade-off of Intra-/Inter-class Diversity for Supervised Pre-training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023


Are aligned neural networks adversarially aligned?
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Out-of-Domain Robustness via Targeted Augmentations.
Proceedings of the International Conference on Machine Learning, 2023

FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Stronger data poisoning attacks break data sanitization defenses.
Mach. Learn., 2022

Impossibility Theorems for Feature Attribution.
CoRR, 2022

Wild-Time: A Benchmark of in-the-Wild Distribution Shift over Time.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Extending the WILDS Benchmark for Unsupervised Adaptation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Mobility network models of COVID-19 explain inequities and inform reopening.
Nat., 2021

On the Opportunities and Risks of Foundation Models.
CoRR, 2021

Supporting COVID-19 Policy Response with Large-scale Mobility-based Modeling.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Accuracy on the Line: on the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Just Train Twice: Improving Group Robustness without Training Group Information.
Proceedings of the 38th International Conference on Machine Learning, 2021


Selective Classification Can Magnify Disparities Across Groups.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
WILDS: A Benchmark of in-the-Wild Distribution Shifts.
CoRR, 2020

Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims.
CoRR, 2020

An Investigation of Why Overparameterization Exacerbates Spurious Correlations.
Proceedings of the 37th International Conference on Machine Learning, 2020

Concept Bottleneck Models.
Proceedings of the 37th International Conference on Machine Learning, 2020

Distributionally Robust Neural Networks.
Proceedings of the 8th International Conference on Learning Representations, 2020

ExpBERT: Representation Engineering with Natural Language Explanations.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization.
CoRR, 2019

On the Accuracy of Influence Functions for Measuring Group Effects.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Temporal FiLM: Capturing Long-Range Sequence Dependencies with Feature-Wise Modulations.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Inferring Multidimensional Rates of Aging from Cross-Sectional Data.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

Identifying and exploiting influential training examples.
Proceedings of the Symposium Interpretable AI for Well-being: Understanding Cognitive Bias and Social Embeddedness co-located with Association for the Advancement of Artificial Intelligence 2019 Spring Symposium (AAAI-Spring Symposium 2019), 2019

2018
Inferring Multi-Dimensional Rates of Aging from Cross-Sectional Data.
CoRR, 2018

2017
Denoising genome-wide histone ChIP-seq with convolutional neural networks.
Bioinform., 2017

Certified Defenses for Data Poisoning Attacks.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Understanding Black-box Predictions via Influence Functions.
Proceedings of the 34th International Conference on Machine Learning, 2017

2011
Sparse Filtering.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

On Random Weights and Unsupervised Feature Learning.
Proceedings of the 28th International Conference on Machine Learning, 2011

Learning Deep Energy Models.
Proceedings of the 28th International Conference on Machine Learning, 2011

2010
Tiled convolutional neural networks.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010


  Loading...