Bill Y. Lin

Orcid: 0000-0002-1149-0186

Affiliations:
  • University of Southern California, USA
  • Shanghai Jiao Tong University, China


According to our database1, Bill Y. Lin authored at least 83 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Knowledge-augmented Methods for Natural Language Processing
Springer Briefs in Computer Science, Springer, ISBN: 978-981-97-0749-2, 2024

TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks.
Trans. Mach. Learn. Res., 2024

CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs.
CoRR, 2024

Visual Perception in Text Strings.
CoRR, 2024

HAICOSYSTEM: An Ecosystem for Sandboxing Safety Risks in Human-AI Interactions.
CoRR, 2024

SimulBench: Evaluating Language Models with Creative Simulation Tasks.
CoRR, 2024

OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation.
CoRR, 2024

The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism.
CoRR, 2024

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs.
CoRR, 2024

ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates.
CoRR, 2024

WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences.
CoRR, 2024

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing.
CoRR, 2024

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models.
CoRR, 2024

WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild.
CoRR, 2024

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series.
CoRR, 2024

CULTURE-GEN: Revealing Global Cultural Perception in Language Models through Natural Language Prompting.
CoRR, 2024

VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?
CoRR, 2024

RewardBench: Evaluating Reward Models for Language Modeling.
CoRR, 2024

L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects.
CoRR, 2024

The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Agent Lumos: Unified and Modular Training for Open-Source Language Agents.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Trans. Mach. Learn. Res., 2023

Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs.
CoRR, 2023

Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging.
CoRR, 2023

Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4.
CoRR, 2023

LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition.
CoRR, 2023

Knowledge-Augmented Methods for Natural Language Processing.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Faith and Fate: Limits of Transformers on Compositionality.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

AutoTriggER: Label-Efficient and Robust Named Entity Recognition with Auxiliary Trigger Extraction.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Complex Reasoning in Natural Languag.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2023

LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

On Grounded Planning for Embodied Tasks with Language Models.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
PairReranker: Pairwise Reranking for Natural Language Generation.
CoRR, 2022

Unsupervised Cross-Task Generalization via Retrieval Augmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On the Robustness of Reading Comprehension Models to Entity Renaming.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Reflect, Not Reflex: Inference-Based Common Ground Improves Dialogue Response Quality.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

On Continual Model Refinement in Out-of-Distribution Data Streams.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
AutoTriggER: Named Entity Recognition with Auxiliary Trigger Extraction.
CoRR, 2021

Probing Causal Common Sense in Dialogue Response Generation.
CoRR, 2021

FedNLP: A Research Platform for Federated Learning in Natural Language Processing.
CoRR, 2021

RiddleSense: Answering Riddle Questions as Commonsense Reasoning.
CoRR, 2021

Differentiable Open-Ended Commonsense Reasoning.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

RICA: Evaluating Robust Inference Capabilities Based on Commonsense Axioms.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Probing Commonsense Explanation in Dialogue Response Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in NLP.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

RockNER: A Simple Method to Create Adversarial Examples for Evaluating the Robustness of Named Entity Recognition Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Learn Continually, Generalize Rapidly: Lifelong Knowledge Accumulation for Few-shot Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

RiddleSense: Reasoning about Riddle Questions Featuring Linguistic Creativity and Commonsense Knowledge.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Common Sense Beyond English: Evaluating and Improving Multilingual Language Models for Commonsense Reasoning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Pre-training Text-to-Text Transformers for Concept-centric Common Sense.
CoRR, 2020

Can BERT Reason? Logically Equivalent Probes for Evaluating the Inference Capabilities of Language Models.
CoRR, 2020

LEAN-LIFE: A Label-Efficient Annotation Framework Towards Learning from Explanation.
CoRR, 2020

NERO: A Neural Rule Grounding Framework for Label-Efficient Relation Extraction.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

FreeDOM: A Transferable Neural Architecture for Structured Information Extraction on Web Documents.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Birds have four legs?! NumerSense: Probing Numerical Commonsense Knowledge of Pre-Trained Language Models.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Scalable Multi-Hop Relational Reasoning for Knowledge-Aware Question Answering.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

CommonGen: A Constrained Text Generation Challenge for Generative Commonsense Reasoning.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

LEAN-LIFE: A Label-Efficient Annotation Framework Towards Learning from Explanation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

Learning to Contextually Aggregate Multi-Source Supervision for Sequence Labeling.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
CommonGen: A Constrained Text Generation Dataset Towards Generative Commonsense Reasoning.
CoRR, 2019

KagNet: Knowledge-Aware Graph Networks for Commonsense Reasoning.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

AlpacaTag: An Active Learning-based Crowd Annotation Framework for Sequence Tagging.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
ExtRA: Extracting Prominent Review Aspects from Customer Feedback.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Neural Adaptation Layers for Cross-domain Named Entity Recognition.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Automatic Extraction of Commonsense LocatedNear Knowledge.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Mining Cross-Cultural Differences and Similarities in Social Media.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Transfer Learning for Traffic Speed Prediction: A Preliminary Study.
Proceedings of the Workshops of the The Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Dynamic Detection of Communities and Their Evolutions in Temporal Social Networks.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Commonsense LocatedNear Relation Extraction.
Proceedings of the 6th Workshop on Automated Knowledge Base Construction, 2017

Multi-channel BiLSTM-CRF Model for Emerging Named Entity Recognition in Social Media.
Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017

2016
Cross-region traffic prediction for China on OpenStreetMap.
Proceedings of the 9th ACM SIGSPATIAL International Workshop on Computational Transportation Science, 2016


  Loading...