Byron C. Wallace

Orcid: 0000-0003-2409-7735

Affiliations:
  • Northeastern University, Boston, MA, USA


According to our database1, Byron C. Wallace authored at least 156 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Do Multi-Document Summarization Models <i>Synthesize</i>?
Trans. Assoc. Comput. Linguistics, 2024

Leveraging generative AI for clinical evidence synthesis needs to ensure trustworthiness.
J. Biomed. Informatics, 2024

Question answering systems for health professionals at the point of care - a systematic review.
J. Am. Medical Informatics Assoc., 2024

NNsight and NDIF: Democratizing Access to Foundation Model Internals.
CoRR, 2024

Learning from Natural Language Explanations for Generalizable Entity Matching.
CoRR, 2024

Automatically Extracting Numerical Results from Randomized Controlled Trials with Large Language Models.
CoRR, 2024

Standardizing the Measurement of Text Diversity: A Tool and a Comparative Analysis of Scores.
CoRR, 2024

How Much Annotation is Needed to Compare Summarization Models?
CoRR, 2024

GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence.
CoRR, 2024

Towards Reducing Diagnostic Errors with Interpretable Risk Prediction.
CoRR, 2024

On-the-fly Definition Augmentation of LLMs for Biomedical NER.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Towards Reducing Diagnostic Errors with Interpretable Risk Prediction.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Function Vectors in Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Evaluating the Zero-shot Robustness of Instruction-tuned Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning from Natural Language Explanations for Generalizable Entity Matching.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Investigating Mysteries of CoT-Augmented Distillation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Detection and Measurement of Syntactic Templates in Generated Text.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Evaluating the Factuality of Zero-shot Summarizers Across Varied Domains.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Open (Clinical) LLMs are Sensitive to Instruction Phrasings.
Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, 2024

InfoLossQA: Characterizing and Recovering Information Loss in Text Simplification.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

FactPICO: Factuality Evaluation for Plain Language Summarization of Medical Evidence.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Leveraging Generative AI for Clinical Evidence Summarization Needs to Achieve Trustworthiness.
CoRR, 2023

Retrieving Evidence from EHRs with LLMs: Possibilities and Challenges.
CoRR, 2023

Do Multi-Document Summarization Models Synthesize?
CoRR, 2023

SemEval-2023 Task 8: Causal Medical Claim Identification and Related PIO Frame Extraction from Social Media Posts.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Jointly Extracting Interventions, Outcomes, and Findings from RCT Reports with LLMs.
Proceedings of the Machine Learning for Healthcare Conference, 2023

Accomodating User Expressivity while Maintaining Safety for a Virtual Alcohol Misuse Counselor.
Proceedings of the 23rd ACM International Conference on Intelligent Virtual Agents, 2023

Appraising the Potential Uses and Harms of LLMs for Medical Systematic Reviews.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

CHiLL: Zero-shot Custom Interpretable Feature Extraction from Clinical Notes with Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

USB: A Unified Summarization Benchmark Across Tasks and Domains.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Multilingual Simplification of Medical Texts.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

How Many and Which Training Points Would Need to be Removed to Flip this Prediction?
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

RedHOT: A Corpus of Annotated Medical Questions, Experiences, and Claims on Social Media.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Automatically Summarizing Evidence from Clinical Trials: A Prototype Highlighting Current Challenges.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. EACL 2023, 2023

NapSS: Paragraph-level Medical Text Simplification via Narrative Prompting and Sentence-matching Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Future Lens: Anticipating Subsequent Tokens from a Single Hidden State.
Proceedings of the 27th Conference on Computational Natural Language Learning, 2023

Automated Metrics for Medical Multi-Document Summarization Disagree with Human Evaluations.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Revisiting Relation Extraction in the era of Large Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Summarizing, Simplifying, and Synthesizing Medical Evidence using GPT-3 (with Varying Success).
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
Learning to Ask Like a Physician.
CoRR, 2022

Self-Repetition in Abstractive Neural Summarizers.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

PHEE: A Dataset for Pharmacovigilance Event Extraction from Text.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

That's the Wrong Lung! Evaluating and Improving the Interpretability of Unsupervised Multimodal Encoders for Medical Data.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Influence Functions for Sequence Tagging Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Overview of MSLR2022: A Shared Task on Multi-document Summarization for Literature Reviews.
Proceedings of the Third Workshop on Scholarly Document Processing, 2022

Intermediate Entity-based Sparse Interpretable Representation Learning.
Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2022

Combining Feature and Instance Attribution to Detect Artifacts.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Evaluating Factuality in Text Simplification.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Kronecker Factorization for Preventing Catastrophic Forgetting in Large-scale Medical Entity Linking.
CoRR, 2021

What Would it Take to get Biomedical QA Systems into Practice?
CoRR, 2021

Interpretability Analysis for Named Entity Recognition to Understand System Predictions and How They Can Improve.
Comput. Linguistics, 2021

An Empirical Comparison of Instance Attribution Methods for NLP.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Does BERT Pretrained on Clinical Notes Reveal Sensitive Data?
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Paragraph-level Simplification of Medical Texts.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

On the Impact of Random Seeds on the Fairness of Clinical Classifiers.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Disentangling Representations of Text by Masking Transformers.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Unsupervised Data Augmentation with Naive Augmentation and without Unlabeled Data.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Applying State of the Art Language Models to Enable Better Clinical Natural Language Processing.
Proceedings of the AMIA 2021, American Medical Informatics Association Annual Symposium, San Diego, CA, USA, October 30, 2021, 2021

Identifying Communication Behavior Indicators in Secure Messages.
Proceedings of the AMIA 2021, American Medical Informatics Association Annual Symposium, San Diego, CA, USA, October 30, 2021, 2021

Biomedical Interpretable Entity Representations.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Trialstreamer: A living, automatically updated database of clinical trial reports.
J. Am. Medical Informatics Assoc., 2020

Understanding Clinical Trial Reports: Extracting Medical Entities and Their Relations.
CoRR, 2020

Generating (Factual?) Narrative Summaries of RCTs: Experiments with Neural Multi-Document Summarization.
CoRR, 2020

Entity-Switched Datasets: An Approach to Auditing the In-Domain Robustness of Named Entity Recognition Models.
CoRR, 2020

Query-Focused EHR Summarization to Aid Imaging Diagnosis.
Proceedings of the Machine Learning for Healthcare Conference, 2020

MMiDaS-AE: multi-modal missing data aware stacked autoencoder for biomedical abstract screening.
Proceedings of the ACM CHIL '20: ACM Conference on Health, 2020

Evidence Inference 2.0: More Data, Better Models.
Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing, 2020

Towards a Computational Framework for Automating Substance Use Counseling with Virtual Agents.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Semi-Automating Knowledge Base Construction for Cancer Genetics.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

Trialstreamer: Mapping and Browsing Medical Evidence in Real-Time.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

Learning to Faithfully Rationalize by Construction.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Explaining Black Box Predictions and Unveiling Data Artifacts through Influence Functions.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

ERASER: A Benchmark to Evaluate Rationalized NLP Models.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Machine learning to help researchers evaluate biases in clinical trials: a prospective, randomized user study.
BMC Medical Informatics Decis. Mak., 2019

Learning to Identify Patients at Risk of Uncontrolled Hypertension Using Electronic Health Records Data.
CoRR, 2019

An Analysis of Attention over Clinical Notes for Predictive Tasks.
CoRR, 2019

Predicting Annotation Difficulty to Improve Task Routing and Model Performance for Biomedical Information Extraction.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Inferring Which Medical Treatments Work from Reports of Clinical Trials.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Attention is not Explanation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Are Online Reviews of Physicians Biased Against Female Providers?
Proceedings of the Machine Learning for Healthcare Conference, 2019

Mash: software tools for developing interactive and transparent machine learning systems.
Proceedings of the Joint Proceedings of the ACM IUI 2019 Workshops co-located with the 24th ACM Conference on Intelligent User Interfaces (ACM IUI 2019), 2019

Explainable modeling of annotations in crowdsourcing.
Proceedings of the 24th International Conference on Intelligent User Interfaces, 2019

What Does the Evidence Say? Models to Help Make Sense of the Biomedical Literature.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Practical Obstacles to Deploying Active Learning.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Structured Neural Topic Models for Reviews.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Neural information retrieval: at the end of the early years.
Inf. Retr. J., 2018

Structured Representations for Reviews: Aspect-Based Variational Hidden Factor Models.
CoRR, 2018

How transferable are the datasets collected by active learners?
CoRR, 2018

Believe it or not: Designing a Human-AI Partnership for Mixed-Initiative Fact-Checking.
Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology, 2018

Automating Biomedical Evidence Synthesis: Recent Work and Directions Forward.
Proceedings of the 3rd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL 2018) co-located with the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018), 2018

Syntactic Patterns Improve Information Extraction for Medical Search.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Structured Multi-Label Biomedical Text Tagging via Attentive Neural Tree Decoding.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Learning Disentangled Representations of Texts with Application to Biomedical Abstracts.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

A Corpus with Multi-Level Annotations of Patients, Interventions and Outcomes to Support Language Processing for Medical Literature.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

An Interpretable Joint Graphical Model for Fact-Checking From Crowds.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Identifying reports of randomized controlled trials (RCTs) via a hybrid machine learning and crowdsourcing approach.
J. Am. Medical Informatics Assoc., 2017

Reports of the Workshops of the Thirty-First AAAI Conference on Artificial Intelligence.
AI Mag., 2017

Quantifying Mental Health from Social Media with Neural User Embeddings.
Proceedings of the Machine Learning for Health Care Conference, 2017

Retrofitting Concept Vector Representations of Medical Concepts to Improve Estimates of Semantic Similarity and Relatedness.
Proceedings of the MEDINFO 2017: Precision Healthcare through Informatics, 2017

A Sensitivity Analysis of (and Practitioners' Guide to) Convolutional Neural Networks for Sentence Classification.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

PheKnow-Cloud: A Tool for Evaluating High-Throughput Phenotype Candidates using Online Medical Literature.
Proceedings of the Summit on Clinical Research Informatics, 2017

Identifying Diagnostic Test Accuracy Publications using a Deep Model.
Proceedings of the Working Notes of CLEF 2017, 2017

A Neural Candidate-Selector Architecture for Automatic Structured Clinical Text Annotation.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Detecting Twitter posts with Adverse Drug Reactions using Convolutional Neural Networks.
Proceedings of the 2nd Social Media Mining for Health Research and Applications Workshop co-located with the American Medical Informatics Association Annual Symposium (AMIA 2017), 2017

Exploiting Domain Knowledge via Grouped Weight Sharing with Application to Text Categorization.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Aggregating and Predicting Sequence Labels from Crowd Annotations.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Automating Biomedical Evidence Synthesis: RobotReviewer.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Active Discriminative Text Representation Learning.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Editorial: special issue on machine learning for health and medicine.
Mach. Learn., 2016

Extracting PICO Sentences from Clinical Trial Reports using Supervised Distant Supervision.
J. Mach. Learn. Res., 2016

Improving the utility of MeSH® terms using the TopicalMeSH representation.
J. Biomed. Informatics, 2016

RobotReviewer: evaluation of a system for automatically assessing bias in clinical trials.
J. Am. Medical Informatics Assoc., 2016

Active Discriminative Word Embedding Learning.
CoRR, 2016

Neural Information Retrieval: A Literature Review.
CoRR, 2016

Crowdsourcing Information Extraction for Biomedical Systematic Reviews.
CoRR, 2016

Reports of the 2016 AAAI Workshop Program.
AI Mag., 2016

A Correlated Worker Model for Grouped, Imbalanced and Multitask Data.
Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016

MGNC-CNN: A Simple Approach to Exploiting Multiple Word Embeddings for Sentence Classification.
Proceedings of the NAACL HLT 2016, 2016

Using Electronic Medical Records and Physician Data to Improve Information Retrieval for Evidence-Based Care.
Proceedings of the 2016 IEEE International Conference on Healthcare Informatics, 2016

Probabilistic Modeling for Crowdsourcing Partially-Subjective Ratings.
Proceedings of the Fourth AAAI Conference on Human Computation and Crowdsourcing, 2016

Rationale-Augmented Convolutional Neural Networks for Text Classification.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Modelling Context with User Embeddings for Sarcasm Detection in Social Media.
Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, 2016

Automated Verification of Phenotypes using PubMed.
Proceedings of the 7th ACM International Conference on Bioinformatics, 2016

Retrofitting Word Vectors of MeSH Terms to Improve Semantic Similarity Measures.
Proceedings of the Seventh International Workshop on Health Text Mining and Information Analysis, 2016

Leveraging coreference to identify arms in medical abstracts: An experimental study.
Proceedings of the Seventh International Workshop on Health Text Mining and Information Analysis, 2016

2015
Automating Risk of Bias Assessment for Clinical Trials.
IEEE J. Biomed. Health Informatics, 2015

Computational irony: A survey and new perspectives.
Artif. Intell. Rev., 2015

Reports of the AAAI 2014 Conference Workshops.
AI Mag., 2015

Healthcare Data Analytics Challenge.
Proceedings of the 2015 International Conference on Healthcare Informatics, 2015

Combining Crowd and Expert Labels Using Decision Theoretic Active Learning.
Proceedings of the Third AAAI Conference on Human Computation and Crowdsourcing, 2015

Improving Retrieval of PubMed Articles Using the TopicalMeSH Representation.
Proceedings of the AMIA 2015, 2015

Sparse, Contextually Informed Models for Irony Detection: Exploiting User Communities, Entities and Sentiment.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

What Predicts Media Coverage of Health Science Articles?
Proceedings of the World Wide Web and Public Health Intelligence, 2015

Graph-Sparse LDA: A Topic Model with Structured Sparsity.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Improving class probability estimates for imbalanced data.
Knowl. Inf. Syst., 2014

A large-scale quantitative analysis of latent factors and sentiment in online doctor reviews.
J. Am. Medical Informatics Assoc., 2014

Spá: A Web-Based Viewer for Text Mining in Evidence Based Medicine.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014

Can Cognitive Scientists Help Computers Recognize Irony?
Proceedings of the 36th Annual Meeting of the Cognitive Science Society, 2014

Humans Require Context to Infer Ironic Intent (so Computers Probably do, too).
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Identifying Differences in Physician Communication Styles with a Log-Linear Transition Component Model.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Organizers.
Proceedings of the Modern Artificial Intelligence for Health Analytics, 2014

Discovering Better AAAI Keywords via Clustering with Community-Sourced Constraints.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Preface.
Proceedings of the Modern Artificial Intelligence for Health Analytics, 2014

2013
A Generative Joint, Additive, Sequential Model of Topics and Speech Acts in Patient-Doctor Communication.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

2012
Challenges and Opportunities in Applied Machine Learning.
AI Mag., 2012

Multiple Narrative Disentanglement: Unraveling Infinite Jest.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Deploying an interactive machine learning system in an evidence-based practice center: abstrackr.
Proceedings of the ACM International Health Informatics Symposium, 2012

Class Probability Estimates are Unreliable for Imbalanced Data (and How to Fix Them).
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

2011
Who Should Label What? Instance Allocation in Multiple Expert Active Learning.
Proceedings of the Eleventh SIAM International Conference on Data Mining, 2011

The Constrained Weight Space SVM: Learning with Ranked Features.
Proceedings of the 28th International Conference on Machine Learning, 2011

Class Imbalance, Redux.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

2010
Semi-automated screening of biomedical citations for systematic reviews.
BMC Bioinform., 2010

Active learning for biomedical citation screening.
Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2010

Modeling annotation time to reduce workload in comparative effectiveness reviews.
Proceedings of the ACM International Health Informatics Symposium, 2010


  Loading...