Emma Strubell

Orcid: 0000-0003-2798-0726

Affiliations:
  • CMU, Pittsburgh, USA
  • University of Massachusetts Amherst, USA


According to our database1, Emma Strubell authored at least 63 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Collage: Decomposable Rapid Prototyping for Information Extraction on Scientific PDFs.
CoRR, 2024

Stereotype or Personalization? User Identity Biases Chatbot Recommendations.
CoRR, 2024

What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions.
CoRR, 2024

Carbon Connect: An Ecosystem for Sustainable Computing.
CoRR, 2024

Source-Aware Training Enables Knowledge Attribution in Language Models.
CoRR, 2024

OLMo: Accelerating the Science of Language Models.
CoRR, 2024

Power Hungry Processing: Watts Driving the Cost of AI Deployment?
Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, 2024

Scalable Data Ablation Approximations for Language Models through Modular Training and Merging.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Gradient Localization Improves Lifelong Pretraining of Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024


AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024


2023
Efficient Methods for Natural Language Processing: A Survey.
Trans. Assoc. Comput. Linguistics, 2023

An Empirical Investigation of the Role of Pre-training in Lifelong Learning.
J. Mach. Learn. Res., 2023

Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation.
CoRR, 2023

Queer People are People First: Deconstructing Sexual Identity Stereotypes in Large Language Models.
CoRR, 2023

Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research.
CoRR, 2023

Large Language Model Distillation Doesn't Need a Teacher.
CoRR, 2023

Regularizing Self-training for Unsupervised Domain Adaptation via Structural Constraints.
CoRR, 2023

The Framework Tax: Disparities Between Inference Efficiency in Research and Deployment.
CoRR, 2023

On the Interactions of Structural Constraints and Data Resources for Structured Prediction.
Proceedings of The Fourth Workshop on Simple and Efficient Natural Language Processing, 2023

Making Scalable Meta Learning Practical.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Data-efficient Active Learning for Structured Prediction with Partial Annotation and Self-Training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Energy and Carbon Considerations of Fine-Tuning BERT.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

DSI++: Updating Transformer Memory with New Documents.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

To Build Our Future, We Must Know Our Past: Contextualizing Paradigm Shifts in Natural Language Processing.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Understanding the Effect of Model Compression on Social Bias in Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

The Framework Tax: Disparities Between Inference Efficiency in NLP Research and Deployment.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Annotating Mentions Alone Enables Efficient Domain Adaptation for Coreference Resolution.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

To Adapt or to Annotate: Challenges and Interventions for Domain Adaptation in Open-Domain Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Efficient and Equitable Natural Language Processing in the Age of Deep Learning (Dagstuhl Seminar 22232).
Dagstuhl Reports, 2022

Error-aware Quantization through Noise Tempering.
CoRR, 2022

Mention Annotations Alone Enable Efficient Domain Adaptation for Coreference Resolution.
CoRR, 2022

SQuAT: Sharpness- and Quantization-Aware Training for BERT.
CoRR, 2022

Measuring the Carbon Intensity of AI in Cloud Instances.
Proceedings of the FAccT '22: 2022 ACM Conference on Fairness, Accountability, and Transparency, Seoul, Republic of Korea, June 21, 2022

A Survey of Active Learning for Natural Language Processing.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Transfer Learning from Semantic Role Labeling to Event Argument Extraction with Template-based Slot Querying.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Train Flat, Then Compress: Sharpness-Aware Minimization Learns More Compressible Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Bridging Fairness and Environmental Sustainability in Natural Language Processing.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Improving Compositional Generalization with Self-Training for Data-to-Text Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Improving Compositional Generalization with Self-Training for Data-to-Text Generation.
CoRR, 2021

WiFiMod: Transformer-based Indoor Human Mobility Modeling using Passive Sensing.
CoRR, 2021

On the Benefit of Syntactic Supervision for Cross-lingual Transfer in Semantic Role Labeling.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

WiFiMod: Transformer-based Indoor Human Mobility Modeling using Passive Sensing.
Proceedings of the COMPASS '21: ACM SIGCAS Conference on Computing and Sustainable Societies, Virtual Event, Australia, 28 June 2021, 2021

Comparing Span Extraction Methods for Semantic Role Labeling.
Proceedings of the 5th Workshop on Structured Prediction for NLP, 2021

2020
Inorganic Materials Synthesis Planning with Literature-Trained Neural Networks.
J. Chem. Inf. Model., 2020

Energy and Policy Considerations for Modern Deep Learning Research.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
The Materials Science Procedural Text Corpus: Annotating Materials Synthesis Procedures with Shallow Semantic Structures.
Proceedings of the 13th Linguistic Annotation Workshop, 2019

Energy and Policy Considerations for Deep Learning in NLP.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Syntax Helps ELMo Understand Semantics: Is Syntax Still Relevant in a Deep Neural Architecture for SRL?
CoRR, 2018

Simultaneously Self-Attending to All Mentions for Full-Abstract Biological Relation Extraction.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Linguistically-Informed Self-Attention for Semantic Role Labeling.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Multi-Task Learning For Parsing The Alexa Meaning Representation Language.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Automatically Extracting Action Graphs from Materials Science Synthesis Procedures.
CoRR, 2017

Fast and Accurate Sequence Labeling with Iterated Dilated Convolutions.
CoRR, 2017

Fast and Accurate Entity Recognition with Iterated Dilated Convolutions.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Dependency Parsing with Dilated Iterated Graph CNNs.
Proceedings of the 2nd Workshop on Structured Prediction for Natural Language Processing, 2017

Attending to All Mention Pairs for Full Abstract Biological Relation Extraction.
Proceedings of the 6th Workshop on Automated Knowledge Base Construction, 2017

2016
Extracting Multilingual Relations under Limited Resources: TAC 2016 Cold-Start KB construction and Slot-Filling using Compositional Universal Schema.
Proceedings of the 2016 Text Analysis Conference, 2016

Multilingual Relation Extraction using Compositional Universal Schema.
Proceedings of the NAACL HLT 2016, 2016

2015
Building Knowledge Bases with Universal Schema: Cold Start and Slot-Filling Approaches.
Proceedings of the 2015 Text Analysis Conference, 2015

Learning Dynamic Feature Selection for Fast Sequential Prediction.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Training for Fast Sequential Prediction Using Dynamic Feature Selection.
CoRR, 2014


  Loading...