Shannon Shen
Orcid: 0009-0009-2704-6950Affiliations:
- MIT, Cambridge, MA, USA
- Allen Institute for AI, Seattle, USA (former)
- Nanjing Tech University, School of Computer Science and Technology, China (former)
According to our database1,
Shannon Shen
authored at least 26 papers
between 2019 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2024
Machine learning to predict notes for chart review in the oncology setting: a proof of concept strategy for improving clinician note-writing.
J. Am. Medical Informatics Assoc., 2024
SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature.
CoRR, 2024
A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models.
CoRR, 2024
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
CoRR, 2023
The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces.
CoRR, 2023
American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Conceptualizing Machine Learning for Dynamic Information Retrieval of Electronic Health Record Notes.
Proceedings of the Machine Learning for Healthcare Conference, 2023
PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific Documents.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
2022
VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups.
Trans. Assoc. Comput. Linguistics, 2022
Don't Say What You Don't Know: Improving the Consistency of Abstractive Summarization by Constraining Beam Search.
CoRR, 2022
Multi-LexSum: Real-world Summaries of Civil Rights Lawsuits at Multiple Granularities.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
2021
CoRR, 2021
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
Deep Learning based Framework for Automatic Damage Detection in Aircraft Engine Borescope Inspection.
Proceedings of the International Conference on Computing, Networking and Communications, 2019