Shannon Shen

Affiliations:
  • MIT, Cambridge, MA, USA
  • Allen Institute for AI, Seattle, USA (former)
  • Nanjing Tech University, School of Computer Science and Technology, China (former)


According to our database1, Shannon Shen authored at least 22 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
A Design Space for Intelligent and Interactive Writing Assistants.
CoRR, 2024

Learning to Decode Collaboratively with Multiple Language Models.
CoRR, 2024

A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models.
CoRR, 2024

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research.
CoRR, 2024

2023
Towards Verifiable Text Generation with Symbolic References.
CoRR, 2023

Beyond Summarization: Designing AI Support for Real-World Expository Writing Tasks.
CoRR, 2023

The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces.
CoRR, 2023

The Semantic Scholar Open Data Platform.
CoRR, 2023

American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Conceptualizing Machine Learning for Dynamic Information Retrieval of Electronic Health Record Notes.
Proceedings of the Machine Learning for Healthcare Conference, 2023

PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific Documents.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups.
Trans. Assoc. Comput. Linguistics, 2022

Don't Say What You Don't Know: Improving the Consistency of Abstractive Summarization by Constraining Beam Search.
CoRR, 2022

Multi-LexSum: Real-world Summaries of Civil Rights Lawsuits at Multiple Granularities.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Incorporating Visual Layout Structures for Scientific Text Classification.
CoRR, 2021

LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

PAWLS: PDF Annotation With Labels and Structure.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
OLALA: Object-Level Active Learning Based Layout Annotation.
CoRR, 2020

Generating Object Stamps.
CoRR, 2020

A Large Dataset of Historical Japanese Documents with Complex Layouts.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Deep Learning based Framework for Automatic Damage Detection in Aircraft Engine Borescope Inspection.
Proceedings of the International Conference on Computing, Networking and Communications, 2019


  Loading...