Tong Sun

Orcid: 0000-0003-4836-0737

Affiliations:
  • Adobe Research


According to our database1, Tong Sun authored at least 39 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding.
CoRR, 2024

LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models.
CoRR, 2024

ARTIST: Improving the Generation of Text-rich Images by Disentanglement.
CoRR, 2024

Toffee: Efficient Million-Scale Dataset Construction for Subject-Driven Text-to-Image Generation.
CoRR, 2024

DocSynthv2: A Practical Autoregressive Modeling for Document Generation.
CoRR, 2024

Improve Temporal Awareness of LLMs for Sequential Recommendation.
CoRR, 2024

Supporting Business Document Workflows via Collection-Centric Information Foraging with Large Language Models.
CoRR, 2024

Automatic Layout Planning for Visually-Rich Documents with Instruction-Following Models.
CoRR, 2024

ATLAS: A System for PDF-centric Human Interaction Data Collection.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: System Demonstrations, 2024

ADOPD: A Large-Scale Document Page Decomposition Dataset.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

SOHES: Self-supervised Open-world Hierarchical Entity Segmentation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Customization Assistant for Text-to-image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

TRINS: Towards Multimodal Language Models that Can Read.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Marco: Supporting Business Document Workflows via Collection-Centric Information Foraging with Large Language Models.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024

2023
Token-Level Adversarial Prompt Detection Based on Perplexity Measures and Contextual Information.
CoRR, 2023

AutoDAN: Automatic and Interpretable Adversarial Attacks on Large Language Models.
CoRR, 2023

LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding.
CoRR, 2023

Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach.
CoRR, 2023

Label-Retrieval-Augmented Diffusion Models for Learning from Noisy Labels.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Critical Analysis of Document Out-of-Distribution Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

StandARone: Infrared-Watermarked Documents as Portable Containers of AR Interaction and Personalization.
Proceedings of the Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022
Unified Pretraining Framework for Document Understanding.
CoRR, 2022

MGDoc: Pre-training with Multi-granular Hierarchy for Document Image Understanding.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Towards Language-Free Training for Text-to-Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Dually Noted: Layout-Aware Annotations with Smartphone Augmented Reality.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

Secure and Efficient Agreement Signing Atop Blockchain and Decentralized Identity.
Proceedings of the Blockchain and Trustworthy Systems - 4th International Conference, 2022

User-Entity Differential Privacy in Learning Natural Language Models.
Proceedings of the IEEE International Conference on Big Data, 2022

Learning Adaptive Axis Attentions in Fine-tuning: Beyond Fixed Sparse Attention Patterns.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

TiGAN: Text-Based Interactive Image Generation and Manipulation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
LAFITE: Towards Language-Free Training for Text-to-Image Generation.
CoRR, 2021

Lets Make A Story Measuring MR Child Engagement.
CoRR, 2021

UniDoc: Unified Pretraining Framework for Document Understanding.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Open-Domain Question Answering with Pre-Constructed Question Spaces.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, 2021

Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU models.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

A Mixed-Reality System to Promote Child Engagement in Remote Intergenerational Storytelling.
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality Adjunct, 2021

2020
Using Behavioral Interactions from a Mobile Device to Classify the Reader's Prior Familiarity and Goal Conditions.
CoRR, 2020

Self-Supervised Relationship Probing.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Scalable Differential Privacy with Certified Robustness in Adversarial Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

Cross-Domain Document Object Detection: Benchmark Suite and Method.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020


  Loading...