Yiheng Xu

Orcid: 0000-0002-6278-7916

According to our database1, Yiheng Xu authored at least 18 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments.
CoRR, 2024

2023
ML-based Modeling to Predict I/O Performance on Different Storage Sub-systems.
CoRR, 2023

OpenAgents: An Open Platform for Language Agents in the Wild.
CoRR, 2023

Lemur: Harmonizing Natural Language and Code for Language Agents.
CoRR, 2023

In-Context Learning with Many Demonstration Examples.
CoRR, 2023

2022
DiT: Self-supervised Pre-training for Document Image Transformer.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

XFUND: A Benchmark Dataset for Multilingual Visually Rich Form Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document Understanding.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Document AI: Benchmarks, Models and Applications.
CoRR, 2021

LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding.
CoRR, 2021

Learning Massive Graph Embeddings on a Single Machine.
CoRR, 2021

LayoutReader: Pre-training of Text and Layout for Reading Order Detection.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
LayoutLM: Pre-training of Text and Layout for Document Image Understanding.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

DocBank: A Benchmark Dataset for Document Layout Analysis.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Graph Convolutional Networks with Markov Random Field Reasoning for Social Spammer Detection.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2009
Comparative Analysis of Multi-period Portfolio Strategies.
Proceedings of the Business Intelligence: Artificial Intelligence in Business, 2009

2008
An Improved Discrete Particle Swarm Optimization Based on Cooperative Swarms.
Proceedings of the 2008 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2008


  Loading...