Haoyu Dong

Orcid: 0000-0003-0692-2228

Affiliations:
  • Microsoft Research, China


According to our database1, Haoyu Dong authored at least 29 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Table-LLM-Specialist: Language Model Specialists for Tables using Iterative Generator-Validator Fine-tuning.
CoRR, 2024

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models.
CoRR, 2024

Vision Language Models for Spreadsheet Understanding: Challenges and Opportunities.
CoRR, 2024

NL2Formula: Generating Spreadsheet Formulas from Natural Language Queries.
CoRR, 2024

TTC-QuAli: A Text-Table-Chart Dataset for Multimodal Quantity Alignment.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

Large Language Models for Tabular Data: Progresses and Future Directions.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

OpenTE: Open-Structure Table Extraction From Text.
Proceedings of the IEEE International Conference on Acoustics, 2024

Encoding Spreadsheets for Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

NL2Formula: Generating Spreadsheet Formulas from Natural Language Queries.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

KET-QA: A Dataset for Knowledge Enhanced Table Question Answering.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Personalized Educational Video Evaluation Combining Student's Cognitive and Teaching Style.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2023

HermEs: Interactive Spreadsheet Formula Prediction via Hierarchical Formulet Expansion.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

SheetPT: Spreadsheet Pre-training Based on Hierarchical Attention Network.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Reflection of Thought: Inversely Eliciting Numerical Reasoning in Language Models via Solving Linear Systems.
CoRR, 2022

TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data.
CoRR, 2022

Table Pre-training: A Survey on Model Architectures, Pretraining Objectives, and Downstream Tasks.
CoRR, 2022

Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream Tasks.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

PLOG: Table-to-Logic Pretraining for Logical Table-to-Text Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

FORTAP: Using Formulas for Numerical-Reasoning-Aware Table Pretraining.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
FORTAP: Using Formulae for Numerical-Reasoning-Aware Table Pretraining.
CoRR, 2021

TUTA: Tree-based Transformers for Generally Structured Table Pre-training.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Semantic table structure identification in spreadsheets.
Proceedings of the ISSTA '21: 30th ACM SIGSOFT International Symposium on Software Testing and Analysis, 2021

2020
Structure-aware Pre-training for Table Understanding with Tree-based Transformers.
CoRR, 2020

Learning Formatting Style Transfer and Structure Extraction for Spreadsheet Tables with a Hybrid Neural Network Architecture.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Neural Formatting for Spreadsheet Tables.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

2019
Visualization Assessment: A Machine Learning Approach.
Proceedings of the 30th IEEE Visualization Conference, 2019

TableSense: Spreadsheet Table Detection with Convolutional Neural Networks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019


  Loading...