Haoyu Cao

Orcid: 0000-0002-3789-9705

According to our database¹, Haoyu Cao authored at least 19 papers between 2022 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Integrating hydrological knowledge into deep learning for DEM super-resolution.

[BibT_eX]

[DOI]

Int. J. Geogr. Inf. Sci., February, 2025

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction.

[BibT_eX]

[DOI]

CoRR, January, 2025

2024

DEM super-resolution framework based on deep learning: decomposing terrain trends and residuals.

[BibT_eX]

[DOI]

Int. J. Digit. Earth, December, 2024

Turning a CLIP Model Into a Scene Text Spotter.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., September, 2024

Communication-efficient clustered federated learning via model distance.

[BibT_eX]

[DOI]

Mach. Learn., June, 2024

AS2LS: Adaptive Anatomical Structure-Based Two-Layer Level Set Framework for Medical Image Segmentation.

[BibT_eX]

[DOI]

Tianyi Han

Haoyu Cao

Yunyun Yang

IEEE Trans. Image Process., 2024

HRVDA: High-Resolution Visual Document Assistant.

[BibT_eX]

[DOI]

CoRR, 2024

Break the Visual Perception: Adversarial Attacks Targeting Encoded Visual Tokens of Large Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

HRVDA: High-Resolution Visual Document Assistant.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Few-shot Temporal Pruning Accelerates Diffusion Models for Text Generation.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

HDNeXt: Hybrid Dynamic MedNeXt with Level Set Regularization for Medical Image Segmentation.

[BibT_eX]

[DOI]

Haoyu Cao

Tianyi Han

Yunyun Yang

Proceedings of the Computer Vision - ACCV 2024, 2024

2023

Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration.

[BibT_eX]

[DOI]

CoRR, 2023

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images.

[BibT_eX]

[DOI]

Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

GMN: Generative Multi-modal Network for Practical Document Information Extraction.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Relational Representation Learning in Visually-Rich Documents.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Query-driven Generative Network for Document Information Extraction in the Wild.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Haoyu Cao

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...