Deqiang Jiang

Orcid: 0000-0003-3987-2431

According to our database1, Deqiang Jiang authored at least 33 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Communication-efficient clustered federated learning via model distance.
Mach. Learn., June, 2024

HRVDA: High-Resolution Visual Document Assistant.
CoRR, 2024

Break the Visual Perception: Adversarial Attacks Targeting Encoded Visual Tokens of Large Vision-Language Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

HRVDA: High-Resolution Visual Document Assistant.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Few-shot Temporal Pruning Accelerates Diffusion Models for Text Generation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise.
CoRR, 2023

Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration.
CoRR, 2023

Looking and Listening: Audio Guided Text Recognition.
CoRR, 2023

Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution.
CoRR, 2023

Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation.
CoRR, 2023

Visual Information Extraction in the Wild: Practical Dataset and End-to-End Solution.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Turning a CLIP Model into a Scene Text Detector.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

TaCo: Textual Attribute Recognition via Contrastive Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

The Devil Is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-training.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
The Devil is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-Training.
CoRR, 2022

Semantic-Preserving Abstractive Text Summarization with Siamese Generative Adversarial Net.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

GMN: Generative Multi-modal Network for Practical Document Information Extraction.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

OS-MSL: One Stage Multimodal Sequential Link Framework for Scene Segmentation and Classification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Relational Representation Learning in Visually-Rich Documents.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Query-driven Generative Network for Document Information Extraction in the Wild.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Neural Collaborative Graph Machines for Table Structure Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Perceiving Stroke-Semantic Context: Hierarchical Contrastive Learning for Robust Scene Text Recognition.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Sequence-to-Action: Grammatical Error Correction with Action Guided Sequence Generation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Show, Read and Reason: Table Structure Recognition with Flexible Context Aggregator.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

RecycleNet: An Overlapped Text Instance Recovery Approach.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Hierarchical Multi-label Text Classification with Horizontal and Vertical Category Correlations.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
PuzzleNet: Scene Text Detection by Segment Context Graph Learning.
CoRR, 2020

Accurate Structured-Text Spotting for Arithmetical Exercise Correction.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020


  Loading...