Mengdan Zhang

Orcid: 0009-0008-2911-5369

According to our database¹, Mengdan Zhang authored at least 30 papers between 2015 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2024

VEGA: Learning Interleaved Image-Text Comprehension in Vision-Language Large Models.

[BibT_eX]

[DOI]

CoRR, 2024

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis.

[BibT_eX]

[DOI]

CoRR, 2024

Multimodal Inplace Prompt Tuning for Open-set Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Cantor: Inspiring Multimodal Chain-of-Thought of MLLM.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Aligning and Prompting Everything All at Once for Universal Visual Perception.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise.

[BibT_eX]

[DOI]

CoRR, 2023

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Multi-modal Queried Object Detection in the Wild.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022

Transformers in computational visual media: A survey.

[BibT_eX]

[DOI]

Comput. Vis. Media, 2022

Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization.

[BibT_eX]

[DOI]

CoRR, 2022

Efficient Decoder-Free Object Detection with Transformers.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

ARM: Any-Time Super-Resolution Method.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

On The Consistency Training for Open-Set Semi-Supervised Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Dig into Multi-modal Cues for Video Retrieval with Hierarchical Alignment.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Integrated Modalities And Multi-Level Granularity: Towards A Unified Video-Text Retrieval Framework.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops, 2021

Learning to Know Where to See: A Visibility-Aware Approach for Occluded Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Dive Deeper into Box for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

2018

Do not Lose the Details: Reinforced Representation Learning for High Performance Visual Tracking.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Online Multi-Target Tracking with Tensor-Based High-Order Graph Matching.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Pattern Recognition, 2018

SPCNet: Scale Position Correlation Network for End-to-End Visual Tracking.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Pattern Recognition, 2018

Visual Tracking via Spatially Aligned Correlation Filters Network.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

2017

DCFNet: Discriminant Correlation Filters Network for Visual Tracking.

[BibT_eX]

[DOI]

CoRR, 2017

Robust Visual Object Tracking with Top-down Reasoning.

[BibT_eX]

[DOI]

Mengdan Zhang

Jiashi Feng

Weiming Hu

Proceedings of the 2017 ACM on Multimedia Conference, 2017

The Visual Object Tracking VOT2017 Challenge Results.

[BibT_eX]

[DOI]

Abdelrahman Eldesokey

Alireza Memarmoghadam

Gorthi R. K. Sai Subrahmanyam

Goutam Bhat

Guan Huang

Guilherme Sousa Bastos

Kannappan Palaniappan

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

A New Approach to Compute CNNs for Extremely Large Images.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

2016

The Visual Object Tracking VOT2016 Challenge Results.

[BibT_eX]

[DOI]

Alireza Memarmoghadam

Gorthi R. K. Sai Subrahmanyam

Guilherme Sousa Bastos

Kannappan Palaniappan

Mario Edoardo Maresca

Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

2015

Robust visual tracking using joint scale-spatial correlation filters.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Joint Scale-Spatial Correlation Tracking with Adaptive Rotation Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015

Mengdan Zhang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...