Mengdan Zhang

Orcid: 0009-0008-2911-5369

According to our database1, Mengdan Zhang authored at least 30 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection.
IEEE Trans. Image Process., 2024

VEGA: Learning Interleaved Image-Text Comprehension in Vision-Language Large Models.
CoRR, 2024

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis.
CoRR, 2024

Multimodal Inplace Prompt Tuning for Open-set Object Detection.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Cantor: Inspiring Multimodal Chain-of-Thought of MLLM.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Aligning and Prompting Everything All at Once for Universal Visual Perception.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise.
CoRR, 2023

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models.
CoRR, 2023

Multi-modal Queried Object Detection in the Wild.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
Transformers in computational visual media: A survey.
Comput. Vis. Media, 2022

Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization.
CoRR, 2022

Efficient Decoder-Free Object Detection with Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

ARM: Any-Time Super-Resolution Method.
Proceedings of the Computer Vision - ECCV 2022, 2022

Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
On The Consistency Training for Open-Set Semi-Supervised Learning.
CoRR, 2021

Dig into Multi-modal Cues for Video Retrieval with Hierarchical Alignment.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Integrated Modalities And Multi-Level Granularity: Towards A Unified Video-Text Retrieval Framework.
Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops, 2021

Learning to Know Where to See: A Visibility-Aware Approach for Occluded Person Re-identification.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Dive Deeper into Box for Object Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

2018
Do not Lose the Details: Reinforced Representation Learning for High Performance Visual Tracking.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Online Multi-Target Tracking with Tensor-Based High-Order Graph Matching.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

SPCNet: Scale Position Correlation Network for End-to-End Visual Tracking.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Visual Tracking via Spatially Aligned Correlation Filters Network.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
DCFNet: Discriminant Correlation Filters Network for Visual Tracking.
CoRR, 2017

Robust Visual Object Tracking with Top-down Reasoning.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

The Visual Object Tracking VOT2017 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

A New Approach to Compute CNNs for Extremely Large Images.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

2016
The Visual Object Tracking VOT2016 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

2015
Robust visual tracking using joint scale-spatial correlation filters.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Joint Scale-Spatial Correlation Tracking with Adaptive Rotation Estimation.
Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015


  Loading...