Zhilong Ji

Orcid: 0000-0002-8799-3409

According to our database¹, Zhilong Ji authored at least 45 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Real face foundation representation learning for generalized deepfake detection.

[BibT_eX]

[DOI]

Pattern Recognit., 2025

CMMaTH: A Chinese Multi-modal Math Skill Evaluation Benchmark for Foundation Models.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024

Explicit Relational Reasoning Network for Scene Text Detection.

[BibT_eX]

[DOI]

CoRR, 2024

VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2024

CMMaTH: A Chinese Multi-modal Math Skill Evaluation Benchmark for Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2024

Two Optimizers Are Better Than One: LLM Catalyst for Enhancing Gradient-Based Optimization.

[BibT_eX]

[DOI]

CoRR, 2024

MasterWeaver: Taming Editability and Identity for Personalized Text-to-Image Generation.

[BibT_eX]

[DOI]

CoRR, 2024

MuMath: Multi-perspective Data Augmentation for Mathematical Reasoning in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Collaborative Domain Alignment for Multi-source Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition - 27th International Conference, 2024

DPA-2D: Depth Propagation and Alignment with 2D Observations Guidance for Human Mesh Recovery.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CK12: A Rounded K12 Knowledge Graph Based Benchmark for Chinese Holistic Cognition Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

LRANet: Towards Accurate and Efficient Scene Text Detection with Low-Rank Approximation Network.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Decoupled Textual Embeddings for Customized Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Patch Is Not All You Need.

[BibT_eX]

[DOI]

CoRR, 2023

Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition.

[BibT_eX]

[DOI]

CoRR, 2023

Masked and Permuted Implicit Context Learning for Scene Text Recognition.

[BibT_eX]

[DOI]

CoRR, 2023

Only Classification Head Is Sufficient for Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

CCLAP: Controllable Chinese Landscape Painting Generation Via Latent Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Decoupling Visual-Semantic Features Learning with Dual Masked Autoencoder for Self-Supervised Scene Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ViSA: Visual and Semantic Alignment for Robust Scene Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition.

[BibT_eX]

[DOI]

Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Black-Box Tuning of Vision-Language Models with Effective Gradient Approximation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Texts as Images in Prompt Tuning for Multi-Label Image Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Inferring and Leveraging Parts from Object Shape for Improving Semantic Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ReCoT: Regularized Co-Training for Facial Action Unit Recognition with Noisy Labels.

[BibT_eX]

[DOI]

Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022

1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object Segmentation.

[BibT_eX]

[DOI]

CoRR, 2022

Position-Aware Contrastive Alignment for Referring Image Segmentation.

[BibT_eX]

[DOI]

CoRR, 2022

1st Place Solutions for UG2+ Challenge 2022 ATMOSPHERIC TURBULENCE MITIGATION.

[BibT_eX]

[DOI]

CoRR, 2022

1st Place Solutions for the UVO Challenge 2022.

[BibT_eX]

[DOI]

CoRR, 2022

Towards Diverse and Faithful One-shot Adaption of Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Vision Transformer Based Scene Text Recognizer with Multi-grained Encoding and Decoding.

[BibT_eX]

[DOI]

Proceedings of the Frontiers in Handwriting Recognition - 18th International Conference, 2022

When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Syntax-Aware Network for Handwritten Mathematical Expression Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Gaze Estimation with an Ensemble of Four Architectures.

[BibT_eX]

[DOI]

CoRR, 2021

1st Place Solutions for UG2+ Challenge 2021 - (Semi-)supervised Face detection in the low light condition.

[BibT_eX]

[DOI]

CoRR, 2021

Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

3DCNN Backed Conv-LSTM Auto Encoder for Micro Facial Expression Video Recognition.

[BibT_eX]

[DOI]

Adam Ahmed Qaid Mohammed

Yongsheng Sang

Proceedings of the Machine Learning and Intelligent Communications, 2021

Automatic 3D Skeleton-based Dynamic Hand Gesture Recognition Using Multi-Layer Convolutional LSTM.

[BibT_eX]

[DOI]

Proceedings of the ICRAI 2021: 7th International Conference on Robotics and Artificial Intelligence, Guangzhou, China, November 19, 2021

Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Local Global Relational Network for Facial Action Units Recognition.

[BibT_eX]

[DOI]

Proceedings of the 16th IEEE International Conference on Automatic Face and Gesture Recognition, 2021

2020

TAL EmotioNet Challenge 2020 Rethinking the Model Chosen Problem in Multi-Task Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Zhilong Ji

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...