Zhilong Ji

Orcid: 0000-0002-8799-3409

According to our database1, Zhilong Ji authored at least 42 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models.
CoRR, 2024

CMMaTH: A Chinese Multi-modal Math Skill Evaluation Benchmark for Foundation Models.
CoRR, 2024

Two Optimizers Are Better Than One: LLM Catalyst for Enhancing Gradient-Based Optimization.
CoRR, 2024

MasterWeaver: Taming Editability and Identity for Personalized Text-to-Image Generation.
CoRR, 2024

MuMath: Multi-perspective Data Augmentation for Mathematical Reasoning in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Collaborative Domain Alignment for Multi-source Domain Adaptation.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

DPA-2D: Depth Propagation and Alignment with 2D Observations Guidance for Human Mesh Recovery.
Proceedings of the 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CK12: A Rounded K12 Knowledge Graph Based Benchmark for Chinese Holistic Cognition Evaluation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

LRANet: Towards Accurate and Efficient Scene Text Detection with Low-Rank Approximation Network.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Decoupled Textual Embeddings for Customized Image Generation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Patch Is Not All You Need.
CoRR, 2023

Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition.
CoRR, 2023

Masked and Permuted Implicit Context Learning for Scene Text Recognition.
CoRR, 2023

Only Classification Head Is Sufficient for Medical Image Segmentation.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

CCLAP: Controllable Chinese Landscape Painting Generation Via Latent Diffusion Model.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Decoupling Visual-Semantic Features Learning with Dual Masked Autoencoder for Self-Supervised Scene Text Recognition.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ViSA: Visual and Semantic Alignment for Robust Scene Text Recognition.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Black-Box Tuning of Vision-Language Models with Effective Gradient Approximation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Texts as Images in Prompt Tuning for Multi-Label Image Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Inferring and Leveraging Parts from Object Shape for Improving Semantic Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ReCoT: Regularized Co-Training for Facial Action Unit Recognition with Noisy Labels.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object Segmentation.
CoRR, 2022

Position-Aware Contrastive Alignment for Referring Image Segmentation.
CoRR, 2022

1st Place Solutions for UG2+ Challenge 2022 ATMOSPHERIC TURBULENCE MITIGATION.
CoRR, 2022

1st Place Solutions for the UVO Challenge 2022.
CoRR, 2022

Towards Diverse and Faithful One-shot Adaption of Generative Adversarial Networks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Vision Transformer Based Scene Text Recognizer with Multi-grained Encoding and Decoding.
Proceedings of the Frontiers in Handwriting Recognition - 18th International Conference, 2022

When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

Syntax-Aware Network for Handwritten Mathematical Expression Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Gaze Estimation with an Ensemble of Four Architectures.
CoRR, 2021

1st Place Solutions for UG2+ Challenge 2021 - (Semi-)supervised Face detection in the low light condition.
CoRR, 2021

Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

3DCNN Backed Conv-LSTM Auto Encoder for Micro Facial Expression Video Recognition.
Proceedings of the Machine Learning and Intelligent Communications, 2021

Automatic 3D Skeleton-based Dynamic Hand Gesture Recognition Using Multi-Layer Convolutional LSTM.
Proceedings of the ICRAI 2021: 7th International Conference on Robotics and Artificial Intelligence, Guangzhou, China, November 19, 2021

Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Local Global Relational Network for Facial Action Units Recognition.
Proceedings of the 16th IEEE International Conference on Automatic Face and Gesture Recognition, 2021

2020
TAL EmotioNet Challenge 2020 Rethinking the Model Chosen Problem in Multi-Task Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020


  Loading...