Sheng Jin

Orcid: 0000-0001-5736-7434

Affiliations:
  • The University of Hong Kong, Hong Kong, SAR, China
  • SenseTime Research
  • Tsinghua University, Beijing, China (former)


According to our database1, Sheng Jin authored at least 34 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
TCFormer: Visual Recognition via Token Clustering Transformer.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension.
CoRR, 2024

TCFormer: Visual Recognition via Token Clustering Transformer.
CoRR, 2024

F-LMM: Grounding Frozen Large Multimodal Models.
CoRR, 2024

AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks.
CoRR, 2024

CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

PROGRAM: PROtotype GRAph Model based Pseudo-Label Learning for Test-Time Adaptation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

When Pedestrian Detection Meets Multi-modal Learning: Generalist Model and Benchmark Dataset.
Proceedings of the Computer Vision - ECCV 2024, 2024

GKGNet: Group K-Nearest Neighbor Based Graph Convolutional Network for Multi-label Image Recognition.
Proceedings of the Computer Vision - ECCV 2024, 2024

UniFS: Universal Few-Shot Instance Perception with Point Representations.
Proceedings of the Computer Vision - ECCV 2024, 2024

You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-person Multi-task Human-Centric Perception.
Proceedings of the Computer Vision - ECCV 2024, 2024

CLIM: Contrastive Language-Image Mosaic for Region Representation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
ZoomNAS: Searching for Whole-Body Human Pose Estimation in the Wild.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

Aligning Bag of Regions for Open-Vocabulary Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
ZoomNAS: Searching for Whole-body Human Pose Estimation in the Wild.
CoRR, 2022

Pose for Everything: Towards Category-Agnostic Pose Estimation.
CoRR, 2022

Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Pose for Everything: Towards Category-Agnostic Pose Estimation.
Proceedings of the Computer Vision - ECCV 2022, 2022

3D Interacting Hand Pose Estimation by Hand De-occlusion and Removal.
Proceedings of the Computer Vision - ECCV 2022, 2022

PoseTrans: A Simple yet Effective Pose Transformation Augmentation for Human Pose Estimation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Robust Few-Shot Learning for User-Provided Data.
IEEE Trans. Neural Networks Learn. Syst., 2021

Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

When Human Pose Estimation Meets Robustness: Adversarial Algorithms and Benchmarks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
When Counterpoint Meets Chinese Folk Melodies.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

HiEve ACM MM Grand Challenge 2020: Pose Tracking in Crowded Scenes.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Whole-Body Human Pose Estimation in the Wild.
Proceedings of the Computer Vision - ECCV 2020, 2020

Differentiable Hierarchical Graph Grouping for Multi-person Pose Estimation.
Proceedings of the Computer Vision - ECCV 2020, 2020

RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Hierarchical automatic curriculum learning: Converting a sparse reward navigation task into dense reward.
Neurocomputing, 2019

TRB: A Novel Triplet Representation for Understanding 2D Human Body.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Multi-Person Articulated Tracking With Spatial and Temporal Embeddings.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Connectionist Temporal Classification with Maximum Entropy Regularization.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018


  Loading...