Yongming Rao

Orcid: 0000-0003-3952-8753

According to our database1, Yongming Rao authored at least 67 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Procedure-Aware Action Quality Assessment: Datasets and Performance Evaluation.
Int. J. Comput. Vis., December, 2024

Point-to-Pixel Prompting for Point Cloud Analysis With Pre-Trained Image Models.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2024

DIML: Deep Interpretable Metric Learning via Structural Matching.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

StableSwap: Stable Face Swapping in a Shared and Controllable Latent Space.
IEEE Trans. Multim., 2024

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution.
CoRR, 2024

Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model.
CoRR, 2024

Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models.
CoRR, 2024

Efficient Inference of Vision Instruction-Following Models with Elastic Cache.
Proceedings of the Computer Vision - ECCV 2024, 2024

X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Generative Multimodal Models are In-Context Learners.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
AdaPoinTr: Diverse Point Cloud Completion With Adaptive Geometry-Aware Transformers.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

3D Point-Voxel Correlation Fields for Scene Flow Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

GFNet: Global Filter Networks for Visual Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

Dynamic Spatial Sparsification for Efficient Vision Transformers and Convolutional Neural Networks.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

PointGLR: Unsupervised Structural Representation Learning of 3D Point Clouds.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Generative Multimodal Models are In-Context Learners.
CoRR, 2023

UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PLOT: Prompt Learning with Optimal Transport for Vision-Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Unleashing Text-to-Image Diffusion Models for Visual Perception.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TCOVIS: Temporally Consistent Online Video Instance Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DiffSwap: High-Fidelity and Controllable Face Swapping via 3D-Aware Masked Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

FLAG3D: A 3D Fitness Activity Dataset with Language Instruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
VideoABC: A Real-World Video Dataset for Abductive Visual Reasoning.
IEEE Trans. Image Process., 2022

Video Saliency Forecasting Transformer.
IEEE Trans. Circuits Syst. Video Technol., 2022

Structure-Preserving Image Super-Resolution.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Prompt Learning with Optimal Transport for Vision-Language Models.
CoRR, 2022

P2P: Tuning Pre-trained Image Models for Point Cloud Analysis with Point-to-Pixel Prompting.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

LiDAR Distillation: Bridging the Beam-Induced Domain Gap for 3D Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Back to Reality: Weakly-supervised 3D Object Detection with Shape-guided Label Enhancement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation.
Proceedings of the Conference on Robot Learning, 2022

2021
Multi-View Partial (MVP) Point Cloud Challenge 2021 on Completion and Registration: Methods and Results.
CoRR, 2021

Global Filter Networks for Image Classification.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Towards Interpretable Deep Metric Learning with Structural Matching.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Group-aware Contrastive Regression for Action Quality Assessment.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

RandomRooms: Unsupervised Pre-training from Synthetic Shapes and Randomized Layouts for 3D Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Multi-Proxy Wasserstein Classifier for Image Classification.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
基于卷积去噪自编码器的芯片表面弱缺陷检测方法 (Detection Method of Chip Surface Weak Defect Based on Convolution Denoising Auto-encoders).
计算机科学, 2020

MetaDistiller: Network Self-Boosting via Meta-Learned Top-Down Distillation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Temporal Coherence or Temporal Motion: Which Is More Critical for Video-Based Person Re-identification?
Proceedings of the Computer Vision - ECCV 2020, 2020

Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Clouds.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Structure-Preserving Super Resolution With Gradient Guidance.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Deep Face Super-Resolution With Iterative Collaboration Between Attentive Recovery and Landmark Estimation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Runtime Network Routing for Efficient Image Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

交通事故时空模式可视分析方法 (Visual Analysis Method of Traffic Accident Spatial-Temporal Pattern).
计算机科学, 2019

Learning Discriminative Aggregation Network for Video-Based Face Recognition and Person Re-identification.
Int. J. Comput. Vis., 2019

P<sup>2</sup>GNet: Pose-Guided Point Cloud Generating Networks for 6-DoF Object Pose Estimation.
CoRR, 2019

COIN: A Large-Scale Dataset for Comprehensive Instructional Video Analysis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Spherical Fractal Convolutional Neural Networks for Point Cloud Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Learning Globally Optimized Object Detector via Policy Gradient.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Runtime Neural Pruning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

V-Tree: Efficient kNN Search on Moving Objects with Road-Network Constraints.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

Learning Discriminative Aggregation Network for Video-Based Face Recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Attention-Aware Deep Reinforcement Learning for Video Face Recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2017


  Loading...