Yongming Rao

Francisco Gómez Fernández

IEEE Trans. Pattern Anal. Mach. Intell., 2023

Generative Multimodal Models are In-Context Learners.

[BibT_eX]

[DOI]

CoRR, 2023

UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PLOT: Prompt Learning with Optimal Transport for Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Unleashing Text-to-Image Diffusion Models for Visual Perception.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TCOVIS: Temporally Consistent Online Video Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DiffSwap: High-Fidelity and Controllable Face Swapping via 3D-Aware Masked Diffusion.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

FLAG3D: A 3D Fitness Activity Dataset with Language Instruction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

VideoABC: A Real-World Video Dataset for Abductive Visual Reasoning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Video Saliency Forecasting Transformer.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Structure-Preserving Image Super-Resolution.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Prompt Learning with Optimal Transport for Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

P2P: Tuning Pre-trained Image Models for Point Cloud Analysis with Point-to-Pixel Prompting.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

LiDAR Distillation: Bridging the Beam-Induced Domain Gap for 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Back to Reality: Weakly-supervised 3D Object Detection with Shape-guided Label Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2022

2021

Multi-View Partial (MVP) Point Cloud Challenge 2021 on Completion and Registration: Methods and Results.

[BibT_eX]

[DOI]

Qinlong Wang

Yang Yang

CoRR, 2021

Global Filter Networks for Image Classification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Towards Interpretable Deep Metric Learning with Structural Matching.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Group-aware Contrastive Regression for Action Quality Assessment.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

RandomRooms: Unsupervised Pre-training from Synthetic Shapes and Randomized Layouts for 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Multi-Proxy Wasserstein Classifier for Image Classification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

基于卷积去噪自编码器的芯片表面弱缺陷检测方法 (Detection Method of Chip Surface Weak Defect Based on Convolution Denoising Auto-encoders).

[BibT_eX]

[DOI]

计算机科学, 2020

MetaDistiller: Network Self-Boosting via Meta-Learned Top-Down Distillation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Temporal Coherence or Temporal Motion: Which Is More Critical for Video-Based Person Re-identification?

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Structure-Preserving Super Resolution With Gradient Guidance.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Deep Face Super-Resolution With Iterative Collaboration Between Attentive Recovery and Landmark Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Runtime Network Routing for Efficient Image Classification.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

交通事故时空模式可视分析方法 (Visual Analysis Method of Traffic Accident Spatial-Temporal Pattern).

[BibT_eX]

[DOI]

计算机科学, 2019

Learning Discriminative Aggregation Network for Video-Based Face Recognition and Person Re-identification.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2019

P<sup>2</sup>GNet: Pose-Guided Point Cloud Generating Networks for 6-DoF Object Pose Estimation.

[BibT_eX]

[DOI]

CoRR, 2019

COIN: A Large-Scale Dataset for Comprehensive Instructional Video Analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Spherical Fractal Convolutional Neural Networks for Point Cloud Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Learning Globally Optimized Object Detector via Policy Gradient.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Runtime Neural Pruning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

V-Tree: Efficient kNN Search on Moving Objects with Road-Network Constraints.

[BibT_eX]

[DOI]

Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

Learning Discriminative Aggregation Network for Video-Based Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Attention-Aware Deep Reinforcement Learning for Video Face Recognition.

[BibT_eX]

[DOI]