Rui Zhao

Orcid: 0000-0001-5874-131X

Affiliations:
  • Sense Time Research, China
  • Shanghai Jiaotong University, Qingyuan Research Institute, China
  • Chinese Academy of Science, Shenzhen Institutes of Advanced Technology, SIAT-SenseTime Joint Lab, China (former)
  • Chinese University of Hong Kong, Department of Electronic Engineering, Hong Kong (PhD 2015)


According to our database1, Rui Zhao authored at least 128 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
A position-enhanced sequential feature encoding model for lung infections and lymphoma classification on CT images.
Int. J. Comput. Assist. Radiol. Surg., October, 2024

A General Scenario-Agnostic Reinforcement Learning for Traffic Signal Control.
IEEE Trans. Intell. Transp. Syst., September, 2024

Structured Domain Adaptation With Online Relation Regularization for Unsupervised Person Re-ID.
IEEE Trans. Neural Networks Learn. Syst., January, 2024

Relation-Aware Distribution Representation Network for Person Clustering With Multiple Modalities.
IEEE Trans. Multim., 2024

EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.
CoRR, 2024

Hybrid Mamba for Few-Shot Segmentation.
CoRR, 2024

QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning.
CoRR, 2024

GuideLight: "Industrial Solution" Guidance for More Practical Traffic Signal Control Agents.
CoRR, 2024

TimeCMA: Towards LLM-Empowered Time Series Forecasting via Cross-Modality Alignment.
CoRR, 2024

Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation.
CoRR, 2024

Causal Evaluation of Language Models.
CoRR, 2024

PET-SQL: A Prompt-enhanced Two-stage Text-to-SQL Framework with Cross-consistency.
CoRR, 2024

Benchmarking the Text-to-SQL Capability of Large Language Models: A Comprehensive Evaluation.
CoRR, 2024

Towards A Better Metric for Text-to-Video Generation.
CoRR, 2024

Spatial-Temporal Large Language Model for Traffic Prediction.
Proceedings of the 25th IEEE International Conference on Mobile Data Management, 2024

CoSLight: Co-optimizing Collaborator Selection and Decision-making to Enhance Traffic Signal Control.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

X-Light: Cross-City Traffic Signal Control Using Transformer on Transformer as Meta Multi-Agent Reinforcement Learner.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

InstructDET: Diversifying Referring Object Detection with Generalized Instructions.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Eliminating Feature Ambiguity for Few-Shot Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

DragAnything: Motion Control for Anything Using Entity Representation.
Proceedings of the Computer Vision - ECCV 2024, 2024

SQL-to-Schema Enhances Schema Linking in Text-to-SQL.
Proceedings of the Database and Expert Systems Applications, 2024

X- Adapter: Universal Compatibility of Plugins for Upgraded Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Self-Supervised Representation Learning from Arbitrary Scenarios.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Instruct-ReID: A Multi-Purpose Person Re-Identification Task with Instructions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

DuaLight: Enhancing Traffic Signal Control by Leveraging Scenario-Specific and Scenario-Shared Knowledge.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Non-Neighbors Also Matter to Kriging: A New Contrastive-Prototypical Learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023
COCAS+: Large-Scale Clothes-Changing Person Re-Identification With Clothes Templates.
IEEE Trans. Circuits Syst. Video Technol., April, 2023

AutoMA: Towards Automatic Model Augmentation for Transferable Adversarial Attacks.
IEEE Trans. Multim., 2023

VisionTraj: A Noise-Robust Trajectory Recovery Framework based on Large-scale Camera Network.
CoRR, 2023

X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model.
CoRR, 2023

Hulk: A Universal Knowledge Translator for Human-Centric Tasks.
CoRR, 2023

Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach.
CoRR, 2023

TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems.
CoRR, 2023

KITS: Inductive Spatio-Temporal Kriging with Increment Training Strategy.
CoRR, 2023

Reboost Large Language Model-based Text-to-SQL, Text-to-Python, and Text-to-Function - with Real Applications in Traffic Domain.
CoRR, 2023

DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing.
CoRR, 2023

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
CoRR, 2023

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation.
CoRR, 2023

Link-Context Learning for Multimodal LLMs.
CoRR, 2023

TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents.
CoRR, 2023

Exposing the Troublemakers in Described Object Detection.
CoRR, 2023

Shikra: Unleashing Multimodal LLM's Referential Dialogue Magic.
CoRR, 2023

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis.
CoRR, 2023

Retrieve Anyone: A General-purpose Person Re-identification Task with Instructions.
CoRR, 2023

3D Model-based Zero-Shot Pose Estimation Pipeline.
CoRR, 2023

Advancing Referring Expression Segmentation Beyond Single Image.
CoRR, 2023

Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems.
CoRR, 2023

Better Aligning Text-to-Image Models with Human Preference.
CoRR, 2023

Efficient Masked Autoencoders with Self-Consistency.
CoRR, 2023

Saliency Guided Contrastive Learning on Scene Images.
CoRR, 2023

Described Object Detection: Liberating Object Detection with Flexible Expressions.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MM-DAG: Multi-task DAG Learning for Multi-modal Data - with Application for Traffic Congestion Analysis.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Patch-Level Contrasting without Patch Correspondence for Accurate and Dense Contrastive Representation Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Contextual Image Masking Modeling via Synergized Contrasting without View Augmentation for Faster and Better Visual Pretraining.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Cycle-consistent Masked AutoEncoder for Unsupervised Domain Generalization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

SparseMAE: Sparse Training Meets Masked Autoencoders.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Advancing Referring Expression Segmentation Beyond Single Image.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Human Preference Score: Better Aligning Text-to-image Models with Human Preference.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Trust Your Partner's Friends: Hierarchical Cross-Modal Contrastive Pre-Training for Video-Text Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Critical Perceptual Pre-trained Model for Complex Trajectory Recovery.
Proceedings of the 2nd ACM SIGSPATIAL International Workshop on Searching and Mining Large Collections of Geospatial Data, 2023

CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Balancing Logit Variation for Long-Tailed Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

HumanBench: Towards General Human-Centric Perception with Projector Assisted Pretraining.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

UniHCP: A Unified Model for Human-Centric Perceptions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Correlated Time Series Self-Supervised Representation Learning via Spatiotemporal Bootstrapping.
Proceedings of the 19th IEEE International Conference on Automation Science and Engineering, 2023

Dynamic Causal Graph Convolutional Network for Traffic Prediction.
Proceedings of the 19th IEEE International Conference on Automation Science and Engineering, 2023

SeqCo-DETR: Sequence Consistency Training for Self-Supervised Object Detection with Transformers.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

Uni6Dv2: Noise Elimination for 6D Pose Estimation.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Exploring Stochastic Autoregressive Image Modeling for Visual Representation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
MIAD: A Maintenance Inspection Dataset for Unsupervised Anomaly Detection.
CoRR, 2022

Uni6Dv3: 5D Anchor Mechanism for 6D Pose Estimation.
CoRR, 2022

DOTIN: Dropping Task-Irrelevant Nodes for GNNs.
CoRR, 2022

Unsupervised Object Detection Pretraining with Joint Object Priors Generation and Detector Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning from Future: A Novel Self-Training Framework for Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Focus Your Distribution: Coarse-to-Fine Non-Contrastive Learning for Anomaly Detection and Localization.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Zero-CL: Instance and Feature decorrelation for negative-free symmetric contrastive learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Domain Invariant Masked Autoencoders for Self-supervised Learning from Multi-domains.
Proceedings of the Computer Vision - ECCV 2022, 2022

Relative Contrastive Loss for Unsupervised Representation Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Unifying Visual Contrastive Learning for Object Recognition from a Graph Perspective.
Proceedings of the Computer Vision - ECCV 2022, 2022

Counterfactual Intervention Feature Transfer for Visible-Infrared Person Re-identification.
Proceedings of the Computer Vision - ECCV 2022, 2022

Align Representations with Base: A New Approach to Self-Supervised Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Feature Erasing and Diffusion Network for Occluded Person Re-Identification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Revisiting the Transferability of Supervised Pretraining: an MLP Perspective.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

UniVIP: A Unified Framework for Self-Supervised Visual Pre-training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Uni6D: A Unified CNN Framework without Projection Breakdown for 6D Pose Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Jointly Contrastive Representation Learning on Road Network and Trajectory.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021
FastFlow: Unsupervised Anomaly Detection and Localization via 2D Normalizing Flows.
CoRR, 2021

Multiple Domain Experts Collaborative Learning: Multi-Source Domain Generalization For Person Re-Identification.
CoRR, 2021

Neighbourhood-guided Feature Reconstruction for Occluded Person Re-Identification.
CoRR, 2021

Self-distillation with Batch Knowledge Ensembling Improves ImageNet Classification.
CoRR, 2021

Consensus-Guided Correspondence Denoising.
CoRR, 2021

Continual Representation Learning for Biometric Identification.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

MST: Masked Self-Supervised Transformer for Visual Representation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Efficient Open-Set Adversarial Attacks on Deep Face Recognition.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Improving Facial Attribute Recognition by Group and Graph Learning.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Progressive Correspondence Pruning by Consensus Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Structured Domain Adaptation for Unsupervised Person Re-identification.
CoRR, 2020

Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

RBF-Softmax: Learning Deep Representative Prototypes with Radial Basis Function Softmax.
Proceedings of the Computer Vision - ECCV 2020, 2020

Self-supervising Fine-Grained Region Similarities for Large-Scale Image Localization.
Proceedings of the Computer Vision - ECCV 2020, 2020

COCAS: A Large-Scale Clothes Changing Person Dataset for Re-Identification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning to Cluster Faces via Confidence and Connectivity Estimation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Density-Aware Feature Embedding for Face Clustering.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Memory-Based Neighbourhood Embedding for Visual Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

P2SGrad: Refined Gradients for Optimizing Deep Face Models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Attention-Aware Compositional Network for Person Re-Identification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Person Re-Identification by Saliency Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

2016
Crossing-Line Crowd Counting with Two-Phase Deep Neural Networks.
Proceedings of the Computer Vision - ECCV 2016, 2016

2015
Saliency detection by multi-context deep learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Person Re-identification: System Design and Evaluation Overview.
Proceedings of the Person Re-Identification, 2014

Highly Efficient Forward and Backward Propagation of Convolutional Neural Networks for Pixelwise Classification.
CoRR, 2014

Learning Mid-level Filters for Person Re-identification.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

DeepReID: Deep Filter Pairing Neural Network for Person Re-identification.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Counting Vehicles from Semantic Regions.
IEEE Trans. Intell. Transp. Syst., 2013

Person Re-identification by Salience Matching.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Unsupervised Salience Learning for Person Re-identification.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Human Reidentification with Transferred Metric Learning.
Proceedings of the Computer Vision - ACCV 2012, 2012

2010
SVD based linear filtering in DCT domain.
Proceedings of the International Conference on Image Processing, 2010


  Loading...