Yu Liu

Orcid: 0009-0005-1369-3250

Affiliations:

Sensetime Group Limited, Beijing, China
Chinese University of Hong Kong, Hong Kong

According to our database¹, Yu Liu authored at least 103 papers between 2016 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2016

2017

2018

2019

2020

2021

2022

2023

2024

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Enhancing Vision-Language Model with Unmasked Token Alignment.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Adaptive pessimism via target Q-value for offline reinforcement learning.

[BibT_eX]

[DOI]

Neural Networks, 2024

SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction.

[BibT_eX]

[DOI]

CoRR, 2024

CLSP: High-Fidelity Contrastive Language-State Pre-training for Agent State Representation.

[BibT_eX]

[DOI]

CoRR, 2024

MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines.

[BibT_eX]

[DOI]

CoRR, 2024

PsyDI: Towards a Personalized and Progressively In-depth Chatbot for Psychological Measurements.

[BibT_eX]

[DOI]

CoRR, 2024

Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2024

UniZero: Generalized and Efficient Planning with Scalable Latent World Models.

[BibT_eX]

[DOI]

CoRR, 2024

Phased Consistency Model.

[BibT_eX]

[DOI]

Fu-Yun Wang

Zhaoyang Huang

Alexander William Bergman

CoRR, 2024

MoVA: Adapting Mixture of Vision Experts to Multimodal Context.

[BibT_eX]

[DOI]

CoRR, 2024

CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching.

[BibT_eX]

[DOI]

CoRR, 2024

FlashFace: Human Image Personalization with High-fidelity Identity Preservation.

[BibT_eX]

[DOI]

CoRR, 2024

Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning.

[BibT_eX]

[DOI]

CoRR, 2024

AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2024 Technical Communications, 2024

Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediction Tasks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Be-Your-Outpainter: Mastering Video Outpainting Through Input-Specific Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LMDrive: Closed-Loop End-to-End Driving with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GLID: Pre-training a Generalist Encoder-Decoder Vision Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Critic-Guided Decision Transformer for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Teach-DETR: Better Training DETR With Teachers.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

HDGT: Heterogeneous Driving Graph Transformer for Multi-Agent Trajectory Prediction via Scene Encoding.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

UniFormer: Unifying Convolution and Self-Attention for Visual Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

LMDrive: Closed-Loop End-to-End Driving with Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Large-scale Masked Face Recognition.

[BibT_eX]

[DOI]

CoRR, 2023

Masked Pretraining for Multi-Agent Decision Making.

[BibT_eX]

[DOI]

CoRR, 2023

Distance-rank Aware Sequential Reward Learning for Inverse Reinforcement Learning with Sub-optimal Demonstrations.

[BibT_eX]

[DOI]

CoRR, 2023

Beyond One-Preference-for-All: Multi-Objective Direct Preference Optimization for Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths.

[BibT_eX]

[DOI]

CoRR, 2023

Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding.

[BibT_eX]

[DOI]

CoRR, 2023

Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors.

[BibT_eX]

[DOI]

Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Transformer-based Open-world Instance Segmentation with Cross-task Consistency Regularization.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Accelerating Reinforcement Learning for Autonomous Driving Using Task-Agnostic and Ego-Centric Motion Skills.

[BibT_eX]

[DOI]

IROS, 2023

DETRs with Collaborative Hybrid Assignments Training.

[BibT_eX]

[DOI]

Zhuofan Zong

Guanglu Song

Yu Liu

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

ReasonNet: End-to-End Driving with Temporal and Global Reasoning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ACE: Cooperative Multi-Agent Q-learning with Bidirectional Action-Dependency.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Large-batch Optimization for Dense Visual Predictions.

[BibT_eX]

[DOI]

CoRR, 2022

Single-Stage Open-world Instance Segmentation with Cross-task Consistency Regularization.

[BibT_eX]

[DOI]

CoRR, 2022

MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Meta Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, 2022

UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Large-batch Optimization for Dense Visual Predictions: Training Faster R-CNN in 4.2 Minutes.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Self-slimmed Vision Transformer.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Towards Robust Face Recognition with Comprehensive Search.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Rethinking Robust Representation Learning Under Fine-Grained Noisy Faces.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

UniNet: Unified Architecture Search with Convolution, Transformer, and MLP.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Unifying Visual Perception by Dispersible Points Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2022

2021

INTERN: A New Learning Paradigm Towards General Vision.

[BibT_eX]

[DOI]

CoRR, 2021

FNAS: Uncertainty-Aware Fast Neural Architecture Search.

[BibT_eX]

[DOI]

CoRR, 2021

Rectifying the Data Bias in Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Switchable K-class Hyperplanes for Noise-Robust Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 2020.

[BibT_eX]

[DOI]

CoRR, 2020

1st place solution for AVA-Kinetics Crossover in AcitivityNet Challenge 2020.

[BibT_eX]

[DOI]

CoRR, 2020

1st Place Solutions for OpenImage2019 - Object Detection and Instance Segmentation.

[BibT_eX]

[DOI]

CoRR, 2020

Top-1 Solution of Multi-Moments in Time Challenge 2019.

[BibT_eX]

[DOI]

CoRR, 2020

Discriminability Distillation in Group Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Learning Where to Focus for Efficient Video Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Rotate-and-Render: Unsupervised Photorealistic Face Rotation From Single-View Images.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Revisiting the Sibling Head in Object Detector.

[BibT_eX]

[DOI]

Guanglu Song

Yu Liu

Xiaogang Wang

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

KPNet: Towards Minimal Face Detector.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Zoom Out-and-In Network with Map Attention Decision for Region Proposal and Object Detection.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2019

An incremental model transfer method for complex process fault diagnosis.

[BibT_eX]

[DOI]

Xiaogang Wang

Xiyu Liu

Yu Liu

IEEE CAA J. Autom. Sinica, 2019

Learning Motion Priors for Efficient Video Object Detection.

[BibT_eX]

[DOI]

CoRR, 2019

Towards Flops-Constrained Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Correlation Congruence for Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Differentiable Kernel Evolution.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Knowledge Distillation via Route Constrained Optimization.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Conditional Adversarial Generative Flow for Controllable Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Talking Face Generation by Adversarially Disentangled Audio-Visual Representation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Gradient Harmonized Single-Stage Detector.

[BibT_eX]

[DOI]

Buyu Li

Yu Liu

Xiaogang Wang

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Crafting GBD-Net for Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

Transductive Centroid Projection for Semi-supervised Large-Scale Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Beyond Trade-Off: Accelerate FCN-Based Face Detector With Higher Accuracy.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Exploring Disentangled Feature Representation Beyond Face Identification.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Region-Based Quality Estimation Network for Large-Scale Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Rethinking Feature Discrimination and Polymerization for Large-scale Recognition.

[BibT_eX]

[DOI]

Yu Liu

Hongyang Li

Xiaogang Wang

CoRR, 2017

Learning Deep Features via Congenerous Cosine Loss for Person Recognition.

[BibT_eX]

[DOI]

Yu Liu

Hongyang Li

Xiaogang Wang

CoRR, 2017

Zoom Out-and-In Network with Recursive Training for Object Proposal.

[BibT_eX]

[DOI]

CoRR, 2017

Do we really need more training data for object localization.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Recurrent Scale Approximation for Object Detection in CNN.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Quality Aware Network for Set to Set Recognition.

[BibT_eX]

[DOI]

Yu Liu

Junjie Yan

Wanli Ouyang

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Scale-Aware Face Detection.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

3D object understanding with 3D Convolutional Neural Networks.

[BibT_eX]

[DOI]

Inf. Sci., 2016

Cascade shallow CNN structure for face verification and identification.

[BibT_eX]

[DOI]

Neurocomputing, 2016

POI: Multiple Object Tracking with High Performance Detection and Appearance Feature.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Yu Liu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...