Yu Liu

Orcid: 0009-0005-1369-3250

Affiliations:
  • Sensetime Group Limited, Beijing, China
  • Chinese University of Hong Kong, Hong Kong


According to our database1, Yu Liu authored at least 103 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

2016
2017
2018
2019
2020
2021
2022
2023
2024
0
5
10
15
20
25
14
11
5
2
4
3
1
3
2
12
13
9
3
5
7
4
4
1

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Enhancing Vision-Language Model with Unmasked Token Alignment.
Trans. Mach. Learn. Res., 2024

Adaptive pessimism via target Q-value for offline reinforcement learning.
Neural Networks, 2024

SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction.
CoRR, 2024

CLSP: High-Fidelity Contrastive Language-State Pre-training for Agent State Representation.
CoRR, 2024

MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines.
CoRR, 2024

PsyDI: Towards a Personalized and Progressively In-depth Chatbot for Psychological Measurements.
CoRR, 2024

Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models.
CoRR, 2024

UniZero: Generalized and Efficient Planning with Scalable Latent World Models.
CoRR, 2024

Phased Consistency Model.
CoRR, 2024

MoVA: Adapting Mixture of Vision Experts to Multimodal Context.
CoRR, 2024

CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching.
CoRR, 2024

FlashFace: Human Image Personalization with High-fidelity Identity Preservation.
CoRR, 2024

Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models.
CoRR, 2024

AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning.
CoRR, 2024

AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data.
Proceedings of the SIGGRAPH Asia 2024 Technical Communications, 2024

Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediction Tasks.
Proceedings of the Computer Vision - ECCV 2024, 2024

Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Be-Your-Outpainter: Mastering Video Outpainting Through Input-Specific Adaptation.
Proceedings of the Computer Vision - ECCV 2024, 2024

ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model.
Proceedings of the Computer Vision - ECCV 2024, 2024

FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis.
Proceedings of the Computer Vision - ECCV 2024, 2024

SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LMDrive: Closed-Loop End-to-End Driving with Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GLID: Pre-training a Generalist Encoder-Decoder Vision Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Critic-Guided Decision Transformer for Offline Reinforcement Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Teach-DETR: Better Training DETR With Teachers.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

HDGT: Heterogeneous Driving Graph Transformer for Multi-Agent Trajectory Prediction via Scene Encoding.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

UniFormer: Unifying Convolution and Self-Attention for Visual Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

LMDrive: Closed-Loop End-to-End Driving with Large Language Models.
CoRR, 2023

Towards Large-scale Masked Face Recognition.
CoRR, 2023

Masked Pretraining for Multi-Agent Decision Making.
CoRR, 2023

Distance-rank Aware Sequential Reward Learning for Inverse Reinforcement Learning with Sub-optimal Demonstrations.
CoRR, 2023

Beyond One-Preference-for-All: Multi-Objective Direct Preference Optimization for Language Models.
CoRR, 2023

RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths.
CoRR, 2023

Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising.
CoRR, 2023

Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding.
CoRR, 2023

Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors.
Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Transformer-based Open-world Instance Segmentation with Cross-task Consistency Regularization.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Accelerating Reinforcement Learning for Autonomous Driving Using Task-Agnostic and Ego-Centric Motion Skills.
IROS, 2023

DETRs with Collaborative Hybrid Assignments Training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

ReasonNet: End-to-End Driving with Temporal and Global Reasoning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ACE: Cooperative Multi-Agent Q-learning with Bidirectional Action-Dependency.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Large-batch Optimization for Dense Visual Predictions.
CoRR, 2022

Single-Stage Open-world Instance Segmentation with Cross-task Consistency Regularization.
CoRR, 2022

MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning.
CoRR, 2022

Meta Knowledge Distillation.
CoRR, 2022

UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning.
CoRR, 2022

Large-batch Optimization for Dense Visual Predictions: Training Faster R-CNN in 4.2 Minutes.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Self-slimmed Vision Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

Towards Robust Face Recognition with Comprehensive Search.
Proceedings of the Computer Vision - ECCV 2022, 2022

Rethinking Robust Representation Learning Under Fine-Grained Noisy Faces.
Proceedings of the Computer Vision - ECCV 2022, 2022

TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

UniNet: Unified Architecture Search with Convolution, Transformer, and MLP.
Proceedings of the Computer Vision - ECCV 2022, 2022

Unifying Visual Perception by Dispersible Points Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer.
Proceedings of the Conference on Robot Learning, 2022

2021
INTERN: A New Learning Paradigm Towards General Vision.
CoRR, 2021

FNAS: Uncertainty-Aware Fast Neural Architecture Search.
CoRR, 2021

Rectifying the Data Bias in Knowledge Distillation.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Switchable K-class Hyperplanes for Noise-Robust Representation Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 2020.
CoRR, 2020

1st place solution for AVA-Kinetics Crossover in AcitivityNet Challenge 2020.
CoRR, 2020

1st Place Solutions for OpenImage2019 - Object Detection and Instance Segmentation.
CoRR, 2020

Top-1 Solution of Multi-Moments in Time Challenge 2019.
CoRR, 2020

Discriminability Distillation in Group Representation Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning Where to Focus for Efficient Video Object Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

Rotate-and-Render: Unsupervised Photorealistic Face Rotation From Single-View Images.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Revisiting the Sibling Head in Object Detector.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

KPNet: Towards Minimal Face Detector.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Zoom Out-and-In Network with Map Attention Decision for Region Proposal and Object Detection.
Int. J. Comput. Vis., 2019

An incremental model transfer method for complex process fault diagnosis.
IEEE CAA J. Autom. Sinica, 2019

Learning Motion Priors for Efficient Video Object Detection.
CoRR, 2019

Towards Flops-Constrained Face Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Correlation Congruence for Knowledge Distillation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Differentiable Kernel Evolution.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Knowledge Distillation via Route Constrained Optimization.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Conditional Adversarial Generative Flow for Controllable Image Synthesis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Talking Face Generation by Adversarially Disentangled Audio-Visual Representation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Gradient Harmonized Single-Stage Detector.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Crafting GBD-Net for Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Transductive Centroid Projection for Semi-supervised Large-Scale Recognition.
Proceedings of the Computer Vision - ECCV 2018, 2018

Beyond Trade-Off: Accelerate FCN-Based Face Detector With Higher Accuracy.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Exploring Disentangled Feature Representation Beyond Face Identification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Region-Based Quality Estimation Network for Large-Scale Person Re-Identification.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Rethinking Feature Discrimination and Polymerization for Large-scale Recognition.
CoRR, 2017

Learning Deep Features via Congenerous Cosine Loss for Person Recognition.
CoRR, 2017

Zoom Out-and-In Network with Recursive Training for Object Proposal.
CoRR, 2017

Do we really need more training data for object localization.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Recurrent Scale Approximation for Object Detection in CNN.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Quality Aware Network for Set to Set Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Scale-Aware Face Detection.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
3D object understanding with 3D Convolutional Neural Networks.
Inf. Sci., 2016

Cascade shallow CNN structure for face verification and identification.
Neurocomputing, 2016

POI: Multiple Object Tracking with High Performance Detection and Appearance Feature.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016


  Loading...