Li Zhang

Orcid: 0000-0003-1031-5420

Affiliations:
  • Fudan University, School of Data Science, Shanghai Key Lab of Intelligent Information Processing, Shanghai, China
  • University of Oxford, Department of Engineering Science, UK (former)
  • Queen Mary University of London, UK (PhD 2017)


According to our database1, Li Zhang authored at least 103 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Vision Transformers: From Semantic Segmentation to Dense Prediction.
Int. J. Comput. Vis., December, 2024

RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM With Neural Radiance Fields.
IEEE Robotics Autom. Lett., September, 2024

Softmax-Free Linear Transformers.
Int. J. Comput. Vis., August, 2024

A Hybrid Approach for Cross-Modality Pose Estimation Between Image and Point Cloud.
IEEE Robotics Autom. Lett., May, 2024

DeepInteraction++: Multi-Modality Interaction for Autonomous Driving.
CoRR, 2024

Gaussian Splatting with Localized Points Management.
CoRR, 2024

Tetrahedron Splatting for 3D Generation.
CoRR, 2024

Automating the Diagnosis of Human Vision Disorders by Cross-modal 3D Generation.
CoRR, 2024

LaneCorrect: Self-supervised Lane Detection.
CoRR, 2024

Diffusion<sup>2</sup>: Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models.
CoRR, 2024

OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation.
CoRR, 2024

From Summary to Action: Enhancing Large Language Models for Complex Tasks with Open World APIs.
CoRR, 2024

S-Agents: Self-organizing Agents in Open-ended Environments.
CoRR, 2024

S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation.
CoRR, 2024

Fast Dynamic 3D Object Generation from a Single-view Video.
CoRR, 2024

Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Reason2Drive: Towards Interpretable and Chain-Based Reasoning for Autonomous Driving.
Proceedings of the Computer Vision - ECCV 2024, 2024

Relightable 3D Gaussians: Realistic Point Cloud Relighting with BRDF Decomposition and Ray Tracing.
Proceedings of the Computer Vision - ECCV 2024, 2024

LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
When, Where and How Does it Fail? A Spatial-Temporal Visual Analytics Approach for Interpretable Object Detection in Autonomous Driving.
IEEE Trans. Vis. Comput. Graph., December, 2023

Dynamic Graph Message Passing Networks.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

SiamMask: A Framework for Fast Online Object Tracking and Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Rethinking Local and Global Feature Representation for Dense Prediction.
Pattern Recognit., 2023

Harnessing Diffusion Models for Visual Perception with Meta Prompts.
CoRR, 2023

WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation.
CoRR, 2023

Periodic Vibration Gaussian: Dynamic Urban Scene Reconstruction and Real-time Rendering.
CoRR, 2023

Relightable 3D Gaussian: Real-time Point Cloud Relighting with BRDF Decomposition and Ray Tracing.
CoRR, 2023

Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting.
CoRR, 2023

Probabilistic Computation with Emerging Covariance: Towards Efficient Uncertainty Quantification.
CoRR, 2023

Preconditioned Score-based Generative Models.
CoRR, 2023

ImpDet: Exploring Implicit Fields for 3D Object Detection.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

SUIT: Learning Significance-Guided Information for 3D Temporal Detection.
IROS, 2023

SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

PARTNER: Level up the Polar Representation for LiDAR 3D Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Translating Images to Road Network: A Non-Autoregressive Sequence-to-Sequence Approach.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Generative Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PolarFormer: Multi-Camera 3D Object Detection with Polar Transformer.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Dual Prior Learning for Blind and Blended Image Restoration.
IEEE Trans. Image Process., 2022

SGM3D: Stereo Guided Monocular 3D Object Detection.
IEEE Robotics Autom. Lett., 2022

How to Trust Unlabeled Data? Instance Credibility Inference for Few-Shot Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

DigestPath: A benchmark dataset with challenge review for the pathological detection and segmentation of digestive-system.
Medical Image Anal., 2022

The devil is in the face: Exploiting harmonious representations for facial expression recognition.
Neurocomputing, 2022

Dynamic Graph Message Passing Networks for Visual Recognition.
CoRR, 2022

Visual Representation Learning with Transformer: A Sequence-to-Sequence Perspective.
CoRR, 2022

Softmax-free Linear Transformers.
CoRR, 2022

PolarFormer: Multi-camera 3D Object Detection with Polar Transformers.
CoRR, 2022

Accelerating Score-based Generative Models for High-Resolution Image Synthesis.
CoRR, 2022

DeepInteraction: 3D Object Detection via Modality Interaction.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

RCLane: Relay Chain Prediction for Lane Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Accelerating Score-Based Generative Models with Preconditioned Diffusion Sampling.
Proceedings of the Computer Vision - ECCV 2022, 2022

Learning Ego 3D Representation as Ray Tracing.
Proceedings of the Computer Vision - ECCV 2022, 2022

FashionViL: Fashion-Focused Vision-and-Language Representation Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

ONCE-3DLanes: Building Monocular 3D Lane Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

UIGR: Unified Interactive Garment Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
Global Aggregation Then Local Distribution for Scene Parsing.
IEEE Trans. Image Process., 2021

Towards Efficient Scene Understanding via Squeeze Reasoning.
IEEE Trans. Image Process., 2021

Large-scale gastric cancer screening and localization using multi-task deep neural network.
Neurocomputing, 2021

SGM3D: Stereo Guided Monocular 3D Object Detection.
CoRR, 2021

Progressive Coordinate Transforms for Monocular 3D Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

SOFT: Softmax-free Transformer with Linear Complexity.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Few-shot Learning for Multi-Modality Tasks.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

A Unified Efficient Pyramid Transformer for Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

The Devil is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Boundary-sensitive Pre-training for Temporal Localization in Videos.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Rethinking Semantic Segmentation From a Sequence-to-Sequence Perspective With Transformers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Delving into Data: Effectively Substitute Training for Black-box Attack.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Depth-Conditioned Dynamic Message Propagation for Monocular 3D Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Dynamic Alignment via Meta-Filter for Few-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Few-shot Action Recognition with Prototype-centered Attentive Learning.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Text-Based Person Search with Limited Data.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Rethinking local and global feature representation for semantic segmentation.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Learning a Few-shot Embedding Model with Contrastive Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Egocentric Action Recognition by Video Attention and Temporal Context.
CoRR, 2020

Self-supervised Video Object Segmentation.
CoRR, 2020

Few-shot Action Recognition via Improved Attention with Self-supervision.
CoRR, 2020

Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Few-Shot Action Recognition with Permutation-Invariant Attention.
Proceedings of the Computer Vision - ECCV 2020, 2020

XingGAN for Person Image Generation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Improving Semantic Segmentation via Decoupled Body and Edge Supervision.
Proceedings of the Computer Vision - ECCV 2020, 2020

The Eighth Visual Object Tracking VOT2020 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Dynamic Graph Message Passing Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Instance Credibility Inference for Few-Shot Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

An Embarrassingly Simple Baseline to One-shot Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Style Normalization and Restitution for Generalizable Person Re-Identification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Convolutional Neural Network with Asymmetric Encoding and Decoding Structure for Brain Vessel Segmentation on Computed Tomographic Angiography.
Proceedings of the Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 2020

Long-Term Cloth-Changing Person Re-identification.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Dynamic Depth Fusion and Transformation for Monocular 3D Object Detection.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
The Seventh Visual Object Tracking VOT2019 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Fast Online Object Tracking and Segmentation: A Unifying Approach.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Dual Graph Convolutional Network for Semantic Segmentation.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

Global Aggregation then Local Distribution in Fully Convolutional Networks.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
Cross-view learning.
PhD thesis, 2018

Learn to Interpret Atari Agents.
CoRR, 2018

Learning to Compare: Relation Network for Few-Shot Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Actor-Critic Sequence Training for Image Captioning.
CoRR, 2017

Learning to Learn: Meta-Critic Networks for Sample Efficient Learning.
CoRR, 2017

Learning a Deep Embedding Model for Zero-Shot Learning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Learning a Discriminative Null Space for Person Re-identification.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016


  Loading...