Xinge Zhu

Orcid: 0000-0003-0107-8099

According to our database1, Xinge Zhu authored at least 66 papers between 2017 and 2024.

Collaborative distances:



In proceedings 
PhD thesis 




Unified 3D and 4D Panoptic Segmentation via Dynamic Shifting Networks.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

LIF-Seg: LiDAR and Camera Image Fusion for 3D LiDAR Semantic Segmentation.
IEEE Trans. Multim., 2024

MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas.
CoRR, 2024

TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation.
CoRR, 2024

A Unified Framework for Human-centric Point Cloud Video Understanding.
CoRR, 2024

LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment.
CoRR, 2024

HUNTER: Unsupervised Human-centric 3D Detection via Transferring Knowledge from Synthetic Instances to Real Scenes.
CoRR, 2024

OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree Queries.
CoRR, 2023

SAM-guided Unsupervised Domain Adaptation for 3D Segmentation.
CoRR, 2023

Model2Scene: Learning 3D Scene Representation via Contrastive Language-CAD Models Pre-training.
CoRR, 2023

Cross-modal and Cross-domain Knowledge Transfer for Label-free 3D Segmentation.
CoRR, 2023

PARTNER: Level up the Polar Representation for LiDAR 3D Object Detection.
CoRR, 2023

WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language.
CoRR, 2023

Cross-Modal and Cross-Domain Knowledge Transfer for Label-Free 3D Segmentation.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Towards Label-free Scene Understanding by Vision Foundation Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Bridging Language and Geometric Primitives for Zero-shot Point Cloud Segmentation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

ContrastMotion: Self-supervised Scene Motion Learning for Large-Scale LiDAR Point Clouds.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

One Training for Multiple Deployments: Polar-based Adaptive BEV Perception for Autonomous Driving.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

GANet: Goal Area Network for Motion Forecasting.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Human-centric Scene Understanding for 3D Large-scale Scenarios.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PARTNER: Level up the Polar Representation for LiDAR 3D Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

See More and Know More: Zero-shot Point Cloud Segmentation via Multi-modal Visual Data.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rethinking Range View Representation for LiDAR Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SCPNet: Semantic Scene Completion on Point Cloud.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CL3D: Unsupervised Domain Adaptation for Cross-LiDAR 3D Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR-Based Perception.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Correction to: AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach.
Int. J. Comput. Vis., 2022

AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach.
Int. J. Comput. Vis., 2022

Zero-shot Point Cloud Segmentation by Transferring Geometric Primitives.
CoRR, 2022

Rethinking Trajectory Prediction via "Team Game".
CoRR, 2022

Vision-Centric BEV Perception: A Survey.
CoRR, 2022

MV-FCOS3D++: Multi-View Camera-Only 4D Object Detection with Pretrained Monocular Backbones.
CoRR, 2022

Towards 3D Scene Understanding by Referring Synthetic Models.
CoRR, 2022

LiDAR-based 4D Panoptic Segmentation via Dynamic Shifting Network.
CoRR, 2022

SIDE: Center-based Stereo 3D Detector with Structure-aware Instance Depth Estimation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Self-Supervised Point Cloud Completion on Real Traffic Scenes Via Scene-Concerned Bottom-Up Mechanism.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Efficient Point Cloud Analysis Using Hilbert Curve.
Proceedings of the Computer Vision - ECCV 2022, 2022

Point-to-Voxel Knowledge Distillation for LiDAR Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Input-Output Balanced Framework for Long-Tailed Lidar Semantic Segmentation.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

FCOS3D: Fully Convolutional One-Stage Monocular 3D Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

AdaStereo: A Simple and Efficient Approach for Adaptive Stereo Matching.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

LiDAR-Based Panoptic Segmentation via Dynamic Shifting Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Probabilistic and Geometric Depth: Detecting Objects in Perspective.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Channel-wise Alignment for Adaptive Object Detection.
CoRR, 2020

Cylinder3D: An Effective 3D Framework for Driving-scene LiDAR Semantic Segmentation.
CoRR, 2020

Adversarial Attacks on Monocular Depth Estimation.
CoRR, 2020

SSN: Shape Signature Networks for Multi-class Object Detection from Point Clouds.
Proceedings of the Computer Vision - ECCV 2020, 2020

AutoTrajectory: Label-Free Trajectory Extraction and Prediction from Videos Using Dynamic Points.
Proceedings of the Computer Vision - ECCV 2020, 2020

Tensor Low-Rank Reconstruction for Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

SelfVoxeLO: Self-supervised LiDAR Odometry with Voxel-based Deep Neural Networks.
Proceedings of the 4th Conference on Robot Learning, 2020

Reconfigurable Voxels: A New Representation for LiDAR-Based Point Clouds.
Proceedings of the 4th Conference on Robot Learning, 2020

High Performance Gesture Recognition via Effective and Efficient Temporal Modeling.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Depth Completion From Sparse LiDAR Data With Depth-Normal Constraints.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Adapting Object Detectors via Selective Cross-Domain Alignment.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Not All Areas Are Equal: Transfer Learning for Semantic Segmentation via Hierarchical Region Selection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

TrafficPredict: Trajectory Prediction for Heterogeneous Traffic-Agents.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Penalizing Top Performers: Conservative Loss for Semantic Segmentation Adaptation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Pose Guided Human Video Generation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Generative Adversarial Frontal View to Bird View Synthesis.
Proceedings of the 2018 International Conference on 3D Vision, 2018

Image Tagging by Joint Deep Visual-Semantic Propagation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Dependency Exploitation: A Unified CNN-RNN Approach for Visual Emotion Recognition.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017
