Xiao Tan

Orcid: 0000-0001-9162-8570

Affiliations:
  • Baidu Inc., Department of Computer Vision Technology, Beijing, China
  • University of Hong Kong (former)
  • University of New South Wales, Sydney, Australia (PhD 2014)


According to our database1, Xiao Tan authored at least 89 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction.
CoRR, 2024

Uni<sup>2</sup>Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection.
CoRR, 2024

Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection.
CoRR, 2024

BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space.
CoRR, 2024

PointMamba: A Simple State Space Model for Point Cloud Analysis.
CoRR, 2024

Uni4DAL: A Unified Baseline for Multi-dataset 4D Auto-Labeling.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

Interactive 3D Object Detection with Prompts.
Proceedings of the Computer Vision - ECCV 2024, 2024

OPEN: Object-Wise Position Embedding for Multi-view 3D Object Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

Decoupled Pseudo-Labeling for Semi-Supervised Monocular 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Semi-supervised Cycle-GAN for face photo-sketch translation in the wild.
Comput. Vis. Image Underst., October, 2023

Multi-Modal 3D Object Detection by Box Matching.
CoRR, 2023

ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box.
CoRR, 2023

A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CFCG: Semi-Supervised Semantic Segmentation via Cross-Fusion and Contour Guidance Supervision.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Forward Flow for Novel View Synthesis of Dynamic Scenes.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Semi-DETR: Semi-Supervised Object Detection with Detection Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CAPE: Camera View Position Embedding for Multi-View 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Command-driven Articulated Object Understanding and Manipulation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

StereoDistill: Pick the Cream from LiDAR for Distilling Stereo-Based 3D Object Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
SGM3D: Stereo Guided Monocular 3D Object Detection.
IEEE Robotics Autom. Lett., 2022

Detaching and Boosting: Dual Engine for Scale-Invariant Self-Supervised Monocular Depth Estimation.
IEEE Robotics Autom. Lett., 2022

Segment as Points for Efficient and Effective Online Multi-Object Tracking and Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

AGO-Net: Association-Guided 3D Point Cloud Object Detection Network.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Rope3D: TheRoadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task.
CoRR, 2022

Spatial Pruned Sparse Convolution for Efficient 3D Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Paint and Distill: Boosting 3D Object Detection with Semantic Passing Network.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Repainting and Imitating Learning for Lane Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022


Diverse Learner: Exploring Diverse Supervision for Semi-supervised Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

GitNet: Geometric Prior-Based Transformation for Birds-Eye-View Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

A Multi-granularity Retrieval System for Natural Language-based Vehicle Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Rope3D: The Roadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Box-Grained Reranking Matching for Multi-Camera Multi-Target Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

TWIST: Two-Way Inter-label Self-Training for Semi-supervised 3D Instance Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Neural Deformable Voxel Grid for Fast Optimization of Dynamic View Synthesis.
Proceedings of the Computer Vision - ACCV 2022, 2022

2021
SGM3D: Stereo Guided Monocular 3D Object Detection.
CoRR, 2021

Improving Video Retrieval by Adaptive Margin.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Lifting the Veil of Frequency in Joint Segmentation and Depth Estimation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

AggNet for Self-supervised Monocular Depth Estimation: Go An Aggressive Step Furthe.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

DANet: Dimension Apart Network for Radar Object Detection.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Weakly-Supervised Spatio-Temporal Anomaly Detection in Surveillance Video.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

The Devil is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Revealing the Reciprocal Relations between Self-Supervised Stereo and Monocular Depth Estimation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Good Practices and a Strong Baseline for Traffic Anomaly Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

A Robust MTMC Tracking System for AI-City Challenge 2021.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Robust and Online Vehicle Counting at Crowded Intersections.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Robust Vehicle Re-Identification via Rigid Structure Prior.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020
A modified quantum-inspired evolutionary algorithm for minimising network coding operations.
Int. J. Wirel. Mob. Comput., 2020

Understanding Image Retrieval Re-Ranking: A Graph Neural Network Perspective.
CoRR, 2020

Coherent Loss: A Generic Framework for Stable Video Segmentation.
CoRR, 2020

PointTrack++ for Effective Online Multi-Object Tracking and Segmentation.
CoRR, 2020

Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Modularized Framework with Category-Sensitive Abnormal Filter for City Anomaly Detection.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Monocular 3D Object Detection via Feature Domain Adaptation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Segment as Points for Efficient Online Multi-Object Tracking and Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Going Beyond Real Data: A Robust Visual Representation for Vehicle Re-identification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Dynamic Inference: A New Approach Toward Efficient Video Action Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Robust Movement-Specific Vehicle Counting at Crowded Intersections.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Multi-Granularity Tracking with Modularlized Components for Unsupervised Vehicles Anomaly Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
A Refined 3D Pose Dataset for Fine-Grained Object Categories.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Recognizing Part Attributes With Insufficient Data.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Perspective-Guided Convolution Networks for Crowd Counting.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Multi-camera vehicle tracking and re-identification based on visual and spatial-temporal features.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
Improving Annotation for 3D Pose Dataset of Fine-Grained Object Categories.
CoRR, 2018

Face Sketch Synthesis with Style Transfer Using Pyramid Column Feature.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Fine-Grained Video Categorization with Redundancy Reduction Attention.
Proceedings of the Computer Vision - ECCV 2018, 2018

3D Pose Estimation for Fine-Grained Object Categories.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Semi-supervised Learning for Face Sketch Synthesis in the Wild.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Large-scale image retrieval with supervised sparse hashing.
Neurocomputing, 2017

2016
Edge-Aware Filtering with Local Polynomial Approximation and Rectangle-Based Weighting.
IEEE Trans. Cybern., 2016

Stereo matching based on multi-direction polynomial model.
Signal Process. Image Commun., 2016

Guided image completion by confidence propagation.
Pattern Recognit., 2016

Orientation-guided geodesic weighting for PatchMatch-based stereo matching.
Inf. Sci., 2016

Single View 3D Reconstruction under an Uncalibrated Camera and an Unknown Mirror Sphere.
Proceedings of the Fourth International Conference on 3D Vision, 2016

2015
Feature matching in stereo images encouraging uniform spatial distribution.
Pattern Recognit., 2015

2014
Advanced Stereo Matching Algorithms.
PhD thesis, 2014

Stereo matching using cost volume watershed and region merging.
Signal Process. Image Commun., 2014

Soft Cost Aggregation with Multi-resolution Fusion.
Proceedings of the Computer Vision - ECCV 2014, 2014

Multipoint Filtering with Local Polynomial Approximation and Range Guidance.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2012
Tree structural watershed for stereo matching.
Proceedings of the Image and Vision Computing New Zealand, 2012

Feature Correspondence with Even Distribution.
Proceedings of the 2012 International Conference on Digital Image Computing Techniques and Applications, 2012

Cross Image Inference Scheme for Stereo Matching.
Proceedings of the Computer Vision, 2012


  Loading...