Tong He

Orcid: 0000-0003-2772-9320

Affiliations:

Shanghai AI Lab, Shanghai
University of Adelaide, SA, Australia (PhD 2020)

According to our database¹, Tong He authored at least 77 papers between 2014 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., February, 2025

2024

Exploring Multi-Timestep Multi-Stage Diffusion Features for Hyperspectral Image Classification.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2024

CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction.

[BibT_eX]

[DOI]

CoRR, 2024

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling.

[BibT_eX]

[DOI]

CoRR, 2024

Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning.

[BibT_eX]

[DOI]

CoRR, 2024

DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild.

[BibT_eX]

[DOI]

CoRR, 2024

DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion.

[BibT_eX]

[DOI]

CoRR, 2024

Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction.

[BibT_eX]

[DOI]

CoRR, 2024

Depth Any Video with Scalable Synthetic Data.

[BibT_eX]

[DOI]

CoRR, 2024

SPA: 3D Spatial-Awareness Enables Effective Embodied Representation.

[BibT_eX]

[DOI]

CoRR, 2024

GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction.

[BibT_eX]

[DOI]

CoRR, 2024

DynaSurfGS: Dynamic Surface Reconstruction with Planar-based Gaussian Splatting.

[BibT_eX]

[DOI]

CoRR, 2024

ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction.

[BibT_eX]

[DOI]

CoRR, 2024

NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstruction.

[BibT_eX]

[DOI]

CoRR, 2024

Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

Holistic-Motion2D: Scalable Whole-body Human Motion Generation in 2D Space.

[BibT_eX]

[DOI]

CoRR, 2024

MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers.

[BibT_eX]

[DOI]

CoRR, 2024

Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision.

[BibT_eX]

[DOI]

CoRR, 2024

EMR-Merging: Tuning-Free High-Performance Model Merging.

[BibT_eX]

[DOI]

CoRR, 2024

Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Pixel-GS: Density Control with Pixel-Aware Gradient for 3D Gaussian Splatting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Agent3D-Zero: An Agent for Zero-Shot 3D Understanding.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

PredBench: Benchmarking Spatio-Temporal Prediction Across Diverse Disciplines.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

GVGEN: Text-to-3D Generation with Volumetric Representation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

UniPAD: A Universal Pre-Training Paradigm for Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DreamComposer: Controllable 3D Object Generation via Multi-View Conditions.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Point Transformer V3: Simpler, Faster, Stronger.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CaMML: Context-Aware Multimodal Learner for Large Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Boosting Residual Networks with Group Knowledge.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Frozen CLIP Transformer Is an Efficient Point Cloud Encoder.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

CP3: Unifying Point Cloud Completion by Pretrain-Prompt-Predict Paradigm.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Dynamic Convolution for 3D Point Cloud Instance Segmentation.

[BibT_eX]

[DOI]

Tong He

Chunhua Shen

Anton van den Hengel

IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

OBMO: One Bounding Box Multiple Objects for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

Partial Fine-Tuning: A Successor to Full Fine-Tuning for Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2023

Hulk: A Universal Knowledge Translator for Human-Centric Tasks.

[BibT_eX]

[DOI]

CoRR, 2023

PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm.

[BibT_eX]

[DOI]

CoRR, 2023

Experts Weights Averaging: A New General Training Scheme for Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2023

When Hyperspectral Image Classification Meets Diffusion Models: An Unsupervised Feature Learning Framework.

[BibT_eX]

[DOI]

CoRR, 2023

SAM3D: Segment Anything in 3D Scenes.

[BibT_eX]

[DOI]

CoRR, 2023

Stimulative Training++: Go Beyond The Performance Limits of Residual Networks.

[BibT_eX]

[DOI]

CoRR, 2023

β-DARTS++: Bi-level Regularization for Proxy-robust Differentiable Architecture Search.

[BibT_eX]

[DOI]

CoRR, 2023

Ponder: Point Cloud Pre-training via Neural Rendering.

[BibT_eX]

[DOI]

CoRR, 2023

Ponder: Point Cloud Pre-training via Neural Rendering.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

GD-MAE: Generative Decoder for MAE Pre-Training on LiDAR Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PVT-SSD: Single-Stage 3D Object Detector with Point-Voxel Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Crossing the Gap: Domain Generalization for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

FCOS: A Simple and Strong Anchor-Free Object Detector.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

ABCNet v2: Adaptive Bezier-Curve Network for Real-Time End-to-End Text Spotting.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Frozen CLIP Model is An Efficient Point Cloud Backbone.

[BibT_eX]

[DOI]

CoRR, 2022

3D-QueryIS: A Query-based Framework for 3D Instance Segmentation.

[BibT_eX]

[DOI]

CoRR, 2022

The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition.

[BibT_eX]

[DOI]

CoRR, 2022

PointInst3D: Segmenting 3D Instances by Points.

[BibT_eX]

[DOI]

Tong He

Chunhua Shen

Anton van den Hengel

CoRR, 2022

Reconstructing Hand-Held Objects from Monocular Video.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022

PointInst3D: Segmenting 3D Instances by Points.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

Exploring the Capacity of an Orderless Box Discretization Network for Multi-orientation Scene Text Detection.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2021

HCRF-Flow: Scene Flow From Point Clouds With Continuous High-Order CRFs and Position-Aware Flow Embedding.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

DyCo3D: Robust Instance Segmentation of 3D Point Clouds Through Dynamic Convolution.

[BibT_eX]

[DOI]

Tong He

Chunhua Shen

Anton van den Hengel

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Instance-Aware Embedding for Point Cloud Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Learning and Memorizing Representative Prototypes for 3D Point Cloud Semantic and Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Exploring the Capacity of Sequential-free Box Discretization Network for Omnidirectional Scene Text Detection.

[BibT_eX]

[DOI]

CoRR, 2019

FCOS: Fully Convolutional One-Stage Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Decoders Matter for Semantic Segmentation: Data-Dependent Decoding Enables Flexible Feature Aggregation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Knowledge Adaptation for Efficient Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

An End-to-End TextSpotter With Explicit Alignment and Attention.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Single Shot Text Detector with Regional Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Orientation-Aware Text Proposals Network for Scene Text Detection.

[BibT_eX]

[DOI]

Proceedings of the Biometric Recognition - 12th Chinese Conference, 2017

2016

Text-Attentional Convolutional Neural Network for Scene Text Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network.

[BibT_eX]

[DOI]

CoRR, 2016

Detecting Text in Natural Image with Connectionist Text Proposal Network.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

2015

Text-Attentional Convolutional Neural Networks for Scene Text Detection.

[BibT_eX]

[DOI]

CoRR, 2015

An efficient method for text detection from indoor panorama images using Extremal Regions.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Information and Automation, 2015

2014

Accurate multi-scale license plate localization via image saliency.

[BibT_eX]

[DOI]

Proceedings of the 17th International IEEE Conference on Intelligent Transportation Systems, 2014

Tong He

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...