Tong He

Orcid: 0000-0003-2772-9320

Affiliations:
  • Shanghai AI Lab, Shanghai
  • University of Adelaide, SA, Australia (PhD 2020)


According to our database1, Tong He authored at least 73 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Exploring Multi-Timestep Multi-Stage Diffusion Features for Hyperspectral Image Classification.
IEEE Trans. Geosci. Remote. Sens., 2024

DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion.
CoRR, 2024

Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction.
CoRR, 2024

Depth Any Video with Scalable Synthetic Data.
CoRR, 2024

SPA: 3D Spatial-Awareness Enables Effective Embodied Representation.
CoRR, 2024

GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction.
CoRR, 2024

DynaSurfGS: Dynamic Surface Reconstruction with Planar-based Gaussian Splatting.
CoRR, 2024

ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction.
CoRR, 2024

NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstruction.
CoRR, 2024

Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation.
CoRR, 2024

Holistic-Motion2D: Scalable Whole-body Human Motion Generation in 2D Space.
CoRR, 2024

MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers.
CoRR, 2024

Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision.
CoRR, 2024

EMR-Merging: Tuning-Free High-Performance Model Merging.
CoRR, 2024

Pixel-GS: Density Control with Pixel-aware Gradient for 3D Gaussian Splatting.
CoRR, 2024

Agent3D-Zero: An Agent for Zero-shot 3D Understanding.
CoRR, 2024

Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning.
CoRR, 2024

DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM.
Proceedings of the Computer Vision - ECCV 2024, 2024

PredBench: Benchmarking Spatio-Temporal Prediction Across Diverse Disciplines.
Proceedings of the Computer Vision - ECCV 2024, 2024

GVGEN: Text-to-3D Generation with Volumetric Representation.
Proceedings of the Computer Vision - ECCV 2024, 2024

UniPAD: A Universal Pre-Training Paradigm for Autonomous Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DreamComposer: Controllable 3D Object Generation via Multi-View Conditions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Point Transformer V3: Simpler, Faster, Stronger.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CaMML: Context-Aware Multimodal Learner for Large Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Boosting Residual Networks with Group Knowledge.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Frozen CLIP Transformer Is an Efficient Point Cloud Encoder.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

CP3: Unifying Point Cloud Completion by Pretrain-Prompt-Predict Paradigm.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Dynamic Convolution for 3D Point Cloud Instance Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

OBMO: One Bounding Box Multiple Objects for Monocular 3D Object Detection.
IEEE Trans. Image Process., 2023

Partial Fine-Tuning: A Successor to Full Fine-Tuning for Vision Transformers.
CoRR, 2023

Hulk: A Universal Knowledge Translator for Human-Centric Tasks.
CoRR, 2023

GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection.
CoRR, 2023

PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm.
CoRR, 2023

Experts Weights Averaging: A New General Training Scheme for Vision Transformers.
CoRR, 2023

When Hyperspectral Image Classification Meets Diffusion Models: An Unsupervised Feature Learning Framework.
CoRR, 2023

SAM3D: Segment Anything in 3D Scenes.
CoRR, 2023

Stimulative Training++: Go Beyond The Performance Limits of Residual Networks.
CoRR, 2023

β-DARTS++: Bi-level Regularization for Proxy-robust Differentiable Architecture Search.
CoRR, 2023

Ponder: Point Cloud Pre-training via Neural Rendering.
CoRR, 2023

Ponder: Point Cloud Pre-training via Neural Rendering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

GD-MAE: Generative Decoder for MAE Pre-Training on LiDAR Point Clouds.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PVT-SSD: Single-Stage 3D Object Detector with Point-Voxel Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Crossing the Gap: Domain Generalization for Image Captioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
FCOS: A Simple and Strong Anchor-Free Object Detector.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

ABCNet v2: Adaptive Bezier-Curve Network for Real-Time End-to-End Text Spotting.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Frozen CLIP Model is An Efficient Point Cloud Backbone.
CoRR, 2022

3D-QueryIS: A Query-based Framework for 3D Instance Segmentation.
CoRR, 2022

The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition.
CoRR, 2022

PointInst3D: Segmenting 3D Instances by Points.
CoRR, 2022

Reconstructing Hand-Held Objects from Monocular Video.
Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022

PointInst3D: Segmenting 3D Instances by Points.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Exploring the Capacity of an Orderless Box Discretization Network for Multi-orientation Scene Text Detection.
Int. J. Comput. Vis., 2021

HCRF-Flow: Scene Flow From Point Clouds With Continuous High-Order CRFs and Position-Aware Flow Embedding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

DyCo3D: Robust Instance Segmentation of 3D Point Clouds Through Dynamic Convolution.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Instance-Aware Embedding for Point Cloud Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning and Memorizing Representative Prototypes for 3D Point Cloud Semantic and Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Exploring the Capacity of Sequential-free Box Discretization Network for Omnidirectional Scene Text Detection.
CoRR, 2019

FCOS: Fully Convolutional One-Stage Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Decoders Matter for Semantic Segmentation: Data-Dependent Decoding Enables Flexible Feature Aggregation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Knowledge Adaptation for Efficient Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
An End-to-End TextSpotter With Explicit Alignment and Attention.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Single Shot Text Detector with Regional Attention.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Orientation-Aware Text Proposals Network for Scene Text Detection.
Proceedings of the Biometric Recognition - 12th Chinese Conference, 2017

2016
Text-Attentional Convolutional Neural Network for Scene Text Detection.
IEEE Trans. Image Process., 2016

Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network.
CoRR, 2016

Detecting Text in Natural Image with Connectionist Text Proposal Network.
Proceedings of the Computer Vision - ECCV 2016, 2016

2015
Text-Attentional Convolutional Neural Networks for Scene Text Detection.
CoRR, 2015

An efficient method for text detection from indoor panorama images using Extremal Regions.
Proceedings of the IEEE International Conference on Information and Automation, 2015

2014
Accurate multi-scale license plate localization via image saliency.
Proceedings of the 17th International IEEE Conference on Intelligent Transportation Systems, 2014


  Loading...