Tong He

Orcid: 0000-0002-2388-8618

Affiliations:
  • Amazon Web Services
  • Facebook Reality Labs Research, USA
  • University of California Los Angeles, CA, USA


According to our database1, Tong He authored at least 45 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Improving Semantic Segmentation via Efficient Self-Training.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

VideoSAM: Open-World Video Segmentation.
CoRR, 2024

One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos.
CoRR, 2024

Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation.
CoRR, 2024

RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation.
CoRR, 2024

GQE: Generalized Query Expansion for Enhanced Text-Video Retrieval.
CoRR, 2024

Unified Lexical Representation for Interpretable Visual-Language Alignment.
CoRR, 2024

New Desiderata for Direct Preference Optimization.
CoRR, 2024

Hallucination of Multimodal Large Language Models: A Survey.
CoRR, 2024

BloomGML: Graph Machine Learning through the Lens of Bilevel Optimization.
CoRR, 2024

Consistent Video-to-Video Transfer Using Synthetic Dataset.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning for Transductive Threshold Calibration in Open-World Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Adaptive Slot Attention: Object Discovery with Dynamic Slot Number.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Graph Machine Learning through the Lens of Bilevel Optimization.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023
Learning for Open-World Calibration with Graph Neural Networks.
CoRR, 2023

LayoutDiffuse: Adapting Foundational Diffusion Models for Layout-to-Image Generation.
CoRR, 2023

Bridging the Gap to Real-World Object-Centric Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Object-Centric Multiple Object Tracking.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Coarse-to-Fine Amodal Segmentation with Shape Prior.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Unsupervised Open-Vocabulary Object Localization in Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Learning Manifold Dimensions with Conditional Variational Autoencoders.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Self-supervised Amodal Video Object Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

PSS: Progressive Sample Selection for Open-World Visual Representation Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

ResNeSt: Split-Attention Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
Deep 3D Embodied Visual Recognition.
PhD thesis, 2021

Skeleton-Based Action Recognition With Focusing-Diffusion Graph Convolutional Networks.
IEEE Signal Process. Lett., 2021

Progressive Coordinate Transforms for Monocular 3D Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

GRIN: Generative Relation and Intention Network for Multi-agent Trajectory Prediction.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Hierarchical Graph Neural Networks for Image Clustering.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ARCH++: Animation-Ready Clothed Human Reconstruction Revisited.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.
J. Mach. Learn. Res., 2020

Improving Semantic Segmentation via Self-Training.
CoRR, 2020

Geo-PIFu: Geometry and Pixel Aligned Implicit Functions for Single-view Human Reconstruction.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

SAM: Squeeze-and-Mimic Networks for Conditional Visual Driving Policy Learning.
Proceedings of the 4th Conference on Robot Learning, 2020

DeepVoxels++: Enhancing the Fidelity of Novel View Synthesis from 3D Voxel Embeddings.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
Focusing and Diffusion: Bidirectional Attentive Graph Convolutional Networks for Skeleton-based Action Recognition.
CoRR, 2019

LaTeS: Latent Space Distillation for Teacher-Student Driving Policy Learning.
CoRR, 2019

GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.
CoRR, 2019

Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources.
CoRR, 2019

Bag of Freebies for Training Object Detection Neural Networks.
CoRR, 2019

GIF2Video: Color Dequantization and Temporal Interpolation of GIF Images.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Bag of Tricks for Image Classification with Convolutional Neural Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

GeoNet: Deep Geodesic Networks for Point Cloud Analysis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Mono3D++: Monocular 3D Vehicle Detection with Two-Scale 3D Hypotheses and Task Priors.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019


  Loading...