Tong He

Orcid: 0000-0002-2388-8618

Affiliations:

Amazon Web Services
Facebook Reality Labs Research, USA
University of California Los Angeles, CA, USA

According to our database¹, Tong He authored at least 45 papers between 2019 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Improving Semantic Segmentation via Efficient Self-Training.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

VideoSAM: Open-World Video Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos.

[BibT_eX]

[DOI]

CoRR, 2024

Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation.

[BibT_eX]

[DOI]

CoRR, 2024

RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation.

[BibT_eX]

[DOI]

CoRR, 2024

GQE: Generalized Query Expansion for Enhanced Text-Video Retrieval.

[BibT_eX]

[DOI]

CoRR, 2024

Unified Lexical Representation for Interpretable Visual-Language Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

New Desiderata for Direct Preference Optimization.

[BibT_eX]

[DOI]

Xiangkun Hu

Tong He

David Wipf

CoRR, 2024

Hallucination of Multimodal Large Language Models: A Survey.

[BibT_eX]

[DOI]

CoRR, 2024

BloomGML: Graph Machine Learning through the Lens of Bilevel Optimization.

[BibT_eX]

[DOI]

CoRR, 2024

Consistent Video-to-Video Transfer Using Synthetic Dataset.

[BibT_eX]

[DOI]

Jiaxin Cheng

Tianjun Xiao

Tong He

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning for Transductive Threshold Calibration in Open-World Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Adaptive Slot Attention: Object Discovery with Dynamic Slot Number.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Graph Machine Learning through the Lens of Bilevel Optimization.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023

Learning for Open-World Calibration with Graph Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2023

LayoutDiffuse: Adapting Foundational Diffusion Models for Layout-to-Image Generation.

[BibT_eX]

[DOI]

CoRR, 2023

Bridging the Gap to Real-World Object-Centric Learning.

[BibT_eX]

[DOI]

Carl-Johann Simon-Gabriel

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Object-Centric Multiple Object Tracking.

[BibT_eX]

[DOI]

Carl-Johann Simon-Gabriel

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Coarse-to-Fine Amodal Segmentation with Shape Prior.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Unsupervised Open-Vocabulary Object Localization in Videos.

[BibT_eX]

[DOI]

Carl-Johann Simon-Gabriel

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

Learning Manifold Dimensions with Conditional Variational Autoencoders.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Self-supervised Amodal Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

PSS: Progressive Sample Selection for Open-World Visual Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

ResNeSt: Split-Attention Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021

Deep 3D Embodied Visual Recognition.

[BibT_eX]

[DOI]

Tong He

PhD thesis, 2021

Skeleton-Based Action Recognition With Focusing-Diffusion Graph Convolutional Networks.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2021

Progressive Coordinate Transforms for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

GRIN: Generative Relation and Intention Network for Multi-agent Trajectory Prediction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Hierarchical Graph Neural Networks for Image Clustering.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ARCH++: Animation-Ready Clothed Human Reconstruction Revisited.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2020

Improving Semantic Segmentation via Self-Training.

[BibT_eX]

[DOI]

CoRR, 2020

Geo-PIFu: Geometry and Pixel Aligned Implicit Functions for Single-view Human Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

SAM: Squeeze-and-Mimic Networks for Conditional Visual Driving Policy Learning.

[BibT_eX]

[DOI]

Proceedings of the 4th Conference on Robot Learning, 2020

DeepVoxels++: Enhancing the Fidelity of Novel View Synthesis from 3D Voxel Embeddings.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019

Focusing and Diffusion: Bidirectional Attentive Graph Convolutional Networks for Skeleton-based Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2019

LaTeS: Latent Space Distillation for Teacher-Student Driving Policy Learning.

[BibT_eX]

[DOI]

CoRR, 2019

GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.

[BibT_eX]

[DOI]

CoRR, 2019

Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources.

[BibT_eX]

[DOI]

CoRR, 2019

Bag of Freebies for Training Object Detection Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2019

GIF2Video: Color Dequantization and Temporal Interpolation of GIF Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Bag of Tricks for Image Classification with Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

GeoNet: Deep Geodesic Networks for Point Cloud Analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Mono3D++: Monocular 3D Vehicle Detection with Two-Scale 3D Hypotheses and Task Priors.

[BibT_eX]

[DOI]

Tong He

Stefano Soatto

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Tong He

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...