Long Zhao

Orcid: 0000-0001-8921-8564

Affiliations:
  • Rutgers University, Department of Computer Science, New Brunswick, NJ, USA


According to our database1, Long Zhao authored at least 47 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Out-of-Domain Generalization From a Single Source: An Uncertainty Quantification Approach.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

ε-VAE: Denoising as Visual Decoding.
CoRR, 2024

Steering Prototypes with Prompt-tuning for Rehearsal-free Continual Learning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

VideoPrism: A Foundational Visual Encoder for Video Understanding.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Taming Self-Training for Open-Vocabulary Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Generating Enhanced Negatives for Training Language-Based Object Detectors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Distilling Vision-Language Models on Millions of Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Deep Deformable Models: Learning 3D Shape Abstractions with Part Consistency.
CoRR, 2023

Improving Pseudo Labels for Open-Vocabulary Object Detection.
CoRR, 2023

VideoGLUE: Video General Understanding Evaluation of Foundation Models.
CoRR, 2023

Spatiotemporally Discriminative Video-Language Pre-Training with Text Grounding.
CoRR, 2023

Steering Prototype with Prompt-tuning for Rehearsal-free Continual Learning.
CoRR, 2023

More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text Matching.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Unified Visual Relationship Detection with Vision and Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Disentangling audio content and emotion with adaptive instance normalization for expressive facial animation synthesis.
Comput. Animat. Virtual Worlds, 2022

View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose.
Int. J. Comput. Vis., 2022

COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality.
Proceedings of the Computer Vision - ECCV 2022, 2022

Exploiting Unlabeled Data with Vision and Language Models for Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Hierarchically Self-supervised Transformer for Human Skeleton Representation Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Global Matching with Overlapping Attention for Optical Flow Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Are Multimodal Transformers Robust to Missing Modality?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual Understanding.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
COMPOSER: Compositional Learning of Group Activity in Videos.
CoRR, 2021

Out-of-domain Generalization from a Single Source: A Uncertainty Quantification Approach.
CoRR, 2021

Aggregating Nested Transformers.
CoRR, 2021

More Than Just Attention: Learning Cross-Modal Attentions with Contrastive Constraints.
CoRR, 2021

Improved Transformer for High-Resolution GANs.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

SMIL: Multimodal Learning with Severely Missing Modality.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Towards Image-to-Video Translation: A Structure-Aware Approach via Multi-stage Generative Adversarial Networks.
Int. J. Comput. Vis., 2020

Maximum-Entropy Adversarial Data Augmentation for Improved Generalization and Robustness.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Learning to Learn Single Domain Generalization.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Knowledge As Priors: Cross-Modal Knowledge Generalization for Datasets Without Superior Knowledge.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Cartoonish sketch-based face editing in videos using identity deformation transfer.
Comput. Graph., 2019

Rethinking Kernel Methods for Node Representation Learning on Graphs.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Semantic Graph Convolutional Networks for 3D Human Pose Regression.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Construct Dynamic Graphs for Hand Gesture Recognition via Spatial-Temporal Attention.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
CR-GAN: Learning Complete Representations for Multi-view Generation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Learning to Forecast and Refine Residual Motion for Image-to-Video Generation.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Sketch-based Face Editing in Video Using Identity Deformation Transfer.
CoRR, 2017

2015
Learning best views of 3D shapes from sketch contour.
Vis. Comput., 2015

Object proposal by multi-branch hierarchical segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Sketch-Based Retrieval Using Content-Aware Hashing.
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014

Size and Location Matter: A New Baseline for Salient Object Detection.
Proceedings of the Computer Vision - ACCV 2014, 2014


  Loading...