Haoqi Fan

Affiliations:
  • Facebook
  • Carnegie Mellon University, Pittsburgh, PA, USA (former)


According to our database1, Haoqi Fan authored at least 28 papers between 2016 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
MAViL: Masked Audio-Video Learners.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles.
Proceedings of the International Conference on Machine Learning, 2023

The effectiveness of MAE pre-pretraining for billion-scale pretraining.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Diffusion Models as Masked Autoencoders.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Scaling Language-Image Pre-Training via Masking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Masked Autoencoders As Spatiotemporal Learners.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

On the Importance of Asymmetry for Siamese Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Reversible Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unified Transformer Tracker for Object Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MViTv2: Improved Multiscale Vision Transformers for Classification and Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Masked Feature Prediction for Self-Supervised Visual Pre-Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Improved Multiscale Vision Transformers for Classification and Detection.
CoRR, 2021

PyTorchVideo: A Deep Learning Library for Video Understanding.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Multiview Pseudo-Labeling for Semi-supervised Learning from Video.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multiscale Vision Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Beyond Short Clips: End-to-End Video-Level Learning With Collaborative Memories.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Can Temporal Information Help with Contrastive Self-Supervised Learning?
CoRR, 2020

Improved Baselines with Momentum Contrastive Learning.
CoRR, 2020

Momentum Contrast for Unsupervised Visual Representation Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
SlowFast Networks for Video Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Order-Aware Generative Modeling Using the 3D-Craft Dataset.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks With Octave Convolution.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Long-Term Feature Banks for Detailed Video Understanding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Stacked Latent Attention for Multimodal Reasoning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Efficient K-Shot Learning With Regularized Deep Networks.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2016
Going Deeper into First-Person Activity Recognition.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016


  Loading...