Christoph Feichtenhofer

Orcid: 0000-0001-9756-7238

According to our database1, Christoph Feichtenhofer authored at least 59 papers between 2013 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
SAM 2: Segment Anything in Images and Videos.
CoRR, 2024

Window Attention is Bugged: How not to Interpolate Position Embeddings.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Demystifying CLIP Data.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Altogether: Image Captioning via Re-aligning Alt-text.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
MAViL: Masked Audio-Video Learners.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles.
Proceedings of the International Conference on Machine Learning, 2023

Token Merging: Your ViT But Faster.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

The effectiveness of MAE pre-pretraining for billion-scale pretraining.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Diffusion Models as Masked Autoencoders.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CiT: Curation in Training for Effective Vision-Language Data.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Multiview Compressive Coding for 3D Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

On the Benefits of 3D Pose and Tracking for Human Action Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Scaling Language-Image Pre-Training via Masking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Masked Autoencoders As Spatiotemporal Learners.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Masked Autoencoders that Listen.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

TrackFormer: Multi-Object Tracking with Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Reversible Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MViTv2: Improved Multiscale Vision Transformers for Classification and Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022


Masked Feature Prediction for Self-Supervised Visual Pre-Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

A ConvNet for the 2020s.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Improved Multiscale Vision Transformers for Classification and Detection.
CoRR, 2021

Ego4D: Around the World in 3, 000 Hours of Egocentric Video.
CoRR, 2021

Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

PyTorchVideo: A Deep Learning Library for Video Understanding.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Multiview Pseudo-Labeling for Semi-supervised Learning from Video.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multiscale Vision Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Modeling Human Motion with Quaternion-Based Neural Networks.
Int. J. Comput. Vis., 2020

Deep Insights into Convolutional Networks for Video Recognition.
Int. J. Comput. Vis., 2020

Feature Pyramid Grids.
CoRR, 2020

Audiovisual SlowFast Networks for Video Recognition.
CoRR, 2020

A Multigrid Method for Efficiently Training Video Models.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Ego-Topo: Environment Affordances From Egocentric Video.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

X3D: Expanding Architectures for Efficient Video Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Grounded Human-Object Interaction Hotspots from Video (Extended Abstract).
CoRR, 2019

Learning Temporal Pose Estimation from Sparsely-Labeled Videos.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Grounded Human-Object Interaction Hotspots From Video.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

SlowFast Networks for Video Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Long-Term Feature Banks for Detailed Video Understanding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

3D Human Pose Estimation in Video With Temporal Convolutions and Semi-Supervised Training.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Learning Discriminative Motion Features Through Detection.
CoRR, 2018

Camera-based vehicle velocity estimation from monocular video.
CoRR, 2018

What Have We Learned From Deep Representations for Action Recognition?
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Detect to Track and Track to Detect.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Spatiotemporal Multiplier Networks for Video Action Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Temporal Residual Networks for Dynamic Scene Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Dynamic Scene Recognition with Complementary Spatiotemporal Features.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Spatiotemporal Residual Networks for Video Action Recognition.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Convolutional Two-Stream Network Fusion for Video Action Recognition.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Dynamically encoded actions based on spacetime saliency.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Fusing RFID and computer vision for probabilistic tag localization.
Proceedings of the IEEE International Conference on RFID, 2014

Bags of Spacetime Energies for Dynamic Scene Recognition.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
A Perceptual Image Sharpness Metric Based on Local Edge Gradient Analysis.
IEEE Signal Process. Lett., 2013

Spatio-temporal Good Features to Track.
Proceedings of the 2013 IEEE International Conference on Computer Vision Workshops, 2013

Spacetime Forests with Complementary Features for Dynamic Scene Recognition.
Proceedings of the British Machine Vision Conference, 2013


  Loading...