Ross B. Girshick

Affiliations:
  • University of California, Berkeley, USA


According to our database1, Ross B. Girshick authored at least 100 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models.
CoRR, 2024

SAM 2: Segment Anything in Images and Videos.
CoRR, 2024

PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators.
CoRR, 2024

2023
The effectiveness of MAE pre-pretraining for billion-scale pretraining.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Segment Anything.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Exploring Plain Vision Transformer Backbones for Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Revisiting Weakly Supervised Pre-Training of Visual Perception Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Masked Autoencoders Are Scalable Vision Learners.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Benchmarking Detection Transfer Learning with Vision Transformers.
CoRR, 2021

Evaluating Large-Vocabulary Object Detectors: The Devil is in the Details.
CoRR, 2021

Early Convolutions Help Transformers See Better.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

PyTorchVideo: A Deep Learning Library for Video Understanding.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Fast and Accurate Model Scaling.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Boundary IoU: Improving Object-Centric Image Segmentation Evaluation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Focal Loss for Dense Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Mask R-CNN.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Improved Baselines with Momentum Contrastive Learning.
CoRR, 2020

Large Scale Weakly and Semi-Supervised Learning for Low-Resource Video ASR.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Training ASR Models By Generation of Contextual Information.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Are Labels Necessary for Neural Architecture Search?
Proceedings of the Computer Vision - ECCV 2020, 2020

A Multigrid Method for Efficiently Training Video Models.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Designing Network Design Spaces.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PointRend: Image Segmentation As Rendering.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Momentum Contrast for Unsupervised Visual Representation Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
PHYRE: A New Benchmark for Physical Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Exploring Randomly Wired Neural Networks for Image Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Rethinking ImageNet Pre-Training.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

TensorMask: A Foundation for Dense Object Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Long-Term Feature Banks for Detailed Video Understanding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Panoptic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Panoptic Feature Pyramid Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

LVIS: A Dataset for Large Vocabulary Instance Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Exploring the Limits of Weakly Supervised Pretraining.
Proceedings of the Computer Vision - ECCV 2018, 2018

Low-Shot Learning From Imaginary Data.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Data Distillation: Towards Omni-Supervised Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning by Asking Questions.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning to Segment Every Thing.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Detecting and Recognizing Human-Object Interactions.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Non-Local Neural Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Object Detection Networks on Convolutional Feature Maps.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Object Instance Segmentation and Fine-Grained Localization Using Hypercolumns.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Editorial- Deep Learning for Computer Vision.
Comput. Vis. Image Underst., 2017

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour.
CoRR, 2017

Inferring and Executing Programs for Visual Reasoning.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Low-Shot Visual Recognition by Shrinking and Hallucinating Features.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Aggregated Residual Transformations for Deep Neural Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Learning Features by Watching Objects Move.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Feature Pyramid Networks for Object Detection.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
The three R's of computer vision: Recognition, reconstruction and reorganization.
Pattern Recognit. Lett., 2016

Region-Based Convolutional Networks for Accurate Object Detection and Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Low-shot visual object recognition.
CoRR, 2016

Reducing Overfitting in Deep Networks by Decorrelating Representations.
Proceedings of the 4th International Conference on Learning Representations, 2016


Unsupervised Deep Embedding for Clustering Analysis.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Deep3D: Fully Automatic 2D-to-3D Video Conversion with Deep Convolutional Neural Networks.
Proceedings of the Computer Vision - ECCV 2016, 2016

Training Region-Based Object Detectors with Online Hard Example Mining.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

You Only Look Once: Unified, Real-Time Object Detection.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Seeing through the Human Reporting Bias: Visual Classifiers from Noisy Human-Centric Labels.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Generalized Sparselet Models for Real-Time Multiclass Object Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation.
Int. J. Comput. Vis., 2015

Learning Visual Classifiers using Human-centric Annotations.
CoRR, 2015

Inferring 3D Object Pose in RGB-D Images.
CoRR, 2015

Exploring Nearest Neighbor Approaches for Image Captioning.
CoRR, 2015

Actions and Attributes from Wholes and Parts.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Contextual Action Recognition with R*CNN.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Fast R-CNN.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Hypercolumns for object segmentation and fine-grained localization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Aligning 3D models to RGB-D images of cluttered scenes.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Deformable part models are convolutional neural networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
One-Bit Object Detection: On learning to localize objects with minimal supervision.
CoRR, 2014

Microsoft COCO: Common Objects in Context.
CoRR, 2014

DenseNet: Implementing Efficient ConvNet Descriptor Pyramids.
CoRR, 2014

LSDA: Large Scale Detection Through Adaptation.
CoRR, 2014

R-CNNs for Pose Estimation and Action Detection.
CoRR, 2014

LSDA: Large Scale Detection through Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Caffe: Convolutional Architecture for Fast Feature Embedding.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

On learning to localize objects with minimal supervision.
Proceedings of the 31th International Conference on Machine Learning, 2014

Part-Based R-CNNs for Fine-Grained Category Detection.
Proceedings of the Computer Vision - ECCV 2014, 2014

Simultaneous Detection and Segmentation.
Proceedings of the Computer Vision - ECCV 2014, 2014

Learning Rich Features from RGB-D Images for Object Detection and Segmentation.
Proceedings of the Computer Vision - ECCV 2014, 2014

Analyzing the Performance of Multilayer Neural Networks for Object Recognition.
Proceedings of the Computer Vision - ECCV 2014, 2014

Understanding Objects in Detail with Fine-Grained Attributes.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Using k-Poselets for Detecting People and Localizing Their Keypoints.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Efficient Human Pose Estimation from Single Depth Images.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Visual object detection with deformable part models.
Commun. ACM, 2013

Discriminatively Activated Sparselets.
Proceedings of the 30th International Conference on Machine Learning, 2013

Training Deformable Part Models with Decorrelated Features.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2012
Sparselet Models for Efficient Multiclass Object Detection.
Proceedings of the Computer Vision - ECCV 2012, 2012

2011
Object Detection with Grammar Models.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Efficient regression of general-activity human poses from depth images.
Proceedings of the IEEE International Conference on Computer Vision, 2011

2010
Object Detection with Discriminatively Trained Part-Based Models.
IEEE Trans. Pattern Anal. Mach. Intell., 2010

Discriminative Latent Variable Models for Object Detection.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Cascade object detection with deformable part models.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Visibility constraints on features of 3D objects.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2004
Simulating Chinese brush painting: the parametric hairy brush.
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2004


  Loading...