Ishan Misra

According to our database1, Ishan Misra authored at least 77 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DINOv2: Learning Robust Visual Features without Supervision.
Trans. Mach. Learn. Res., 2024

Movie Gen: A Cast of Media Foundation Models.
CoRR, 2024

The Llama 3 Herd of Models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
et al.
CoRR, 2024

Factorizing Text-to-Video Generation by Explicit Image Conditioning.
Proceedings of the Computer Vision - ECCV 2024, 2024

Generating Illustrated Instructions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

InstanceDiffusion: Instance-Level Control for Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
On Bringing Robots Home.
CoRR, 2023

Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning.
CoRR, 2023

SelfEval: Leveraging the discriminative nature of generative models for evaluation.
CoRR, 2023

DINOv2: Learning Robust Visual Features without Supervision.
CoRR, 2023

Vision-Language Models Performing Zero-Shot Tasks Exhibit Gender-based Disparities.
CoRR, 2023

A Simple Recipe for Competitive Low-compute Self supervised Vision Models.
CoRR, 2023

MonoNeRF: Learning Generalizable NeRFs from Monocular Videos without Camera Poses.
Proceedings of the International Conference on Machine Learning, 2023

RoPAWS: Robust Semi-supervised Representation Learning from Uncurated Data.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

The hidden uniform cluster prior in self-supervised learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Vision-Language Models Performing Zero-Shot Tasks Exhibit Disparities Between Gender Groups.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

The effectiveness of MAE pre-pretraining for billion-scale pretraining.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

MOST: Multiple Object localization with Self-supervised Transformers for object discovery.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

GeneCIS: A Benchmark for General Conditional Image Similarity.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

OmniMAE: Single Model Masked Pretraining on Images and Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ImageBind One Embedding Space to Bind Them All.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Cut and Learn for Unsupervised Object Detection and Instance Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Video Representations from Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
The Hidden Uniform Cluster Prior in Self-Supervised Learning.
CoRR, 2022

Multiplane NeRF-Supervised Disentanglement of Depth and Camera Pose from Videos.
CoRR, 2022

Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision.
CoRR, 2022

A Data-Augmentation Is Worth A Thousand Samples: Exact Quantification From Analytical Augmented Sample Moments.
CoRR, 2022

A Data-Augmentation Is Worth A Thousand Samples: Analytical Moments And Sampling-Free Training.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Frame Averaging for Invariant and Equivariant Network Design.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Detecting Twenty-Thousand Classes Using Image-Level Supervision.
Proceedings of the Computer Vision - ECCV 2022, 2022

Masked Siamese Networks for Label-Efficient Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Omnivore: A Single Model for Many Visual Modalities.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Masked-attention Mask Transformer for Universal Image Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Scaling up Instance Segmentation using Approximately Localized Phrases.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Mask2Former for Video Instance Segmentation.
CoRR, 2021

Self-supervised Pretraining of Visual Features in the Wild.
CoRR, 2021

Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Barlow Twins: Self-Supervised Learning via Redundancy Reduction.
Proceedings of the 38th International Conference on Machine Learning, 2021

Self-Supervised Pretraining of 3D Features on any Point-Cloud.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

An End-to-End Transformer Model for 3D Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

MDETR - Modulated Detection for End-to-End Multi-Modal Understanding.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Emerging Properties in Self-Supervised Vision Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

3D Spatial Recognition Without Spatially Labeled 3D.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Audio-Visual Instance Discrimination with Cross-Modal Agreement.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Robust Audio-Visual Instance Discrimination.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Can Temporal Information Help with Contrastive Self-Supervised Learning?
CoRR, 2020

Unsupervised Learning of Visual Features by Contrasting Cluster Assignments.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

ClusterFit: Improving Generalization of Visual Representations.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Self-Supervised Learning of Pretext-Invariant Representations.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

In Defense of Grid Features for Visual Question Answering.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Binary Image Selection (BISON): Interpretable Evaluation of Visual Grounding.
CoRR, 2019

Evaluating Text-to-Image Matching using Binary Image Selection (BISON).
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

3D-RelNet: Joint Object and Relational Network for 3D Prediction.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Scaling and Benchmarking Self-Supervised Visual Representation Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Does Object Recognition Work for Everyone?
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
Mainstream: Dynamic Stem-Sharing for Multi-Tenant Video Processing.
Proceedings of the 2018 USENIX Annual Technical Conference, 2018

Learning by Asking Questions.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection.
Proceedings of the IEEE International Conference on Computer Vision, 2017

From Red Wine to Red Tomato: Composition with Context.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Generating Natural Questions About an Image.
CoRR, 2016

Unsupervised Learning using Sequential Verification for Action Recognition.
CoRR, 2016


Shuffle and Learn: Unsupervised Learning Using Temporal Order Verification.
Proceedings of the Computer Vision - ECCV 2016, 2016

Seeing through the Human Reporting Bias: Visual Classifiers from Noisy Human-Centric Labels.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Cross-Stitch Networks for Multi-task Learning.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Generating Natural Questions About an Image.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Applying artificial vision models to human scene understanding.
Frontiers Comput. Neurosci., 2015

Learning Visual Classifiers using Human-centric Annotations.
CoRR, 2015

Watch and learn: Semi-supervised learning of object detectors from videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Data-driven exemplar model selection.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

2013
CPU and/or GPU: Revisiting the GPU Vs. CPU Myth
CoRR, 2013

2011
Hybrid implementation of error diffusion dithering.
Proceedings of the 18th International Conference on High Performance Computing, 2011


  Loading...