Ali Farhadi

Orcid: 0000-0001-7249-2380

Affiliations:
  • AI2, Allen Institute for Artificial Intelligence, Seattle, WA, USA


According to our database1, Ali Farhadi authored at least 199 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
CLIP meets Model Zoo Experts: Pseudo-Supervision for Visual Enhancement.
Trans. Mach. Learn. Res., 2024

Bytes Are All You Need: Transformers Operating Directly On File Bytes.
Trans. Mach. Learn. Res., 2024

ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition.
CoRR, 2024

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models.
CoRR, 2024

FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning.
CoRR, 2024

OLMoE: Open Mixture-of-Experts Language Models.
CoRR, 2024

Task Me Anything.
CoRR, 2024

Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass.
CoRR, 2024

Selective Visual Representations Improve Convergence and Generalization for Embodied AI.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning to Build by Building Your Own Instructions.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
lo-fi: distributed fine-tuning without communication.
Trans. Mach. Learn. Res., 2023

FLUID: A Unified Evaluation Framework for Flexible Sequential Data.
Trans. Mach. Learn. Res., 2023

Are "Hierarchical" Visual Representations Hierarchical?
CoRR, 2023

MatFormer: Nested Transformer for Elastic Inference.
CoRR, 2023

Stable and low-precision training for large-scale vision-language models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Neural Priming for Sample-Efficient Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

AdANNS: A Framework for Adaptive Semantic Search.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On the Connection between Pre-training Data Diversity and Fine-tuning Robustness.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Localized Symbolic Knowledge Distillation for Visual Commonsense Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023


Objaverse-XL: A Universe of 10M+ 3D Objects.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Self-Supervised Object Goal Navigation with In-Situ Finetuning.
IROS, 2023

Moving Forward by Moving Backward: Embedding Action Impact over Action Semantics.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Impossibly Good Experts and How to Follow Them.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Neural Radiance Field Codebooks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

FastFill: Efficient Compatible Model Update.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Editing models with task arithmetic.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

What does a platypus look like? Generating customized prompts for zero-shot image classification.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Reinforcement.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SHARCS: Efficient Transformers Through Routing with Dynamic Width Sub-networks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Objaverse: A Universe of Annotated 3D Objects.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Phone2Proc: Bringing Robust Robots into Our Chaotic World.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SAFER: Safe Collision Avoidance Using Focused and Efficient Trajectory Search with Reinforcement Learning.
Proceedings of the 19th IEEE International Conference on Automation Science and Engineering, 2023

2022
RangeAugment: Efficient Online Augmentation with Range Learning.
CoRR, 2022

Object Goal Navigation with End-to-End Self-Supervision.
CoRR, 2022

Phone2Proc: Bringing Robust Robots Into Our Chaotic World.
CoRR, 2022

Editing Models with Task Arithmetic.
CoRR, 2022

Retrospectives on the Embodied AI Workshop.
CoRR, 2022

Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents.
CoRR, 2022

Safe Real-World Reinforcement Learning for Mobile Agent Obstacle Avoidance.
CoRR, 2022

What does a platypus look like? Generating customized prompts for zero-shot image classification.
CoRR, 2022

ProcTHOR: Large-Scale Embodied AI Using Procedural Generation.
CoRR, 2022

Matryoshka Representations for Adaptive Deployment.
CoRR, 2022

The Introspective Agent: Interdependence of Strategy, Physiology, and Sensing for Embodied Agents.
CoRR, 2022

Matryoshka Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Patching open-vocabulary models by interpolating weights.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Exposing the Limits of Video-Text Models through Contrast Sets.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Layer-Wise Data-Free CNN Compression.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time.
Proceedings of the International Conference on Machine Learning, 2022

Break and Make: Interactive Structural Understanding Using LEGO Bricks.
Proceedings of the Computer Vision - ECCV 2022, 2022

Object Manipulation via Visual Target Localization.
Proceedings of the Computer Vision - ECCV 2022, 2022

MERLOT RESERVE: Neural Script Knowledge through Vision and Language and Sound.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Robust fine-tuning of zero-shot models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Forward Compatible Training for Large-Scale Embedding Retrieval Systems.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Forward Compatible Training for Representation Learning.
CoRR, 2021

LCS: Learning Compressible Subspaces for Adaptive Network Compression at Inference Time.
CoRR, 2021

Robust fine-tuning of zero-shot models.
CoRR, 2021

Spectral acceleration prediction using genetic programming based approaches.
Appl. Soft Comput., 2021

MERLOT: Multimodal Neural Script Knowledge Models.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

TuringAdvice: A Generative and Dynamic Evaluation of Language Use.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Probing Contextual Language Models for Common Ground with Visual Representations.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Learning Neural Network Subspaces.
Proceedings of the 38th International Conference on Machine Learning, 2021

Learning Generalizable Visual Representations via Interactive Gameplay.
Proceedings of the 9th International Conference on Learning Representations, 2021

What Can You Learn From Your Muscles? Learning Visual Representation from Human Interactions.
Proceedings of the 9th International Conference on Learning Representations, 2021

Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Pushing It Out of the Way: Interactive Visual Navigation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

LanguageRefer: Spatial-Language Model for 3D Visual Grounding.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
In the Wild: From ML Models to Pragmatic ML Systems.
CoRR, 2020

Probing Text Models for Common Ground with Visual Representations.
CoRR, 2020

Visual Commonsense Graphs: Reasoning about the Dynamic Context of a Still Image.
CoRR, 2020

Evaluating Machines by their Real-World Language Use.
CoRR, 2020

Watching the World Go By: Representation Learning from Unlabeled Videos.
CoRR, 2020

Fine-Tuning Pretrained Language Models: Weight Initializations, Data Orders, and Early Stopping.
CoRR, 2020

Enabling AI at the edge with XNOR-networks.
Commun. ACM, 2020

Supermasks in Superposition.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Soft Threshold Weight Reparameterization for Learnable Sparsity.
Proceedings of the 37th International Conference on Machine Learning, 2020

Grounded Situation Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020

VisualCOMET: Reasoning About the Dynamic Context of a Still Image.
Proceedings of the Computer Vision - ECCV 2020, 2020

A Cordial Sync: Going Beyond Marginal Policies for Multi-agent Embodied Tasks.
Proceedings of the Computer Vision - ECCV 2020, 2020

Visual Reaction: Learning to Play Catch With Your Drone.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

What's Hidden in a Randomly Weighted Neural Network?
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Use the Force, Luke! Learning to Predict Physical Forces by Simulating Effects.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

RoboTHOR: An Open Simulation-to-Real Embodied AI Platform.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Butterfly Transform: An Efficient FFT Based Neural Architecture Design.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Artificial Agents Learn Flexible Visual Representations by Playing a Hiding Game.
CoRR, 2019

Butterfly Transform: An Efficient FFT Based Neural Architecture Design.
CoRR, 2019

What Should I Do Now? Marrying Reinforcement Learning and Symbolic Planning.
CoRR, 2019

Defending Against Neural Fake News.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Discovering Neural Wirings.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Visual Semantic Navigation using Scene Priors.
Proceedings of the 7th International Conference on Learning Representations, 2019

From Recognition to Cognition: Visual Commonsense Reasoning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

ELASTIC: Improving CNNs With Dynamic Scaling Policies.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Video Relationship Reasoning Using Gated Spatio-Temporal Energy Graph.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Two Body Problem: Collaborative Visual Task Completion.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Conditional Driving from Natural Language Instructions.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

HellaSwag: Can a Machine Really Finish Your Sentence?
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
PhotoShape: photorealistic materials for large-scale shape collections.
ACM Trans. Graph., 2018

Re<sup>3</sup>: Real-Time Recurrent Regression Networks for Visual Tracking of Generic Objects.
IEEE Robotics Autom. Lett., 2018

ELASTIC: Improving CNNs with Instance Specific Scaling Policies.
CoRR, 2018

Label Refinery: Improving ImageNet Classification through Label Progression.
CoRR, 2018

Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos.
CoRR, 2018

YOLOv3: An Incremental Improvement.
CoRR, 2018

Transferring Common-Sense Knowledge for Object Detection.
CoRR, 2018

Neural Speed Reading via Skim-RNN.
Proceedings of the 6th International Conference on Learning Representations, 2018

Phrase-Indexed Question Answering: A New Challenge for Scalable Document Comprehension.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

DOCK: Detecting Objects by Transferring Common-Sense Knowledge.
Proceedings of the Computer Vision - ECCV 2018, 2018

Imagine This! Scripts to Compositions to Videos.
Proceedings of the Computer Vision - ECCV 2018, 2018

Actor and Observer: Joint Modeling of First and Third-Person Videos.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

IQA: Visual Question Answering in Interactive Environments.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

SeGAN: Segmenting and Generating the Invisible.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Who Let the Dogs Out? Modeling Dog Behavior From Visual Data.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Structured Set Matching Networks for One-Shot Part Labeling.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

AJILE Movement Prediction: Multimodal Deep Learning for Natural Human Neural Recordings and Video.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Semantic Highlight Retrieval and Term Prediction.
IEEE Trans. Image Process., 2017

Summarizing Unconstrained Videos Using Salient Montages.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

AI2-THOR: An Interactive 3D Environment for Visual AI.
CoRR, 2017

Re3 : Real-Time Recurrent Regression Networks for Object Tracking.
CoRR, 2017

Toward visual intelligence.
Proceedings of the Workshop on Trends in Machine-Learning (and impact on computer architecture), 2017

Target-driven visual navigation in indoor scenes using deep reinforcement learning.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Query-Reduction Networks for Question Answering.
Proceedings of the 5th International Conference on Learning Representations, 2017

Bidirectional Attention Flow for Machine Comprehension.
Proceedings of the 5th International Conference on Learning Representations, 2017

Visual Semantic Planning Using Deep Successor Representations.
Proceedings of the IEEE International Conference on Computer Vision, 2017

See the Glass Half Full: Reasoning About Liquid Containers, Their Volume and Content.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Commonly Uncommon: Semantic Sparsity in Situation Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Asynchronous Temporal Fields for Action Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

YOLO9000: Better, Faster, Stronger.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

LCNN: Lookup-Based Convolutional Neural Network.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Ranking Highlights in Personal Videos by Analyzing Edited Videos.
IEEE Trans. Image Process., 2016

Query-Regression Networks for Machine Comprehension.
CoRR, 2016

NCAM: Near-Data Processing for Nearest Neighbor Search.
CoRR, 2016

Stating the Obvious: Extracting Visual Common Sense Knowledge.
Proceedings of the NAACL HLT 2016, 2016

Unsupervised Deep Embedding for Clustering Analysis.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Semantic highlight retrieval.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Much Ado About Time: Exhaustive Annotation of Temporal Data.
Proceedings of the Fourth AAAI Conference on Human Computation and Crowdsourcing, 2016

Deep3D: Fully Automatic 2D-to-3D Video Conversion with Deep Convolutional Neural Networks.
Proceedings of the Computer Vision - ECCV 2016, 2016

Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding.
Proceedings of the Computer Vision - ECCV 2016, 2016

FigureSeer: Parsing Result-Figures in Research Papers.
Proceedings of the Computer Vision - ECCV 2016, 2016

XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks.
Proceedings of the Computer Vision - ECCV 2016, 2016

"What Happens If..." Learning to Predict the Effect of Forces in Images.
Proceedings of the Computer Vision - ECCV 2016, 2016

A Diagram is Worth a Dozen Images.
Proceedings of the Computer Vision - ECCV 2016, 2016

Situation Recognition: Visual Semantic Role Labeling for Image Understanding.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Actions ~ Transformations.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

You Only Look Once: Unified, Real-Time Object Detection.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

A Task-Oriented Approach for Cost-Sensitive Recognition.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Toward a Taxonomy and Computational Models of Abnormalities in Images.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Are Elephants Bigger than Butterflies? Reasoning about Sizes of Objects.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
On the Application of Genetic Programming for New Generation of Ground Motion Prediction Equations.
Proceedings of the Handbook of Genetic Programming Applications, 2015

Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing.
CoRR, 2015

Learning to Select and Order Vacation Photographs.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Visalogy: Answering Visual Analogy Questions.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Deep Classifiers from Image Tags in the Wild.
Proceedings of the 2015 Workshop on Community-Organized Multimodal Mining: Opportunities for Novel Solutions, 2015

Generating Notifications for Missing Actions: Don't Forget to Turn the Lights Off!
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Solving Geometry Problems: Combining Text and Diagram Interpretation.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

VisKE: Visual knowledge extraction and question answering by visual verification of relation phrases.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Discriminative and consistent similarities in instance-level Multiple Instance Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Abnormal Object Recognition: A Comprehensive Study.
CoRR, 2014

Image Classification and Retrieval from User-Supplied Tags.
CoRR, 2014

Multi-Resolution Language Grounding with Weak Supervision.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Salient Montages from Unconstrained Videos.
Proceedings of the Computer Vision - ECCV 2014, 2014

Ranking Domain-Specific Highlights by Analyzing Edited Videos.
Proceedings of the Computer Vision - ECCV 2014, 2014

Towards Transparent Systems: Semantic Characterization of Failure Modes.
Proceedings of the Computer Vision - ECCV 2014, 2014

Predicting Failures of Vision Systems.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Incorporating Scene Context and Object Layout into Appearance Modeling.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Learning Everything about Anything: Webly-Supervised Visual Concept Learning.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Action Recognition in the Presence of One Egocentric and Multiple Static Cameras.
Proceedings of the Computer Vision - ACCV 2014, 2014

Diagram Understanding in Geometry Questions.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Phrasal Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Object-Centric Anomaly Detection by Attribute-Based Reasoning.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Multi-attribute Queries: To Merge or Not to Merge?
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Adding Unlabeled Samples to Categories by Learned Attributes.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Semantic Understanding of Professional Soccer Commentaries.
Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Attribute Discovery via Predictable Discriminative Binary Codes.
Proceedings of the Computer Vision - ECCV 2012, 2012

Building a dictionary of image fragments.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Designing representational architectures in recognition
PhD thesis, 2011

Using Classification to Protect the Integrity of Spectrum Measurements in White Space Networks.
Proceedings of the Network and Distributed System Security Symposium, 2011

Understanding egocentric activities.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Recognition using visual phrases.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
It's All About the Data.
Proc. IEEE, 2010

Every Picture Tells a Story: Generating Sentences from Images.
Proceedings of the Computer Vision, 2010

Attribute-centric recognition for cross-category generalization.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

The benefits and challenges of collecting richer object annotations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Unlabeled data improvesword prediction.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

A latent model of discriminative aspect.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Describing objects by their attributes.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008
Scene Discovery by Matrix Factorization.
Proceedings of the Computer Vision, 2008

Learning to Recognize Activities from the Wrong View Point.
Proceedings of the Computer Vision, 2008

2007
Transfer Learning in Sign language.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
How to tell the difference between a cat and a dog?
Int. J. Imaging Syst. Technol., 2006

An application of linear predictive coding and computational geometry to iris recognition.
Int. J. Imaging Syst. Technol., 2006

Aligning ASL for Statistical Translation Using a Discriminative Word Model.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

2003
Image segmentation via local higher order statistics.
Int. J. Imaging Syst. Technol., 2003


  Loading...