Kristen Grauman

Orcid: 0000-0002-9591-5873

Affiliations:
  • University of Texas at Austin, USA


According to our database1, Kristen Grauman authored at least 270 papers between 2001 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Human Action Anticipation: A Survey.
CoRR, 2024

ExpertAF: Expert Actionable Feedback from Video.
CoRR, 2024

HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness.
CoRR, 2024

Sim2Real Transfer for Audio-Visual Navigation with Frequency-Adaptive Acoustic Field Prediction.
CoRR, 2024

ActiveRIR: Active Audio-Visual Exploration for Acoustic Environment Modeling.
CoRR, 2024

Institute for Foundations of Machine Learning (IFML): Advancing AI systems that will transform our world.
AI Mag., 2024

Put Myself in Your Shoes: Lifting the Egocentric Perspective from Exocentric Videos.
Proceedings of the Computer Vision - ECCV 2024, 2024

4DIFF: 3D-Aware Diffusion Model for Third-to-First Viewpoint Translation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos.
Proceedings of the Computer Vision - ECCV 2024, 2024

Learning Object State Changes in Videos: An Open-World Perspective.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SoundingActions: Learning How Actions Sound from Narrated Egocentric Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Detours for Navigating Instructional Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Few-View Object Reconstruction with Unknown Categories and Camera Poses.
Proceedings of the International Conference on 3D Vision, 2024

2023
Visually-Guided Audio Spatialization in Video with Geometry-Aware Multi-task Learning.
Int. J. Comput. Vis., October, 2023

Editorial: Special Section on Egocentric Perception.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

A domain-agnostic approach for characterization of lifelong learning systems.
Neural Networks, March, 2023

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives.
CoRR, 2023

Egocentric Video Task Translation @ Ego4D Challenge 2022.
CoRR, 2023

What You Say Is What You Show: Visual Narration Detection in Instructional Videos.
CoRR, 2023

Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

EgoTracks: A Long-term Egocentric Visual Object Tracking Dataset.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

EgoDistill: Egocentric Head Motion Distillation for Efficient Video Understanding.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Self-Supervised Visual Acoustic Matching.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

EgoEnv: Human-centric environment representations from egocentric video.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Single-Stage Visual Query Localization in Egocentric Videos.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Video-Mined Task Graphs for Keystep Recognition in Instructional Videos.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning to Map Efficiently by Active Echolocation.
IROS, 2023

SpotEM: Efficient Video Search for Episodic Memory.
Proceedings of the International Conference on Machine Learning, 2023

Learning Audio-Visual Dereverberation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Egocentric Video Task Translation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Novel-View Acoustic Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

HierVL: Learning Hierarchical Video-Language Embeddings.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Learning Spherical Convolution for $360^{\circ }$360∘ Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Retrospectives on the Embodied AI Workshop.
CoRR, 2022

Egocentric scene context for human-centric environment understanding from video.
CoRR, 2022

Active Audio-Visual Separation of Dynamic Sound Sources.
CoRR, 2022

Discovering Underground Maps from Fashion.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Few-Shot Audio-Visual Learning of Environment Acoustics.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Environment Predictive Coding for Visual Navigation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Active Audio-Visual Separation of Dynamic Sound Sources.
Proceedings of the Computer Vision - ECCV 2022, 2022

Egocentric Activity Recognition and Localization on a 3D Map.
Proceedings of the Computer Vision - ECCV 2022, 2022

PONI: Potential Functions for ObjectGoal Navigation with Interaction-free Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022


Visual Acoustic Matching.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Zero Experience Required: Plug & Play Modular Transfer Learning for Semantic Visual Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Modeling Fashion Influence From Photos.
IEEE Trans. Multim., 2021

Learning Compressible 360$^{\circ }$∘ Video Isomers.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

An Exploration of Embodied Visual Exploration.
Int. J. Comput. Vis., 2021

Ego4D: Around the World in 3, 000 Hours of Egocentric Video.
CoRR, 2021

Shapes as Product Differentiation: Neural Network Embedding in the Analysis of Markets for Fonts.
CoRR, 2021

Environment Predictive Coding for Embodied Agents.
CoRR, 2021

Shaping embodied agent behavior with activity-context priors from egocentric video.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Dexterous Grasping with Object-Centric Visual Affordances.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Learning to Set Waypoints for Audio-Visual Navigation.
Proceedings of the 9th International Conference on Learning Representations, 2021

Multiview Pseudo-Labeling for Semi-supervised Learning from Video.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Audio-Visual Floorplan Reconstruction.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Move2Hear: Active Audio-Visual Source Separation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

From Culture to Clothing: Discovering the World Events Behind A Century of Fashion Images.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Anticipative Video Transformer.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language Feedback.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Ego-Exo: Transferring Visual Representations From Third-Person to First-Person Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

VisualVoice: Audio-Visual Speech Separation With Cross-Modal Consistency.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Semantic Audio-Visual Navigation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

DexVIP: Learning Dexterous Grasping with Human Hand Pose Priors from Video.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Geometry-Aware Multi-Task Learning for Binaural Audio Generation from Video.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Densifying Supervision for Fine-Grained Visual Comparisons.
Int. J. Comput. Vis., 2020

Dexterous Robotic Grasping with Object-Centric Visual Affordances.
CoRR, 2020

Audio-Visual Waypoints for Navigation.
CoRR, 2020

Learning Affordance Landscapes forInteraction Exploration in 3D Environments.
CoRR, 2020

Learning Patterns of Tourist Movement and Photography from Geotagged Photos at Archaeological Heritage Sites in Cuzco, Peru.
CoRR, 2020

Audiovisual SlowFast Networks for Video Recognition.
CoRR, 2020

Computer Vision for Fashion: From Individual Recommendations to World-wide Trends.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Learning Affordance Landscapes for Interaction Exploration in 3D Environments.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Occupancy Anticipation for Efficient Exploration and Navigation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Proposal-Based Video Completion.
Proceedings of the Computer Vision - ECCV 2020, 2020

VisualEchoes: Spatial Image Representation Learning Through Echolocation.
Proceedings of the Computer Vision - ECCV 2020, 2020

SoundSpaces: Audio-Visual Navigation in 3D Environments.
Proceedings of the Computer Vision - ECCV 2020, 2020

Don't Judge an Object by Its Context: Learning to Overcome Contextual Bias.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

You2Me: Inferring Body Pose in Egocentric Video via First and Second Person Interactions.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Ego-Topo: Environment Affordances From Egocentric Video.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

ViBE: Dressing for Diverse Body Shapes.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Listen to Look: Action Recognition by Previewing Audio.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

From Paris to Berlin: Discovering Fashion Style Influences Around the World.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Emergence of exploratory look-around behaviors through active observation completion.
Sci. Robotics, 2019

Pixel Objectness: Learning to Segment Generic Objects Automatically in Images and Videos.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

End-to-End Policy Learning for Active Visual Categorization.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Click Carving: Interactive Object Segmentation in Images and Videos with Point Clicks.
Int. J. Comput. Vis., 2019

Predicting How to Distribute Work Between Algorithms and Humans to Segment an Image Batch.
Int. J. Comput. Vis., 2019

Audio-Visual Embodied Navigation.
CoRR, 2019

Dressing for Diverse Body Shapes.
CoRR, 2019

Grounded Human-Object Interaction Hotspots from Video (Extended Abstract).
CoRR, 2019

Grounded Human-Object Interaction Hotspots From Video.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Fashion++: Minimal Edits for Outfit Improvement.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Co-Separating Sounds of Visual Objects.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Thinking Outside the Pool: Active Training Image Creation for Relative Attributes.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Extreme Relative Pose Estimation for RGB-D Scans via Scene Completion.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Less Is More: Learning Highlight Detection From Video Duration.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Kernel Transformer Networks for Compact Spherical Convolution.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

SpotTune: Transfer Learning Through Adaptive Fine-Tuning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2.5D Visual Sound.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Predicting Foreground Object Ambiguity and Efficiently Crowdsourcing the Segmentation(s).
Int. J. Comput. Vis., 2018

Subjects and Their Objects: Localizing Interactees for a Person-Centric View of Importance.
Int. J. Comput. Vis., 2018

Snap Angle Prediction for 360° Panorama.
CoRR, 2018

Attributes as Operators.
CoRR, 2018

Visual Question Answer Diversity.
Proceedings of the Sixth AAAI Conference on Human Computation and Crowdsourcing, 2018

Retrospective Encoders for Video Summarization.
Proceedings of the Computer Vision - ECCV 2018, 2018

Snap Angle Prediction for 360 ∘ Panoramas.
Proceedings of the Computer Vision - ECCV 2018, 2018

Sidekick Policy Learning for Active Visual Exploration.
Proceedings of the Computer Vision - ECCV 2018, 2018

Attributes as Operators: Factorizing Unseen Attribute-Object Compositions.
Proceedings of the Computer Vision - ECCV 2018, 2018

ShapeCodes: Self-supervised Feature Learning by Lifting Views to Viewgrids.
Proceedings of the Computer Vision - ECCV 2018, 2018

BlockDrop: Dynamic Inference Paths in Residual Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning Compressible 360deg Video Isomers.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Learning Compressible 360° Video Isomers.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning to Look Around: Intelligently Exploring Unseen Environments for Unknown Tasks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Creating Capsule Wardrobes From Fashion Images.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

VizWiz Grand Challenge: Answering Visual Questions From Blind People.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Im2Flow: Motion Hallucination From Static Images for Action Recognition.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning to Separate Object Sounds by Watching Unlabeled Video.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Compare and Contrast: Learning Prominent Visual Differences.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

BrowseWithMe: An Online Clothes Shopping Assistant for People with Visual Impairments.
Proceedings of the 20th International ACM SIGACCESS Conference on Computers and Accessibility, 2018

2017
Geodesic Flow Kernel and Landmarks: Kernel Methods for Unsupervised Domain Adaptation.
Proceedings of the Domain Adaptation in Computer Vision Applications., 2017

Guest Editorial: Best of CVPR 2015.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Efficient Activity Detection in Untrimmed Video with Max-Subgraph Search.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Next-active-object prediction from egocentric videos.
J. Vis. Commun. Image Represent., 2017

Learning Image Representations Tied to Egomotion from Unlabeled Video.
Int. J. Comput. Vis., 2017

Learning to look around.
CoRR, 2017

Unsupervised learning through one-shot image-based shape reconstruction.
CoRR, 2017

Flat2Sphere: Learning Spherical Convolution for Fast Features from 360° Imagery.
CoRR, 2017

FusionSeg: Learning to combine motion and appearance for fully automatic segmention of generic objects in videos.
CoRR, 2017

Pixel Objectness.
CoRR, 2017

Pano2Vid: Automatic Cinematography for Watching 360° Videos.
Proceedings of the 6th Workshop on Intelligent Cinematography and Editing, 2017

Learning Spherical Convolution for Fast Features from 360° Imagery.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Semantic Jitter: Dense Supervision for Visual Comparisons via Synthetic Images.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Learning the Latent "Look": Unsupervised Discovery of a Style-Coherent Embedding from Fashion Images.
Proceedings of the IEEE International Conference on Computer Vision, 2017

On-demand Learning for Deep Image Restoration.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Fashion Forward: Forecasting Visual Style in Fashion.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Making 360° Video Watchable in 2D: Learning Videography for Click Free Viewing.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Seeing Invisible Poses: Estimating 3D Body Pose from Egocentric Video.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Detangling People: Individuating Multiple Close People and Their Body Parts via Region Assembly.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

FusionSeg: Learning to Combine Motion and Appearance for Fully Automatic Segmentation of Generic Objects in Videos.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

CrowdVerge: Predicting If People Will Agree on the Answer to a Visual Question.
Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 2017

2016
Crowdsourcing in Computer Vision.
Found. Trends Comput. Graph. Vis., 2016

Dense Supervision for Visual Comparisons via Synthetic Images.
CoRR, 2016

Visual Question: Predicting If a Crowd Will Agree on the Answer.
CoRR, 2016

From One-Trick Ponies to All-Rounders: On-Demand Learning for Image Restoration.
CoRR, 2016

Video Analysis for Body-worn Cameras in Law Enforcement.
CoRR, 2016

Text detection in stores using a repetition prior.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Click Carving: Segmenting Objects in Video with Point Clicks.
Proceedings of the Fourth AAAI Conference on Human Computation and Crowdsourcing, 2016

Video Summarization with Long Short-Term Memory.
Proceedings of the Computer Vision - ECCV 2016, 2016

Leaving Some Stones Unturned: Dynamic Feature Prioritization for Activity Detection in Streaming Video.
Proceedings of the Computer Vision - ECCV 2016, 2016

Detecting Engagement in Egocentric Video.
Proceedings of the Computer Vision - ECCV 2016, 2016

Look-Ahead Before You Leap: End-to-End Active Recognition by Forecasting the Effect of Motion.
Proceedings of the Computer Vision - ECCV 2016, 2016

Summary Transfer: Exemplar-Based Subset Selection for Video Summarization.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Slow and Steady Feature Analysis: Higher Order Temporal Coherence in Video.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Active Image Segmentation Propagation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Pull the Plug? Predicting If Computers or Humans Should Segment Images.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Object-Centric Representation Learning from Unlabeled Videos.
Proceedings of the Computer Vision - ACCV 2016, 2016

2015
Boundary Preserving Dense Local Regions.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Predicting Important Objects for Egocentric Video Summarization.
Int. J. Comput. Vis., 2015

WhittleSearch: Interactive Image Search with Relative Attribute Feedback.
Int. J. Comput. Vis., 2015

Discovering Attribute Shades of Meaning with the Crowd.
Int. J. Comput. Vis., 2015

Learning image representations equivariant to ego-motion.
CoRR, 2015

Large-Margin Determinantal Point Processes.
Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015

Just Noticeable Differences in Visual Attributes.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Learning Image Representations Tied to Ego-Motion.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Action and Attention in First-person Vision.
Proceedings of the British Machine Vision Conference 2015, 2015

Intentional Photos from an Unintentional Photographer: Detecting Snap Points in Egocentric Video with a Web Photo Prior.
Proceedings of the Mobile Cloud Visual Media Computing - From Interaction to Service, 2015

2014
Hashing Hyperplane Queries to Near Points with Applications to Large-Scale Active Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Large-Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds.
Int. J. Comput. Vis., 2014

Editorial: Special Issue on Active and Interactive Methods in Computer Vision.
Int. J. Comput. Vis., 2014

Learning Kernels for Unsupervised Domain Adaptation with Applications to Visual Object Recognition.
Int. J. Comput. Vis., 2014

Predicting Useful Neighborhoods for Lazy Local Learning.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Zero-shot recognition with unreliable attributes.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Diverse Sequential Subset Selection for Supervised Video Summarization.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Keynote Speakers.
Proceedings of the Seconf AAAI Conference on Human Computation and Crowdsourcing, 2014

Detecting Snap Points in Egocentric Video with a Web Photo Prior.
Proceedings of the Computer Vision - ECCV 2014, 2014

Supervoxel-Consistent Foreground Propagation in Video.
Proceedings of the Computer Vision - ECCV 2014, 2014

Fine-Grained Visual Comparisons with Local Learning.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Beyond Comparing Image Pairs: Setwise Active Learning for Relative Attributes.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Decorrelating Semantic Visual Attributes by Resisting the Urge to Share.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Inferring Unseen Views of People.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Inferring Analogous Attributes.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Which Image Pairs Will Cosegment Well? Predicting Partners for Cosegmentation.
Proceedings of the Computer Vision - ACCV 2014, 2014

Predicting the Location of "interactees" in Novel Human-Object Interactions.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
Learning Binary Hash Codes for Large-Scale Image Search.
Proceedings of the Machine Learning for Computer Vision, 2013

Reconstructing a fragmented face from a cryptographic identification protocol.
Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision, 2013

Reshaping Visual Datasets for Domain Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Analogy-preserving Semantic Embedding for Visual Object Categorization.
Proceedings of the 30th International Conference on Machine Learning, 2013

Connecting the Dots with Landmarks: Discriminatively Learning Domain-Invariant Features for Unsupervised Domain Adaptation.
Proceedings of the 30th International Conference on Machine Learning, 2013

Implied Feedback: Learning Nuances of User Behavior in Image Search.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Attribute Adaptation for Personalized Image Search.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Attribute Pivots for Guiding Relevance Feedback in Image Search.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Predicting Sufficient Annotation Strength for Interactive Foreground Segmentation.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Active Learning of an Action Detector from Untrimmed Videos.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Story-Driven Summarization for Egocentric Video.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Deformable Spatial Pyramid Matching for Fast Dense Correspondences.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Object-Centric Spatio-Temporal Pyramids for Egocentric Activity Recognition.
Proceedings of the British Machine Vision Conference, 2013

2012
Object-Graphs for Context-Aware Visual Category Discovery.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Kernelized Locality-Sensitive Hashing.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Reading between the Lines: Object Localization Using Implicit Cues from Image Tags.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Learning the Relative Importance of Objects from Tagged Images for Retrieval and Cross-Modal Search.
Int. J. Comput. Vis., 2012

Semantic Kernel Forests from Multiple Taxonomies.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Active Frame Selection for Label Propagation in Videos.
Proceedings of the Computer Vision - ECCV 2012, 2012

Shape Sharing for Object Segmentation.
Proceedings of the Computer Vision - ECCV 2012, 2012

Discovering important people and objects for egocentric video summarization.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

WhittleSearch: Image search with relative attribute feedback.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Geodesic flow kernel for unsupervised domain adaptation.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Discovering localized attributes for fine-grained recognition.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Efficient activity detection with max-subgraph search.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Relative Attributes for Enhanced Human-Machine Communication.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
Visual Object Recognition
Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers, ISBN: 978-3-031-01553-3, 2011

Cost-Sensitive Active Visual Category Learning.
Int. J. Comput. Vis., 2011

Learning a Tree of Metrics with Disjoint Visual Features.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Learning with Whom to Share in Multi-task Feature Learning.
Proceedings of the 28th International Conference on Machine Learning, 2011

Relative attributes.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Key-segments for video object segmentation.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Actively selecting annotations among objects and attributes.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Annotator rationales for visual recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Efficient region search for object detection.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Interactively building a discriminative vocabulary of nameable attributes.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Learning the easy things first: Self-paced visual category discovery.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Sharing features between objects and their attributes.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Clues from the beaten path: Location estimation with bursty sequences of tourist photos.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Face Discovery with Social Context.
Proceedings of the British Machine Vision Conference, 2011

2010
Gaussian Processes for Object Categorization.
Int. J. Comput. Vis., 2010

A task-driven intelligent workspace system to provide guidance feedback.
Comput. Vis. Image Underst., 2010

Efficiently searching for similar images.
Commun. ACM, 2010

Far-sighted active learning on a budget for image and video recognition.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Top-down pairwise potentials for piecing together multi-class segmentation puzzles.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

Collect-cut: Segmentation with top-down cues discovered in multi-object images.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Object-graphs for context-aware category discovery.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Learning a hierarchy of discriminative space-time neighborhood features for human action recognition.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Asymmetric region-to-image matching for comparing images with generic object categories.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

3D Facial similarity: Automatic assessment versus perceptual judgments.
Proceedings of the Fourth IEEE International Conference on Biometrics: Theory Applications and Systems, 2010

Accounting for the Relative Importance of Objects in Image Retrieval.
Proceedings of the British Machine Vision Conference, 2010

2009
Fast Similarity Search for Learned Metrics.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Foreground Focus: Unsupervised Learning from Partially Matching Images.
Int. J. Comput. Vis., 2009

Kernelized locality-sensitive hashing for scalable image search.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

What's it going to cost you?: Predicting effort vs. informativeness for multi-label image annotations.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Shape discovery from unlabeled image collections.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Observe locally, infer globally: A space-time MRF for detecting abnormal activities with incremental updates.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008
Watch, Listen & Learn: Co-training on Captioned Images and Videos.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2008

Multi-Level Active Prediction of Useful Image Annotations for Recognition.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Online Metric Learning and Fast Similarity Search.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Keywords to visual categories: Multiple-instance learning forweakly supervised object categorization.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Fast image search for learned metrics.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Foreground Focus: Finding Meaningful Features in Unlabeled Images.
Proceedings of the British Machine Vision Conference 2008, Leeds, UK, September 2008, 2008

2007
The Pyramid Match Kernel: Efficient Learning with Sets of Features.
J. Mach. Learn. Res., 2007

Active Learning with Gaussian Processes for Object Categorization.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Pyramid Match Hashing: Sub-Linear Time Indexing Over Partial Correspondences.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

The Pyramid Match: Efficient Learning with Partial Correspondences.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006
Matching sets of features for efficient retrieval and recognition.
PhD thesis, 2006

Approximate Correspondences in High Dimensions.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Unsupervised Learning of Categories from Sets of Partially Matching Image Features.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

2005
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Avoiding the "Streetlight Effect": Tracking by Exploring Likelihood Modes.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Efficient Image Matching with Distributions of Local Invariant Features.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

A picture is worth a thousand keywords: image-based object search on a mobile platform.
Proceedings of the Extended Abstracts Proceedings of the 2005 Conference on Human Factors in Computing Systems, 2005

2004
Virtual Visual Hulls: Example-Based 3D Shape Inference from Silhouettes.
Proceedings of the Statistical Methods in Video Processing, 2004

Fast Contour Matching Using Approximate Earth Mover's Distance.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

2003
Communication via eye blinks and eyebrow raises: video-based human-computer interfaces.
Univers. Access Inf. Soc., 2003

Inferring 3D Structure with a Statistical Image-Based Shape Model.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

A Bayesian Approach to Image-Based Visual Hull Reconstruction.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

2001
Communication via Eye Blinks - Detection and Duration Analysis in Real Time.
Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001


  Loading...