Irfan A. Essa

Orcid: 0000-0002-6236-2969

Affiliations:
  • Georgia Institute of Technology, Atlanta GA, USA


According to our database1, Irfan A. Essa authored at least 220 papers between 1990 and 2024.

Collaborative distances:
  • Dijkstra number2 of two.
  • Erdős number3 of three.

Awards

IEEE Fellow

IEEE Fellow 2012, "For contributions to computer vision and graphics".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity Recognition.
Sensors, February, 2024

Exploring Efficient Foundational Multi-modal Models for Video Summarization.
CoRR, 2024

Mamba Fusion: Learning Actions Through Questioning.
CoRR, 2024

Limitations in Employing Natural Language Supervision for Sensor-Based Human Activity Recognition - And Ways to Overcome Them.
CoRR, 2024

Cropper: Vision-Language Model for Image Cropping through In-Context Learning.
CoRR, 2024

CamViG: Camera Aware Image-to-Video Generation with Multimodal Transformers.
CoRR, 2024

3D Semantic MapNet: Building Maps for Multi-Object Re-Identification in 3D.
CoRR, 2024

On the Efficacy of Text-Based Input Modalities for Action Anticipation.
CoRR, 2024


Language Model Beats Diffusion - Tokenizer is key to visual generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Parrot: Pareto-Optimal Multi-reward Reinforcement Learning Framework for Text-to-Image Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Photorealistic Video Generation with Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Prompt-Free Diffusion: Taking "Text" Out of Text-to-Image Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

A Washing Machine is All You Need? On the Feasibility of Machine Data for Self-Supervised Human Activity Recognition.
Proceedings of the International Conference on Activity and Behavior Computing, 2024

2023
Emergence of Maps in the Memories of Blind Navigation Agents.
AI Matters, June, 2023

Cascaded Compositional Residual Learning for Complex Interactive Behaviors.
IEEE Robotics Autom. Lett., 2023

BayRnTune: Adaptive Bayesian Domain Randomization via Strategic Fine-tuning.
CoRR, 2023

Words into Action: Learning Diverse Humanoid Robot Behaviors using Language Guided Iterative Motion Refinement.
CoRR, 2023

Automatic Multi-Path Web Story Creation from a Structural Article.
CoRR, 2023

Multimodal Contrastive Learning with Hard Negative Sampling for Human Activity Recognition.
CoRR, 2023

StyleDrop: Text-to-Image Generation in Any Style.
CoRR, 2023

Learning Disentangled Prompts for Compositional Image Synthesis.
CoRR, 2023

Tackling Hate Speech in Low-resource Languages with Context Experts.
CoRR, 2023

Investigating Enhancements to Contrastive Predictive Coding for Human Activity Recognition.
Proceedings of the IEEE International Conference on Pervasive Computing and Communications, 2023

SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

StyleDrop: Text-to-Image Synthesis of Any Style.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Discrete Predictor-Corrector Diffusion Models for Image Synthesis.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

MAGVIT: Masked Generative Video Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Visual Prompt Tuning for Generative Transfer Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MaskSketch: Unpaired Structure-guided Masked Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Slide Gestalt: Automatic Structure Extraction in Slide Decks for Non-Visual Access.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

Text and Click inputs for unambiguous open vocabulary instance segmentation.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

Integrating Noisy Knowledge into Language Representations for E-Commerce Applications.
Proceedings of the IEEE International Conference on Big Data, 2023

2022
Assessing the State of Self-Supervised Human Activity Recognition Using Wearables.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2022

Multi-Stage Based Feature Fusion of Multi-Modal Data for Human Activity Recognition.
CoRR, 2022

End-to-End Multimodal Representation Learning for Video Dialog.
CoRR, 2022

Video based Object 6D Pose Estimation using Transformers.
CoRR, 2022

Finding Islands of Predictability in Action Forecasting.
CoRR, 2022

Learning Temporal Rules from Noisy Timeseries Data.
CoRR, 2022

Sharing Decoders: Network Fission for Multi-task Pixel Prediction.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Synthesis-Assisted Video Prototyping From a Document.
Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology, 2022

VER: Scaling On-Policy RL Leads to the Emergence of Navigation in Embodied Rearrangement.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Graph-based Cluttered Scene Generation and Interactive Exploration using Deep Reinforcement Learning.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Discrete Representations Strengthen Vision Transformer Robustness.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Improved Masked Image Generation with Token-Critic.
Proceedings of the Computer Vision - ECCV 2022, 2022

BLT: Bidirectional Layout Transformer for Controllable Layout Generation.
Proceedings of the Computer Vision - ECCV 2022, 2022

How to Train PointGoal Navigation Agents on a (Sample and Compute) Budget.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

2021
Contrastive Predictive Coding for Human Activity Recognition.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2021

VideoPose: Estimating 6D object pose from videos.
CoRR, 2021

Unsupervised Action Segmentation for Instructional Videos.
CoRR, 2021

Automatic Non-Linear Video Editing Transfer.
CoRR, 2021

PLAN-B: Predicting Likely Alternative Next Best Sequences for Action Prediction.
CoRR, 2021

Automatic Instructional Video Creation from a Markdown-Formatted Tutorial.
Proceedings of the UIST '21: The 34th Annual ACM Symposium on User Interface Software and Technology, 2021

Text as Neural Operator: Image Manipulation by Text Instruction.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Automatic Generation of Two-Level Hierarchical Tutorials from Instructional Makeup Videos.
Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021

Unsupervised Discovery of Actions in Instructional Videos.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Semantic MapNet: Building Allocentric Semantic Maps and Representations from Egocentric Views.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Semantic MapNet: Building Allocentric SemanticMaps and Representations from Egocentric Views.
CoRR, 2020

Analyzing Visual Representations in Embodied Navigation Tasks.
CoRR, 2020

Automatic Video Creation From a Web Page.
Proceedings of the UIST '20: The 33rd Annual ACM Symposium on User Interface Software and Technology, 2020

Masked reconstruction based self-supervision for human activity recognition.
Proceedings of the ISWC '20: 2020 ACM International Symposium on Wearable Computers, 2020

DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames.
Proceedings of the 8th International Conference on Learning Representations, 2020

Neural Design Network: Graphic Layout Generation with Constraints.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Neural Design Network: Graphic Layout Generation with Constraints.
CoRR, 2019

Decentralized Distributed PPO: Solving PointGoal Navigation.
CoRR, 2019

Estimating Mass Distribution of Articulated Objects through Physical Interaction.
CoRR, 2019

Novel evaluation of surgical activity recognition models using task-based efficiency metrics.
Int. J. Comput. Assist. Radiol. Surg., 2019

Eyemotion: Classifying Facial Expressions in VR Using Eye-Tracking Cameras.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Video Jigsaw: Unsupervised Learning of Spatiotemporal Context for Video Action Recognition.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

A Data-Driven Predictive Model of Individual-Specific Effects of FES on Human Gait Dynamics.
Proceedings of the International Conference on Robotics and Automation, 2019

Floors are Flat: Leveraging Semantics for Real-Time Surface Normal Prediction.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features.
Proceedings of the IEEE International Conference on Acoustics, 2019

Embodied Question Answering in Photorealistic Environments With Point Cloud Perception.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Audio Visual Scene-Aware Dialog.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Unbiasing Semantic Segmentation For Robot Perception using Synthetic Data Feature Transfer.
CoRR, 2018

Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7.
CoRR, 2018

Object category learning and retrieval with weak supervision.
CoRR, 2018

Let's Dance: Learning From Online Dance Videos.
CoRR, 2018

DiscrimNet: Semi-Supervised Action Recognition from Videos using Generative Adversarial Networks.
CoRR, 2018

Video and accelerometer-based motion analysis for automated surgical skills assessment.
Int. J. Comput. Assist. Radiol. Surg., 2018

Automated surgical skill assessment in RMIS training.
Int. J. Comput. Assist. Radiol. Surg., 2018

rtCaptcha: A Real-Time CAPTCHA Based Liveness Detection System.
Proceedings of the 25th Annual Network and Distributed System Security Symposium, 2018

Surgical Activity Recognition in Robot-Assisted Radical Prostatectomy Using Deep Learning.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2018, 2018

Identifying Gait Phases from Joint Kinematics during Walking with Switched Linear Dynamical Systems.
Proceedings of the 7th IEEE International Conference on Biomedical Robotics and Biomechatronics, 2018

2017
Computer Vision in Sports.
Comput. Vis. Image Underst., 2017

Complex Event Recognition from Images with Few Training Examples.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Exploring Symmetric and Asymmetric Bimanual Eating Detection with Inertial Sensors on the Wrist.
Proceedings of the 1st Workshop on Digital Biomarkers, 2017

Towards using visual attributes to infer image sentiment of social events.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Selfie-Presentation in Everyday Life: A Large-Scale Characterization of Selfie Contexts on Instagram.
Proceedings of the Eleventh International Conference on Web and Social Media, 2017

One-Shot Learning for Semantic Segmentation.
Proceedings of the British Machine Vision Conference 2017, 2017

Challenges and Opportunities in Automated Detection of Eating Activity.
Proceedings of the Mobile Health - Sensors, Analytic Methods, and Applications, 2017

2016
Automated video-based assessment of surgical skills for training and evaluation in medical schools.
Int. J. Comput. Assist. Radiol. Surg., 2016

Discovering picturesque highlights from egocentric vacation videos.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Leveraging Contextual Cues for Generating Basketball Highlights.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

2015
Depth Extraction from Videos Using Geometric Context and Occlusion Boundaries.
CoRR, 2015

Finding Temporally Consistent Occlusion Boundaries in Videos Using Geometric Context.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Semantic Instance Labeling Leveraging Hierarchical Segmentation.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Leveraging Context to Support Automated Food Recognition in Restaurants.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Egocentric Field-of-View Localization Using First-Person Point-of-View Devices.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Automated Assessment of Surgical Skills Using Frequency Analysis.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015, 2015

Inferring Meal Eating Activities in Real World Settings from Ambient Sounds: A Feasibility Study.
Proceedings of the 20th International Conference on Intelligent User Interfaces, 2015

Predicting daily activities from egocentric images using deep learning.
Proceedings of the 2015 ACM International Symposium on Wearable Computers, 2015

A practical approach for recognizing eating moments with wrist-mounted inertial sensing.
Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2015

2014
A visualization framework for team sports captured using multiple static cameras.
Comput. Vis. Image Underst., 2014

Automated surgical OSATS prediction from videos.
Proceedings of the IEEE 11th International Symposium on Biomedical Imaging, 2014

Measuring Child Visual Attention using Markerless Head Tracking from Color and Depth Sensing Cameras.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

Efficient Hierarchical Graph-Based Segmentation of RGBD Videos.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Clustering Social Event Images Using Kernel Canonical Correlation Analysis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014

Depth Extraction from Videos Using Geometric Context and Occlusion Boundaries.
Proceedings of the British Machine Vision Conference, 2014

2013
The Manifold of Human Emotions
Proceedings of the 1st International Conference on Learning Representations, 2013

Feasibility of identifying eating moments from first-person images leveraging human computation.
Proceedings of the International SenseCam & Pervasive Imaging Conference 2013, 2013


Post-processing approach for radiometric self-calibration of video.
Proceedings of the IEEE International Conference on Computational Photography, 2013

Technological approaches for addressing privacy concerns when recognizing eating behaviors with wearable cameras.
Proceedings of the 2013 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2013

Decoding Children's Social Behavior.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Geometric Context from Videos.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Augmenting Bag-of-Words: Data-Driven Discovery of Temporal and Structural Information for Activity Recognition.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Beyond Sentiment: The Manifold of Human Emotions.
Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics, 2013

2012
Linguistic transfer of human assembly tasks to robots.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Calibration-free rolling shutter removal.
Proceedings of the 2012 IEEE International Conference on Computational Photography, 2012

Orientation-aware scene understanding for mobile cameras.
Proceedings of the 2012 ACM Conference on Ubiquitous Computing, 2012

Recognizing water-based activities in the home through infrastructure-mediated sensing.
Proceedings of the 2012 ACM Conference on Ubiquitous Computing, 2012

Weakly Supervised Learning of Object Segmentations from Web-Scale Video.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Detecting regions of interest in dynamic scenes with camera motions.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Augmenting aerial earth maps with dynamic information from videos.
Virtual Real., 2011

Bilayer Segmentation of Webcam Videos Using Tree-Based Classifiers.
IEEE Trans. Pattern Anal. Mach. Intell., 2011

Guest Editors' Introduction to the Special Section on Award-Winning Papers from the IEEE Conference on Computer Vision and Pattern Recognition 2009 (CVPR 2009).
IEEE Trans. Pattern Anal. Mach. Intell., 2011

Gaussian process regression flow for analysis of motion trajectories.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Auto-directed video stabilization with robust L1 optimal camera paths.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Fluid Simulation with Articulated Bodies.
IEEE Trans. Vis. Comput. Graph., 2010

Modulating video credibility via visualization of quality evaluations.
Proceedings of the 4th ACM Workshop on Information Credibility on the Web, 2010

Motion fields to predict play evolution in dynamic sport scenes.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Player localization using multiple static cameras for sports visualization.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Efficient hierarchical graph-based video segmentation.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Discontinuous seam-carving for video retargeting.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Human Action Recognition Using Global Point Feature Histograms and Action Shapes.
Adv. Robotics, 2009

A novel sequence representation for unsupervised analysis of human activities.
Artif. Intell., 2009

Human video textures.
Proceedings of the 2009 Symposium on Interactive 3D Graphics, 2009

Augmenting Aerial Earth Maps with dynamic information.
Proceedings of the Science & Technology Proceedings, 2009

Learning the basic units in American Sign Language using discriminative segmental feature selection.
Proceedings of the IEEE International Conference on Acoustics, 2009

Videolyzer: quality analysis of online informational video for bloggers and journalists.
Proceedings of the 27th International Conference on Human Factors in Computing Systems, 2009

2008
Audio Puzzler: piecing together time-stamped speech transcripts with a puzzle game.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Localization and 3D reconstruction of urban scenes using GPS.
Proceedings of the 12th IEEE International Symposium on Wearable Computers (ISWC 2008), September 28, 2008

An annotation model for making sense of information quality in online video.
Proceedings of the 3rd International Conference on Pragmatic Web, 2008

3D Shape Context and Distance Transform for action recognition.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Discriminative feature selection for hidden Markov models using Segmental Boosting.
Proceedings of the IEEE International Conference on Acoustics, 2008

Computational photography and video: interacting and creating with videos and images.
Proceedings of the working conference on Advanced Visual Interfaces, 2008

2007
Data-driven and Procedural Analysis and Synthesis of Multimedia.
Proceedings of the Eighth International Workshop on Image Analysis for Multimedia Interactive Services, 2007

A Boosted Segmentation Method for Surgical Workflow Analysis.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2007, 10th International Conference, Brisbane, Australia, October 29, 2007

Improving Activity Discovery with Automatic Neighborhood Estimation.
Proceedings of the IJCAI 2007, 2007

Detecting Subdimensional Motifs: An Efficient Algorithm for Generalized Multivariate Pattern Discovery.
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

Structure from Statistics - Unsupervised Activity Analysis using Suffix Trees.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Incorporating Phase Information for Source Separation via Spectrogram Factorization.
Proceedings of the IEEE International Conference on Acoustics, 2007

Phase-Aware Non-negative Spectrogram Factorization.
Proceedings of the Independent Component Analysis and Signal Separation, 2007

The evolution of authorship in a remix society.
Proceedings of the HYPERTEXT 2007, 2007

Tree-based Classifiers for Bilayer Video Segmentation.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Discovering Multivariate Motifs using Subsequence Density Estimation and Greedy Mixture Learning.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006
Videotater: an approach for pen-based digital video segmentation and tagging.
Proceedings of the 19th Annual ACM Symposium on User Interface Software and Technology, 2006

Interactive mosaic generation for video navigation.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Unsupervised analysis of activity sequences using event-motifs.
Proceedings of the 4th ACM International Workshop on Video Surveillance and Sensor Networks, 2006

Discovering Characteristic Actions from On-Body Sensor Data.
Proceedings of the Tenth IEEE International Symposium on Wearable Computers (ISWC 2006), 2006

Source Detection Using Repetitive Structure.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Estimating the Spatial Position of Spectral Components in Audio.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2006

Learning Temporal Sequence Model from Partially Labeled Data.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Element-Free Elastic Models for Volume Fitting and Capture.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

2005
Texture optimization for example-based synthesis.
ACM Trans. Graph., 2005

Experiences with optimizing two stream-based applications for cluster execution.
J. Parallel Distributed Comput., 2005

Mediating photo collage authoring.
Proceedings of the 18th Annual ACM Symposium on User Interface Software and Technology, 2005

Unsupervised Activity Discovery and Characterization From Event-Streams.
Proceedings of the UAI '05, 2005

Tracking Multiple Objects through Occlusions.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Video-based nonphotorealistic and expressive illustration of motion.
Proceedings of the Computer Graphics International 2005, 2005

2004
Image and video based painterly animation.
Proceedings of the 3rd International Symposium on Non-Photorealistic Animation and Rendering, 2004

Feature Weighting for Segmentation.
Proceedings of the ISMIR 2004, 2004

Parameterized Authentication.
Proceedings of the Computer Security, 2004

Novel Skeletal Representation for Articulated Creatures.
Proceedings of the Computer Vision, 2004

Asymmetrically Boosted HMM for Speech Reading.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

Propagation Networks for Recognition of Partially Ordered Sequential Action.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

Content Based Image Synthesis.
Proceedings of the Image and Video Retrieval: Third International Conference, 2004

2003
Graphcut textures: image and video synthesis using graph cuts.
ACM Trans. Graph., 2003

Presenting Movement in a Computer-Based Dance Tutor.
Int. J. Hum. Comput. Interact., 2003

Exemplar-Based Surface Texture.
Proceedings of the 8th International Fall Workshop on Vision, Modeling, and Visualization, 2003

Rhythmic similarity through elaboration.
Proceedings of the ISMIR 2003, 2003

Perceptual user interfaces using vision-based eye tracking.
Proceedings of the 5th International Conference on Multimodal Interfaces, 2003

Spectral Partitioning for Structure from Motion.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Mandatory human participation: a new authentication scheme for building secure systems.
Proceedings of the 12th International Conference on Computer Communications and Networks, 2003

Expectation Grammars: Leveraging High-Level Expectations for Activity Recognition.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

ARGMode - Activity Recognition using Graphical Models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2003

Boosted Audio-Visual HMM for Speech Reading.
Proceedings of the 2003 IEEE International Workshop on Analysis and Modeling of Faces and Gestures (AMFG 2003), 2003

2002
Controlled animation of video sprites.
Proceedings of the 2002 ACM SIGGRAPH/Eurographics Symposium on Computer Animation, 2002

Learning Video Processing by Example.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Recognizing Multitasked Activities from Video Using Stochastic Context-Free Grammar.
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

2001
Image-based motion blur for stop motion animation.
Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques, 2001

Real-time Photo-Realistic Physically Based Rendering of Fine Scale Human Skin Structure.
Proceedings of the 12th Eurographics Workshop on Rendering Techniques, 2001

Towards reliable multimodal sensing in aware environments.
Proceedings of the 2001 workshop on Perceptive user interfaces, 2001

Propagation of Innovative Information in Non-Linear Least-Squares Structure from Motion.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

Depth Layers from Occlusions.
Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001

2000
Ubiquitous sensing for smart and aware environments.
IEEE Wirel. Commun., 2000

Video textures.
Proceedings of the 27th Annual Conference on Computer Graphics and Interactive Techniques, 2000

Machine Learning for Video-Based Rendering.
Proceedings of the Advances in Neural Information Processing Systems 13, 2000

Detecting and Tracking Eyes by Using Their Physiological Properties, Dynamics, and Appearance.
Proceedings of the 2000 Conference on Computer Vision and Pattern Recognition (CVPR 2000), 2000

Increasing the opportunities for aging in place.
Proceedings of the Proceedings on the 2000 conference on Universal Usability, 2000

A non-invasive computer vision system for reliable eye tracking.
Proceedings of the CHI '00 Extended Abstracts on Human Factors in Computing Systems, 2000

Living laboratories: the future computing environments group at the Georgia Institute of Technology.
Proceedings of the CHI '00 Extended Abstracts on Human Factors in Computing Systems, 2000

1999
Computers Seeing People.
AI Mag., 1999

Adaptive Parallelization of Model-Based Head Tracking.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1999

Exploiting Human Actions and Object Context for Recognition Tasks.
Proceedings of the International Conference on Computer Vision, 1999

Motion based Decompositing of Video.
Proceedings of the International Conference on Computer Vision, 1999

The Aware Home: A Living Laboratory for Ubiquitous Computing Research.
Proceedings of the Cooperative Buildings, 1999

1997
Coding, Analysis, Interpretation, and Recognition of Facial Expressions.
IEEE Trans. Pattern Anal. Mach. Intell., 1997

1996
Task-Specific Gesture Analysis in Real-Time Using Interpolated Views.
IEEE Trans. Pattern Anal. Mach. Intell., 1996

Motion regularization for model-based head tracking.
Proceedings of the 13th International Conference on Pattern Recognition, 1996

Vision-Based HCI - What's Next and What are the Difficult Problems?
Proceedings of the 2nd International Conference on Automatic Face and Gesture Recognition (FG '96), 1996

Modeling, Tracking and Interactive Animation of Faces and Heads Using Input from Video.
Proceedings of the Computer Animation 1996, 1996

1995
Facial Expression Recognition Using a Dynamic Model and Motion Energy.
Proceedings of the Procedings of the Fifth International Conference on Computer Vision (ICCV 95), 1995

1994
Correlation and Interpolation Networks for Real-time Expression Analysis/Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 7, 1994

A vision system for observing and extracting facial action parameters.
Proceedings of the Conference on Computer Vision and Pattern Recognition, 1994

Visually guided animation.
Proceedings of the Computer Animation 1994, 1994

1992
A Unified Approach for Physical and Geometric Modeling for Graphics and Animation.
Comput. Graph. Forum, 1992

1990
The ThingWorld modeling system: virtual sculpting by modal forces.
Proceedings of the 1990 symposium on Interactive 3D graphics, 1990


  Loading...