Ioannis Patras

Orcid: 0000-0003-3913-4738

According to our database1, Ioannis Patras authored at least 240 papers between 1996 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Bilinear Models of Parts and Appearances in Generative Adversarial Networks.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

One-Shot Neural Face Reenactment via Finding Directions in GAN's Latent Space.
Int. J. Comput. Vis., August, 2024

CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition.
CoRR, 2024

MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance.
CoRR, 2024

Are CLIP features all you need for Universal Synthetic Image Origin Attribution?
CoRR, 2024

Get Confused Cautiously: Textual Sequence Memorization Erasure with Selective Entropy Maximization.
CoRR, 2024

Multimodal Machine Learning in Mental Health: A Survey of Data, Algorithms, and Challenges.
CoRR, 2024

EquiPrompt: Debiasing Diffusion Models via Iterative Bootstrapping in Chain of Thoughts.
CoRR, 2024

Enhancing Zero-Shot Facial Expression Recognition by LLM Knowledge Transfer.
CoRR, 2024

FashionSD-X: Multimodal Fashion Garment Synthesis using Latent Diffusion.
CoRR, 2024

VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning.
CoRR, 2024

DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment.
CoRR, 2024

Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization.
CoRR, 2024

Self-Supervised Representation Learning with Cross-Context Learning between Global and Hypercolumn Features.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Improving Fairness using Vision-Language Driven Image Augmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

CLIPCleaner: Cleaning Noisy Labels with CLIP.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

FOAA: Flattened Outer Arithmetic Attention for Multimodal Tumor Classification.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition.
Proceedings of the 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing.
Proceedings of the Computer Vision - ECCV 2024, 2024

LAFS: Landmark-Based Facial Self-Supervised Learning for Face Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Self-Supervised Facial Representation Learning with Facial Region Awareness.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Motor Imagery Decoding Using Ensemble Curriculum Learning and Collaborative Training.
Proceedings of the 12th International Winter Conference on Brain-Computer Interface, 2024

2023
Artistic neural style transfer using CycleGAN and FABEMD by adaptive information selection.
Pattern Recognit. Lett., January, 2023

Machine Learning Approaches for Fine-Grained Symptom Estimation in Schizophrenia: A Comprehensive Review.
CoRR, 2023

SimDETR: Simplifying self-supervised pretraining for DETR.
CoRR, 2023

"Just To See You Smile": SMILEY, a Voice-Guided GUY GAN.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Parts of Speech-Grounded Subspaces in Vision-Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Study on the Use of Attention for Explaining Video Summarization.
Proceedings of the 2nd Workshop on User-centric Narrative Summarization of Long Videos, 2023

Low-Light Image Enhancement Based on U-Net and Haar Wavelet Pooling.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023

NarSUM '23: The 2nd Workshop on User-Centric Narrative Summarization of Long Videos.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Controllable image generation and manipulation.
Proceedings of the 2nd ACM International Workshop on Multimedia AI against Disinformation, 2023

MOAB: Multi-Modal Outer Arithmetic Block for Fusion of Histopathological Images and Genetic Data for Brain Tumor Grading.
Proceedings of the 20th IEEE International Symposium on Biomedical Imaging, 2023

PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Selecting A Diverse Set Of Aesthetically-Pleasing and Representative Video Thumbnails Using Reinforcement Learning.
Proceedings of the IEEE International Conference on Image Processing, 2023

HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

StyleMask: Disentangling the Style Space of StyleGAN2 for Neural Face Reenactment.
Proceedings of the 17th IEEE International Conference on Automatic Face and Gesture Recognition, 2023

A Simple Baseline for Knowledge-Based Visual Question Answering.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

DivClust: Controlling Diversity in Deep Clustering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Self-Supervised Video Similarity Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MaskCon: Masked Contrastive Learning for Coarse-Labelled Dataset.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Attribute-Preserving Face Dataset Anonymization via Latent Code Optimization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Prompting Visual-Language Models for Dynamic Facial Expression Recognition.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval.
Int. J. Comput. Vis., 2022

Capsule Network based Contrastive Learning of Unsupervised Visual Representations.
CoRR, 2022

ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences.
CoRR, 2022

Video Summarization in the Deep Learning Era: Current Landscape and Future Directions.
Proceedings of the NarSUM '22: Proceedings of the 1st Workshop on User-centric Narrative Summarization of Long Videos, 2022

Learning from Label Relationships in Human Affect.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of the Video Frames.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Explaining video summarization based on the focus of attention.
Proceedings of the IEEE International Symposium on Multimedia, 2022

Adaptive Soft Contrastive Learning.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

SSR: An Efficient and Robust Framework for Learning with Unknown Label Noise.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

CovMix: Covariance Mixing Regularization for Motor Imagery Decoding.
Proceedings of the 10th International Winter Conference on Brain-Computer Interface, 2022

2021
AC-SUM-GAN: Connecting Actor-Critic and Generative Adversarial Networks for Unsupervised Video Summarization.
IEEE Trans. Circuits Syst. Video Technol., 2021

AMIGOS: A Dataset for Affect, Personality and Mood Research on Individuals and Groups.
IEEE Trans. Affect. Comput., 2021

SchiNet: Automatic Estimation of Symptoms of Schizophrenia from Facial Behaviour Analysis.
IEEE Trans. Affect. Comput., 2021

Video Summarization Using Deep Neural Networks: A Survey.
Proc. IEEE, 2021

S3: Supervised Self-supervised Learning under Label Noise.
CoRR, 2021

Relationship-based Neural Baby Talk.
CoRR, 2021

Uncertainty Propagation in Convolutional Neural Networks: Technical Report.
CoRR, 2021

Few-Shot Action Localization without Knowing Boundaries.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Combining Adversarial and Reinforcement Learning for Video Thumbnail Selection.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Combining Global and Local Attention with Positional Encoding for Video Summarization.
Proceedings of the IEEE International Symposium on Multimedia, 2021

WarpedGANSpace: Finding non-linear RBF paths in GAN latent space.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Tensor Component Analysis for Interpreting the Latent Space of GANs.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Pairwise Ranking Network for Affect Recognition.
Proceedings of the 9th International Conference on Affective Computing and Intelligent Interaction, 2021

Estimating continuous affect with label uncertainty.
Proceedings of the 9th International Conference on Affective Computing and Intelligent Interaction, 2021

2020
Temporal Action Localization with Variance-Aware Networks.
CoRR, 2020

Boundary Uncertainty in a Single-Stage Temporal Action Localization Network.
CoRR, 2020

Unsupervised Video Summarization via Attention-Driven Adversarial Learning.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Performance over Random: A Robust Evaluation Protocol for Video Summarization Methods.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Cycle-Consistent Adversarial Networks and Fast Adaptive Bi-dimensional Empirical Mode Decomposition for Style Transfer.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

2019
Alone versus In-a-group: A Multi-modal Framework for Automatic Affect Recognition.
ACM Trans. Multim. Comput. Commun. Appl., 2019

FIVR: Fine-Grained Incident Video Retrieval.
IEEE Trans. Multim., 2019

Implicit and Explicit Concept Relations in Deep Neural Networks for Multi-Label Video/Image Annotation.
IEEE Trans. Circuits Syst. Video Technol., 2019

A deep generic to specific recognition model for group membership analysis using non-verbal cues.
Image Vis. Comput., 2019

Registration-free Face-SSD: Single shot analysis of smiles, facial attributes, and affect in the wild.
Comput. Vis. Image Underst., 2019

Universal Foreground Segmentation Based on Deep Feature Fusion Network for Multi-Scene Videos.
IEEE Access, 2019

Detecting Tampered Videos with Multimedia Forensics and Deep Learning.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

Multimodal Video Annotation for Retrieval and Discovery of Newsworthy Video in a News Verification Scenario.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019


A Stepwise, Label-based Approach for Improving the Adversarial Training in Unsupervised Video Summarization.
Proceedings of the 1st International Workshop on AI for Smart TV Content Production, 2019

Exploring Feature Representation and Training Strategies in Temporal Action Localization.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

ViSiL: Fine-Grained Spatio-Temporal Video Similarity Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Can Automatic Facial Expression Analysis Be Used for Treatment Outcome Estimation in Schizophrenia?
Proceedings of the IEEE International Conference on Acoustics, 2019

Your Fellows Matter: Affect Analysis across Subjects in Group Videos.
Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019

TARN: Temporal Attentive Relation Network for Few-Shot and Zero-Shot Action Recognition.
Proceedings of the 30th British Machine Vision Conference 2019, 2019


Finding Semantically Related Videos in Closed Collections.
Proceedings of the Video Verification in the Fake News Era., 2019

Finding Near-Duplicate Videos in Large-Scale Collections.
Proceedings of the Video Verification in the Fake News Era., 2019

Video Fragmentation and Reverse Search on the Web.
Proceedings of the Video Verification in the Fake News Era., 2019

2018
Linear Maximum Margin Classifier for Learning from Uncertain Data.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Semi-supervised Fisher vector network.
CoRR, 2018


Visual and Audio Analysis of Movies Video for Emotion Detection @ Emotional Impact of Movies Task MediaEval 2018.
Proceedings of the Working Notes Proceedings of the MediaEval 2018 Workshop, 2018

A Multi-Task Cascaded Network for Prediction of Affect, Personality, Mood and Social Context Using EEG Signals.
Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition, 2018

LikeNet: A Siamese Motion Estimation Network Trained in an Unsupervised Way.
Proceedings of the British Machine Vision Conference 2018, 2018

Deep Mixture of MRFs for Human Pose Estimation.
Proceedings of the Computer Vision - ACCV 2018, 2018

Multimedia Processing Essentials.
Proceedings of the Personal Multimedia Preservation, 2018

2017
Gaze movement-driven random forests for query clustering in automatic video annotation.
Multim. Tools Appl., 2017

Discriminative convolutional Fisher vector network for action recognition.
CoRR, 2017

AMIGOS: A dataset for Mood, personality and affect research on Individuals and GrOupS.
CoRR, 2017


Comparison of Fine-Tuning and Extension Strategies for Deep Convolutional Neural Networks.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017


Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Query and Keyframe Representations for Ad-hoc Video Search.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Concept Language Models and Event-based Concept Number Selection for Zero-example Event Detection.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

VideoAnalysis4ALL: An On-line Tool for the Automatic Fragmentation and Concept-based Annotation, and the Interactive Exploration of Videos.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Near-Duplicate Video Retrieval with Deep Metric Learning.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

SmileNet: Registration-Free Smiling Face Detection In The Wild.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Deep Globally Constrained MRFs for Human Pose Estimation.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Generic to Specific Recognition Models for Membership Analysis in Group Videos.
Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition, 2017

Deep Refinement Convolutional Networks for Human Pose Estimation.
Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition, 2017

Fusing Multilabel Deep Networks for Facial Action Unit Detection.
Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition, 2017

Background modelling based on generative unet.
Proceedings of the 14th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2017

2016
Learning to detect video events from zero or very few video examples.
Image Vis. Comput., 2016

Action recognition using saliency learned from recorded human gaze.
Image Vis. Comput., 2016

Special Issue on Individual and Group Activities in Video Event Analysis.
Comput. Vis. Image Underst., 2016


Video Event Detection Using Kernel Support Vector Machine with Isotropic Gaussian Sample Uncertainty (KSVM-iGSU).
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

VERGE: A Multimodal Interactive Search Engine for Video Browsing and Retrieval.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Ordering of Visual Descriptors in a Classifier Cascade Towards Improved Video Concept Detection.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Alone versus In-a-group: A Comparative Analysis of Facial Affect Recognition.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Deep Multi-task Learning with Label Correlation Constraint for Video Concept Detection.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Action Recognition Using Convolutional Restricted Boltzmann Machines.
Proceedings of the 1st International Workshop on Multimedia Analysis and Retrieval for Multimodal Interaction, 2016

Minimal filtered channel features for pedestrian detection.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Video aesthetic quality assessment using kernel Support Vector Machine with isotropic Gaussian sample uncertainty (KSVM-IGSU).
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Online multi-task learning for semantic concept detection in video.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Unsupervised convolutional neural networks for motion estimation.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Automatic Recognition of Emotions and Membership in Group Videos.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016

2015
Face Pose Analysis.
Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015

Fine-Tuning Regression Forests Votes for Object Alignment in the Wild.
IEEE Trans. Image Process., 2015

Robust Face Alignment Under Occlusion via Regional Predictive Power Estimation.
IEEE Trans. Image Process., 2015

Local Features and a Two-Layer Stacking Architecture for Semantic Concept Detection in Video.
IEEE Trans. Emerg. Top. Comput., 2015

Privileged Information-Based Conditional Structured Output Regression Forest for Facial Point Detection.
IEEE Trans. Circuits Syst. Video Technol., 2015

DECAF: MEG-Based Multimodal Database for Decoding Affective Physiological Responses.
IEEE Trans. Affect. Comput., 2015

Random Subspace Supervised Descent Method for Regression Problems in Computer Vision.
IEEE Signal Process. Lett., 2015

Cascade of forests for face alignment.
IET Comput. Vis., 2015


VERGE: A Multimodal Interactive Video Search Engine.
Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

A Study on the Use of a Binary Local Descriptor and Color Extensions of Local Descriptors for Video Concept Detection.
Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

A flexible calibration method of multiple Kinects for 3D human reconstruction.
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

Cascade of classifiers based on binary, non-binary and deep convolutional network descriptors for video concept detection.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Inference of personality traits and affect schedule by analysis of spontaneous reactions to affective videos.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Mirror, mirror on the wall, tell me, is the error small?
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Face Alignment Assisted by Head Pose Estimation.
Proceedings of the British Machine Vision Conference 2015, 2015

Identifying valence and arousal levels via connectivity between EEG channels.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Concept Detection in Multimedia Web Resources About Home Made Explosives.
Proceedings of the 10th International Conference on Availability, Reliability and Security, 2015

2014
Face Sketch Landmarks Localization in the Wild.
IEEE Signal Process. Lett., 2014

Multimodal random forest based tensor regression.
IET Comput. Vis., 2014


Learning visual saliency using topographic independent component analysis.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Non-invasive player experience estimation from body motion and game context.
Proceedings of the 2014 IEEE Conference on Computational Intelligence and Games, 2014

Structured Semi-supervised Forest for Facial Landmarks Localization with Face Mask Reasoning.
Proceedings of the British Machine Vision Conference, 2014

2013
High order pLSA for indexing tagged images.
Signal Process., 2013

Coupled Gaussian Processes for Pose-Invariant Facial Expression Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Fusion of facial expressions and EEG for implicit affective tagging.
Image Vis. Comput., 2013

Semi-supervised visual recognition with constrained graph regularized non negative matrix factorization.
Proceedings of the IEEE International Conference on Image Processing, 2013

Sieving Regression Forest Votes for Facial Feature Detection in the Wild.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Privileged information-based conditional regression forest for facial feature detection.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Supervised dictionary learning for action localization.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

2012
Tree-Structured Feature Extraction Using Mutual Information.
IEEE Trans. Neural Networks Learn. Syst., 2012

Tensor Learning for Regression.
IEEE Trans. Image Process., 2012

DEAP: A Database for Emotion Analysis ;Using Physiological Signals.
IEEE Trans. Affect. Comput., 2012

Higher rank Support Tensor Machines for visual recognition.
Pattern Recognit., 2012

Leveraging social media for scalable object detection.
Pattern Recognit., 2012

Max-margin Non-negative Matrix Factorization.
Image Vis. Comput., 2012

Exploiting gaze movements for automatic video annotation.
Proceedings of the 13th International Workshop on Image Analysis for Multimedia Interactive Services, 2012

Image Interpretation by Combining Ontologies and Bayesian Networks.
Proceedings of the Artificial Intelligence: Theories and Applications, 2012

Higher Rank Support Tensor Machines.
Proceedings of the Advances in Visual Computing - 8th International Symposium, 2012

Affective gaming: Beyond using sensors.
Proceedings of the 5th International Symposium on Communications, 2012

Coupled 3D tracking and pose optimization of rigid objects using particle filter.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

A simple and effective extrinsic calibration method of a camera and a single line scanning lidar.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Support tensor action spotting.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Learning codebook weights for action detection.
Proceedings of the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2012

Face Parts Localization Using Structured-Output Regression Forests.
Proceedings of the Computer Vision, 2012

Exploring the Similarities of Neighboring Spatiotemporal Points for Action Pair Matching.
Proceedings of the Computer Vision - ACCV 2012, 2012

Exploiting Depth and Intensity Information for Head Pose Estimation with Random Forests and Tensor Models.
Proceedings of the Computer Vision - ACCV 2012 Workshops, 2012

2011
Enhancing Computer Vision Using the Collective Intelligence of Social Media.
Proceedings of the New Directions in Web Data Management 1, 2011

Evidence-Driven Image Interpretation by Combining Implicit and Explicit Knowledge in a Bayesian Network.
IEEE Trans. Syst. Man Cybern. Part B, 2011

Spatiotemporal Localization and Categorization of Human Actions in Unsegmented Image Sequences.
IEEE Trans. Image Process., 2011

Utilizing Implicit User Feedback to Improve Interactive Video Retrieval.
Adv. Multim., 2011

ITI-CERTH participation to TRECVID 2011.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

An eye-tracking-based approach to facilitate interactive video search.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

Continuous emotion detection in response to music videos.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

Support tucker machines.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Max-Margin Semi-NMF.
Proceedings of the British Machine Vision Conference, 2011

Combining Multi-modal Features for Social Media Analysis.
Proceedings of the Social Media Modeling and Computing., 2011

2010
Coupled Prediction Classification for Robust Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2010

A Dynamic Texture-Based Approach to Recognition of Facial Actions and Their Temporal Models.
IEEE Trans. Pattern Anal. Mach. Intell., 2010

Exploiting implicit user feedback in interactive video retrieval.
Proceedings of the 11th International Workshop on Image Analysis for Multimedia Interactive Services, 2010

Discriminative space-time voting for joint recognition and localization of actions.
Proceedings of the 2nd international workshop on Social signal processing, 2010

Regression-Based Multi-view Facial Expression Recognition.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Pyramidal Model for Image Semantic Segmentation.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Multiplicative Update Rules for Multilinear Support Tensor Machines.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Coupled Gaussian Process Regression for Pose-Invariant Facial Expression Recognition.
Proceedings of the Computer Vision, 2010

Facial expression invariant head pose normalization using Gaussian Process Regression.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

Optimizing visual search with implicit user feedback in interactive video retrieval.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

Relative Margin Support Tensor Machines for gait and action recognition.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

Single Trial Classification of EEG and Peripheral Physiological Signals for Recognition of Emotions Induced by Music Videos.
Proceedings of the Brain Informatics, International Conference, 2010

2009
Face Pose Analysis.
Proceedings of the Encyclopedia of Biometrics, 2009

Sparse B-spline polynomial descriptors for human activity recognition.
Image Vis. Comput., 2009

Context awareness in graph-based image semantic segmentation via visual word distributions.
Proceedings of the 10th Workshop on Image Analysis for Multimedia Interactive Services, 2009

The fast-3D spatio-temporal interest region detector.
Proceedings of the 10th Workshop on Image Analysis for Multimedia Interactive Services, 2009

An Evidence-Driven Probabilistic Inference Framework for Semantic Image Understanding.
Proceedings of the Machine Learning and Data Mining in Pattern Recognition, 2009

Discriminative 3D human pose estimation from monocular images via topological preserving hierarchical affinity clustering.
Proceedings of the 12th IEEE International Conference on Computer Vision Workshops, 2009

An implicit spatiotemporal shape model for human activity localization and recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009

Latent Semantics Local Distribution for CRF-based Image Semantic Segmentation.
Proceedings of the British Machine Vision Conference, 2009

EEG analysis for implicit tagging of video data.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

2008
Incremental Refinement of Image Salient-Point Detection.
IEEE Trans. Image Process., 2008

On the role of structure in part-based object detection.
Proceedings of the International Conference on Image Processing, 2008

Incremental salient point detection.
Proceedings of the IEEE International Conference on Acoustics, 2008

B-spline polynomial descriptors for human activity recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2008

2007
Probabilistic Confidence Measures for Block Matching Motion Estimation.
IEEE Trans. Circuits Syst. Video Technol., 2007

Regression-Based Template Tracking in Presence of Occlusions.
Proceedings of the Eighth International Workshop on Image Analysis for Multimedia Interactive Services, 2007

Template Trackingwith Observation Relevance Determination.
Proceedings of the International Conference on Image Processing, 2007

Regression tracking with data relevance determination.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Trajectory-Based Representation of Human Actions.
Proceedings of the Artifical Intelligence for Human Computing, 2007

2006
Dynamics of Facial Expression: Recognition of Facial Actions and Their Temporal Segments From Face Profile Image Sequences.
IEEE Trans. Syst. Man Cybern. Part B, 2006

Spatiotemporal salient points for visual recognition of human actions.
IEEE Trans. Syst. Man Cybern. Part B, 2006

Combining color and shape information for illumination-viewpoint invariant object recognition.
IEEE Trans. Image Process., 2006

Kernel-based Recognition of Human Actions Using Spatiotemporal Salient Points.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2006

Gaze Tracking by Using Factorized Likelihoods Particle Filtering and Stereo Vision.
Proceedings of the 3rd International Symposium on 3D Data Processing, 2006

2005
Tracking deformable motion.
Proceedings of the IEEE International Conference on Systems, 2005

Detecting facial actions and their temporal segments in nearly frontal-view face image sequences.
Proceedings of the IEEE International Conference on Systems, 2005

Spatiotemporal saliency for human action recognition.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Facial Action Unit Detection using Probabilistic Actively Learned Support Vector Machines on Tracked Facial Point Data.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2005

2004
Dense motion estimation using regularization constraints on local parametric models.
IEEE Trans. Image Process., 2004

Online globally consistent mosaicing using an efficient representation.
Proceedings of the IEEE International Conference on Systems, 2004

Motion history for facial action detection in video.
Proceedings of the IEEE International Conference on Systems, 2004

Temporal modeling of facial actions from face profile image sequences.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Particle Filtering with Factorized Likelihoods for Tracking Facial Features.
Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), 2004

2003
Semi-automatic object-based video segmentation with labeling of color segments.
Signal Process. Image Commun., 2003

2002
TREC Feature Extraction by Active Learning.
Proceedings of The Eleventh Text REtrieval Conference, 2002

Regularized Patch Motion Estimation.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Facial action recognition in face profile image sequences.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Confidence measures for block matching motion estimation.
Proceedings of the 2002 International Conference on Image Processing, 2002

2001
Video Segmentation by MAP Labeling of Watershed Segments.
IEEE Trans. Pattern Anal. Mach. Intell., 2001

1998
An Iterative Motion Estimation-Segmentation Method using Watershed Segments.
Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

1996
Joint disparity and motion field estimation in stereoscopic image sequences.
Proceedings of the 13th International Conference on Pattern Recognition, 1996


  Loading...