Nicu Sebe

Orcid: 0000-0002-6597-7248

Affiliations:
  • University of Trento, Italy


According to our database1, Nicu Sebe authored at least 621 papers between 1997 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
A pure MLP-Mixer-based GAN framework for guided image translation.
Pattern Recognit., 2025

GraphMLP: A graph MLP-like architecture for 3D human pose estimation.
Pattern Recognit., 2025

2024
Spatial entropy as an inductive bias for vision transformers.
Mach. Learn., September, 2024

Unsupervised Point Cloud Representation Learning by Clustering and Neural Rendering.
Int. J. Comput. Vis., August, 2024

Bridge Gap in Pixel and Feature Level for Cross-Modality Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., June, 2024

Graph Transformer GANs With Graph Masked Modeling for Architectural Layout Generation.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2024

Cloth Interactive Transformer for Virtual Try-On.
ACM Trans. Multim. Comput. Commun. Appl., April, 2024

Guest Editorial Introduction to the Issue on Pre-Trained Models for Multi-Modality Understanding.
IEEE Trans. Multim., 2024

Adaptive Log-Euclidean Metrics for SPD Matrix Learning.
IEEE Trans. Image Process., 2024

Style-Hallucinated Dual Consistency Learning: A Unified Framework for Visual Domain Generalization.
Int. J. Comput. Vis., 2024

PMGNet: Disentanglement and entanglement benefit mutually for compositional zero-shot learning.
Comput. Vis. Image Underst., 2024

RMLR: Extending Multinomial Logistic Regression into General Geometries.
CoRR, 2024

Discriminative Anchor Learning for Efficient Multi-view Clustering.
CoRR, 2024

Optimizing Resource Consumption in Diffusion Models through Hallucination Early Detection.
CoRR, 2024

GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models.
CoRR, 2024

PVAFN: Point-Voxel Attention Fusion Network with Multi-Pooling Enhancing for 3D Object Detection.
CoRR, 2024

Global-Local Distillation Network-Based Audio-Visual Speaker Tracking with Incomplete Modalities.
CoRR, 2024

ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining.
CoRR, 2024

Large Language Models for Multimodal Deformable Image Registration.
CoRR, 2024

When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding.
CoRR, 2024

Masked Image Modeling: A Survey.
CoRR, 2024

Towards Localized Fine-Grained Control for Facial Expression Generation.
CoRR, 2024

Any Image Restoration with Efficient Automatic Degradation Adaptation.
CoRR, 2024

Understanding Matrix Function Normalizations in Covariance Pooling through the Lens of Riemannian Geometry.
CoRR, 2024

3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance.
CoRR, 2024

Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization.
CoRR, 2024

Bringing Masked Autoencoders Explicit Contrastive Properties for Point Cloud Self-Supervised Learning.
CoRR, 2024

Product Geometries on Cholesky Manifolds with Applications to SPD Manifolds.
CoRR, 2024

TransferAttn: Transferable-guided Attention Is All You Need for Video Domain Adaptation.
CoRR, 2024

Sharing Key Semantics in Transformer Makes Efficient Image Restoration.
CoRR, 2024

Curriculum Direct Preference Optimization for Diffusion and Consistency Models.
CoRR, 2024

Deep Learning-Based Object Pose Estimation: A Comprehensive Survey.
CoRR, 2024

Socially Pertinent Robots in Gerontological Healthcare.
CoRR, 2024

Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery.
CoRR, 2024

An Efficient Learning-based Solver Comparable to Metaheuristics for the Capacitated Arc Routing Problem.
CoRR, 2024

Key-Graph Transformer for Image Restoration.
CoRR, 2024

Uncertainty-Aware Testing-Time Optimization for 3D Human Pose Estimation.
CoRR, 2024

Bilateral Reference for High-Resolution Dichotomous Image Segmentation.
CoRR, 2024

VASE: Object-Centric Appearance and Shape Manipulation of Real Videos.
CoRR, 2024

SpectralCLIP: Preventing Artifacts in Text-Guided Style Transfer from a Spectral Perspective.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Improving Fairness using Vision-Language Driven Image Augmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Generalized Source-Free Domain-adaptive Segmentation via Reliable Knowledge Propagation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

UVMap-ID: A Controllable and Personalized UV Map Generative Model.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Cross-Class Domain Adaptive Semantic Segmentation with Visual Language Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SM<sup>4</sup>Depth: Seamless Monocular Metric Depth Estimation across Multiple Cameras and Scenes by One Model.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Mitigating robust overfitting via self-residual-calibration regularization (Abstract Reprint).
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Democratizing Fine-grained Visual Recognition with Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

A Lie Group Approach to Riemannian Batch Normalization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Denoising Diffusion Probabilistic Models for Action-Conditioned 3D Motion Generation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Federated Generalized Category Discovery.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

OpenBias: Open-Set Bias Detection in Text-to-Image Generative Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Riemannian Multinomial Logistics Regression for SPD Neural Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Diversity-Authenticity Co-constrained Stylization for Federated Domain Generalization in Person Re-identification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Edge Guided GANs With Multi-Scale Contrastive Learning for Semantic Image Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Compositional Semantic Mix for Domain Adaptation in Point Cloud Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Introduction to the Special Issue on Trustworthy Multimedia Computing and Applications in Urban Scenes.
ACM Trans. Multim. Comput. Commun. Appl., November, 2023

A Memorizing and Generalizing Framework for Lifelong Person Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Nonlinear neurons with human-like apical dendrite activations.
Appl. Intell., November, 2023

Interactive Neural Painting.
Comput. Vis. Image Underst., October, 2023

Guest Editorial Introduction to the Special Issue on Video Transformers.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Orthogonal SVD Covariance Conditioning and Latent Disentanglement.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Self-training transformer for source-free domain adaptation.
Appl. Intell., July, 2023

Fast Differentiable Matrix Square Root and Inverse Square Root.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

AttentionGAN: Unpaired Image-to-Image Translation Using Attention-Guided Generative Adversarial Networks.
IEEE Trans. Neural Networks Learn. Syst., April, 2023

Towards Robust Person Re-Identification by Defending Against Universal Attackers.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

Mitigating robust overfitting via self-residual-calibration regularization.
Artif. Intell., April, 2023

On the Eigenvalues of Global Covariance Pooling for Fine-Grained Visual Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Bipartite Graph Reasoning GANs for Person Pose and Facial Image Synthesis.
Int. J. Comput. Vis., March, 2023

Disentangle Saliency Detection into Cascaded Detail Modeling and Body Filling.
ACM Trans. Multim. Comput. Commun. Appl., January, 2023

Bidirectional Transformer GAN for Long-term Human Motion Prediction.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Deep Unsupervised Key Frame Extraction for Efficient Video Classification.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Cross-View Panorama Image Synthesis.
IEEE Trans. Multim., 2023

ISF-GAN: An Implicit Style Function for High-Resolution Image-to-Image Translation.
IEEE Trans. Multim., 2023

Interaction Transformer for Human Reaction Generation.
IEEE Trans. Multim., 2023

100-Driver: A Large-Scale, Diverse Dataset for Distracted Driver Classification.
IEEE Trans. Intell. Transp. Syst., 2023

Structure-Guided Cross-Attention Network for Cross-Domain OCT Fluid Segmentation.
IEEE Trans. Image Process., 2023

Logit Margin Matters: Improving Transferable Targeted Adversarial Attack by Logit Calibration.
IEEE Trans. Inf. Forensics Secur., 2023

Impact of Facial Landmark Localization on Facial Expression Recognition.
IEEE Trans. Affect. Comput., 2023

Local and Global GANs With Semantic-Aware Upsampling for Image Generation.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

HiEve: A Large-Scale Benchmark for Human-Centric Video Analysis in Complex Events.
Int. J. Comput. Vis., 2023

Semantic Connectivity-Driven Pseudo-labeling for Cross-domain Segmentation.
CoRR, 2023

Diversified in-domain synthesis with efficient fine-tuning for few-shot classification.
CoRR, 2023

Zero-Shot Point Cloud Registration.
CoRR, 2023

RankFeat&RankWeight: Rank-1 Feature/Weight Removal for Out-of-distribution Detection.
CoRR, 2023

Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation.
CoRR, 2023

Flow Factorized Representation Learning.
CoRR, 2023

Budget-Aware Pruning: Handling Multiple Domains with Less Parameters.
CoRR, 2023

Turn Fake into Real: Adversarial Head Turn Attacks Against Deepfake Detection.
CoRR, 2023

Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation Evaluation.
CoRR, 2023

Point-PC: Point Cloud Completion Guided by Prior Knowledge via Causal Inference.
CoRR, 2023

T2TD: Text-3D Generation Model based on Prior Knowledge Guidance.
CoRR, 2023

Federated Generalized Category Discovery.
CoRR, 2023

Riemannian Multiclass Logistics Regression for SPD Neural Networks.
CoRR, 2023

Latent Traversals in Generative Models as Potential Flows.
CoRR, 2023

PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models.
CoRR, 2023

Large-scale Pre-trained Models are Surprisingly Strong in Incremental Novel Class Discovery.
CoRR, 2023

Adaptive Riemannian Metrics on SPD Manifolds.
CoRR, 2023

Modiff: Action-Conditioned 3D Motion Generation with Denoising Diffusion Probabilistic Models.
CoRR, 2023

Unleashing the Transferability Power of Unsupervised Pre-Training for Emotion Recognition in Masked and Unmasked Facial Images.
IEEE Access, 2023

Overlap-guided Gaussian Mixture Models for Point Cloud Registration.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Tightening Classification Boundaries in Open Set Domain Adaptation through Unknown Exploitation.
Proceedings of the 36th SIBGRAPI Conference on Graphics, Patterns and Images, 2023

Discover and Align Taxonomic Context Priors for Open-world Semi-Supervised Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Flow Factorized Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

FedVQA: Personalized Federated Visual Question Answering over Heterogeneous Scenes.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Multi-Domain Lifelong Visual Question Answering via Self-Critical Distillation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation Evaluation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Latent Traversals in Generative Models as Potential Flows.
Proceedings of the International Conference on Machine Learning, 2023

Edge Guided GANs with Contrastive Learning for Semantic Image Synthesis.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Budget-Aware Pruning for Multi-domain Learning.
Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023

StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Model.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Householder Projector for Unsupervised Latent Semantics Discovery.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PI-Trans: Parallel-Convmlp and Implicit-Transformation Based Gan for Cross-View Image Translation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Dynamically Instance-Guided Adaptation: A Backward-free Approach for Test-Time Domain Adaptive Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Masked Jigsaw Puzzle: A Versatile Position Embedding for Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Dynamic Conceptional Contrastive Learning for Generalized Category Discovery.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Attribute-Preserving Face Dataset Anonymization via Latent Code Optimization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Graph Transformer GANs for Graph-Constrained House Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Spatio-Temporal Graph Diffusion for Text-Driven Human Motion Generation.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

A Structure-Guided Diffusion Model for Large-Hole Image Completion.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

Cross-Modality Earth Mover's Distance for Visible Thermal Person Re-identification.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Introduction to the Special Issue on Fine-Grained Visual Recognition and Re-Identification.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Editorial Deep Learning for Anomaly Detection.
IEEE Trans. Neural Networks Learn. Syst., 2022

Total Generate: Cycle in Cycle Generative Adversarial Networks for Generating Human Faces, Hands, Bodies, and Natural Scenes.
IEEE Trans. Multim., 2022

Unsupervised High-Resolution Portrait Gaze Correction and Animation.
IEEE Trans. Image Process., 2022

Quasi-Equilibrium Feature Pyramid Network for Salient Object Detection.
IEEE Trans. Image Process., 2022

Joint Representation Learning and Keypoint Detection for Cross-View Geo-Localization.
IEEE Trans. Image Process., 2022

Relation Regularized Scene Graph Generation.
IEEE Trans. Cybern., 2022

Source-Free Open Compound Domain Adaptation in Semantic Segmentation.
IEEE Trans. Circuits Syst. Video Technol., 2022

Guest Editorial Introduction to the Special Issue on Advanced Machine Learning Methodologies for Large-Scale Video Object Segmentation and Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022

Facial Expression Translation Using Landmark Guided GANs.
IEEE Trans. Affect. Comput., 2022

Automatic Prediction of Group Cohesiveness in Images.
IEEE Trans. Affect. Comput., 2022

Video anomaly detection using deep residual-spatiotemporal translation network.
Pattern Recognit. Lett., 2022

Cross-view panorama image synthesis with progressive attention GANs.
Pattern Recognit., 2022

Probabilistic Graph Attention Network With Conditional Kernels for Pixel-Wise Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Guest Editorial: Introduction to the Special Section on Fine-Grained Visual Categorization.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Deep traffic sign detection and recognition without target domain real images.
Mach. Vis. Appl., 2022

solo-learn: A Library of Self-supervised Methods for Visual Representation Learning.
J. Mach. Learn. Res., 2022

Curriculum Learning: A Survey.
Int. J. Comput. Vis., 2022

Cross-domain object detection using unsupervised image translation.
Expert Syst. Appl., 2022

Low-budget label query through domain alignment enforcement.
Comput. Vis. Image Underst., 2022

Consistency-Aware Anchor Pyramid Network for Crowd Localization.
CoRR, 2022

Parameter Sharing in Budget-Aware Adapters for Multi-Domain Learning.
CoRR, 2022

Vision+X: A Survey on Multimodal Learning in the Light of Data.
CoRR, 2022

Smooth image-to-image translations with latent space interpolations.
CoRR, 2022

Rethinking the Learning Paradigm for Facial Expression Recognition.
CoRR, 2022

Training and Tuning Generative Neural Radiance Fields for Attribute-Conditional 3D-Aware Face Generation.
CoRR, 2022

Spatial Entropy Regularization for Vision Transformers.
CoRR, 2022

Breaking the Chain of Gradient Leakage in Vision Transformers.
CoRR, 2022

LeRaC: Learning Rate Curriculum.
CoRR, 2022

Federated and Generalized Person Re-identification through Domain and Feature Hallucinating.
CoRR, 2022

Cross-Modality Earth Mover's Distance for Visible Thermal Person Re-Identification.
CoRR, 2022

Dual-Head Contrastive Domain Adaptation for Video Action Recognition.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Adversarial Style Augmentation for Domain Generalized Urban-Scene Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

RankFeat: Rank-1 Feature Removal for Out-of-distribution Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Image and Video Generation: A Deep Learning Approach.
Proceedings of the 11th International Conference on Pattern Recognition Applications and Methods, 2022

Temporal Alignment for History Representation in Reinforcement Learning.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Unsupervised Domain Adaptation for Video Transformers in Action Recognition.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Fast Differentiable Matrix Square Root.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

3D-Aware Semantic-Guided Generative Model for Human Synthesis.
Proceedings of the Computer Vision - ECCV 2022, 2022

Improving Covariance Conditioning of the SVD Meta-layer by Orthogonality.
Proceedings of the Computer Vision, 2022

Batch-Efficient EigenDecomposition for Small and Medium Matrices.
Proceedings of the Computer Vision - ECCV 2022, 2022

GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

CoSMix: Compositional Semantic Mix for Domain Adaptation in 3D LiDAR Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Uncertainty-Guided Source-Free Domain Adaptation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Class-Incremental Novel Class Discovery.
Proceedings of the Computer Vision - ECCV 2022, 2022

Novel Class Discovery in Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Hyperbolic Vision Transformers: Combining Improvements in Metric Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Data Augmentation-free Unsupervised Learning for 3D Point Cloud Understanding.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Geometry-Contrastive Transformer for Generalized 3D Pose Transfer.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Introduction to the Special Issue on Explainable AI on Multimedia Computing.
ACM Trans. Multim. Comput. Commun. Appl., 2021

When Dictionary Learning Meets Deep Learning: Deep Dictionary Learning and Coding Network for Image Recognition With Limited Data.
IEEE Trans. Neural Networks Learn. Syst., 2021

Embedding Perspective Analysis Into Multi-Column Convolutional Neural Network for Crowd Counting.
IEEE Trans. Image Process., 2021

Layout-to-Image Translation With Double Pooling Generative Adversarial Networks.
IEEE Trans. Image Process., 2021

Multi-View Spatial Attention Embedding for Vehicle Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., 2021

AMIGOS: A Dataset for Affect, Personality and Mood Research on Individuals and Groups.
IEEE Trans. Affect. Comput., 2021

Editorial for the special issue on the DAFNE project (DigitalAnastylosis of Frescoes challeNgE).
Pattern Recognit. Lett., 2021

Explainable deep learning for efficient and robust pattern recognition: A survey of recent developments.
Pattern Recognit., 2021

Appearance and Pose-Conditioned Human Image Generation Using Deformable GANs.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

TriGAN: image-to-image translation for multi-source domain adaptation.
Mach. Vis. Appl., 2021

Metric-Learning-Based Deep Hashing Network for Content-Based Retrieval of Remote Sensing Images.
IEEE Geosci. Remote. Sens. Lett., 2021

Viewpoint and Scale Consistency Reinforcement for UAV Vehicle Re-Identification.
Int. J. Comput. Vis., 2021

Curriculum self-paced learning for cross-domain object detection.
Comput. Vis. Image Underst., 2021

Expert and Crowd-Guided Affect Annotation and Prediction.
CoRR, 2021

Global and Local Alignment Networks for Unpaired Image-to-Image Translation.
CoRR, 2021

Bi-Mix: Bidirectional Mixing for Domain Adaptive Nighttime Semantic Segmentation.
CoRR, 2021

Efficient Training of Visual Transformers with Small-Size Datasets.
CoRR, 2021

Controllable Person Image Synthesis with Spatially-Adaptive Warped Normalization.
CoRR, 2021

Transformer-Based Source-Free Domain Adaptation.
CoRR, 2021

Cloth Interactive Transformer for Virtual Try-On.
CoRR, 2021

Transformers Solve the Limited Receptive Field for Monocular Depth Prediction.
CoRR, 2021

Deep traffic light detection by overlaying synthetic context on arbitrary natural images.
Comput. Graph., 2021

Coarse-to-Fine Gaze Redirection with Numerical and Pictorial Guidance.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Efficient Training of Visual Transformers with Small Datasets.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Whitening for Self-Supervised Representation Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

Speak2Label: Using Domain Knowledge for Creating a Large Scale Driver Gaze Zone Estimation Dataset.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Exploiting sample correlation for crowd counting with multi-expert network.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Intrinsic-Extrinsic Preserved GANs for Unsupervised 3D Pose Transfer.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

OpenMix: Reviving Known Knowledge for Discovering Novel Visual Categories in an Open World.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Neighborhood Contrastive Learning for Novel Class Discovery.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning to Generalize Unseen Domains via Memory-based Multi-Source Meta-Learning for Person Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Joint Noise-Tolerant Learning and Meta Camera Shift Adaptation for Unsupervised Person Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Curriculum Graph Co-Teaching for Multi-Target Domain Adaptation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Smoothing the Disentangled Latent Style Space for Unsupervised Image-to-Image Translation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

AniFormer: Data-driven 3D Animation with Transformer.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Learning to Attack Real-World Models for Person Re-identification via Virtual-Guided Meta-Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Learning How to Smile: Expression Video Generation With Conditional Adversarial Recurrent Nets.
IEEE Trans. Multim., 2020

Spatio-Temporal Attention Networks for Action Recognition and Detection.
IEEE Trans. Multim., 2020

Unified Generative Adversarial Networks for Controllable Image-to-Image Translation.
IEEE Trans. Image Process., 2020

Binary neural networks: A survey.
Pattern Recognit., 2020

Progressive Fusion for Unsupervised Binocular Depth Estimation Using Cycled Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

The Unmanned Aerial Vehicle Benchmark: Object Detection, Tracking and Baseline.
Int. J. Comput. Vis., 2020

Special Issue on Generating Realistic Visual Data of Human Behavior.
Int. J. Comput. Vis., 2020

DF-GAN: Deep Fusion Generative Adversarial Networks for Text-to-Image Synthesis.
CoRR, 2020

Human in Events: A Large-Scale Benchmark for Human-centric Video Analysis in Complex Events.
CoRR, 2020

MGGR: MultiModal-Guided Gaze Redirection with Coarse-to-Fine Learning.
CoRR, 2020

Edge Guided GANs with Semantic Preserving for Semantic Image Synthesis.
CoRR, 2020

GMM-UNIT: Unsupervised Multi-Domain and Multi-Modal Image-to-Image Translation via Attribute Gaussian Mixture Modeling.
CoRR, 2020

Non-linear Neurons with Human-like Apical Dendrite Activations.
CoRR, 2020

Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation.
CoRR, 2020

Low-Budget Unsupervised Label Query through Domain Alignment Enforcement.
CoRR, 2020

Unsupervised Anomaly Detection and Localization Based on Deep Spatiotemporal Translation Network.
IEEE Access, 2020

Attention-based Fusion for Multi-source Human Image Generation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Latent World Models For Intrinsically Motivated Exploration.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Dual In-painting Model for Unsupervised Gaze Correction and Animation in the Wild.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Describe What to Change: A Text-guided Unsupervised Image-to-image Translation Approach.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Class-Aware Modality Mix and Center-Guided Metric Learning for Visible-Thermal Person Re-Identification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Retrieval Guided Unsupervised Multi-domain Image to Image Translation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

FATE/MM 20: 2nd International Workshop on Fairness, Accountability, Transparency and Ethics in MultiMedia.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Dual Attention GANs for Semantic Image Synthesis.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Motion-supervised Co-Part Segmentation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

An Online Deep Learning Based System for Defects Detection in Glass Panels.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

The Good, The Bad, and The Ugly: Neural Networks Straight From JPEG.
Proceedings of the IEEE International Conference on Image Processing, 2020

Weakly-Supervised Crowd Counting Learns from Sorting Rather Than Locations.
Proceedings of the Computer Vision - ECCV 2020, 2020

XingGAN for Person Image Generation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Online Depth Learning Against Forgetting in Monocular Videos.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Reverse Perspective Network for Perspective-Aware Object Counting.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Bipartite Graph Reasoning GANs for Person Image Generation.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

SF-UDA<sup>3D</sup>: Source-Free Unsupervised Domain Adaptation for LiDAR-Based 3D Object Detection.
Proceedings of the 8th International Conference on 3D Vision, 2020

2019
Increasing Image Memorability with Neural Style Transfer.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Special Section on Multimodal Understanding of Social, Affective, and Subjective Attributes.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Monocular Depth Estimation Using Multi-Scale Continuous CRFs as Sequential Deep Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Recurrent Face Aging with Hierarchical AutoRegressive Memory.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Self Paced Deep Learning for Weakly Supervised Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Fast and robust dynamic hand gesture recognition via key frames extraction and feature fusion.
Neurocomputing, 2019

Asymmetric Generative Adversarial Networks for Image-to-Image Translation.
CoRR, 2019

GazeCorrection: Self-Guided Eye Manipulation in the wild using Self-Supervised Generative Adversarial Networks.
CoRR, 2019

What is the relationship between face alignment and facial expression recognition?
CoRR, 2019

Online Adaptation through Meta-Learning for Stereo Depth Estimation.
CoRR, 2019

Temporal Spiking Recurrent Neural Network for Action Recognition.
IEEE Access, 2019

Training Adversarial Discriminators for Cross-Channel Abnormal Event Detection in Crowds.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Low-Shot Learning From Imaginary 3D Model.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Deep Micro-Dictionary Learning and Coding Network.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

CV-C3D: Action Recognition on Compressed Videos with Convolutional 3D Networks.
Proceedings of the 32nd SIBGRAPI Conference on Graphics, Patterns and Images, 2019

First Order Motion Model for Image Animation.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Gesture-to-Gesture Translation in the Wild via Category-Independent Conditional Maps.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

FAT/MM'19: 1st International Workshop on Fairness, Accountability, and Transparency in MultiMedia.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Effortless Deep Training for Traffic Sign Detection Using Templates and Arbitrary Natural Images.
Proceedings of the International Joint Conference on Neural Networks, 2019

Predicting Group Cohesiveness in Images.
Proceedings of the International Joint Conference on Neural Networks, 2019

Cross-Domain Car Detection Using Unsupervised Image-to-Image Translation: From Day to Night.
Proceedings of the International Joint Conference on Neural Networks, 2019

Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation.
Proceedings of the International Joint Conference on Neural Networks, 2019

Whitening and Coloring Batch Transform for GANs.
Proceedings of the 7th International Conference on Learning Representations, 2019

Expression Conditional Gan for Facial Expression-to-Expression Translation.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Regularized Evolutionary Algorithm for Dynamic Neural Topology Search.
Proceedings of the Image Analysis and Processing - ICIAP 2019, 2019

Unsupervised Domain Adaptation Using Full-Feature Whitening and Colouring.
Proceedings of the Image Analysis and Processing - ICIAP 2019, 2019

Budget-Aware Adapters for Multi-Domain Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Attribute-Guided Sketch Generation.
Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019

Multi-Channel Attention Selection GAN With Cascaded Semantic Guidance for Cross-View Image Translation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Animating Arbitrary Objects via Deep Motion Transfer.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Unsupervised Domain Adaptation Using Feature-Whitening and Consensus Loss.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Refine and Distill: Exploiting Cycle-Inconsistency and Knowledge Distillation for Unsupervised Monocular Depth Estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Pattern-Affinitive Propagation Across Depth, Surface Normal and Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Anomaly Event Detection Using Generative Adversarial Network for Surveillance Videos.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Structured Coupled Generative Adversarial Networks for Unsupervised Monocular Depth Estimation.
Proceedings of the 2019 International Conference on 3D Vision, 2019

2018
Human-Centered Computing: Application to Multimedia.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Joint Attributes and Event Analysis for Multimedia Event Detection.
IEEE Trans. Neural Networks Learn. Syst., 2018

Dynamic Affinity Graph Construction for Spectral Clustering Using Multiple Features.
IEEE Trans. Neural Networks Learn. Syst., 2018

Cross-Paced Representation Learning With Partial Curricula for Sketch-Based Image Retrieval.
IEEE Trans. Image Process., 2018

Flexible Manifold Learning With Optimal Graph for Image and Video Representation.
IEEE Trans. Image Process., 2018

ASCERTAIN: Emotion and Personality Recognition Using Commercial Sensors.
IEEE Trans. Affect. Comput., 2018

Quantization-based hashing: a general framework for scalable image and video retrieval.
Pattern Recognit., 2018

A Survey on Learning to Hash.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Recurrent Convolutional Shape Regression.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Viewpoint-Consistent 3D Face Alignment.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Guest Editors' Introduction to the Special Section on Learning with Shared Information for Computer Vision and Multimedia Analysis.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Deep appearance and motion learning for egocentric activity recognition.
Neurocomputing, 2018

New Signals in Multimedia Systems and Applications.
IEEE Multim., 2018

Predicting Group Cohesiveness in Images.
CoRR, 2018

Whitening and Coloring transform for GANs.
CoRR, 2018

Every Smile is Unique: Landmark-Guided Diverse Smile Generation.
CoRR, 2018

Plug-and-Play CNN for Crowd Motion Analysis: An Application in Abnormal Event Detection.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Session details: Deep-1 (Image Translation).
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

EE-USAD: ACM MM 2018Workshop on UnderstandingSubjective Attributes of Data focus on Evoked Emotions.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

GestureGAN for Hand Gesture-to-Gesture Translation in the Wild.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

FontMatcher: Font Image Paring for Harmonious Digital Graphic Design.
Proceedings of the 23rd International Conference on Intelligent User Interfaces, 2018

Deep Metric and Hash-Code Learning for Content-Based Retrieval of Remote Sensing Images.
Proceedings of the 2018 IEEE International Geoscience and Remote Sensing Symposium, 2018

Semantic-Fusion Gans for Semi-Supervised Satellite Image Classification.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Automatic Group Affect Analysis in Images via Visual Attribute and Feature Networks.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Depression Severity Estimation from Multiple Modalities.
Proceedings of the 20th IEEE International Conference on e-Health Networking, 2018

Deformable GANs for Pose-Based Human Image Generation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Group Consistent Similarity Learning via Deep CRF for Person Re-Identification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Every Smile Is Unique: Landmark-Guided Diverse Smile Generation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

PAD-Net: Multi-Tasks Guided Prediction-and-Distillation Network for Simultaneous Depth Estimation and Scene Parsing.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Enhancing Perceptual Attributes with Bayesian Style Generation.
Proceedings of the Computer Vision - ACCV 2018, 2018

Dual Generator Generative Adversarial Networks for Multi-domain Image-to-Image Translation.
Proceedings of the Computer Vision - ACCV 2018, 2018

Unsupervised Adversarial Depth Estimation Using Cycled Generative Networks.
Proceedings of the 2018 International Conference on 3D Vision, 2018

Utilizing implicit user cues for multimedia analytics.
Proceedings of the Frontiers of Multimedia Research, 2018

Multimodal analysis of free-standing conversational groups.
Proceedings of the Frontiers of Multimedia Research, 2018

2017
Guest Editorial: Large-Scale Multimedia Data Retrieval, Classification, and Understanding.
IEEE Trans. Multim., 2017

The Many Shades of Negativity.
IEEE Trans. Multim., 2017

Efficient human action recognition using histograms of motion gradients and VLAD with descriptor shape information.
Multim. Tools Appl., 2017

Detecting anomalous events in videos by learning deep representations of appearance and motion.
Comput. Vis. Image Underst., 2017

The S-Hock dataset: A new benchmark for spectator crowd analysis.
Comput. Vis. Image Underst., 2017

Indoor localization via multi-view images and videos.
Comput. Vis. Image Underst., 2017

AMIGOS: A dataset for Mood, personality and affect research on Individuals and GrOupS.
CoRR, 2017

Learning Deep Structured Multi-Scale Features using Attention-Gated CRFs for Contour Prediction.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Spatio-Temporal VLAD Encoding for Human Action Recognition in Videos.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

What your Facebook Profile Picture Reveals about your Personality.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

SAWACMMM'17: The 1st Workshop on Multi Media Applications within the South African Context.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

MUSA2: First ACM Workshop on Multimodal Understanding of Social, Affective and Subjective Attributes.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

How to Make an Image More Memorable?: A Deep Style Transfer Approach.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Simple, Efficient and Effective Encodings of Local Deep Features for Video Action Recognition.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Abnormal event detection in videos using generative adversarial nets.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

A cross-modal adaptation approach for brain decoding.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Multi-scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Learning Cross-Modal Deep Representations for Robust Pedestrian Detection.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Spatio-Temporal Vector of Locally Max Pooled Features for Action Recognition in Videos.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Viraliency: Pooling Local Virality.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

SALSA: A Multimodal Dataset for the Automated Analysis of Free-Standing Social Interactions.
Proceedings of the Group and Crowd Behavior for Computer Vision, 1st Edition, 2017

Exploring Multitask and Transfer Learning Algorithms for Head Pose Estimation in Dynamic Multiview Scenarios.
Proceedings of the Group and Crowd Behavior for Computer Vision, 1st Edition, 2017

2016
Active domain adaptation with noisy labels for multimedia analysis.
World Wide Web, 2016

Learning Personalized Models for Facial Expression Analysis and Gesture Recognition.
IEEE Trans. Multim., 2016

A Distance-Computation-Free Search Scheme for Binary Code Databases.
IEEE Trans. Multim., 2016

Multimodal Personality Recognition in Collaborative Goal-Oriented Tasks.
IEEE Trans. Multim., 2016

Category Specific Dictionary Learning for Attribute Specific Feature Selection.
IEEE Trans. Image Process., 2016

Optimized Graph Learning Using Partial Tags and Multiple Features for Image and Video Annotation.
IEEE Trans. Image Process., 2016

Perceptual Attributes Optimization for Multivideo Summarization.
IEEE Trans. Cybern., 2016

Guest Editorial: Big Media Data: Understanding, Search, and Mining.
IEEE Trans. Big Data, 2016

A Multi-Task Learning Framework for Head Pose Estimation under Target Motion.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

SALSA: A Novel Dataset for Multimodal Group Behavior Analysis.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

A modified vector of locally aggregated descriptors approach for fast video classification.
Multim. Tools Appl., 2016

Emotion recognition in the wild.
J. Multimodal User Interfaces, 2016

Event-based media processing and analysis: A survey of the literature.
Image Vis. Comput., 2016

Deep and fast: Deep learning hashing with semi-supervised graph construction.
Image Vis. Comput., 2016

Special Issue on Event-based Media Processing and Analysis.
Image Vis. Comput., 2016

Where am I in the dark: Exploring active transfer learning on the use of indoor localization based on thermal imaging.
Neurocomputing, 2016

Collaborative Sparse Coding for Multiview Action Recognition.
IEEE Multim., 2016

Computational Modeling of Affective Qualities of Abstract Paintings.
IEEE Multim., 2016

Fisher Kernel Temporal Variation-based Relevance Feedback for video retrieval.
Comput. Vis. Image Underst., 2016

Emotion-Based Crowd Representation for Abnormality Detection.
CoRR, 2016

The Death and Life of Great Italian Cities: A Mobile Phone Data Perspective.
Proceedings of the 25th International Conference on World Wide Web, 2016

Academic Coupled Dictionary Learning for Sketch-based Image Retrieval.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Joint Graph Learning and Video Segmentation via Multiple Cues and Topology Calibration.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Are Safer Looking Neighborhoods More Lively?: A Multimodal Investigation into Urban Life.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Emerging Topics in Learning from Noisy and Missing Data.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

A Quality Adaptive Multimodal Affect Recognition System for User-Centric Multimedia Indexing.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Multi-Paced Dictionary Learning for cross-domain retrieval and recognition.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Sparse-coded cross-domain adaptation from the visual to the brain domain.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Boosting VLAD with double assignment using deep features for action recognition in videos.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Brain and music: Music genre classification using brain signals.
Proceedings of the 24th European Signal Processing Conference, 2016

The First 3D Face Alignment in the Wild (3DFAW) Challenge.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Recurrent Face Aging.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Self-Adaptive Matrix Completion for Heart Rate Estimation from Face Videos under Realistic Conditions.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Recognizing Emotions from Abstract Paintings Using Non-Linear Matrix Completion.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Histograms of Motion Gradients for real-time video classification.
Proceedings of the 14th International Workshop on Content-Based Multimedia Indexing, 2016

Projective Unsupervised Flexible Embedding with Optimal Graph.
Proceedings of the British Machine Vision Conference 2016, 2016

Recurrent Convolutional Face Alignment.
Proceedings of the Computer Vision - ACCV 2016, 2016

Sparse Code Filtering for Action Pattern Mining.
Proceedings of the Computer Vision - ACCV 2016, 2016

Graph-without-cut: An Ideal Graph Learning for Image Segmentation.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Semisupervised Feature Selection via Spline Regression for Video Semantic Recognition.
IEEE Trans. Neural Networks Learn. Syst., 2015

Compact Image Fingerprint Via Multiple Kernel Hashing.
IEEE Trans. Multim., 2015

Event Oriented Dictionary Learning for Complex Event Detection.
IEEE Trans. Image Process., 2015

Egocentric Daily Activity Recognition via Multitask Clustering.
IEEE Trans. Image Process., 2015

Affective Analysis of Professional and Amateur Abstract Paintings Using Statistical Analysis and Art Theory.
ACM Trans. Interact. Intell. Syst., 2015

Guest Editorial: Big Media Data: Understanding, Search, and Mining (Part 2).
IEEE Trans. Big Data, 2015

DECAF: MEG-Based Multimodal Database for Decoding Affective Physiological Responses.
IEEE Trans. Affect. Comput., 2015

Memory efficient large-scale image-based localization.
Multim. Tools Appl., 2015

Video classification with Densely extracted HOG/HOF/MBH features: an evaluation of the accuracy/computational efficiency trade-off.
Int. J. Multim. Inf. Retr., 2015

Cross-Lingual Cross-Media Content Linking: Annotations and Joint Representations (Dagstuhl Seminar 15201).
Dagstuhl Reports, 2015

Supervised Hashing with Pseudo Labels for Scalable Multimedia Retrieval.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Who's Afraid of Itten: Using the Art Theory of Color Combination to Analyze Emotions in Abstract Paintings.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

2nd Workshop on Computational Models of Social Interactions: Human-Computer-Media Communication (HCMC2015).
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Analyzing Free-standing Conversational Groups: A Multimodal Approach.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Attribute Guided Dictionary Learning.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Looking at Mondrian's Victory Boogie-Woogie: What Do I Feel?
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Inferring Painting Style with Multi-Task Dictionary Learning.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Implicit User-centric Personality Recognition Based on Physiological Responses to Emotional Videos.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Beyond Bag-of-Words: Fast video classification with Fisher Kernel Vector of Locally Aggregated Descriptors.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

PET: An eye-tracking dataset for animal-centric Pascal object classes.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Real-life violent social interaction detection.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Cluster encoding for modelling temporal variation in video.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Emotions in Abstract Art: Does Texture Matter?
Proceedings of the Image Analysis and Processing - ICIAP 2015, 2015

Movie Genre Classification by Exploiting MEG Brain Signals.
Proceedings of the Image Analysis and Processing - ICIAP 2015, 2015

FaceCept3D: Real Time 3D Face Tracking and Analysis.
Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015

Regressing a 3D Face Shape from a Single Image.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Unsupervised Tube Extraction Using Transductive Learning and Dense Trajectories.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Localize Me Anywhere, Anytime: A Multi-task Point-Retrieval Approach.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Facial expression recognition under a wide range of head poses.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

The more the merrier: Analysing the affect of a group of people in images.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Inference of personality traits and affect schedule by analysis of spontaneous reactions to affective videos.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Optimal graph learning with partial tags and multiple features for image and video annotation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

The S-HOCK dataset: Analyzing crowds at the stadium.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Learning Deep Representations of Appearance and Motion for Anomalous Event Detection.
Proceedings of the British Machine Vision Conference 2015, 2015

Complex Event Detection via Event Oriented Dictionary Learning.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Weakly Supervised Photo Cropping.
IEEE Trans. Multim., 2014

Image Attribute Adaptation.
IEEE Trans. Multim., 2014

Guest Editorial Special Section on Socio-Mobile Media Analysis and Retrieval.
IEEE Trans. Multim., 2014

Multitask Linear Discriminant Analysis for View Invariant Action Recognition.
IEEE Trans. Image Process., 2014

Faved! Biometrics: Tell Me Which Image You Like and I'll Tell You Who You Are.
IEEE Trans. Inf. Forensics Secur., 2014

Knowledge Adaptation with PartiallyShared Features for Event DetectionUsing Few Exemplars.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Special issue on contextual vision computing.
Mach. Vis. Appl., 2014

Exploring Transfer Learning Approaches for Head Pose Classification from Multi-view Surveillance Images.
Int. J. Comput. Vis., 2014

Harnessing Lab Knowledge for Real-World Action Recognition.
Int. J. Comput. Vis., 2014

Large-Scale Geosocial Multimedia [Guest editorial].
IEEE Multim., 2014

Special section on learning from multiple evidences for large scale multimedia analysis.
Comput. Vis. Image Underst., 2014

GLocal tells you more: Coupling GLocal structural for feature selection with sparsity for image and video classification.
Comput. Vis. Image Underst., 2014

You Talkin' to Me?: Recognizing Complex Human Interactions in Unconstrained Videos.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

We are not All Equal: Personalizing Models for Facial Expression Analysis with Transductive Parameter Transfer.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Multiple Features But Few Labels?: A Symbiotic Solution Exemplified for Video Analysis.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Temporal Dropout of Changes Approach to Convolutional Learning of Spatio-Temporal Features.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

A Multi-task Learning Framework for Time-continuous Emotion Estimation from Crowd Annotations.
Proceedings of the 2014 International ACM Workshop on Crowdsourcing for Multimedia, 2014

Realtime Video Classification using Dense HOF/HOG.
Proceedings of the International Conference on Multimedia Retrieval, 2014

The Mystery of Faces: Investigating Face Contribution for Multimedia Event Detection.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Simultaneous Ground Metric Learning and Matrix Factorization with Earth Mover's Distance.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Evaluating Multi-task Learning for Multi-view Head-Pose Classification in Interactive Environments.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Clustered Multi-task Linear Discriminant Analysis for View Invariant Color-Depth Action Recognition.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Robust Real-Time Extreme Head Pose Estimation.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Emotional Valence Recognition, Analysis of Salience and Eye Movements.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Unsupervised Domain Adaptation for Personalized Facial Emotion Recognition.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

It's all about habits: Exploiting multi-task clustering for activities of daily living analysis.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Minimizing dataset bias: Discriminative multi-task sparse coding through shared subspace learning for image classification.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Money walks: a human-centric study on the economics of personal mobile data.
Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2014

Learning to Group Objects.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Knowing Where I Am: Exploiting Multi-Task Learning for Multi-view Indoor Image-based Localization.
Proceedings of the British Machine Vision Conference, 2014

Recognizing Daily Activities from First-Person Videos with Multi-task Clustering.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
Multi-Feature Fusion via Hierarchical Regression for Multimedia Analysis.
IEEE Trans. Multim., 2013

Feature Selection for Multimedia Analysis by Sharing Information Among Multiple Tasks.
IEEE Trans. Multim., 2013

Multimedia Event Detection Using A Classifier-Specific Intermediate Representation.
IEEE Trans. Multim., 2013

A Prototype Learning Framework Using EMD: Application to Complex Scenes Analysis.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Guest Editorial: Human-Computer Interaction: Real-Time Vision Aspects of Natural User Interfaces.
Int. J. Comput. Vis., 2013

Unified Dictionary Learning and Region Tagging with Hierarchical Sparse Representation.
Comput. Vis. Image Underst., 2013

Nobody likes Mondays: foreground detection and behavioral patterns analysis in complex urban scenes.
Proceedings of the 4th ACM/IEEE international workshop on Analysis and retrieval of tracked events and motion in imagery stream, 2013

GLocal structural feature selection with sparsity for multimedia data understanding.
Proceedings of the ACM Multimedia Conference, 2013

Time matters!: capturing variation in time in video using fisher kernels.
Proceedings of the ACM Multimedia Conference, 2013

We are not equally negative: fine-grained labeling for multimedia event detection.
Proceedings of the ACM Multimedia Conference, 2013

Fisher kernel based relevance feedback for multimodal video retrieval.
Proceedings of the International Conference on Multimedia Retrieval, 2013

Thinking of Images as What They Are: Compound Matrix Regression for Image Classification.
Proceedings of the IJCAI 2013, 2013

On the relationship between head pose, social attention and personality prediction for unstructured and dynamic group interactions.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Multi-task linear discriminant analysis for multi-view action recognition.
Proceedings of the IEEE International Conference on Image Processing, 2013

We like it! Mapping image preferences on the counting grid.
Proceedings of the IEEE International Conference on Image Processing, 2013

Daily Living Activities Recognition via Efficient High and Low Level Cues Combination and Fisher Kernel Representation.
Proceedings of the Image Analysis and Processing - ICIAP 2013, 2013

No Matter Where You Are: Flexible Graph-Guided Multi-task Learning for Multi-view Head Pose Classification under Target Motion.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Feature Weighting via Optimal Thresholding for Video Analysis.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Decoding affect in videos employing the MEG brain signal.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Particles cross-influence for entity grouping.
Proceedings of the 21st European Signal Processing Conference, 2013

Exploiting visual search theory to infer social interactions.
Proceedings of the Multimedia Content and Mobile Devices 2013, 2013

Complex Event Detection via Multi-source Video Attributes.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Multimodal Engagement Classification for Affective Cinema.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

User-centric Affective Video Tagging from MEG and Peripheral Physiological Responses.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

2012
Discriminating Joint Feature Analysis for Multimedia Data Understanding.
IEEE Trans. Multim., 2012

Web Image Annotation Via Subspace-Sparsity Collaborated Feature Selection.
IEEE Trans. Multim., 2012

Combining Head Pose and Eye Location Information for Gaze Estimation.
IEEE Trans. Image Process., 2012

Sparse Color Interest Points for Image Retrieval and Object Categorization.
IEEE Trans. Image Process., 2012

Connecting Meeting Behavior with Extraversion - A Systematic Study.
IEEE Trans. Affect. Comput., 2012

Societally connected multimedia across cultures.
J. Zhejiang Univ. Sci. C, 2012

What Are You Looking at? - Improving Visual Gaze Estimation by Saliency.
Int. J. Comput. Vis., 2012

The SocioMetric Badges Corpus: A Multilevel Behavioral Dataset for Social Behavior in Complex Organizations.
Proceedings of the 2012 International Conference on Privacy, 2012

In the eye of the beholder: employing statistical analysis and eye tracking for analyzing abstract paintings.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Knowledge adaptation for ad hoc multimedia event detection with few exemplars.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Distributional semantics with eyes: using image analysis to improve computational representations of word meaning.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Categorization of a collection of pictures into structured events.
Proceedings of the International Conference on Multimedia Retrieval, 2012

Classifier-specific intermediate representation for multimedia tasks.
Proceedings of the International Conference on Multimedia Retrieval, 2012

Enhanced semantic descriptors for functional scene categorization.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Active transfer learning for multi-view head-pose classification.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Friends don't lie: inferring personality traits from social network structure.
Proceedings of the 2012 ACM Conference on Ubiquitous Computing, 2012

Boosting-based transfer learning for multi-view head-pose classification from surveillance videos.
Proceedings of the 20th European Signal Processing Conference, 2012

Exploiting Sparse Representations for Robust Analysis of Noisy Complex Video Scenes.
Proceedings of the Computer Vision - ECCV 2012, 2012

Real Time Detection of Social Interactions in Surveillance Video.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

(Unseen) event recognition via semantic compositionality.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

An Adaptation Framework for Head-Pose Classification in Dynamic Multi-view Scenarios.
Proceedings of the Computer Vision, 2012

Tell Me What You Like and I'll Tell You What You Are: Discriminating Visual Preferences on Flickr Data.
Proceedings of the Computer Vision - ACCV 2012, 2012

UX_Mate: from facial expressions to UX evaluation.
Proceedings of the Designing Interactive Systems Conference 2012, 2012

2011
Building descriptive and discriminative visual codebook for large-scale image applications.
Multim. Tools Appl., 2011

Personalization in multimedia retrieval: A survey.
Multim. Tools Appl., 2011

Looking at the viewer: analysing facial activity to detect personal highlights of multimedia contents.
Multim. Tools Appl., 2011

Computer vision for ambient intelligence.
J. Ambient Intell. Smart Environ., 2011

Contextual Modeling of Personality States' Dynamics in Face-to-Face Interactions.
Proceedings of the PASSAT/SocialCom 2011, Privacy, 2011

Automatic modeling of personality states in small group interactions.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Can computers learn from humans to see better?: inferring scene semantics from viewers' eye movements.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Exploiting the entire feature space with sparsity for automatic image annotation.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Please, tell me about yourself: automatic personality assessment using short self-presentations.
Proceedings of the 13th International Conference on Multimodal Interfaces, 2011

Sorting Atomic Activities for Discovering Spatio-temporal Patterns in Dynamic Scenes.
Proceedings of the Image Analysis and Processing - ICIAP 2011, 2011

2010
Human-Centered Computing.
Proceedings of the Encyclopedia of Software Engineering, 2010

Special Issue on Multimodal Affective Interaction.
IEEE Trans. Multim., 2010

Large-scale image and video search: Challenges, technologies, and trends.
J. Vis. Commun. Image Represent., 2010

Best of Automatic Face and Gesture Recognition 2008.
Image Vis. Comput., 2010

Sonify your face: facial expressions for sound generation.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Behavior and properties of spatio-temporal local features under visual transformations.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

The use of non-conventional methods for content analysis and understanding: panel overview.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Human-centered multimedia systems: tutorial overview.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Putting the pieces together: multimodal analysis of social attention in meetings.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Making computers look the way we look: exploiting visual attention for image understanding.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Toward an automatically generated soundtrack from low-level cross-modal correlations for automotive scenarios.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Pervasive video analysis: workshop overview.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Visual Gaze Estimation by Joint Head and Eye Information.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Challenges of Human Behavior Understanding.
Proceedings of the Human Behavior Understanding, First International Workshop, 2010

Employing social gaze and speaking activity for automatic determination of the <i>Extraversion</i> trait.
Proceedings of the 12th International Conference on Multimodal Interfaces / 7. International Workshop on Machine Learning for Multimodal Interaction, 2010

An Eye Fixation Database for Saliency Detection in Images.
Proceedings of the Computer Vision, 2010

Systematic Evaluation of Spatio-Temporal Features on Comparative Video Challenges.
Proceedings of the Computer Vision - ACCV 2010 Workshops, 2010

Human-centered Computing.
Proceedings of the Handbook of Ambient Intelligence and Smart Environments, 2010

2009
Human-centered Computing: Application to Multimedia.
Proceedings of the Encyclopedia of Database Systems, 2009

Multimodal interfaces: Challenges and perspectives.
J. Ambient Intell. Smart Environ., 2009

Isocentric color saliency in images.
Proceedings of the International Conference on Image Processing, 2009

Webcam-Based Visual Gaze Estimation.
Proceedings of the Image Analysis and Processing, 2009

Image saliency by isocentric curvedness and color.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Lonely but attractive: Sparse color salient points for object retrieval and categorization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009

Exploiting facial expressions for affective video summarisation.
Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009

2008
Machine Learning Techniques for Face Analysis.
Proceedings of the Machine Learning Techniques for Multimedia, 2008

Special section from the ACM multimedia conference 2007.
ACM Trans. Multim. Comput. Commun. Appl., 2008

Distance Learning for Similarity Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

Similarity Matching in Computer Vision and Multimedia.
Comput. Vis. Image Underst., 2008

3rd international workshop on human-centered computing (HCC '08).
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Facial expression recognition as a creative interface.
Proceedings of the 13th International Conference on Intelligent User Interfaces, 2008

Emotional valence categorization using holistic image features.
Proceedings of the International Conference on Image Processing, 2008

2007
Context-Based Object-Class Recognition and Retrieval by Generalized Correlograms.
IEEE Trans. Pattern Anal. Mach. Intell., 2007

Authentic facial expression analysis.
Image Vis. Comput., 2007

The age of human computer interaction.
Image Vis. Comput., 2007

Multimodal human-computer interaction: A survey.
Comput. Vis. Image Underst., 2007

Guest Editors' Introduction: Human-Centered Computing--Toward a Human Revolution.
Computer, 2007

International workshop on human-centered multimedia: overview.
Proceedings of the International Workshop on Human-Centered Multimedia, 2007

Human-centered multimedia systems: tutorial overview.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Personalized multimedia retrieval: the new trend?
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Do Colour Interest Points Improve Image Retrieval?
Proceedings of the International Conference on Image Processing, 2007

Cooperative Object Tracking with Multiple PTZ Cameras.
Proceedings of the 14th International Conference on Image Analysis and Processing (ICIAP 2007), 2007

Human-Centered Computing: Challenges and Perspectives.
Proceedings of the 23rd International Conference on Data Engineering Workshops, 2007

Human-Computer Intelligent Interaction: A Survey.
Proceedings of the Human-Computer Interaction, 2007

Integrating Relevance Feedback in Boosting for Content-Based Image Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2007

Two-Dimensional Adaptive Discriminant Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2007

Overview of the ImageCLEF 2007 Object Retrieval Task.
Proceedings of the Working Notes for CLEF 2007 Workshop co-located with the 11th European Conference on Digital Libraries (ECDL 2007), 2007

Class-Specific Binary Correlograms for Object Recognition.
Proceedings of the British Machine Vision Conference 2007, 2007

2006
Content-based multimedia information retrieval: State of the art and challenges.
ACM Trans. Multim. Comput. Commun. Appl., 2006

Boosting the distance estimation: Application to the <i>K</i>-Nearest Neighbor Classifier.
Pattern Recognit. Lett., 2006

Human-centered computing: a multimedia perspective.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

The Role of Featural and Configural Information in Face Classification A Simulation of the Expertise Hypothesis.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Emotion Recognition Based on Joint Visual and Audio Cues.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

A New Study on Distance Metrics as Similarity Measurement.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Toward Robust Distance Metric Analysis for Similarity Estimation.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Evaluation of Intensity and Color Corner Detectors for Affine Invariant Salient Regions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2006

Corner Detectors for Affine Invariant Salient Regions: Is Color Important?.
Proceedings of the Image and Video Retrieval, 5th International Conference, 2006

2005
Machine Learning in Computer Vision
Computational Imaging and Vision 29, Springer, ISBN: 978-1-4020-3275-2, 2005

How to Complete Performance Graphs in Content-Based Image Retrieval: Add Generality and Normalize Scope.
IEEE Trans. Pattern Anal. Mach. Intell., 2005

Learning probabilistic classifiers for human-computer interaction applications.
Multim. Syst., 2005

Systems and architectures for multimedia information retrieval.
Multim. Syst., 2005

Affective multimodal human-computer interaction.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Video Object Boundary Reconstruction by 2-Pass Voting.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Neighborhood issue in single-frame image super-resolution.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Affective Meeting Video Analysis.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Efficient Object-Class Recognition by Boosting Contextual Information.
Proceedings of the Pattern Recognition and Image Analysis, Second Iberian Conference, 2005

Semi-Supervised Face Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2005

Fast Spatial Pattern Discovery Integrating Boosting with Constellations of Contextual Descriptors.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

2004
Semisupervised Learning of Classifiers: Theory, Algorithms, and Their Application to Human-Computer Interaction.
IEEE Trans. Pattern Anal. Mach. Intell., 2004

Towards authentic emotion recognition.
Proceedings of the IEEE International Conference on Systems, 2004

Complete Performance Graphs in Probabilistic Information Retrieval.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Boosting contextual information in content-based image retrieval.
Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2004

Skin Detection: A Bayesian Network Approach.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

A new analysis of the value of unlabeled data in semi-supervised learning for image retrieval.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Toward an improved error metric.
Proceedings of the 2004 International Conference on Image Processing, 2004

Authentic Emotion Detection in Real-Time Video.
Proceedings of the Computer Vision in Human-Computer Interaction, 2004

The State-of-the-Art in Human-Computer Interaction.
Proceedings of the Computer Vision in Human-Computer Interaction, 2004

Robust Error Metric Analysis for Noise Estimation in Image Indexing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2004

2003
Robust Computer Vision - Theory and Applications
Computational Imaging and Vision 26, Springer, ISBN: 978-94-017-0295-9, 2003

Comparing salient point detectors.
Pattern Recognit. Lett., 2003

Evaluation of salient point techniques.
Image Vis. Comput., 2003

Video retrieval and summarization.
Comput. Vis. Image Underst., 2003

Facial expression recognition from video sequences: temporal and static modeling.
Comput. Vis. Image Underst., 2003

Semi-supervised learning for facial expression recognition.
Proceedings of the 5th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2003

Content-based indexing performance: size normalized precision, recall, generality evaluation.
Proceedings of the 2003 International Conference on Image Processing, 2003

Learning Bayesian Network Classifiers for Facial Expression Recognition using both Labeled and Unlabeled Data.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

The State of the Art in Image and Video Retrieval.
Proceedings of the Image and Video Retrieval, Second International Conference, 2003

Evaluation of Expression Recognition Techniques.
Proceedings of the Image and Video Retrieval, Second International Conference, 2003

2002
Detecting Automobiles and People for Semantic Video Retrieval.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Emotion Recognition Using a Cauchy Naive Bayes Classifier.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

An overcomplete discrete wavelet transform for video compression.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Facial expression recognition from video sequences.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Object Recognition for Video Retrieval.
Proceedings of the Image and Video Retrieval, International Conference, 2002

Robust Shape Matching.
Proceedings of the Image and Video Retrieval, International Conference, 2002

Challenges of Image and Video Retrieval.
Proceedings of the Image and Video Retrieval, International Conference, 2002

2001
Texture Features for Content-Based Retrieval.
Proceedings of the Principles of Visual Information Retrieval, 2001

Video Indexing and Understanding.
Proceedings of the Principles of Visual Information Retrieval, 2001

Color-based retrieval.
Pattern Recognit. Lett., 2001

Image retrieval using wavelet-based salient points.
J. Electronic Imaging, 2001

Content-based image retrieval using wavelet-based salient points.
Proceedings of the Storage and Retrieval for Media Databases 2001, 2001

Extended Performance Graphs for Cluster Retrieval.
Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001

Salient Points for Content-Based Retrieval.
Proceedings of the British Machine Vision Conference 2001, 2001

2000
Toward Improved Ranking Metrics.
IEEE Trans. Pattern Anal. Mach. Intell., 2000

Wavelet-Based Salient Points: Applications to Image Retrieval Using Color and Texture Features.
Proceedings of the Advances in Visual Information Systems, 4th International Conference, 2000

A Ground-Truth Training Set for Hierarchical Clustering in Content-Based Image Retrieval.
Proceedings of the Advances in Visual Information Systems, 4th International Conference, 2000

Wavelet Based Texture Classification.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

Maximum Likelihood Stereo Matching.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

Color Based Retrieval and Recognition.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

Wavelet-Based Salient Points for Image Retrieval.
Proceedings of the 2000 International Conference on Image Processing, 2000

Improving Visual Matching.
Proceedings of the 2000 Conference on Computer Vision and Pattern Recognition (CVPR 2000), 2000

Visual Websearching Using Iconic Queries.
Proceedings of the 2000 Conference on Computer Vision and Pattern Recognition (CVPR 2000), 2000

1999
Adapting <i>k-d</i> Trees to Visual Retrieval.
Proceedings of the Visual Information and Information Systems, 1999

Multi-scale sub-image search.
Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

Robust color indexing.
Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

1998
Towards optimal ranking metrics.
Proceedings of the XI Computer Graphics, 1998

Which ranking metric is optimal? With applications in image retrieval and stereo matching.
Proceedings of the Fourteenth International Conference on Pattern Recognition, 1998

1997
Generation of the HDL-A-model of a micromembrane from its finite-element-description.
Proceedings of the European Design and Test Conference, 1997


  Loading...