2025
MadCLIP: Few-shot Medical Anomaly Detection with CLIP.
CoRR, June, 2025
Toward Modeling Commensal Interactions in Human Dyads.
Proceedings of the Companion Publication of the 2025 ACM Designing Interactive Systems Conference, 2025
2024
Co-Located Human-Human Interaction Analysis Using Nonverbal Cues: A Survey.
ACM Comput. Surv., May, 2024
Trajectory-based fish event classification through pre-training with diffusion models.
Ecol. Informatics, 2024
CLIP-VAD: Exploiting Vision-Language Models for Voice Activity Detection.
CoRR, 2024
Socially Pertinent Robots in Gerontological Healthcare.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Anticipating Next Active Objects for Egocentric Videos.
IEEE Access, 2024
Leveraging Next-Active Objects for Context-Aware Anticipation in Egocentric Videos.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024
Exploring Fine-Grained Retail Product Discrimination with Zero-Shot Object Classification Using Vision-Language Models.
Proceedings of the 8th IEEE Forum on Research and Technologies for Society and Industry Innovation, 2024
AL-GTD: Deep Active Learning for Gaze Target Detection.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Automatic Recognition of Commensal Activities in Co-located and Online settings.
Proceedings of the Companion Proceedings of the 26th International Conference on Multimodal Interaction, 2024
Upper-Body Pose-Based Gaze Estimation for Privacy-Preserving 3D Gaze Target Detection.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024
Diffusion-Based Unsupervised Pre-training for Automated Recognition of Vitality Forms.
Proceedings of the 2024 International Conference on Advanced Visual Interfaces, 2024
2023
Affect Recognition in Hand-Object Interaction Using Object-Sensed Tactile and Kinematic Data.
IEEE Trans. Haptics, 2023
Modeling Multiple Temporal Scales of Full-Body Movements for Emotion Classification.
IEEE Trans. Affect. Comput., 2023
SKELTER: unsupervised skeleton action denoising and recognition using transformers.
Frontiers Comput. Sci., 2023
Guided Attention for Next Active Object @ EGO4D STA Challenge.
CoRR, 2023
Unleashing the Transferability Power of Unsupervised Pre-Training for Emotion Recognition in Masked and Unmasked Facial Images.
IEEE Access, 2023
Exploring Diffusion Models for Unsupervised Video Anomaly Detection.
Proceedings of the IEEE International Conference on Image Processing, 2023
Enhancing Next Active Object-Based Egocentric Action Anticipation with Guided Attention.
Proceedings of the IEEE International Conference on Image Processing, 2023
Unsupervised Video Anomaly Detection with Diffusion Models Conditioned on Compact Motion Representations.
Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023
Object-aware Gaze Target Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
2022
Face-to-Face Co-Located Human-Human Social Interaction Analysis using Nonverbal Cues: A Survey.
CoRR, 2022
Graph Laplacian-Improved Convolutional Residual Autoencoder for Unsupervised Human Action and Emotion Recognition.
IEEE Access, 2022
Multimodal Emotion Recognition with Modality-Pairwise Unsupervised Contrastive Loss.
Proceedings of the 26th International Conference on Pattern Recognition, 2022
Multimodal Across Domains Gaze Target Detection.
Proceedings of the International Conference on Multimodal Interaction, 2022
2021
RealVAD: A Real-World Dataset and A Method for Voice Activity Detection by Body Motion Analysis.
IEEE Trans. Multim., 2021
Personality Traits Classification Using Deep Visual Activity-Based Nonverbal Features of Key-Dynamic Images.
IEEE Trans. Affect. Comput., 2021
Estimating Presentation Competence using Multimodal Nonverbal Behavioral Cues.
CoRR, 2021
S-VVAD: Visual Voice Activity Detection by Motion Segmentation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021
Predicting Gaze from Egocentric Social Interaction Videos and IMU Data.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021
Unsupervised Human Action Recognition with Skeletal Graph Laplacian and Self-Supervised Viewpoints Invariance.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021
2020
RealVAD: A Real-world Dataset for Voice Activity Detection.
Dataset, July, 2020
Editorial: Computational Approaches for Human-Human and Human-Robot Social Interactions.
Frontiers Robotics AI, 2020
Subspace Clustering for Action Recognition with Covariance Representations and Temporal Pruning.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Analysis of Face-Touching Behavior in Large Scale Social Interaction Dataset.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020
2019
A Sequential Data Analysis Approach to Detect Emergent Leaders in Small Groups.
IEEE Trans. Multim., 2019
Comparisons of Visual Activity Primitives for Voice Activity Detection.
Proceedings of the Image Analysis and Processing - ICIAP 2019, 2019
Voice Activity Detection by Upper Body Motion Analysis and Unsupervised Domain Adaptation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019
2018
Prediction of the Leadership Style of an Emergent Leader Using Audio and Visual Nonverbal Features.
IEEE Trans. Multim., 2018
Extracting statistically significant behaviour from fish tracking data with and without large dataset cleaning.
IET Comput. Vis., 2018
Investigation of Small Group Social Interactions Using Deep Visual Activity-Based Nonverbal Features.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
A Multi-View Learning Approach to Deception Detection.
Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition, 2018
Filling the Gaps: Predicting Missing Joints of Human Poses Using Denoising Autoencoders.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018
2017
Moving as a Leader: Detecting Emergent Leadership in Small Groups using Body Pose.
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Multi-task learning of social psychology assessments and nonverbal features for automatic leadership identification.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017
2016
Proceedings of the Fish4Knowledge: Collecting and Analyzing Massive Coral Reef Fish Video Data, 2016
Detecting emergent leader in a meeting environment using nonverbal visual features only.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016
Identification of emergent leaders in a meeting scenario using multiple kernel learning.
Proceedings of the 2nd Workshop on Advancements in Social Signal Processing for Multimodal Interaction, 2016
2015
Classifying imbalanced data sets using similarity based hierarchical decomposition.
Pattern Recognit., 2015
Applying semi-synchronised task farming to large-scale computer vision problems.
Int. J. High Perform. Comput. Appl., 2015
2014
A rule-based event detection system for real-life underwater domain.
Mach. Vis. Appl., 2014
A research tool for long-term and continuous analysis of fish assemblage in coral-reefs using underwater camera footage.
Ecol. Informatics, 2014
2013
Detecting abnormal fish trajectories using clustered and labeled data.
Proceedings of the IEEE International Conference on Image Processing, 2013
Detection of Abnormal Fish Trajectories Using a Clustering Based Hierarchical Classifier.
Proceedings of the British Machine Vision Conference, 2013
2012
Event detection in underwater domain by exploiting fish trajectory clustering.
Proceedings of the 1st ACM international workshop on Multimedia analysis for ecological data, 2012
A filtering mechanism for normal fish trajectories.
Proceedings of the 21st International Conference on Pattern Recognition, 2012
2011
Fusion of thermal- and visible-band video for abandoned object detection.
J. Electronic Imaging, 2011