Tanaya Guha

Pattern Recognit., 2025

2024

On the effects of obfuscating speaker attributes in privacy-aware depression detection.

[BibT_eX]

[DOI]

Nujud Aloshban

Anna Esposito

Alessandro Vinciarelli

Pattern Recognit. Lett., 2024

Active Listener: Continuous Generation of Listener's Head Motion Response in Dyadic Interactions.

[BibT_eX]

[DOI]

Bishal Ghosh

Emma Li

CoRR, 2024

WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting.

[BibT_eX]

[DOI]

Olly Styles

Sam Miller

Patricio Cerda-Mardini

Bertie Vidgen

CoRR, 2024

CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification.

[BibT_eX]

[DOI]

Yiming Ma

CoRR, 2024

NAPE: Numbering as a Position Encoding in Graphs.

[BibT_eX]

[DOI]

Olayinka Ajayi

Hongkai Wen

IEEE Access, 2024

Detecting in-car VR Motion Sickness from Lower Face Action Units.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2024

Assessing Privacy Risks of Attribute Inference Attacks Against Speech-Based Depression Detection System.

[BibT_eX]

[DOI]

Basmah Alsenani

Anna Esposito

Alessandro Vinciarelli

Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

2023

Explainable Human-centered Traits from Head Motion and Facial Expression Dynamics.

[BibT_eX]

[DOI]

Ramanathan Subramanian

CoRR, 2023

Privacy Risks in Speech Emotion Recognition: A Systematic Study on Gender Inference Attack.

[BibT_eX]

[DOI]

Basmah Alsenani

Alessandro Vinciarelli

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Explainable Depression Detection via Head Motion Patterns.

[BibT_eX]

[DOI]

Monika Gahalawat

Raul Fernandez Rojas

Ramanathan Subramanian

Roland Goecke

Proceedings of the 25th International Conference on Multimodal Interaction, 2023

Heterogeneous Graph Learning for Acoustic Event Classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Dynamic Emotion Modeling With Learnable Graphs and Graph Inception Network.

[BibT_eX]

[DOI]

Subarna Tripathi

IEEE Trans. Multim., 2022

Multi-Camera Trajectory Forecasting With Trajectory Tensors.

[BibT_eX]

[DOI]

Olly Styles

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Self-Supervised Graphs for Audio Representation Learning With Limited Labeled Data.

[BibT_eX]

[DOI]

Krishna Somandepalli

IEEE J. Sel. Top. Signal Process., 2022

Real-Time Driver Monitoring Systems through Modality and View Analysis.

[BibT_eX]

[DOI]

CoRR, 2022

Visually-aware Acoustic Event Detection using Heterogeneous Graphs.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Fusioncount: Efficient Crowd Counting Via Multiscale Feature Fusion.

[BibT_eX]

[DOI]

Yiming Ma

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Self-Supervised Frontalization and Rotation Gan with Random Swap for Pose-Invariant Face Recognition.

[BibT_eX]

[DOI]

Jiashu Liao

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Graph-based Transform based on 3D Convolutional Neural Network for Intra-Prediction of Imaging Data.

[BibT_eX]

[DOI]

Proceedings of the Data Compression Conference, 2022

2021

Computational Media Intelligence: Human-Centered Machine Analysis of Media.

[BibT_eX]

[DOI]

Proc. IEEE, 2021

Learning Spatial-Temporal Graphs for Active Speaker Detection.

[BibT_eX]

[DOI]

CoRR, 2021

SG2Caps: Revisiting Scene Graphs for Image Captioning.

[BibT_eX]

[DOI]

CoRR, 2021

Graph-Based Transform Based on Neural Networks for Intra-Prediction of Imaging Data.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021

Head Matters: Explainable Human-centered Trait Prediction from Head Motion Dynamics.

[BibT_eX]

[DOI]

Surbhi Madan

Monika Gahalawat

Ramanathan Subramanian

Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

In Defense of Scene Graphs for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Compact Graph Architecture for Speech Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Towards Autism Screening through Emotion-guided Eye Gaze Response.

[BibT_eX]

[DOI]

Surjya Ghosh

Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021

Graph Based Transforms based on Graph Neural Networks for Predictive Transform Coding.

[BibT_eX]

[DOI]

Proceedings of the 31st Data Compression Conference, 2021

2020

Dynamic character graph via online face clustering for movie analysis.

[BibT_eX]

[DOI]

Prakhar Kulshreshtha

Multim. Tools Appl., 2020

Learnable Graph Inception Network for Emotion Recognition.

[BibT_eX]

[DOI]

Subarna Tripathi

CoRR, 2020

Multiple Object Forecasting: Predicting Future Object Locations in Diverse Environments.

[BibT_eX]

[DOI]

Olly Styles

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Coordinated Joint Multimodal Embeddings for Generalized Audio-Visual Zero-shot Classification and Retrieval of Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

ATQAM/MAST'20: Joint Workshop on Aesthetic and Technical Quality Assessment of Multimedia and Media Analytics for Societal Trends.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Attention Selective Network For Face Synthesis And Pose-Invariant Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2020

Ensemble Network For Ranking Images Based On Visual Appeal.

[BibT_eX]

[DOI]

Sachin Singh

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Variational Recurrent Sequence-to-Sequence Retrieval for Stepwise Illustration.

[BibT_eX]

[DOI]

Vishwash Batra

Aparajita Haldar

Yulan He

Hakan Ferhatosmanoglu

George Vogiatzis

Proceedings of the Advances in Information Retrieval, 2020

Multi-Camera Trajectory Forecasting: Pedestrian Trajectory Prediction in a Network of Cameras.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Dirichlet Latent Variable Model: A Dynamic Model Based on Dirichlet Prior for Audio Processing.

[BibT_eX]

[DOI]

Anurendra Kumar

Prasanta Kumar Ghosh

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Coordinated Joint Multimodal Embeddings for Generalized Audio-Visual Zeroshot Classification and Retrieval of Videos.

[BibT_eX]

[DOI]

CoRR, 2019

Learning Affective Correspondence between Music and Image.

[BibT_eX]

[DOI]

Gaurav Verma

Eeshan Gunesh Dhekane

Proceedings of the IEEE International Conference on Acoustics, 2019

Computational Analysis of Gaze Behavior in Autism During Interaction with Virtual Agents.

[BibT_eX]

[DOI]

Zeeshan Akhtar

Proceedings of the IEEE International Conference on Acoustics, 2019

Graph-Based Transform with Weighted Self-Loops for Predictive Transform Coding Based on Template Matching.

[BibT_eX]

[DOI]

Proceedings of the Data Compression Conference, 2019

2018

Unsupervised Discovery of Character Dictionaries in Animation Movies.

[BibT_eX]

[DOI]

Krishna Somandepalli

IEEE Trans. Multim., 2018

A Computational Study of Expressive Facial Dynamics in Children with Autism.

[BibT_eX]

[DOI]

Zhaojun Yang

Ruth B. Grossman

IEEE Trans. Affect. Comput., 2018

Multichannel Attention Network for Analyzing Visual Behavior in Public Speaking.

[BibT_eX]

[DOI]

Rahul Sharma

Gaurav Sharma

Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Learning Spontaneity to Improve Emotion Recognition in Speech.

[BibT_eX]

[DOI]

Karttikeya Mangalam

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

An Online Algorithm for Constrained Face Clustering in Videos.

[BibT_eX]

[DOI]

Prakhar Kulshreshtha

Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

A Dynamic Latent Variable Model for Source Separation.

[BibT_eX]

[DOI]

Anurendra Kumar

Prasanta Ghosh

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Music Tempo Estimation Using Sub-Band Synchrony.

[BibT_eX]

[DOI]

Shreyan Chowdhury

Rajesh M. Hegde

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

On the role of head motion in affective expression.

[BibT_eX]

[DOI]

Atanu Samanta

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

Novel affective features for multiscale prediction of emotion in music.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Workshop on Multimedia Signal Processing, 2016

Blind image quality assessment using subspace alignment.

[BibT_eX]

[DOI]

Indra Kiran

Gaurav Pandey

Proceedings of the Tenth Indian Conference on Computer Vision, 2016

A trajectory clustering approach to crowd flow segmentation in videos.

[BibT_eX]

[DOI]

Rahul Sharma

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Opening big in box office? Trailer content can help.

[BibT_eX]

[DOI]

Adarsh Tadimari

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A multimodal mixture-of-experts model for dynamic emotion prediction in movies.

[BibT_eX]

[DOI]

Ankit Goyal

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Gender Representation in Cinematic Content: A Multimodal Approach.

[BibT_eX]

[DOI]

Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

On quantifying facial expression-related atypicality of children with Autism Spectrum Disorder.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Computationally deconstructing movie narratives: An informatics approach.

[BibT_eX]

[DOI]

Stacy L. Smith

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Image Similarity Using Sparse Representation and Compression Distance.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2014

Sparse representation-based image quality assessment.

[BibT_eX]

[DOI]

Ehsan Nezhadarya

Signal Process. Image Commun., 2014

Multimodal Prediction of Affective Dimensions and Depression in Human-Computer Interactions.

[BibT_eX]

[DOI]

Maarten Van Segbroeck

Matthew Black

Alexandros Potamianos

Proceedings of the 4th International Workshop on Audio/Visual Emotion Challenge, 2014

Affective Feature Design and Predicting Continuous Affective Dimensions from Music.

[BibT_eX]

[DOI]

Maarten Van Segbroeck

Jangwon Kim

Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Learning sparse models for image quality assessment.

[BibT_eX]

[DOI]

Ehsan Nezhadarya

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Image similarity measurement from sparse reconstruction errors.

[BibT_eX]

[DOI]

Tyseer Aboulnasr

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Learning Sparse Representations for Human Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2012

On Image Similarity, Sparse Representation and Kolmogorov Complexity

[BibT_eX]

[DOI]

CoRR, 2012

A sparse reconstruction based algorithm for image and video classification.

[BibT_eX]

[DOI]

Rabab Kreidieh Ward

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Action recognition by learnt class-specific overcomplete dictionaries.

[BibT_eX]

[DOI]

Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

2010

Differential Radon Transform for gait recognition.

[BibT_eX]

[DOI]