Tanaya Guha

Orcid: 0000-0003-2167-4891

According to our database1, Tanaya Guha authored at least 71 papers between 2010 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Self-supervised random mask attention GAN in tackling pose-invariant face recognition.
Pattern Recognit., 2025

2024
On the effects of obfuscating speaker attributes in privacy-aware depression detection.
Pattern Recognit. Lett., 2024

Active Listener: Continuous Generation of Listener's Head Motion Response in Dyadic Interactions.
CoRR, 2024

WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting.
CoRR, 2024

CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification.
CoRR, 2024

NAPE: Numbering as a Position Encoding in Graphs.
IEEE Access, 2024

Assessing Privacy Risks of Attribute Inference Attacks Against Speech-Based Depression Detection System.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

2023
Explainable Human-centered Traits from Head Motion and Facial Expression Dynamics.
CoRR, 2023

Privacy Risks in Speech Emotion Recognition: A Systematic Study on Gender Inference Attack.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Explainable Depression Detection via Head Motion Patterns.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

Heterogeneous Graph Learning for Acoustic Event Classification.
Proceedings of the IEEE International Conference on Acoustics, 2023

Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-Attention.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Dynamic Emotion Modeling With Learnable Graphs and Graph Inception Network.
IEEE Trans. Multim., 2022

Multi-Camera Trajectory Forecasting With Trajectory Tensors.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Self-Supervised Graphs for Audio Representation Learning With Limited Labeled Data.
IEEE J. Sel. Top. Signal Process., 2022

Real-Time Driver Monitoring Systems through Modality and View Analysis.
CoRR, 2022

Visually-aware Acoustic Event Detection using Heterogeneous Graphs.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Fusioncount: Efficient Crowd Counting Via Multiscale Feature Fusion.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Self-Supervised Frontalization and Rotation Gan with Random Swap for Pose-Invariant Face Recognition.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Graph-based Transform based on 3D Convolutional Neural Network for Intra-Prediction of Imaging Data.
Proceedings of the Data Compression Conference, 2022

2021
Computational Media Intelligence: Human-Centered Machine Analysis of Media.
Proc. IEEE, 2021

Learning Spatial-Temporal Graphs for Active Speaker Detection.
CoRR, 2021

SG2Caps: Revisiting Scene Graphs for Image Captioning.
CoRR, 2021

Graph-Based Transform Based on Neural Networks for Intra-Prediction of Imaging Data.
Proceedings of the 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021

Head Matters: Explainable Human-centered Trait Prediction from Head Motion Dynamics.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

In Defense of Scene Graphs for Image Captioning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Compact Graph Architecture for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Towards Autism Screening through Emotion-guided Eye Gaze Response.
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021

Graph Based Transforms based on Graph Neural Networks for Predictive Transform Coding.
Proceedings of the 31st Data Compression Conference, 2021

2020
Dynamic character graph via online face clustering for movie analysis.
Multim. Tools Appl., 2020

Learnable Graph Inception Network for Emotion Recognition.
CoRR, 2020

Multiple Object Forecasting: Predicting Future Object Locations in Diverse Environments.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Coordinated Joint Multimodal Embeddings for Generalized Audio-Visual Zero-shot Classification and Retrieval of Videos.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

ATQAM/MAST'20: Joint Workshop on Aesthetic and Technical Quality Assessment of Multimedia and Media Analytics for Societal Trends.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Attention Selective Network For Face Synthesis And Pose-Invariant Face Recognition.
Proceedings of the IEEE International Conference on Image Processing, 2020

Ensemble Network For Ranking Images Based On Visual Appeal.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Variational Recurrent Sequence-to-Sequence Retrieval for Stepwise Illustration.
Proceedings of the Advances in Information Retrieval, 2020

Multi-Camera Trajectory Forecasting: Pedestrian Trajectory Prediction in a Network of Cameras.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Dirichlet Latent Variable Model: A Dynamic Model Based on Dirichlet Prior for Audio Processing.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Coordinated Joint Multimodal Embeddings for Generalized Audio-Visual Zeroshot Classification and Retrieval of Videos.
CoRR, 2019

Learning Affective Correspondence between Music and Image.
Proceedings of the IEEE International Conference on Acoustics, 2019

Computational Analysis of Gaze Behavior in Autism During Interaction with Virtual Agents.
Proceedings of the IEEE International Conference on Acoustics, 2019

Graph-Based Transform with Weighted Self-Loops for Predictive Transform Coding Based on Template Matching.
Proceedings of the Data Compression Conference, 2019

2018
Unsupervised Discovery of Character Dictionaries in Animation Movies.
IEEE Trans. Multim., 2018

A Computational Study of Expressive Facial Dynamics in Children with Autism.
IEEE Trans. Affect. Comput., 2018

Multichannel Attention Network for Analyzing Visual Behavior in Public Speaking.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Learning Spontaneity to Improve Emotion Recognition in Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

An Online Algorithm for Constrained Face Clustering in Videos.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

A Dynamic Latent Variable Model for Source Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Music Tempo Estimation Using Sub-Band Synchrony.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

On the role of head motion in affective expression.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Novel affective features for multiscale prediction of emotion in music.
Proceedings of the 18th IEEE International Workshop on Multimedia Signal Processing, 2016

Blind image quality assessment using subspace alignment.
Proceedings of the Tenth Indian Conference on Computer Vision, 2016

A trajectory clustering approach to crowd flow segmentation in videos.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Opening big in box office? Trailer content can help.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A multimodal mixture-of-experts model for dynamic emotion prediction in movies.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Gender Representation in Cinematic Content: A Multimodal Approach.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

On quantifying facial expression-related atypicality of children with Autism Spectrum Disorder.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Computationally deconstructing movie narratives: An informatics approach.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Image Similarity Using Sparse Representation and Compression Distance.
IEEE Trans. Multim., 2014

Sparse representation-based image quality assessment.
Signal Process. Image Commun., 2014

Multimodal Prediction of Affective Dimensions and Depression in Human-Computer Interactions.
Proceedings of the 4th International Workshop on Audio/Visual Emotion Challenge, 2014

Affective Feature Design and Predicting Continuous Affective Dimensions from Music.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Learning sparse models for image quality assessment.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Image similarity measurement from sparse reconstruction errors.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Learning Sparse Representations for Human Action Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

On Image Similarity, Sparse Representation and Kolmogorov Complexity
CoRR, 2012

A sparse reconstruction based algorithm for image and video classification.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Action recognition by learnt class-specific overcomplete dictionaries.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

2010
Differential Radon Transform for gait recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010


  Loading...