Athanasios Katsamanis

Konstantinos I. Diamantaras

Proceedings of the Intelligent Systems and Applications, 2022

Audio and ASR-based Filled Pause Detection.

[BibT_eX]

[DOI]

Aggelina Chatziagapi

Dimitris Sgouropoulos

Constantinos Karouzos

Thomas Melistas

Theodoros Giannakopoulos

Shrikanth Narayanan

Proceedings of the 10th International Conference on Affective Computing and Intelligent Interaction, 2022

2021

EmpBot: A T5-based Empathetic Chatbot focusing on Sentiments.

[BibT_eX]

[DOI]

Emmanouil Zaranis

Georgios Paraskevopoulos

CoRR, 2021

AudioVisual Speech Synthesis: A brief literature review.

[BibT_eX]

[DOI]

Efthymios Georgiou

CoRR, 2021

2019

A behaviorally inspired fusion approach for computational audiovisual saliency modeling.

[BibT_eX]

[DOI]

Argiro Vatakis

Signal Process. Image Commun., 2019

Data Augmentation Using GANs for Speech Emotion Recognition.

[BibT_eX]

[DOI]

Aggelina Chatziagapi

Georgios Paraskevopoulos

Dimitris Sgouropoulos

Georgios Pantazopoulos

Malvina Nikandrou

Theodoros Giannakopoulos

Shrikanth Narayanan

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Using Oliver API for emotion-aware movie content characterization.

[BibT_eX]

[DOI]

Theodoros Giannakopoulos

Spiros Dimopoulos

Georgios Pantazopoulos

Aggelina Chatziagapi

Dimitris Sgouropoulos

Proceedings of the 2019 International Conference on Content-Based Multimedia Indexing, 2019

2018

Multi-View Audio-Articulatory Features for Phonetic Recognition on RTMRI-TIMIT Database.

[BibT_eX]

[DOI]

Ioannis K. Douros

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Multiple Instance Learning for Behavioral Coding.

[BibT_eX]

[DOI]

James Gibson

Francisco Romero

Panagiotis Paraskevas Filntisis

IEEE Trans. Affect. Comput., 2017

Video-realistic expressive audio-visual speech synthesis for the Greek language.

[BibT_eX]

[DOI]

Pirros Tsiakoulis

Speech Commun., 2017

Room-localized spoken command recognition in multi-room, multi-microphone environments.

[BibT_eX]

[DOI]

Panagiotis Paraskevas Filntisis

Comput. Speech Lang., 2017

Demonstration of an HMM-based photorealistic expressive audio-visual speech synthesis system.

[BibT_eX]

[DOI]

Panagiotis Paraskevas Filntisis

Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Photorealistic adaptation and interpolation of facial expressions using HMMS and AAMS for audio-visual speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Multimodal Gesture Recognition via Multiple Hypotheses Rescoring.

[BibT_eX]

[DOI]

Proceedings of the Gesture Recognition, 2017

Multimodal gesture recognition.

[BibT_eX]

[DOI]

Proceedings of the Handbook of Multimodal-Multisensor Interfaces: Foundations, User Modeling, and Common Modality Combinations, 2017

2016

A Phase-Based Time-Frequency Masking for Multi-Channel Speech Enhancement in Domestic Environments.

[BibT_eX]

[DOI]

Alessio Brutti

Georgia Panagiotaropoulou

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

FMRI-based perceptual validation of a computational model for visual and auditory saliency in videos.

[BibT_eX]

[DOI]

Athanasia Zlatintsi

Athanassios Protopapas

Efstratios Karavasilis

Nikolaos Smyrnis

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Towards a behaviorally-validated computational audiovisual saliency model.

[BibT_eX]

[DOI]

Argiro Vatakis

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Multimodal human action recognition in assistive human-robot interaction.

[BibT_eX]

[DOI]

Nikolaos Kardaris

E. Mavroudi

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Improved Dictionary Selection and Detection Schemes in Sparse-CNMF-Based Overlapping Acoustic Event Detection.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

On Shape Recognition and Language.

[BibT_eX]

[DOI]

Georgios Pavlakos

Proceedings of the Perspectives in Shape Analysis, 2016

2015

Multimodal gesture recognition via multiple hypotheses rescoring.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2015

Predicting audio-visual salient events based on visual, audio and text modalities for movie summarization.

[BibT_eX]

[DOI]

Athanasia Zlatintsi

Elias Iosif

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Multi-room speech activity detection using a distributed microphone network in domestic environments.

[BibT_eX]

[DOI]

Alessio Brutti

Marco Matassoni

Alberto Abad

Miguel Matos

Proceedings of the 23rd European Signal Processing Conference, 2015

Context-sensitive learning for enhanced audiovisual emotion classification (Extended abstract).

[BibT_eX]

[DOI]

Martin Wöllmer

Florian Eyben

Björn W. Schuller

Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014

Computing vocal entrainment: A signal-derived PCA-based quantification scheme with application to affect analysis in married couple interactions.

[BibT_eX]

[DOI]

Andrew Christensen

Comput. Speech Lang., 2014

ATHENA: a Greek multi-sensory database for home automation control uthor: isidoros rodomagoulakis (NTUA, Greece).

[BibT_eX]

[DOI]

Ramón Fernandez Astudillo

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

The DIRHA-GRID corpus: baseline and tools for multi-room distant speech recognition using distributed microphones.

[BibT_eX]

[DOI]

Marco Matassoni

Mirco Ravanelli

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Kinect-based multimodal gesture recognition using a two-pass fusion scheme.

[BibT_eX]

[DOI]

Georgios Pavlakos

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Robust far-field spoken command recognition for home automation combining adaptation and multichannel processing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

The Athena-RC system for speech activity detection and speaker localization in the DIRHA smart home.

[BibT_eX]

[DOI]

Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

Predicting Eyes' Fixations in Movie Videos: Visual Saliency Experiments on a New Eye-Tracking Database.

[BibT_eX]

[DOI]

Proceedings of the Engineering Psychology and Cognitive Ergonomics, 2014

Experiments in acoustic source localization using sparse arrays in adverse indoors environments.

[BibT_eX]

[DOI]

Proceedings of the 22nd European Signal Processing Conference, 2014

Multi-microphone fusion for detection of speech and acoustic events in smart spaces.

[BibT_eX]

[DOI]

Proceedings of the 22nd European Signal Processing Conference, 2014

2013

Toward automating a human behavioral coding system for married couples' interactions using speech acoustic features.

[BibT_eX]

[DOI]

Speech Commun., 2013

Tracking continuous emotional trends of participants during affective dyadic interactions using body language and speech information.

[BibT_eX]

[DOI]

Image Vis. Comput., 2013

Multi-band long-term signal variability features for robust voice activity detection.

[BibT_eX]

[DOI]

Maarten Van Segbroeck

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012

Context-Sensitive Learning for Enhanced Audiovisual Emotion Classification.

[BibT_eX]

[DOI]

Martin Wöllmer

Florian Eyben

Björn W. Schuller

IEEE Trans. Affect. Comput., 2012

The Twins Corpus of Museum Visitor Questions.

[BibT_eX]

[DOI]

Priti Aggarwal

Ron Artstein

Jillian Gerten

Angela Nazarian

David R. Traum

Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Ada and Grace: Direct Interaction with Museum Visitors.

[BibT_eX]

[DOI]

Anton Leuski

Dan Noren

William R. Swartout

Proceedings of the Intelligent Virtual Agents - 12th International Conference, 2012

Based on Isolated Saliency or Causal Integration? Toward a Better Understanding of Human Annotation Process using Multiple Instance Learning and Sequential Probability Ratio Test.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Analyzing the memory of BLSTM Neural Networks for enhanced emotion classification in dyadic spoken interactions.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A hierarchical framework for modeling multimodality and emotional evolution in affective dialogs.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

An acoustic analysis of shared enjoyment in ECA interactions of children with autism.

[BibT_eX]

[DOI]

Theodora Chaspari

Emily Mower Provost

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Using measures of vocal entrainment to inform outcome-related behaviors in marital conflicts.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011

Acoustic and Visual Cues of Turn-Taking Dynamics in Dyadic Interactions.

[BibT_eX]

[DOI]

Viktor Rozgic

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Automatic Data-Driven Learning of Articulatory Primitives from Real-Time MRI Data Using Convolutive NMF with Sparseness Constraints.

[BibT_eX]

[DOI]

Vikram Ramanarayanan

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Direct Estimation of Articulatory Kinematics from Real-Time Magnetic Resonance Image Sequences.

[BibT_eX]

[DOI]

Michael I. Proctor

Adam C. Lammert

Louis M. Goldstein

Christina Hagedorn

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A Multimodal Real-Time MRI Articulatory Corpus for Speech Research.

[BibT_eX]

[DOI]

Erik Bresch

Prasanta Kumar Ghosh

Louis Goldstein

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

An Analysis of PCA-Based Vocal Entrainment Measures in Married Couples' Affective Spoken Interactions.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Morphological Variation in the Adult Vocal Tract: A Modeling Study of its Potential Acoustic Impact.

[BibT_eX]

[DOI]

Adam C. Lammert

Michael I. Proctor

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Validating rt-MRI Based Articulatory Representations via Articulatory Recognition.

[BibT_eX]

[DOI]

Erik Bresch

Vikram Ramanarayanan

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Automatic Identification of Salient Acoustic Instances in Couples' Behavioral Interactions Using Diverse Density Support Vector Machines.

[BibT_eX]

[DOI]

James Gibson

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

"You made me do it": Classification of Blame in Married Couples' Interactions by Fusing Automatically Derived Speech and Language Information.

[BibT_eX]

[DOI]

Matthew Black

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Estimation of ordinal approach-avoidance labels in dyadic interactions: Ordinal logistic regression approach.

[BibT_eX]

[DOI]

Viktor Rozgic

Proceedings of the IEEE International Conference on Acoustics, 2011

Tracking changes in continuous emotion states using body language and prosodic cues.

[BibT_eX]

[DOI]

Yun Wang

Proceedings of the IEEE International Conference on Acoustics, 2011

Affective State Recognition in Married Couples' Interactions Using PCA-Based Vocal Entrainment Measures with Multiple Instance Learning.

[BibT_eX]

[DOI]

Proceedings of the Affective Computing and Intelligent Interaction, 2011

Multiple Instance Learning for Classification of Human Behavior Observations.

[BibT_eX]

[DOI]

James Gibson

Proceedings of the Affective Computing and Intelligent Interaction, 2011

2010

A new multichannel multi modal dyadic interaction database.

[BibT_eX]

[DOI]

Viktor Rozgic

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Rapid semi-automatic segmentation of real-time magnetic resonance images for parametric vocal tract analysis.

[BibT_eX]

[DOI]

Michael I. Proctor

Daniel Bone

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Quantification of prosodic entrainment in affective spontaneous spoken interactions of married couples.

[BibT_eX]

[DOI]

Matthew Black

Adam C. Lammert

Andrew Christensen

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Statistical multi-stream modeling of real-time MRI articulatory speech data.

[BibT_eX]

[DOI]

Erik Bresch

Louis Goldstein

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Automatic classification of married couples' behavior using audio features.

[BibT_eX]

[DOI]

Matthew Black

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

Adaptive Multimodal Fusion by Uncertainty Compensation With Application to Audiovisual Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2009

Face Active Appearance Modeling and Speech Acoustic Information to Recover Articulation.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2009

Tongue tracking in Ultrasound images with Active Appearance Models.

[BibT_eX]

[DOI]

Anastasios Roussos

Proceedings of the International Conference on Image Processing, 2009

Product-HMMs for automatic sign language recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Multisensor multiband cross-energy tracking for feature extraction and recognition.

[BibT_eX]

[DOI]

Stamatios Lefkimmiatis

Proceedings of the IEEE International Conference on Acoustics, 2008

Audiovisual-to-articulatory speech inversion using Active Appearance Models for the face and Hidden Markov Models for the dynamics.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Audiovisual speech inversion by switching dynamical modeling governed by a Hidden Markov process.

[BibT_eX]

[DOI]

Gopal Ananthakrishnan

Olov Engwall

Proceedings of the 2008 16th European Signal Processing Conference, 2008

Adaptive Multimodal Fusion by Uncertainty Compensation with Application to Audio-Visual Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Multimodal Processing and Interaction, Audio, Video, Text, 2008

Cross-Modal Integration for Performance Improving in Multimedia: A Review.

[BibT_eX]

[DOI]

Patrick Gros

Proceedings of the Multimodal Processing and Interaction, Audio, Video, Text, 2008

2007

Multimodal Fusion and Learning with Uncertain Features Applied to Audiovisual Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007

Audiovisual-to-Articulatory Speech Inversion Using HMMs.

[BibT_eX]

[DOI]

Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007

2006

Adaptive multimodal fusion by uncertainty compensation.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Multimodal fusion by adaptive compensation for feature uncertainty with application to audiovisual speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 14th European Signal Processing Conference, 2006

2005

Advances in statistical estimation and tracking of AM-FM speech components.

[BibT_eX]

[DOI]