Carlos Busso

Orcid: 0000-0002-4075-4072

Affiliations:
  • University of Texas at Dallas


According to our database1, Carlos Busso authored at least 202 papers between 2004 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Speech emotion recognition in real static and dynamic human-robot interaction scenarios.
Comput. Speech Lang., 2025

2024
Selective Acoustic Feature Enhancement for Speech Emotion Recognition With Noisy Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Analyzing Continuous-Time and Sentence-Level Annotations for Speech Emotion Recognition.
IEEE Trans. Affect. Comput., 2024

Deep temporal clustering features for speech emotion recognition.
Speech Commun., 2024

Describe Where You Are: Improving Noise-Robustness for Speech Emotion Recognition with Text Description of the Environment.
CoRR, 2024

A Layer-Anchoring Strategy for Enhancing Cross-Lingual Speech Emotion Recognition.
CoRR, 2024

We Need Variations in Speech Synthesis: Sub-center Modelling for Speaker Embeddings.
CoRR, 2024

emoDARTS: Joint Optimisation of CNN & Sequential Neural Network Architectures for Superior Speech Emotion Recognition.
CoRR, 2024

emoDARTS: Joint Optimization of CNN and Sequential Neural Network Architectures for Superior Speech Emotion Recognition.
IEEE Access, 2024

Odyssey 2024 - Speech Emotion Recognition Challenge: Dataset, Baseline Framework, and Results.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

Toward Robust and Discriminative Emotional Speech Representations.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

Mixed-EVC: Mixed Emotion Synthesis and Control in Voice Conversion.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

Driver Head Pose Estimation with Multimodal Temporal Fusion of Color and Depth Modeling Networks.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2024

Revealing Emotional Clusters in Speaker Embeddings: A Contrastive Learning Strategy for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

Generalization of Self-Supervised Learning-Based Representations for Cross-Domain Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

Dynamic Speech Emotion Recognition Using A Conditional Neural Process.
Proceedings of the IEEE International Conference on Acoustics, 2024

Enhanced Facial Landmarks Detection for Patients with Repaired Cleft Lip and Palate.
Proceedings of the 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

2023
Multimodal attention for lip synthesis using conditional generative adversarial networks.
Speech Commun., September, 2023

Unsupervised Scalable Multimodal Driving Anomaly Detection.
IEEE Trans. Intell. Veh., April, 2023

Estimation of Driver's Gaze Region From Head Position and Orientation Using Probabilistic Confidence Regions.
IEEE Trans. Intell. Veh., January, 2023

Aligning Small Datasets Using Domain Adversarial Learning: Applications in Automated in Vivo Oral Cancer Diagnosis.
IEEE J. Biomed. Health Informatics, 2023

Sequential Modeling by Leveraging Non-Uniform Distribution of Speech Emotion.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Chunk-Level Speech Emotion Recognition: A General Framework of Sequence-to-One Dynamic Temporal Modeling.
IEEE Trans. Affect. Comput., 2023

Quantifying Emotional Similarity in Speech.
IEEE Trans. Affect. Comput., 2023

Versatile Audio-Visual Learning for Handling Single and Multi Modalities in Emotion Regression and Classification Tasks.
CoRR, 2023

Example-Based Query To Identify Causes of Driving Anomaly with Few Labeled Samples.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2023

Seatbelt Segmentation Using Synthetic Images.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2023

MSP-DISK: Naturalistic and Diverse In-Vehicle Database for Joint Pose and Seat Belt Detection.
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2023

Preference Learning Labels by Anchoring on Consecutive Annotations.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Computation and Memory Efficient Noise Adaptation of Wav2Vec2.0 for Noisy Speech Emotion Recognition with Skip Connection Adapters.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Distant Speech Emotion Recognition in an Indoor Human-robot Interaction Scenario.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

The Importance of Calibration: Rethinking Confidence and Performance of Speech Multi-label Emotion Classifiers.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Enhancing Resilience to Missing Data in Audio-Text Emotion Recognition with Multi-Scale Chunk Regularization.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

Phonetic Anchor-Based Transfer Learning to Facilitate Unsupervised Cross-Lingual Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Unsupervised Domain Adaptation for Preference Learning Based Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Role of Lexical Boundary Information in Chunk-Level Segmentation for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Adapting a Self-Supervised Speech Representation for Noisy Speech Emotion Recognition by Using Contrastive Teacher-Student Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

Learning Cross-Modal Audiovisual Representations with Ladder Networks for Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Combining Relative and Absolute Learning Formulations to Predict Emotional Attributes From Speech.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

An Intelligent Infrastructure Toward Large Scale Naturalistic Affective Speech Corpora Collection.
Proceedings of the 11th International Conference on Affective Computing and Intelligent Interaction, 2023

Analyzing the Effect of Affective Priming on Emotional Annotations.
Proceedings of the 11th International Conference on Affective Computing and Intelligent Interaction, 2023

2022
The Multimodal Driver Monitoring Database: A Naturalistic Corpus to Study Driver Attention.
IEEE Trans. Intell. Transp. Syst., 2022

Temporal Head Pose Estimation From Point Cloud in Naturalistic Driving Conditions.
IEEE Trans. Intell. Transp. Syst., 2022

Unsupervised Personalization of an Emotion Recognition System: The Unique Properties of the Externalization of Valence in Speech.
IEEE Trans. Affect. Comput., 2022

Robust Audiovisual Emotion Recognition: Aligning Modalities, Capturing Temporal Information, and Handling Missing Features.
IEEE Trans. Affect. Comput., 2022

Mixed Emotion Modelling for Emotional Voice Conversion.
CoRR, 2022

Driving Anomaly Detection Using Conditional Generative Adversarial Network.
CoRR, 2022

Driving Anomaly Detection Using Contrastive Multiview Coding to Interpret Cause of Anomaly.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Improving Speech Emotion Recognition Using Self-Supervised Learning with Domain-Specific Audiovisual Tasks.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Exploiting Co-occurrence Frequency of Emotions in Perceptual Evaluations To Train A Speech Emotion Classifier.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Privacy Preserving Personalization for Video Facial Expression Recognition Using Federated Learning.
Proceedings of the International Conference on Multimodal Interaction, 2022

Incorporating Gaze Behavior Using Joint Embedding With Scene Context for Driver Takeover Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

Not All Features are Equal: Selection of Robust Features for Speech Emotion Recognition in Noisy Environments.
Proceedings of the IEEE International Conference on Acoustics, 2022

AuxFormer: Robust Approach to Audiovisual Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Exploiting Annotators' Typed Description of Emotion Perception to Maximize Utilization of Ratings for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Monologue versus Conversation: Differences in Emotion Perception and Acoustic Expressivity.
Proceedings of the 10th International Conference on Affective Computing and Intelligent Interaction, 2022

2021
End-to-End Audiovisual Speech Recognition System With Multitask Learning.
IEEE Trans. Multim., 2021

Guided Generative Adversarial Neural Network for Representation Learning and Audio Generation Using Fewer Labelled Audio Data.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

The Ordinal Nature of Emotions: An Emerging Approach.
IEEE Trans. Affect. Comput., 2021

Speech-Driven Expressive Talking Lips with Conditional Sequential Generative Adversarial Networks.
IEEE Trans. Affect. Comput., 2021

Predicting Emotionally Salient Regions Using Qualitative Agreement of Deep Neural Network Regressors.
IEEE Trans. Affect. Comput., 2021

Over-Sampling Emotional Speech Data Based on Subjective Evaluations Provided by Multiple Individuals.
IEEE Trans. Affect. Comput., 2021

Deep Representation Learning for Affective Speech Signal Analysis and Processing: Preventing unwanted signal disparities.
IEEE Signal Process. Mag., 2021

Voice Activity Detection with Teacher-Student Domain Emulation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Separation of Emotional and Reconstruction Embeddings on Ladder Network to Improve Speech Emotion Recognition Robustness in Noisy Conditions.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Deepemocluster: a Semi-Supervised Framework for Latent Cluster Representation of Speech Emotions.
Proceedings of the IEEE International Conference on Acoustics, 2021

End-to-End Neural Network for Feature Extraction and Cancer Diagnosis of In Vivo Fluorescence Lifetime Images of Oral Lesions.
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021

Generative Approach Using Soft-Labels to Learn Uncertainty in Predicting Emotional Attributes <sup>*</sup>.
Proceedings of the 9th International Conference on Affective Computing and Intelligent Interaction, 2021

Multimodal Behavior Modeling for Socially Interactive Agents.
Proceedings of the Handbook on Socially Interactive Agents: 20 Years of Research on Embodied Conversational Agents, 2021

2020
Semi-Supervised Speech Emotion Recognition With Ladder Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Multimodal emotion recognition: Understanding the production process before modeling multimodal behaviors.
Proceedings of the 2020 Workshop on Speech, Music and Mind, 2020

Robust Driver Head Pose Estimation in Naturalistic Conditions from Point-Cloud Data.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

Use of Triplet-Loss Function to Improve Driving Anomaly Detection Using Conditional Generative Adversarial Network.
Proceedings of the 23rd IEEE International Conference on Intelligent Transportation Systems, 2020

Ensemble of Students Taught by Probabilistic Teachers to Improve Speech Emotion Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

The MSP-Conversation Corpus.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

An Efficient Temporal Modeling Approach for Speech Emotion Recognition by Mapping Varied Duration Sentences into Fixed Number of Chunks.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

MSP-Face Corpus: A Natural Audiovisual Emotional Database.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020

Style Extractor For Facial Expression Recognition in the Presence of Speech.
Proceedings of the IEEE International Conference on Image Processing, 2020

Modeling Uncertainty in Predicting Emotional Attributes from Spontaneous Speech.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Dynamic versus Static Facial Expressions in the Presence of Speech.
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020

2019
Curriculum Learning for Speech Emotion Recognition From Crowdsourced Labels.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Building Naturalistic Emotionally Balanced Speech Corpus by Retrieving Emotional Speech from Existing Podcast Recordings.
IEEE Trans. Affect. Comput., 2019

End-to-end audiovisual speech activity detection with bimodal recurrent neural models.
Speech Commun., 2019

Speech-driven animation with meaningful behaviors.
Speech Commun., 2019

The Ambiguous World of Emotion Representation.
CoRR, 2019

Report of 2017 NSF Workshop on Multimedia Challenges, Opportunities and Research Roadmaps.
CoRR, 2019

Discriminative Features for Texture Retrieval Using Wavelet Packets.
IEEE Access, 2019

Lexical Dependent Emotion Detection Using Synthetic Speech Reference.
IEEE Access, 2019

Analysis of the Relationship Between Physiological Signals and Vehicle Maneuvers During a Naturalistic Driving Study.
Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference, 2019

Speech Emotion Recognition with a Reject Option.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Driving Anomaly Detection with Conditional Generative Adversarial Network using Physiological and CAN-Bus Data.
Proceedings of the International Conference on Multimodal Interaction, 2019

Estimation of Gaze Region Using Two Dimensional Probabilistic Maps Constructed Using Convolutional Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

Retrieving Speech Samples with Similar Emotional Content Using a Triplet Loss Function.
Proceedings of the IEEE International Conference on Acoustics, 2019

Exploring the Intersection Between Speaker Verification and Emotion Recognition.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2019

Active Learning for Speech Emotion Recognition Using Deep Neural Network.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction, 2019

2018
Gating Neural Network for Large Vocabulary Audiovisual Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Domain Adversarial for Acoustic Emotion Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Calibration free, user-independent gaze estimation with tensor analysis.
Image Vis. Comput., 2018

Probabilistic Estimation of the Gaze Region of the Driver using Dense Classification.
Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

Audiovisual Speech Activity Detection with Advanced Long Short-Term Memory.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Role of Regularization in the Prediction of Valence from Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Ladder Networks for Emotion Recognition: Using Unsupervised Auxiliary Tasks to Improve Predictions of Emotional Attributes.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Preference-Learning with Qualitative Agreement for Sentence Level Emotional Annotations.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Predicting Categorical Emotions by Jointly Learning Primary and Secondary Emotions through Multitask Learning.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Aligning Audiovisual Features for Audiovisual Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

FI-CAP: Robust Framework to Benchmark Head Pose Estimation in Challenging Environments.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Novel Realizations of Speech-Driven Head Movements with Generative Adversarial Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Study of Dense Network Approaches for Speech Emotion Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Expressive Speech-Driven Lip Movements with Multitask Learning.
Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition, 2018

2017
The Cost of Dichotomizing Continuous Labels for Binary Classification Problems: Deriving a Bayesian-Optimal Classifier.
IEEE Trans. Affect. Comput., 2017

MSP-IMPROV: An Acted Corpus of Dyadic Interactions to Study Emotion Perception.
IEEE Trans. Affect. Comput., 2017

Driver Modeling for Detection and Assessment of Driver Distraction: Examples from the UTDrive Test Bed.
IEEE Signal Process. Mag., 2017

Meaningful head movements driven by emotional synthetic speech.
Speech Commun., 2017

Assessment and classification of singing quality based on audio-visual features.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Joint Learning of Speech-Driven Facial Motion with Bidirectional Long-Short Term Memory.
Proceedings of the Intelligent Virtual Agents - 17th International Conference, 2017

Challenges in head pose estimation of drivers in naturalistic recordings using existing tools.
Proceedings of the 20th IEEE International Conference on Intelligent Transportation Systems, 2017

Probabilistic estimation of the driver's gaze from head orientation and position.
Proceedings of the 20th IEEE International Conference on Intelligent Transportation Systems, 2017

Bimodal Recurrent Neural Network for Audiovisual Voice Activity Detection.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Jointly Predicting Arousal, Valence and Dominance with Multi-Task Learning.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

A Stepwise Analysis of Aggregated Crowdsourced Labels Describing Multimodal Emotional Behaviors.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

A study of speaker verification performance with expressive speech.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Ranking emotional attributes with deep neural networks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Incremental adaptation using active learning for acoustic emotion recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Ensemble feature selection for domain adaptation in speech emotion recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

The ordinal nature of emotions.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

Predicting speaker recognition reliability by considering emotional content.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

Formulating emotion perception as a probabilistic model with application to categorical emotion classification.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

2016
Detecting Drivers' Mirror-Checking Actions and Its Application to Maneuver and Secondary Task Recognition.
IEEE Trans. Intell. Transp. Syst., 2016

Using Agreement on Direction of Change to Build Rank-Based Emotion Classifiers.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Facial Expression Recognition in the Presence of Speech Using Blind Lexical Compensation.
IEEE Trans. Affect. Comput., 2016

The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing.
IEEE Trans. Affect. Comput., 2016

Increasing the Reliability of Crowdsourcing Evaluations Using Online Quality Assessment.
IEEE Trans. Affect. Comput., 2016

The USC CreativeIT database of multimodal dyadic interactions: from speech and full body motion capture to continuous emotional annotations.
Lang. Resour. Evaluation, 2016

Analyzing the relationship between head pose and gaze to model driver visual attention.
Proceedings of the 19th IEEE International Conference on Intelligent Transportation Systems, 2016

Improving Boundary Estimation in Audiovisual Speech Activity Detection Using Bayesian Information Criterion.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

A Portable Automatic PA-TA-KA Syllable Detection System to Derive Biomarkers for Neurological Disorders.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Head Motion Generation with Synthetic Speech: A Data Driven Approach.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Defining Emotionally Salient Regions Using Qualitative Agreement Method.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Retrieving Categorical Emotions Using a Probabilistic Framework to Define Preference Learning Samples.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Practical considerations on the use of preference learning for ranking emotional speech.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A multimodal analysis of synchrony during dyadic interaction using a metric based on sequential pattern mining.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Automatic composition of broadcast news summaries using rank classifiers trained with acoustic and lexical features.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Tradeoff between quality and quantity of emotional annotations to characterize expressive behaviors.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Predicting Perceived Visual and Cognitive Distractions of Drivers With Multimodal Features.
IEEE Trans. Intell. Transp. Syst., 2015

UMEME: University of Michigan Emotional McGurk Effect Data Set.
IEEE Trans. Affect. Comput., 2015

Correcting Time-Continuous Emotional Labels by Modeling the Reaction Lag of Evaluators.
IEEE Trans. Affect. Comput., 2015

Challenges in Concussion Detection Using Vocal Acoustic Biomarkers.
IEEE Access, 2015

An unsupervised visual-only voice activity detection approach using temporal orofacial features.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Retrieving Target Gestures Toward Speech Driven Animation with Meaningful Behaviors.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Adjacent Vehicle Collision Warning System using Image Sensor and Inertial Measurement Unit.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Emotion recognition using synthetic speech as neutral reference.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Supervised domain adaptation for emotion recognition from speech.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

MSP-AVATAR corpus: Motion capture recordings to study the role of discourse functions in the design of intelligent virtual agents.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

2014
Compensating for speaker or lexical variabilities in speech for emotion recognition.
Speech Commun., 2014

Shape-based modeling of the fundamental frequency contour for emotion detection in speech.
Comput. Speech Lang., 2014

Evaluation of syllable rate estimation in expressive speech and its contribution to emotion recognition.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Lipreading approach for isolated digits recognition under whisper and neutral speech.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Building a naturalistic emotional speech corpus by retrieving expressive behaviors from existing speech corpora.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Speech-Driven Animation Constrained by Appropriate Discourse Functions.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

User Independent Gaze Estimation by Exploiting Similarity Measures in the Eye Pair Appearance Eigenspace.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

Using Perceptual Evaluation to Quantify Cognitive and Visual Driver Distractions.
Proceedings of the Smart Mobile In-Vehicle Systems, Next Generation Advancements, 2014

2013
Modeling of Driver Behavior in Real World Scenarios Using Multiple Noninvasive Sensors.
IEEE Trans. Multim., 2013

Exploring Cross-Modality Affective Reactions for Audiovisual Emotion Recognition.
IEEE Trans. Affect. Comput., 2013

Iterative Feature Normalization Scheme for Automatic Emotion Detection from Speech.
IEEE Trans. Affect. Comput., 2013

Energy and F0 contour modeling with functional data analysis for emotional speech detection.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Evaluating the robustness of an appearance-based gaze estimation method for multimodal interfaces.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Analysis of facial features of drivers under cognitive and visual distractions.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Audiovisual corpus to analyze whisper speech.
Proceedings of the IEEE International Conference on Acoustics, 2013

Feature and model level compensation of lexical content for facial emotion recognition.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Analysis and Compensation of the Reaction Lag of Evaluators in Continuous Emotional Annotations.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

2012
Generating Human-Like Behaviors Using Joint, Speech-Driven Models for Conversational Agents.
IEEE Trans. Speech Audio Process., 2012

Unveiling the Acoustic Properties that Describe the Valence Dimension.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Indoor robotic terrain classification via angular velocity based hierarchical classifier selection.
Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Factorizing speaker, lexical and emotional variabilities observed in facial expressions.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

A personalized emotion recognition system using an unsupervised feature adaptation scheme.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Emotion recognition using a hierarchical binary decision tree approach.
Speech Commun., 2011

Detecting Sleepiness by Fusing Classifiers Trained with Novel Acoustic Features.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Analysis of driver behaviors during common tasks using frontal video camera and CAN-Bus information.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Iterative feature normalization for emotional speech detection.
Proceedings of the IEEE International Conference on Acoustics, 2011

Audio-visual isolated digit recognition for whispered speech.
Proceedings of the 19th European Signal Processing Conference, 2011

2010
Visual emotion recognition using compact facial representations and viseme information.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Analysis of Emotionally Salient Aspects of Fundamental Frequency for Emotion Detection.
IEEE Trans. Speech Audio Process., 2009

Modeling mutual influence of interlocutor emotion states in dyadic spoken interactions.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Interpreting ambiguous emotional expressions.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

2008
IEMOCAP: interactive emotional dyadic motion capture database.
Lang. Resour. Evaluation, 2008

Scripted dialogs versus improvisation: lessons learned about emotional elicitation techniques from the IEMOCAP database.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

The expression and perception of emotions: comparing assessments of self versus others.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007
Interrelation Between Speech and Facial Gestures in Emotional Utterances: A Single Subject Study.
IEEE Trans. Speech Audio Process., 2007

Rigid Head Motion in Expressive Speech Animation: Analysis and Synthesis.
IEEE Trans. Speech Audio Process., 2007

Multimodal Meeting Monitoring: Improvements on Speaker Tracking and Segmentation through a Modified Mixture Particle Filter.
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007

Joint Analysis of the Emotional Fingerprint in the Face and Speech: A single subject study.
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007

Using neutral speech models for emotional speech analysis.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Real-Time Monitoring of Participants' Interaction in a Meeting using Audio-Visual Sensors.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Modeling, estimating, and compensating low-bit rate coding distortion in speech recognition.
IEEE Trans. Speech Audio Process., 2006

2005
Natural head motion synthesis driven by acoustic prosodic features.
Comput. Animat. Virtual Worlds, 2005

Investigating the role of phoneme-level modifications in emotional speech resynthesis.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Smart room: participant and speaker localization and identification.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
A real-time protocol for the Internet based on the least mean square algorithm.
IEEE Trans. Multim., 2004

Audio-based head motion synthesis for Avatar-based telepresence systems.
Proceedings of the 2004 ACM SIGMM Workshop on Effective Telepresence, 2004

An acoustic study of emotions expressed in speech.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Emotion recognition based on phoneme classes.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Analysis of emotion recognition using facial expressions, speech and multimodal information.
Proceedings of the 6th International Conference on Multimodal Interfaces, 2004


  Loading...