2025
UniDE: A multi-level and low-resource framework for automatic dialogue evaluation via LLM-based data augmentation and multitask learning.
Inf. Process. Manag., 2025
Reparameterization of Lightweight Transformer for On-Device Speech Emotion Recognition.
IEEE Internet Things J., 2025
2024
Refashioning Emotion Recognition Modeling: The Advent of Generalized Large Models.
IEEE Trans. Comput. Soc. Syst., October, 2024
An Angle-Oriented Approach to Transferring Speech to Gesture for Highly Anthropomorphized Embodied Conversational Agents.
Int. J. Comput. Intell. Appl., June, 2024
Discriminative Feature Learning-Based Federated Lightweight Distillation Against Multiple Attacks.
IEEE Internet Things J., May, 2024
Gradient-Level Differential Privacy Against Attribute Inference Attack for Speech Emotion Recognition.
IEEE Signal Process. Lett., 2024
Individual mapping and asymmetric dual supervision for discrete cross-modal hashing.
Expert Syst. Appl., 2024
ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis.
CoRR, 2024
ParaLBench: A Large-Scale Benchmark for Computational Paralinguistics over Acoustic Foundation Models.
CoRR, 2024
Re-Parameterization of Lightweight Transformer for On-Device Speech Emotion Recognition.
CoRR, 2024
Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling.
CoRR, 2024
STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition.
CoRR, 2024
LSTDial: Enhancing Dialogue Generation via Long- and Short-Term Measurement Feedback.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Open Vocabulary Emotion Prediction Based on Large Multimodal Models.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024
Esihgnn: Event-State Interactions Infused Heterogeneous Graph Neural Network for Conversational Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024
Customising General Large Language Models for Specialised Emotion Recognition Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2024
Adaptive Speech Emotion Representation Learning Based On Dynamic Graph.
Proceedings of the IEEE International Conference on Acoustics, 2024
HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer's Disease Detection From Spontaneous Speech.
Proceedings of the IEEE International Conference on Acoustics, 2024
Intelligent Cardiac Auscultation for Murmur Detection via Parallel-Attentive Models with Uncertainty Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2024
EmoTransKG: An Innovative Emotion Knowledge Graph to Reveal Emotion Transformation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
Diversifying Emotional Dialogue Generation via Selective Adversarial Training.
Sensors, July, 2023
ACG-EmoCluster: A Novel Framework to Capture Spatial and Temporal Information from Emotional Speech Enhanced by DeepCluster.
Sensors, 2023
Refashioning Emotion Recognition Modelling: The Advent of Generalised Large Models.
CoRR, 2023
Frequency Domain Feature Learning with Wavelet Transform for Image Translation.
Proceedings of the PRICAI 2023: Trends in Artificial Intelligence, 2023
Speaker-aware Cross-modal Fusion Architecture for Conversational Emotion Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
GCFormer: A Graph Convolutional Transformer for Speech Emotion Recognition.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023
Privacy-Enhanced Federated Learning Against Attribute Inference Attack for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023
Zero-Shot Speech Emotion Recognition Using Generative Learning with Reconstructed Prototypes.
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
Exploring Zero-Shot Emotion Recognition in Speech Using Semantic-Embedding Prototypes.
IEEE Trans. Multim., 2022
Rethinking Auditory Affective Descriptors Through Zero-Shot Emotion Recognition in Speech.
IEEE Trans. Comput. Soc. Syst., 2022
Dynamic Restrained Uncertainty Weighting Loss for Multitask Learning of Vocal Expression.
CoRR, 2022
Deliberation Selector for Knowledge-Grounded Conversation Generation.
Proceedings of the PRICAI 2022: Trends in Artificial Intelligence, 2022
Automatic Respiratory Sound Classification Via Multi-Branch Temporal Convolutional Network.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Self-attention transfer networks for speech emotion recognition.
Virtual Real. Intell. Hardw., 2021
Can Machine Learning Assist Locating the Excitation of Snore Sound? A Review.
IEEE J. Biomed. Health Informatics, 2021
EmoBed: Strengthening Monomodal Emotion Recognition via Training with Crossmodal Emotion Embeddings.
IEEE Trans. Affect. Comput., 2021
Artificial Intelligence Internet of Things for the Elderly: From Assisted Living to Health-Care Monitoring.
IEEE Signal Process. Mag., 2021
Deep Learning for Mobile Mental Health: Challenges and recent advances.
IEEE Signal Process. Mag., 2021
Combining a parallel 2D CNN with a self-attention Dilated Residual Network for CTC-based discrete speech emotion recognition.
Neural Networks, 2021
Computer Audition for Fighting the SARS-CoV-2 Corona Crisis - Introducing the Multitask Speech Corpus for COVID-19.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
IEEE Internet Things J., 2021
Internet of emotional people: Towards continual affective computing cross cultures via audiovisual signals.
Future Gener. Comput. Syst., 2021
Learning audio sequence representations for acoustic event classification.
Expert Syst. Appl., 2021
Exploring Perception Uncertainty for Emotion Recognition in Dyadic Conversation and Music Listening.
Cogn. Comput., 2021
Identifying surgical-mask speech using deep neural networks on low-level aggregation.
Proceedings of the SAC '21: The 36th ACM/SIGAPP Symposium on Applied Computing, 2021
2020
Snore-GANs: Improving Automatic Snore Sound Classification With Synthesized Data.
IEEE J. Biomed. Health Informatics, 2020
Guest Editorial Special Issue on Adversarial Learning in Computational Intelligence.
IEEE Trans. Emerg. Top. Comput. Intell., 2020
Exploiting time-frequency patterns with LSTM-RNNs for low-bitrate audio restoration.
Neural Comput. Appl., 2020
Automatic Assessment of Depression From Speech via a Hierarchical Attention Transfer Network and Attention Autoencoders.
IEEE J. Sel. Top. Signal Process., 2020
Robust Semisupervised Generative Adversarial Networks for Speech Emotion Recognition via Distribution Smoothness.
IEEE Access, 2020
An Early Study on Intelligent Analysis of Speech Under COVID-19: Severity, Sleep Quality, Fatigue, and Anxiety.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Hierarchical Attention Transfer Networks for Depression Assessment from Speech.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Generating and Protecting Against Adversarial Attacks for Deep Speech-Based Emotion Recognition Models.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
Dynamic Difficulty Awareness Training for Continuous Emotion Prediction.
IEEE Trans. Multim., 2019
Adversarial Training in Affective Computing and Sentiment Analysis: Recent Advances and Perspectives [Review Article].
IEEE Comput. Intell. Mag., 2019
Exploring Deep Spectrum Representations via Attention-Based Recurrent and Convolutional Neural Networks for Speech Emotion Recognition.
IEEE Access, 2019
Automatic Detection of Major Depressive Disorder via a Bag-of-Behaviour-Words Approach.
Proceedings of the Third International Symposium on Image Computing and Digital Medicine, 2019
Attention-Enhanced Connectionist Temporal Classification for Discrete Speech Emotion Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Autonomous Emotion Learning in Speech: A View of Zero-Shot Speech Emotion Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
VCMNet: Weakly Supervised Learning for Automatic Infant Vocalisation Maturity Analysis.
Proceedings of the International Conference on Multimodal Interaction, 2019
Compact Convolutional Recurrent Neural Networks via Binarization for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019
Implicit Fusion by Joint Audiovisual Training for Emotion Recognition in Mono Modality.
Proceedings of the IEEE International Conference on Acoustics, 2019
Attention-augmented End-to-end Multi-task Learning for Emotion Prediction from Speech.
Proceedings of the IEEE International Conference on Acoustics, 2019
Teaching Machines to Know Your Depressive State: On Physical Activity in Health and Major Depressive Disorder.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019
Audiovisual Analysis for Recognising Frustration during Game-Play: Introducing the Multimodal Game Frustration Database.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction, 2019
2018
Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments.
ACM Trans. Intell. Syst. Technol., 2018
Semisupervised Autoencoders for Speech Emotion Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Deep Scalogram Representations for Acoustic Scene Classification.
IEEE CAA J. Autom. Sinica, 2018
Adversarial Training in Affective Computing and Sentiment Analysis: Recent Advances and Perspectives.
CoRR, 2018
Snoring classified: The Munich-Passau Snore Sound Corpus.
,
,
,
,
,
,
,
,
,
,
Comput. Biol. Medicine, 2018
Leveraging Unlabeled Data for Emotion Recognition With Enhanced Collaborative Semi-Supervised Learning.
IEEE Access, 2018
Exploring Spatio-Temporal Representations by Integrating Attention-based Bidirectional-LSTM-RNNs and FCNs for Speech Emotion Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Automated Classification of Children's Linguistic versus Non-Linguistic Vocalisations.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Bags in Bag: Generating Context-Aware Bags for Tracking Emotions from Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Evolving Learning for Analysing Mood-Related Infant Vocalisation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Exploring A New Method for Food Likability Rating Based on DT-CWT Theory.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018
Towards Conditional Adversarial Training for Predicting Emotions from Speech.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Classification of the Excitation Location of Snore Sounds in the Upper Airway by Acoustic Multifeature Analysis.
IEEE Trans. Biomed. Eng., 2017
A Two-Dimensional Framework of Multiple Kernel Subspace Learning for Recognizing Emotion in Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Advanced Data Exploitation in Speech Analysis: An overview.
IEEE Signal Process. Mag., 2017
Universum Autoencoder-Based Domain Adaptation for Speech Emotion Recognition.
IEEE Signal Process. Lett., 2017
Strength modelling for real-worldautomatic continuous affect recognition from audiovisual signals.
Image Vis. Comput., 2017
Learning Audio Sequence Representations for Acoustic Event Classification.
CoRR, 2017
Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments.
CoRR, 2017
Recognizing Emotions From Whispered Speech Based on Acoustic Feature Transfer Learning.
IEEE Access, 2017
From Hard to Soft: Towards more Human-like Emotion Recognition by Modelling the Perception Uncertainty.
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Towards Intelligent Crowdsourcing for Audio Data Annotation: Integrating Active Learning in the Real World.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Towards intoxicated speech recognition.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
Prediction-based learning for continuous emotion recognition in speech.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Reconstruction-error-based learning for continuous emotion recognition in speech.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Deep Sequential Image Features on Acoustic Scene Classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017
Wavelets Revisited for the Classification of Acoustic Scenes.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017
2016
Exploitation of Phase-Based Features for Whispered Speech Emotion Recognition.
IEEE Access, 2016
Spectral and Cepstral Audio Noise Reduction Techniques in Speech Emotion Recognition.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Fisher Kernels on Phase-Based Features for Speech Emotion Recognition.
Proceedings of the Dialogues with Social Robots, 2016
Facing Realism in Spontaneous Emotion Recognition from Speech: Feature Enhancement by Autoencoder with LSTM Neural Networks.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Multiscale kernel locally penalised discriminant analysis exemplified by emotion recognition in speech.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016
Enhanced semi-supervised learning for multimodal emotion recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Wavelet features for classification of vote snore sounds.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
The University of Passau Open Emotion Recognition System for the Multimodal Emotion Challenge.
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016
2015
Semi-Autonomous Data Enrichment and Optimisation for Intelligent Speech Analysis.
PhD thesis, 2015
Cooperative Learning and its Application to Emotion Recognition from Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Dynamic Active Learning Based on Agreement and Applied to Emotion Recognition in Spoken Interactions.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015
Bird sounds classification by large scale acoustic features and extreme learning machine.
Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015
On rater reliability and agreement based dynamic active learning.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
2014
Channel mapping using bidirectional long short-term memory for dereverberation in hands-free voice controlled devices.
IEEE Trans. Consumer Electron., 2014
Distributing Recognition in Computational Paralinguistics.
IEEE Trans. Affect. Comput., 2014
Autoencoder-based Unsupervised Domain Adaptation for Speech Emotion Recognition.
IEEE Signal Process. Lett., 2014
Robust speech recognition using long short-term memory recurrent neural networks for hybrid acoustic modelling.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Linked Source and Target Domain Subspace Feature Transfer Learning - Exemplified by Speech Emotion Recognition.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014
Introducing shared-hidden-layer autoencoders for transfer learning and their application in acoustic emotion recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Active learning by label uncertainty for acoustic emotion recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Co-training succeeds in Computational Paralinguistics.
Proceedings of the IEEE International Conference on Acoustics, 2013
Feature enhancement by bidirectional LSTM networks for conversational speech recognition in highly non-stationary noise.
Proceedings of the IEEE International Conference on Acoustics, 2013
Sparse Autoencoder-Based Feature Transfer Learning for Speech Emotion Recognition.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013
2012
Synthesized speech for model training in cross-corpus recognition of human emotion.
Int. J. Speech Technol., 2012
Towards distributed recognition of emotion from speech.
Proceedings of the 5th International Symposium on Communications, 2012
Active Learning by Sparse Instance Tracking and Classifier Confidence in Acoustic Emotion Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Semi-supervised learning helps in sound event classification.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Automatic recognition of emotion evoked by general sound events.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Towards Automatic Intoxication Detection from Speech in Real-Life Acoustic Environments.
Proceedings of the 10th ITG Conference on Speech Communication, 2012
2011
Using Multiple Databases for Training in Emotion Recognition: To Unite or to Vote?
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Unsupervised learning in cross-corpus acoustic emotion recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011