Björn W. Schuller
Orcid: 0000-0002-6478-8699Affiliations:
- Imperial College London, GLAM, UK
- University of Augsburg, Department of Computer Science, Germany
- University of Passau, Faculty of Computer Science and Mathematics, Germany (former)
According to our database1,
Björn W. Schuller
authored at least 1,089 papers
between 2001 and 2025.
Collaborative distances:
Collaborative distances:
Awards
ACM Fellow
ACM Fellow 2023, "For empirical and theoretical contributions to the development of computer audition, affective computing, and health informatics".
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on twitter.com
-
on orkg.org
-
on orcid.org
-
on id.loc.gov
-
on d-nb.info
-
on schuller.one
-
on doc.ic.ac.uk
On csauthors.net:
Bibliography
2025
Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers [Research Frontier].
IEEE Comput. Intell. Mag., February, 2025
An On-Board Executable Multi-Feature Transfer-Enhanced Fusion Model for Three-Lead EEG Sensor-Assisted Depression Diagnosis.
IEEE J. Biomed. Health Informatics, January, 2025
IEEE Signal Process. Lett., 2025
2024
Challenges in Observing the Emotions of Children with Autism Interacting with a Social Robot.
Int. J. Soc. Robotics, December, 2024
Attention-Based Temporal Graph Representation Learning for EEG-Based Emotion Recognition.
IEEE J. Biomed. Health Informatics, October, 2024
IEEE Trans. Comput. Soc. Syst., October, 2024
Heart Sound Abnormality Detection From Multi-Institutional Collaboration: Introducing a Federated Learning Framework.
IEEE Trans. Biomed. Eng., October, 2024
Fed-MStacking: Heterogeneous Federated Learning With Stacking Misaligned Labels for Abnormal Heart Sound Detection.
IEEE J. Biomed. Health Informatics, September, 2024
DepressionMLP: A Multi-Layer Perceptron Architecture for Automatic Depression Level Prediction via Facial Keypoints and Action Units.
IEEE Trans. Circuits Syst. Video Technol., September, 2024
IEEE Comput. Intell. Mag., August, 2024
Audio Enhancement for Computer Audition - An Iterative Training Paradigm Using Sample Importance.
J. Comput. Sci. Technol., July, 2024
Audiovisual Affect Recognition for Autonomous Vehicles: Applications and Future Agendas.
IEEE Trans. Intell. Transp. Syst., June, 2024
Multichannel Speech Enhancement Based on Neural Beamforming and a Context-Focused Post-Filtering Network.
IEEE Trans. Cogn. Dev. Syst., June, 2024
Automatic Bird Sound Source Separation Based on Passive Acoustic Devices in Wild Environment.
IEEE Internet Things J., May, 2024
Multi-view domain-adaptive representation learning for EEG-based emotion recognition.
Inf. Fusion, April, 2024
Patterns, March, 2024
Patterns, March, 2024
COLD Fusion: Calibrated and Ordinal Latent Distribution Fusion for Uncertainty-Aware Multimodal Emotion Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2024
Introducing the COVID-19 YouTube (COVYT) speech dataset featuring the same speakers with and without infection.
Biomed. Signal Process. Control., February, 2024
LEPCNet: A Lightweight End-to-End PCG Classification Neural Network Model for Wearable Devices.
IEEE Trans. Instrum. Meas., 2024
A Non-Invasive Speech Quality Evaluation Algorithm for Hearing Aids With Multi-Head Self-Attention and Audiogram-Based Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Are 3D Face Shapes Expressive Enough for Recognising Continuous Emotions and Action Unit Intensities?
IEEE Trans. Affect. Comput., 2024
Contrastive Learning Based Modality-Invariant Feature Acquisition for Robust Multimodal Emotion Recognition With Missing Modalities.
IEEE Trans. Affect. Comput., 2024
IEEE Trans. Affect. Comput., 2024
Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers.
Nat. Mac. Intell., 2024
Towards Friendly AI: A Comprehensive Review and New Perspectives on Human-AI Alignment.
CoRR, 2024
Detecting Machine-Generated Music with Explainability - A Challenge and Early Benchmarks.
CoRR, 2024
Detecting Document-level Paraphrased Machine Generated Content: Mimicking Human Writing Style and Involving Discourse Features.
CoRR, 2024
autrainer: A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks.
CoRR, 2024
ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis.
CoRR, 2024
M6: Multi-generator, Multi-domain, Multi-lingual and cultural, Multi-genres, Multi-instrument Machine-Generated Music Detection Databases.
CoRR, 2024
From Audio Deepfake Detection to AI-Generated Music Detection - A Pathway and Overview.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Does the Definition of Difficulty Matter? Scoring Functions and their Role for Curriculum Learning.
CoRR, 2024
Trading through Earnings Seasons using Self-Supervised Contrastive Representation Learning.
CoRR, 2024
Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models.
CoRR, 2024
Negation Blindness in Large Language Models: Unveiling the NO Syndrome in Image Generation.
CoRR, 2024
Wav2Small: Distilling Wav2Vec2 to 72K parameters for Low-Resource Speech emotion recognition.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Emotion and Intent Joint Understanding in Multimodal Conversation: A Benchmarking Dataset.
CoRR, 2024
Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition.
CoRR, 2024
This Paper Had the Smartest Reviewers - Flattery Detection Utilising an Audio-Textual Transformer-Based Approach.
CoRR, 2024
CoRR, 2024
ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets.
CoRR, 2024
DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition.
CoRR, 2024
ParaCLAP - Towards a general language-audio model for computational paralinguistic tasks.
CoRR, 2024
Enrolment-based personalisation for improving individual-level fairness in speech emotion recognition.
CoRR, 2024
INTERSPEECH 2009 Emotion Challenge Revisited: Benchmarking 15 Years of Progress in Speech Emotion Recognition.
CoRR, 2024
CoRR, 2024
Enhancing Suicide Risk Assessment: A Speech-Based Automated Approach in Emergency Medicine.
CoRR, 2024
emoDARTS: Joint Optimisation of CNN & Sequential Neural Network Architectures for Superior Speech Emotion Recognition.
CoRR, 2024
STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition.
CoRR, 2024
emoDARTS: Joint Optimization of CNN and Sequential Neural Network Architectures for Superior Speech Emotion Recognition.
IEEE Access, 2024
Domain Adapting Deep Reinforcement Learning for Real-World Speech Emotion Recognition.
IEEE Access, 2024
IEEE Access, 2024
Robust Robotic Search and Rescue in Harsh Environments: An Example and Open Challenges.
Proceedings of the IEEE International Symposium on Robotic and Sensors Environments, 2024
Proceedings of the 27th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2024
MuSe '24: The 5th Multimodal Sentiment Analysis Challenge and Workshop: Social Perception & Humor.
Proceedings of the 5th on Multimodal Sentiment Analysis Challenge and Workshop: Social Perception and Humor, 2024
The MuSe 2024 Multimodal Sentiment Analysis Challenge: Social Perception and Humor Recognition.
Proceedings of the 5th on Multimodal Sentiment Analysis Challenge and Workshop: Social Perception and Humor, 2024
MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024
MRAC'24 Track 2: 2nd International Workshop on Multimodal and Responsible Affective Computing.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024
The 2nd Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA) 2024: Dataset and Results.
Proceedings of IJCAI 2024 Workshop&Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA 2024) co-located with 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024), 2024
Deep Neural Quality of Service Prediction for Unmanned Aircraft System Communications.
Proceedings of the International Wireless Communications and Mobile Computing, 2024
Dense Coordinate Channel Attention Network for Depression Level Estimation from Speech.
Proceedings of the Pattern Recognition - 27th International Conference, 2024
Proceedings of the Pattern Recognition - 27th International Conference, 2024
EVAC 2024 - Empathic Virtual Agent Challenge: Appraisal-based Recognition of Affective States.
Proceedings of the 26th International Conference on Multimodal Interaction, 2024
Emotion-Aware Contrastive Adaptation Network for Source-Free Cross-Corpus Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Speech Swin-Transformer: Exploring a Hierarchical Transformer with Shifted Windows for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Bringing the Discussion of Minima Sharpness to the Audio Domain: A Filter-Normalised Evaluation for Acoustic Scene Classification.
Proceedings of the IEEE International Conference on Acoustics, 2024
Personalised Anomaly Detectors and Prototypical Representations for Relapse Detection from Wearable-Based Digital Phenotyping.
Proceedings of the IEEE International Conference on Acoustics, 2024
Improving Speaker-Independent Speech Emotion Recognition using Dynamic Joint Distribution Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Task Selection and Assignment for Multi-Modal Multi-Task Dialogue Act Classification with Non-Stationary Multi-Armed Bandits.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer's Disease Detection From Spontaneous Speech.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Intelligent Cardiac Auscultation for Murmur Detection via Parallel-Attentive Models with Uncertainty Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2024
An Automatic Analysis of Ultrasound Vocalisations for the Prediction of Interaction Context in Captive Egyptian Fruit Bats.
Proceedings of the 32nd European Signal Processing Conference, 2024
Proceedings of the 32nd European Signal Processing Conference, 2024
Proceedings of the 32nd European Signal Processing Conference, 2024
Proceedings of the 32nd European Signal Processing Conference, 2024
Proceedings of the 32nd European Signal Processing Conference, 2024
Proceedings of the 32nd European Signal Processing Conference, 2024
Audio-Based Step-Count Estimation for Running - Windowing and Neural Network Baselines.
Proceedings of the 32nd European Signal Processing Conference, 2024
Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2024
Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2024
Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2024
Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2024
EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024
Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
Expert Syst. Appl., December, 2023
Battling with the low-resource condition for snore sound recognition: introducing a meta-learning strategy.
EURASIP J. Audio Speech Music. Process., December, 2023
Guest Editorial Trustworthy and Collaborative AI for Personalised Healthcare Through Edge-of-Things.
IEEE J. Biomed. Health Informatics, November, 2023
Patterns, November, 2023
Proc. IEEE, October, 2023
A weakly supervised spatial group attention network for fine-grained visual recognition.
Appl. Intell., October, 2023
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023
IEEE Trans. Cybern., June, 2023
Comput. Speech Lang., June, 2023
Int. J. Hum. Comput. Interact., May, 2023
Frontiers Digit. Health, May, 2023
Can a Holistic View Facilitate the Development of Intelligent Traditional Chinese Medicine? A Survey.
IEEE Trans. Comput. Soc. Syst., April, 2023
Biomed. Signal Process. Control., April, 2023
Frontiers Digit. Health, March, 2023
IEEE Trans. Comput. Soc. Syst., February, 2023
IEEE Trans. Affect. Comput., 2023
IEEE Trans. Affect. Comput., 2023
The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements.
IEEE Trans. Affect. Comput., 2023
Dual Attention and Element Recalibration Networks for Automatic Depression Level Prediction.
IEEE Trans. Affect. Comput., 2023
Multitask Learning From Augmented Auxiliary Data for Improving Speech Emotion Recognition.
IEEE Trans. Affect. Comput., 2023
Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition.
IEEE Trans. Affect. Comput., 2023
IEEE Trans. Affect. Comput., 2023
IEEE Trans. Affect. Comput., 2023
IEEE Trans. Affect. Comput., 2023
Guest Editorial: Special Issue on Affective Speech and Language Synthesis, Generation, and Conversion.
IEEE Trans. Affect. Comput., 2023
IEEE Signal Process. Lett., 2023
Automated composition of Galician Xota - tuning RNN-based composers for specific musical styles using deep Q-learning.
PeerJ Comput. Sci., 2023
Multistage linguistic conditioning of convolutional layers for speech emotion recognition.
Frontiers Comput. Sci., 2023
Computational charisma - A brick by brick blueprint for building charismatic artificial intelligence.
Frontiers Comput. Sci., 2023
IEEE Intell. Syst., 2023
Will Affective Computing Emerge From Foundation Models and General Artificial Intelligence? A First Evaluation of ChatGPT.
IEEE Intell. Syst., 2023
CoRR, 2023
Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model.
CoRR, 2023
CoRR, 2023
Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers.
CoRR, 2023
Going Retro: Astonishingly Simple Yet Effective Rule-based Prosody Modelling for Speech Synthesis Simulating Emotion Dimensions.
CoRR, 2023
Improving Speech Emotion Recognition Performance using Differentiable Architecture Search.
CoRR, 2023
The MuSe 2023 Multimodal Sentiment Analysis Challenge: Mimicked Emotions, Cross-Cultural Humour, and Personalisation.
CoRR, 2023
CoRR, 2023
Will Affective Computing Emerge from Foundation Models and General AI? A First Evaluation on ChatGPT.
CoRR, 2023
Toward Detecting and Addressing Corner Cases in Deep Learning Based Medical Image Segmentation.
IEEE Access, 2023
Proceedings of the Speech and Computer - 25th International Conference, 2023
The MuSe 2023 Multimodal Sentiment Analysis Challenge: Mimicked Emotions, Cross-Cultural Humour, and Personalisation.
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023
Personalised Speech-Based Heart Rate Categorisation Using Weighted-Instance Learning.
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023
An Overview of the ICASSP Special Session on AI Security and Privacy in Speech and Audio Processing.
Proceedings of the ACM Multimedia Asia Workshops, 2023
The ACM Multimedia 2023 Computational Paralinguistics Challenge: Emotion Share & Requests.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
MRAC'23: 1st International Workshop on Multimodal and Responsible Affective Computing.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
"Do touch!" - 3D Scanning and Printing Technologies for the Haptic Representation of Cultural Assets: A Study with Blind Target Users.
Proceedings of the 5th Workshop on analySis, 2023
MuSe 2023 Challenge: Multimodal Prediction of Mimicked Emotions, Cross-Cultural Humour, and Personalised Recognition of Affects.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2023
Analysis and automatic prediction of exertion from speech: Contrasting objective and subjective measures collected while running.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
The effect of clinical intervention on the speech of individuals with PTSD: features and recognition performances.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Nkululeko: Machine Learning Experiments on Speaker Characteristics Without Programming.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
SWRR: Feature Map Classifier Based on Sliding Window Attention and High-Response Feature Reuse for Multimodal Emotion Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2023
SMILENets: Audio Representation Learning via Neural Knowledge Distillation of Traditional Audio-Feature Extractors.
Proceedings of the 8th International Conference on Frontiers of Signal Processing, 2023
An End-to-End Model for Mental Disorders Detection by Spontaneous Physical Activity Data.
Proceedings of the IEEE International Conference on Data Mining, 2023
Crossmodal Transformer on Multi-Physical Signals for Personalised Daily Mental Health Prediction.
Proceedings of the IEEE International Conference on Data Mining, 2023
An Investigation on Data Augmentation and Multiple Instance Learning for Diagnosis of COVID-19 from Speech and Cough Sound.
Proceedings of the International Conference on Consumer Electronics - Taiwan, 2023
Hierarchical Network with Decoupled Knowledge Distillation for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Zero-Shot Speech Emotion Recognition Using Generative Learning with Reconstructed Prototypes.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Daily Mental Health Monitoring from Speech: A Real-World Japanese Dataset and Multitask Learning Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2023
Positive-Pair Redundancy Reduction Regularisation for Speech-Based Asthma Diagnosis Prediction.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Knowledge Transfer for on-Device Speech Emotion Recognition With Neural Structured Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
AMNet: Introducing an Adaptive Mel-Spectrogram End-to-End Neural Network for Heart Sound Classification.
Proceedings of the IEEE International Conference on E-health Networking, 2023
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023
Applying Speech Derived Breathing Patterns to Automatically Classify Human Confidence.
Proceedings of the 31st European Signal Processing Conference, 2023
Multimodal Recognition of Valence, Arousal and Dominance via Late-Fusion of Text, Audio and Facial Expressions.
Proceedings of the 31st European Symposium on Artificial Neural Networks, 2023
Less is More: A Novel Feature Extraction Method for Heart Sound Classification via Fractal Transformation.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023
Cutting Weights of Deep Learning Models for Heart Sound Classification: Introducing a Knowledge Distillation Approach.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023
A novel and simple approach to regularise attention frameworks and its efficacy in segmentation.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023
NeuroCellCentreDB: Exploring a Novel Dataset for Neuron-like Cell Centre Detection with Deep Neural Networks.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023
How does Music Affect Your Brain? A Pilot Study on EEG and Music Features for Automatic Analysis.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023
Deep Modelling Strategies for Human Confidence Classification using Audio-visual Data.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023
Noise Robust Recognition of Depression Status and Treatment Response from Speech via Unsupervised Feature Aggregation.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023
Somatisation Disorder Detection via Speech: Introducing a Self-Supervised Learning Model.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023
Universal Lesion Detection Utilising Cascading R-CNNs and a Novel Video Pretraining Method.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023
2022
Introducing the COVID-19 YouTube (COVYT) speech dataset featuring the same speakers with and without infection.
Dataset, September, 2022
Guest Editorial: Introduction to the Special Section on Efficient Network Design for Convergence of Deep Learning and Edge Computing.
IEEE Trans. Netw. Sci. Eng., 2022
Exploring Zero-Shot Emotion Recognition in Speech Using Semantic-Embedding Prototypes.
IEEE Trans. Multim., 2022
IEEE Trans. Intell. Transp. Syst., 2022
Capturing Time Dynamics From Speech Using Neural Networks for Surgical Mask Detection.
IEEE J. Biomed. Health Informatics, 2022
Selective Element and Two Orders Vectorization Networks for Automatic Depression Severity Diagnosis via Facial Changes.
IEEE Trans. Circuits Syst. Video Technol., 2022
Rethinking Auditory Affective Descriptors Through Zero-Shot Emotion Recognition in Speech.
IEEE Trans. Comput. Soc. Syst., 2022
IEEE Trans. Comput. Soc. Syst., 2022
IEEE Trans. Comput. Soc. Syst., 2022
Psychological Field Versus Physiological Field: From Qualitative Analysis to Quantitative Modeling of the Mental Status.
IEEE Trans. Comput. Soc. Syst., 2022
Domain Invariant Feature Learning for Speaker-Independent Speech Emotion Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Holistic Affect Recognition Using PaNDA: Paralinguistic Non-Metric Dimensional Analysis.
IEEE Trans. Affect. Comput., 2022
IEEE Trans. Affect. Comput., 2022
IEEE Trans. Affect. Comput., 2022
Face mask recognition from audio: The MASC database and an overview on the mask challenge.
Pattern Recognit., 2022
Fitbeat: COVID-19 estimation based on wristband heart rate using a contrastive convolutional auto-encoder.
Pattern Recognit., 2022
Pattern Recognit., 2022
IEEE Trans. Pattern Anal. Mach. Intell., 2022
MEDAS: an open-source platform as a service to help break the walls between medicine and informatics.
Neural Comput. Appl., 2022
IEEE J. Sel. Top. Signal Process., 2022
Correction to: The perception of emotional cues by children in artificial background noise.
Int. J. Speech Technol., 2022
DeepSpectrumLite: A Power-Efficient Transfer Learning Framework for Embedded Speech and Audio Processing From Decentralized Data.
Frontiers Artif. Intell., 2022
Future Gener. Comput. Syst., 2022
Is Speech the New Blood? Recent Progress in AI-Based Disease Detection From Audio in a Nutshell.
Frontiers Digit. Health, 2022
Personalised depression forecasting using mobile sensor data and ecological momentary assessment.
Frontiers Digit. Health, 2022
Voice Analysis for Neurological Disorder Recognition-A Systematic Review and Perspective on Emerging Trends.
Frontiers Digit. Health, 2022
Evaluating the COVID-19 Identification ResNet (CIdeR) on the INTERSPEECH COVID-19 From Audio Challenges.
Frontiers Digit. Health, 2022
An Estimation of Online Video User Engagement From Features of Time- and Value-Continuous, Dimensional Emotions.
Frontiers Comput. Sci., 2022
Evaluating the Impact of Voice Activity Detection on Speech Emotion Recognition for Autistic Children.
Frontiers Comput. Sci., 2022
Frontiers Comput. Sci., 2022
Frontiers Comput. Sci., 2022
ACM Comput. Surv., 2022
Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19.
CoRR, 2022
Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers.
CoRR, 2022
CoRR, 2022
Proceedings of the ACII Affective Vocal Bursts Workshop and Competition 2022 (A-VB): Understanding a critically understudied modality of emotional expression.
CoRR, 2022
CoRR, 2022
Self-Supervised Attention Networks and Uncertainty Loss Weighting for Multi-Task Emotion Recognition on Vocal Bursts.
CoRR, 2022
Proceedings of the ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts.
CoRR, 2022
The ACII 2022 Affective Vocal Bursts Workshop & Competition: Understanding a critically understudied modality of emotional expression.
CoRR, 2022
Dynamic Restrained Uncertainty Weighting Loss for Multitask Learning of Vocal Expression.
CoRR, 2022
COVYT: Introducing the Coronavirus YouTube and TikTok speech dataset featuring the same speakers with and without infection.
CoRR, 2022
Redundancy Reduction Twins Network: A Training framework for Multi-output Emotion Regression.
CoRR, 2022
Exploring speaker enrolment for few-shot personalisation in emotional vocalisation prediction.
CoRR, 2022
The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts.
CoRR, 2022
Continuous-Time Audiovisual Fusion with Recurrence vs. Attention for In-The-Wild Affect Recognition.
CoRR, 2022
Climate Change & Computer Audition: A Call to Action and Overview on Audio Intelligence to Help Save the Planet.
CoRR, 2022
Robust Federated Learning Against Adversarial Attacks for Speech Emotion Recognition.
CoRR, 2022
Predicting Sex and Stroke Success - Computer-aided Player Grunt Analysis in Tennis Matches.
CoRR, 2022
Normalise for Fairness: A Simple Normalisation Technique for Fairness in Regression Machine Learning Problems.
CoRR, 2022
The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
A Personalised Approach to Audiovisual Humour Recognition and its Individual-level Fairness.
Proceedings of the MuSe@MM 2022: Proceedings of the 3rd International on Multimodal Sentiment Analysis Workshop and Challenge, 2022
Improving Exertion and Wellbeing Prediction in Outdoor Running Conditions using Audio-based Surface Recognition.
Proceedings of the MMSports@MM 2022: Proceedings of the 5th International ACM Workshop on Multimedia Content Analysis in Sports, 2022
The MuSe 2022 Multimodal Sentiment Analysis Challenge: Humor, Emotional Reactions, and Stress.
Proceedings of the MuSe@MM 2022: Proceedings of the 3rd International on Multimodal Sentiment Analysis Workshop and Challenge, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
A Comparative Cross Language View On Acted Databases Portraying Basic Emotions Utilising Machine Learning.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022
Proceedings of the 7th International Workshop on Sensor-based Activity Recognition and Artificial Intelligence, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Distinguishing between pre- and post-treatment in the speech of patients with chronic obstructive pulmonary disease.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Quantifying Cognitive Load from Voice using Transformer-Based Models and a Cross-Dataset Evaluation.
Proceedings of the 21st IEEE International Conference on Machine Learning and Applications, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Convoluational Transformer With Adaptive Position Embedding For Covid-19 Detection From Cough Sounds.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
Insights on Modelling Physiological, Appraisal, and Affective Indicators of Stress using Audio Features.
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
CNN-Based Heart Sound Classification with an Imbalance-Compensating Weighted Loss Function.
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
Time-Continuous Audiovisual Fusion with Recurrence vs Attention for In-The-Wild Affect Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
Proceedings of the 15th International Congress on Image and Signal Processing, 2022
COVID-19 Detection Exploiting Self-Supervised Learning Representations of Respiratory Sounds.
Proceedings of the IEEE-EMBS International Conference on Biomedical and Health Informatics, 2022
Proceedings of the IEEE-EMBS International Conference on Biomedical and Health Informatics, 2022
A Novel Policy for Pre-trained Deep Reinforcement Learning for Speech Emotion Recognition.
Proceedings of the ACSW 2022: Australasian Computer Science Week 2022, Brisbane, Australia, February 14, 2022
Proceedings of the 10th International Conference on Affective Computing and Intelligent Interaction, ACII 2022, 2022
Ist Stimme das neue Blut? KI und Stimmbiomarker zu früheren Diagnose - für jedermann, überall und jederzeit.
Proceedings of the Künstliche Intelligenz im Gesundheitswesen: Entwicklungen, 2022
2021
Virtual Real. Intell. Hardw., 2021
Frustration recognition from speech during game interaction using wide residual networks.
Virtual Real. Intell. Hardw., 2021
Predictable Robots for Autistic Children - Variance in Robot Behaviour, Idiosyncrasies in Autistic Children's Characteristics, and Child-Robot Engagement.
ACM Trans. Comput. Hum. Interact., 2021
CAA-Net: Conditional Atrous CNNs With Attention for Explainable Device-Robust Acoustic Scene Classification.
IEEE Trans. Multim., 2021
IEEE J. Biomed. Health Informatics, 2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Guided Generative Adversarial Neural Network for Representation Learning and Audio Generation Using Fewer Labelled Audio Data.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
A Deep Adaptation Network for Speech Enhancement: Combining a Relativistic Discriminator With Multi-Kernel Maximum Mean Discrepancy.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
An Online Robot Collision Detection and Identification Scheme by Supervised Learning and Bayesian Decision Theory.
IEEE Trans Autom. Sci. Eng., 2021
EmoBed: Strengthening Monomodal Emotion Recognition via Training with Crossmodal Emotion Embeddings.
IEEE Trans. Affect. Comput., 2021
IEEE Signal Process. Mag., 2021
Artificial Intelligence Internet of Things for the Elderly: From Assisted Living to Health-Care Monitoring.
IEEE Signal Process. Mag., 2021
IEEE Signal Process. Mag., 2021
SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild.
IEEE Trans. Pattern Anal. Mach. Intell., 2021
Combining a parallel 2D CNN with a self-attention Dilated Residual Network for CTC-based discrete speech emotion recognition.
Neural Networks, 2021
Multim. Tools Appl., 2021
Computer Audition for Fighting the SARS-CoV-2 Corona Crisis - Introducing the Multitask Speech Corpus for COVID-19.
IEEE Internet Things J., 2021
Can Appliances Understand the Behavior of Elderly Via Machine Learning? A Feasibility Study.
IEEE Internet Things J., 2021
Inf. Fusion, 2021
Introduction to the Special Issue on MMAC: Multimodal Affective Computing of Large-Scale Multimedia Data.
IEEE Multim., 2021
Internet of emotional people: Towards continual affective computing cross cultures via audiovisual signals.
Future Gener. Comput. Syst., 2021
COVID-19 and Computer Audition: An Overview on What Speech & Sound Analysis Could Contribute in the SARS-CoV-2 Corona Crisis.
Frontiers Digit. Health, 2021
CovNet: A Transfer Learning Framework for Automatic COVID-19 Detection From Crowd-Sourced Cough Sounds.
Frontiers Digit. Health, 2021
Frontiers Big Data, 2021
Frontiers Comput. Sci., 2021
An Evaluation of Speech-Based Recognition of Emotional and Physiological Markers of Stress.
Frontiers Comput. Sci., 2021
IEEE Intell. Syst., 2021
Expert Syst. Appl., 2021
Capturing dynamics of post-earnings-announcement drift using a genetic algorithm-optimized XGBoost.
Expert Syst. Appl., 2021
Conversational Agent as Trustworthy Autonomous System (Trust-CA) (Dagstuhl Seminar 21381).
Dagstuhl Reports, 2021
Representation transfer learning from deep end-to-end speech recognition networks for the classification of health states from speech.
Comput. Speech Lang., 2021
CoRR, 2021
CoRR, 2021
The EIHW-GLAM Deep Attentive Multi-model Fusion System for Cough-based COVID-19 Recognition in the DiCOVA 2021 Challenge.
CoRR, 2021
CoRR, 2021
DeepSpectrumLite: A Power-Efficient Transfer Learning Framework for Embedded Speech and Audio Processing from Decentralised Data.
CoRR, 2021
On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era.
CoRR, 2021
Remote smartphone-based speech collection: acceptance and barriers in individuals with major depressive disorder.
CoRR, 2021
CoRR, 2021
CoRR, 2021
CoRR, 2021
Exploring Perception Uncertainty for Emotion Recognition in Dyadic Conversation and Music Listening.
Cogn. Comput., 2021
Robot-Based Intervention for Children With Autism Spectrum Disorder: A Systematic Literature Review.
IEEE Access, 2021
An Enhanced Adversarial Network with Combined Latent Features for Spatio-temporal Facial Affect Estimation in the Wild.
Proceedings of the 16th International Joint Conference on Computer Vision, 2021
Emotion Recognition in Public Speaking Scenarios Utilising An LSTM-RNN Approach with Attention.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Identifying surgical-mask speech using deep neural networks on low-level aggregation.
Proceedings of the SAC '21: The 36th ACM/SIGAPP Symposium on Applied Computing, 2021
Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021
Proceedings of the 23rd International Workshop on Multimedia Signal Processing, 2021
Proceedings of the 23rd International Workshop on Multimedia Signal Processing, 2021
MuSe-Toolbox: The Multimodal Sentiment Analysis Continuous Annotation Fusion and Discrete Class Transformation Toolbox.
Proceedings of the MuSe '21: Proceedings of the 2nd on Multimodal Sentiment Analysis Challenge, 2021
MuSe 2021 Challenge: Multimodal Emotion, Sentiment, Physiological-Emotion, and Stress Detection.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
The MuSe 2021 Multimodal Sentiment Analysis Challenge: Sentiment, Emotion, Physiological-Emotion, and Stress.
Proceedings of the MuSe '21: Proceedings of the 2nd on Multimodal Sentiment Analysis Challenge, 2021
Proceedings of the MuSe '21: Proceedings of the 2nd on Multimodal Sentiment Analysis Challenge, 2021
Proceedings of the 3rd IEEE Global Conference on Life Sciences and Technologies, 2021
Proceedings of the Conversational AI for Natural Human-Centric Interaction, 2021
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021
Coughing-Based Recognition of Covid-19 with Spatial Attentive ConvLSTM Recurrent Neural Networks.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Cough-Based COVID-19 Detection with Contextual Attention Convolutional Neural Networks and Gender Information.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Recognising Covid-19 from Coughing Using Ensembles of SVMs and LSTMs with Handcrafted and Deep Audio Features.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Remote Smartphone-Based Speech Collection: Acceptance and Barriers in Individuals with Major Depressive Disorder.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
The DiCOVA 2021 Challenge - An Encoder-Decoder Approach for COVID-19 Recognition from Coughing Audio.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the International Joint Conference on Neural Networks, 2021
Proceedings of the ICT for Health, Accessibility and Wellbeing, 2021
Towards Sonification in Multimodal and User-friendlyExplainable Artificial Intelligence.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021
Predicting Group Work Performance from Physical Handwriting Features in a Smart English Classroom.
Proceedings of the ICDSP 2021: 5th International Conference on Digital Signal Processing, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
The Role of Task and Acoustic Similarity in Audio Transfer Learning: Insights from the Speech Emotion Recognition Case.
Proceedings of the IEEE International Conference on Acoustics, 2021
A Novel Attention-Based Gated Recurrent Unit and its Efficacy in Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021
Hierarchical Attention-Based Temporal Convolutional Networks for Eeg-Based Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the Universal Access in Human-Computer Interaction. Design Methods and User Experience, 2021
Proceedings of the 16th IEEE International Conference on Automatic Face and Gesture Recognition, 2021
Sensing the Sounds of Silence: A Pilot Study on the Detection of Model Mice of Autism Spectrum Disorder from Ultrasonic Vocalisations.
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021
COVID-19 Detection with a Novel Multi-Type Deep Fusion Method using Breathing and Coughing Information.
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021
Transferring Cross-Corpus Knowledge: An Investigation on Data Augmentation for Heart Sound Classification.
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021
Transformer-based CNNs: Mining Temporal Context Information for Multi-sound COVID-19 Diagnosis.
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021
Fairness and Underspecification in Acoustic Scene Classification: The Case for Disaggregated Evaluations.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021
Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021
Proceedings of the 34th IEEE International Symposium on Computer-Based Medical Systems, 2021
Proceedings of the Seventh IEEE International Conference on Multimedia Big Data, 2021
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
Embracing and Exploiting Annotator Emotional Subjectivity: An Affective Rater Ensemble Model.
Proceedings of the 2021 9th International Conference on Affective Computing and Intelligent Interaction, 2021
2020
IEEE J. Biomed. Health Informatics, 2020
Machine Listening for Heart Status Monitoring: Introducing and Benchmarking HSS - The Heart Sounds Shenzhen Corpus.
IEEE J. Biomed. Health Informatics, 2020
IEEE Trans. Emerg. Top. Comput. Intell., 2020
IEEE Trans. Cybern., 2020
"Are You Playing a Shooter Again?!" Deep Representation Learning for Audio-Based Video Game Genre Recognition.
IEEE Trans. Games, 2020
Neural Comput. Appl., 2020
Validity of machine learning in biology and medicine increased through collaborations across fields of expertise.
Nat. Mach. Intell., 2020
Knowl. Inf. Syst., 2020
Automatic Assessment of Depression From Speech via a Hierarchical Attention Transfer Network and Attention Autoencoders.
IEEE J. Sel. Top. Signal Process., 2020
I see it in your eyes: Training the shallowest-possible CNN to recognise emotions and pain from muted web-assisted in-the-wild video-chats in real-time.
Inf. Process. Manag., 2020
Int. J. Speech Technol., 2020
Frontiers Digit. Health, 2020
Considerations for a More Ethical Approach to Data in AI: On Data Representation and Infrastructure.
Frontiers Big Data, 2020
Towards cross-modal pre-training and learning tempo-spatial characteristics for audio recognition with convolutional and recurrent neural networks.
EURASIP J. Audio Speech Music. Process., 2020
CoRR, 2020
CoRR, 2020
Capturing dynamics of post-earnings-announcement drift using genetic algorithm-optimised supervised learning.
CoRR, 2020
MeDaS: An open-source platform as service to help break the walls between medicine and informatics.
CoRR, 2020
Go-CaRD - Generic, Optical Car Part Recognition and Detection: Collection, Insights, and Applications.
CoRR, 2020
Deep Reinforcement Learning with Pre-training for Time-efficient Training of Automatic Speech Recognition.
CoRR, 2020
A Novel Fusion of Attention and Sequence to Sequence Autoencoders to Predict Sleepiness From Speech.
CoRR, 2020
ConcealNet: An End-to-end Neural Network for Packet Loss Concealment in Deep Speech Emotion Recognition.
CoRR, 2020
CoRR, 2020
MuSe 2020 - The First International Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop.
CoRR, 2020
Cross-lingual Zero- and Few-shot Hate Speech Detection Utilising Frozen Transformer Language Models and AXEL.
CoRR, 2020
Guided Generative Adversarial Neural Network for Representation Learning and High Fidelity Audio Generation using Fewer Labelled Audio Data.
CoRR, 2020
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends.
CoRR, 2020
Classification of Lung Nodules Based on Deep Residual Networks and Migration Learning.
Comput. Intell. Neurosci., 2020
High-Fidelity Audio Generation and Representation Learning With Guided Adversarial Autoencoder.
IEEE Access, 2020
Proceedings of the PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare, 2020
Proceedings of the 22nd IEEE International Workshop on Multimedia Signal Processing, 2020
Summary of MuSe 2020: Multimodal Sentiment Analysis, Emotion-target Engagement and Trustworthiness Detection in Real-life Media.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
MuSe 2020 Challenge and Workshop: Multimodal Sentiment Analysis, Emotion-target Engagement and Trustworthiness Detection in Real-life Media: Emotional Car Reviews in-the-wild.
Proceedings of the MuSe'20: Proceedings of the 1st International on Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop, 2020
Unsupervised Representation Learning with Attention and Sequence to Sequence Autoencoders to Predict Sleepiness From Speech.
Proceedings of the MuSe'20: Proceedings of the 1st International on Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop, 2020
Proceedings of the Working Notes Proceedings of the MediaEval 2020 Workshop, 2020
Emotion and Themes Recognition in Music with Convolutional and Recurrent Attention-Blocks.
Proceedings of the Working Notes Proceedings of the MediaEval 2020 Workshop, 2020
Average Jane, Where Art Thou? - Recent Avenues in Efficient Machine Learning Under Subjectivity Uncertainty.
Proceedings of the Information Processing and Management of Uncertainty in Knowledge-Based Systems, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Computer Audition for Continuous Rainforest Occupancy Monitoring: The Case of Bornean Gibbons' Call Detection.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Uncertainty-Aware Machine Support for Paper Reviewing on the Interspeech 2019 Submission Corpus.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
The INTERSPEECH 2020 Computational Paralinguistics Challenge: Elderly Emotion, Breathing & Masks.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Enhancing Transferability of Black-Box Adversarial Attacks via Lifelong Learning for Speech Emotion Recognition Models.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
An Investigation of Cross-Cultural Semi-Supervised Learning for Continuous Affect Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Deep Architecture Enhancing Robustness to Noise, Adversarial Attacks, and Cross-Corpus Setting for Speech Emotion Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Learning Higher Representations from Pre-Trained Deep Models with Data Augmentation for the COMPARE 2020 Challenge Mask Task.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Towards Silent Paralinguistics: Deriving Speaking Mode and Speaker ID from Electromyographic Signals.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Toward Silent Paralinguistics: Speech-to-EMG - Retrieving Articulatory Muscle Activity from Speech.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
An Evaluation of the Effect of Anxiety on Speech - Computational Prediction of Anxiety from Sustained Vowels.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
An Early Study on Intelligent Analysis of Speech Under COVID-19: Severity, Sleep Quality, Fatigue, and Anxiety.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Hierarchical Component-attention Based Speaker Turn Embedding for Emotion Recognition.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020
Exploring Spatial-Temporal Representations for fNIRS-based Intimacy Detection via an Attention-enhanced Cascade Convolutional Recurrent Neural Network.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
X-AWARE: ConteXt-AWARE Human-Environment Attention Fusion for Driver Gaze Prediction in the Wild.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Stargan for Emotional Speech Conversion: Validated by Data Augmentation of End-To-End Emotion Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Generating and Protecting Against Adversarial Attacks for Deep Speech-Based Emotion Recognition Models.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Interaction with the Soundscape: Exploring Emotional Audio Generation for Improved Individual Wellbeing.
Proceedings of the Artificial Intelligence in HCI, 2020
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020
A Curriculum Learning Approach for Pain Intensity Recognition from Facial Expressions.
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020
Audio for Audio is Better? An Investigation on Transfer Learning Models for Heart Sound Classification.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020
2019
Proceedings of the Innovations in Big Data Mining and Embedded Knowledge, 2019
IEEE Trans. Multim., 2019
Connecting Subspace Learning and Extreme Learning Machine in Speech Emotion Recognition.
IEEE Trans. Multim., 2019
IEEE Trans. Games, 2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
IEEE Trans. Affect. Comput., 2019
Deep Affect Prediction in-the-Wild: Aff-Wild Database and Challenge, Deep Architectures, and Beyond.
Int. J. Comput. Vis., 2019
Large-scale Data Collection and Analysis via a Gamified Intelligent Crowdsourcing Platform.
Int. J. Autom. Comput., 2019
Affective and behavioural computing: Lessons learnt from the First Computational Paralinguistics Challenge.
Comput. Speech Lang., 2019
CoRR, 2019
Poisson CNN: Convolutional Neural Networks for the Solution of the Poisson Equation with Varying Meshes and Dirichlet Boundary Conditions.
CoRR, 2019
CoRR, 2019
Presenting the Acoustic Sounds for Wellbeing Dataset and Baseline Classification Results.
CoRR, 2019
A Comparison of Online Automatic Speech Recognition Systems and the Nonverbal Responses to Unintelligible Speech.
CoRR, 2019
Responsible and Representative Multimodal Data Acquisition and Analysis: On Auditability, Benchmarking, Confidence, Data-Reliance & Explainability.
CoRR, 2019
On Many-to-Many Mapping Between Concordance Correlation Coefficient and Mean Square Error.
CoRR, 2019
SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild.
CoRR, 2019
Microexpressions: A Chance for Computers to Beat Humans at Detecting Hidden Emotions?
Computer, 2019
Adversarial Training in Affective Computing and Sentiment Analysis: Recent Advances and Perspectives [Review Article].
IEEE Comput. Intell. Mag., 2019
Exploring Deep Spectrum Representations via Attention-Based Recurrent and Convolutional Neural Networks for Speech Emotion Recognition.
IEEE Access, 2019
From Speech to Facial Activity: Towards Cross-modal Sequence-to-Sequence Attention Networks.
Proceedings of the 21st IEEE International Workshop on Multimedia Signal Processing, 2019
Can Deep Generative Audio be Emotional? Towards an Approach for Personalised Emotional Audio Generation.
Proceedings of the 21st IEEE International Workshop on Multimedia Signal Processing, 2019
Predicting Biological Signals from Speech: Introducing a Novel Multimodal Dataset and Results.
Proceedings of the 21st IEEE International Workshop on Multimedia Signal Processing, 2019
AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition.
Proceedings of the 9th International on Audio/Visual Emotion Challenge and Workshop, 2019
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Emotion and Themes Recognition in Music Utilising Convolutional and Recurrent Neural Networks.
Proceedings of the Working Notes Proceedings of the MediaEval 2019 Workshop, 2019
A Comparison of AI-Based Throughput Prediction for Cellular Vehicle-To-Server Communication.
Proceedings of the 15th International Wireless Communications & Mobile Computing Conference, 2019
Proceedings of the 2019 International Symposium on Intelligent Signal Processing and Communication Systems, 2019
A Diplomatic Edition of Il Lauro Secco: Ground Truth for OMR of White Mensural Notation.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
Automatic Detection of Major Depressive Disorder via a Bag-of-Behaviour-Words Approach.
Proceedings of the Third International Symposium on Image Computing and Digital Medicine, 2019
Attention-Enhanced Connectionist Temporal Classification for Discrete Speech Emotion Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Autonomous Emotion Learning in Speech: A View of Zero-Shot Speech Emotion Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Towards Robust Speech Emotion Recognition Using Deep Residual Networks for Speech Enhancement.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
The INTERSPEECH 2019 Computational Paralinguistics Challenge: Styrian Dialects, Continuous Sleepiness, Baby Sounds & Orca Activity.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
A Hierarchical Attention Network-Based Approach for Depression Detection from Transcribed Clinical Interviews.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Using Speech to Predict Sequentially Measured Cortisol Levels During a Trier Social Stress Test.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Analysing and Inferring of Intimacy Based on fNIRS Signals and Peripheral Physiological Signals.
Proceedings of the International Joint Conference on Neural Networks, 2019
Proceedings of the International Joint Conference on Neural Networks, 2019
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Proceedings of the International Conference on Multimodal Interaction, 2019
VCMNet: Weakly Supervised Learning for Automatic Infant Vocalisation Maturity Analysis.
Proceedings of the International Conference on Multimodal Interaction, 2019
Proceedings of the 2019 IEEE International Conference on Connected Vehicles and Expo, 2019
Context Modelling Using Hierarchical Attention Networks for Sentiment and Self-assessed Emotion Detection in Spoken Narratives.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acoustic Scenes.
Proceedings of the IEEE International Conference on Acoustics, 2019
Implicit Fusion by Joint Audiovisual Training for Emotion Recognition in Mono Modality.
Proceedings of the IEEE International Conference on Acoustics, 2019
Attention-augmented End-to-end Multi-task Learning for Emotion Prediction from Speech.
Proceedings of the IEEE International Conference on Acoustics, 2019
Time-series Clustering with Jointly Learning Deep Representations, Clusters and Temporal Boundaries.
Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019
Performance Analysis of Unimodal and Multimodal Models in Valence-Based Empathy Recognition.
Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019
Proceedings of the 27th European Signal Processing Conference, 2019
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019
Teaching Machines to Know Your Depressive State: On Physical Activity in Health and Major Depressive Disorder.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019
Multi-instance Learning for Bipolar Disorder Diagnosis using Weakly Labelled Speech Data.
Proceedings of the 9th International Conference on Digital Public Health, 2019
Personalized Estimation of Engagement From Videos Using Active Learning With Deep Reinforcement Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019
Augment to Prevent: Short-Text Data Augmentation in Deep Learning for Hate-Speech Classification.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019
I Know How you Feel Now, and Here's why!: Demystifying Time-Continuous High Resolution Text-Based Affect Predictions in the Wild.
Proceedings of the 32nd IEEE International Symposium on Computer-Based Medical Systems, 2019
Audiovisual Analysis for Recognising Frustration during Game-Play: Introducing the Multimodal Game Frustration Database.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction, 2019
2018
Introduction to the Special Section on Multimedia Computing and Applications of Socio-Affective Behaviors in the Wild.
ACM Trans. Multim. Comput. Commun. Appl., 2018
IEEE Trans. Multim., 2018
Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments.
ACM Trans. Intell. Syst. Technol., 2018
Guest Editorial Special Issue on Computational Intelligence for End-to-End Audio Processing.
IEEE Trans. Emerg. Top. Comput. Intell., 2018
IEEE ACM Trans. Audio Speech Lang. Process., 2018
IEEE Trans. Affect. Comput., 2018
Asynchronous and Event-Based Fusion Systems for Affect Recognition on Naturalistic Data in Comparison to Conventional Approaches.
IEEE Trans. Affect. Comput., 2018
A closed-form solution to the graph total variation problem for continuous emotion profiling in noisy environment.
Speech Commun., 2018
Personalized machine learning for robot perception of affect and engagement in autism therapy.
Sci. Robotics, 2018
Deep Canonical Time Warping for Simultaneous Alignment and Representation Learning of Sequences.
IEEE Trans. Pattern Anal. Mach. Intell., 2018
Three recent trends in Paralinguistics on the way to omniscient machine intelligence.
J. Multimodal User Interfaces, 2018
IEEE CAA J. Autom. Sinica, 2018
Adversarial Training in Affective Computing and Sentiment Analysis: Recent Advances and Perspectives.
CoRR, 2018
Applying Cooperative Machine Learning to Speed Up the Annotation of Social Signals in Large Multi-modal Corpora.
CoRR, 2018
CoRR, 2018
What Affective Computing Reveals about Autistic Children's Facial Expressions of Joy or Fear.
Computer, 2018
Comput. Biol. Medicine, 2018
Speech emotion recognition: two decades in a nutshell, benchmarks, and ongoing trends.
Commun. ACM, 2018
Leveraging Unlabeled Data for Emotion Recognition With Enhanced Collaborative Semi-Supervised Learning.
IEEE Access, 2018
Trustability-Based Dynamic Active Learning for Crowdsourced Labelling of Emotional Audio Data.
IEEE Access, 2018
Analysing communication requirements for crowd sourced backend generation of HD Maps used in automated driving.
Proceedings of the 2018 IEEE Vehicular Networking Conference, 2018
How Good Is Your Model 'Really'? On 'Wildness' of the In-the-Wild Speech-Based Affect Recognisers.
Proceedings of the Speech and Computer - 20th International Conference, 2018
Proceedings of the Speech and Computer - 20th International Conference, 2018
Proceedings of the 2018 Workshop on Speech, Music and Mind, 2018
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
AVEC 2018 Workshop and Challenge: Bipolar Disorder and Cross-Cultural Affect Recognition.
Proceedings of the 2018 on Audio/Visual Emotion Challenge and Workshop, 2018
ASMMC-MMAC 2018: The Joint Workshop of 4th the Workshop on Affective Social Multimedia Computing and first Multi-Modal Affective Computing of Large-Scale Multimedia Data Workshop.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
Passive monitoring and geo-based prediction of mobile network vehicle-to-server communication.
Proceedings of the 14th International Wireless Communications & Mobile Computing Conference, 2018
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018
CultureNet: A Deep Learning Approach for Engagement Intensity Estimation from Face Images of Children with Autism.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018
Automated Classification of Children's Linguistic versus Non-Linguistic Vocalisations.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
The INTERSPEECH 2018 Computational Paralinguistics Challenge: Atypical & Self-Assessed Affect, Crying & Heart Beats.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
How Did You like 2017? Detection of Language Markers of Depression and Narcissism in Personal Narratives.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Annotator Trustability-based Cooperative Learning Solutions for Intelligent Audio Analysis.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
The Perception and Analysis of the Likeability and Human Likeness of Synthesized Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Recognition of Echolalic Autistic Child Vocalisations Utilising Convolutional Recurrent Neural Networks.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Noise Invariant Frame Selection: A Simple Method to Address the Background Noise Problem for Text-independent Speaker Verification.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Proceedings of the 1st IEEE/ACM International Workshop on Software Engineering for AI in Autonomous Systems, 2018
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018
Proceedings of the IEEE International Conference on Data Mining, 2018
Introducing an Emotion-Driven Assistance System for Cognitively Impaired Individuals.
Proceedings of the Computers Helping People with Special Needs, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
What is my Dog Trying to Tell Me? the Automatic Recognition of the Context and Perceived Emotion of Dog Barks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
A Cnn-Gru Approach to Capture Time-Frequency Pattern Interdependence for Snore Sound Classification.
Proceedings of the 26th European Signal Processing Conference, 2018
A Fusion of Deep Convolutional Generative Adversarial Networks and Sequence to Sequence Autoencoders for Acoustic Scene Classification.
Proceedings of the 26th European Signal Processing Conference, 2018
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018
Proceedings of the 2018 International Conference on Digital Health, 2018
Proceedings of the 2018 International Conference on Digital Health, 2018
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018
Proceedings of the Audio Mostly 2018 on Sound in Immersion and Emotion, 2018
Evaluation of the Pain Level from Speech: Introducing a Novel Pain Database and Benchmarks.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018
Proceedings of the Handbook of Multimodal-Multisensor Interfaces: Foundations, User Modeling, and Common Modality Combinations, 2018
Proceedings of the Handbook of Multimodal-Multisensor Interfaces: Foundations, User Modeling, and Common Modality Combinations, 2018
Perspectives on predictive power of multimodal deep learning: surprises and future directions.
Proceedings of the Handbook of Multimodal-Multisensor Interfaces: Foundations, User Modeling, and Common Modality Combinations, 2018
2017
Proceedings of the Emotions and Personality in Personalized Services, 2017
WIREs Data Mining Knowl. Discov., 2017
Classification of the Excitation Location of Snore Sounds in the Upper Airway by Acoustic Multifeature Analysis.
IEEE Trans. Biomed. Eng., 2017
A Two-Dimensional Framework of Multiple Kernel Subspace Learning for Recognizing Emotion in Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
IEEE Trans. Affect. Comput., 2017
IEEE Trans. Affect. Comput., 2017
IEEE Signal Process. Mag., 2017
IEEE Signal Process. Lett., 2017
IEEE Trans. Pattern Anal. Mach. Intell., 2017
IEEE J. Sel. Top. Signal Process., 2017
J. Mach. Learn. Res., 2017
auDeep: Unsupervised Learning of Representations from Audio with Deep Recurrent Neural Networks.
J. Mach. Learn. Res., 2017
Image Vis. Comput., 2017
Strength modelling for real-worldautomatic continuous affect recognition from audiovisual signals.
Image Vis. Comput., 2017
Frontiers Robotics AI, 2017
CoRR, 2017
Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments.
CoRR, 2017
DeepCoder: Semi-parametric Variational Autoencoders for Facial Action Unit Intensity Estimation.
CoRR, 2017
Complex., 2017
Comput. Intell. Neurosci., 2017
Recognizing Emotions From Whispered Speech Based on Acoustic Feature Transfer Learning.
IEEE Access, 2017
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2017
Proceedings of the Speech and Computer - 19th International Conference, 2017
Proceedings of the AES International Conference Semantic Audio 2017, 2017
Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, CA, USA, October 23, 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
From Hard to Soft: Towards more Human-like Emotion Recognition by Modelling the Perception Uncertainty.
Proceedings of the 2017 ACM on Multimedia Conference, 2017
An Image-based Deep Spectrum Feature Representation for the Recognition of Emotional Speech.
Proceedings of the 2017 ACM on Multimedia Conference, 2017
A Paralinguistic Approach To Speaker Diarisation: Using Age, Gender, Voice Likability and Personality Traits.
Proceedings of the 2017 ACM on Multimedia Conference, 2017
The Perception of Emotion in the Singing Voice: The Understanding of Music Mood for Music Organisation.
Proceedings of the 4th International Workshop on Digital Libraries for Musicology, 2017
The SEILS Dataset: Symbolically Encoded Scores in Modern-Early Notation for Computational Musicology.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Implementing Gender-Dependent Vowel-Level Analysis for Boosting Speech-Based Depression Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
The INTERSPEECH 2017 Computational Paralinguistics Challenge: Addressee, Cold & Snoring.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Earlier Identification of Children with Autism Spectrum Disorder: An Automatic Vocalisation-Based Approach.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Emotional Speech of Mentally and Physically Disabled Individuals: Introducing the EmotAsS Database and First Findings.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Towards Intelligent Crowdsourcing for Audio Data Annotation: Integrating Active Learning in the Real World.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
"Did you laugh enough today?" - Deep Neural Networks for Mobile and Wearable Laughter Trackers.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Spotting Social Signals in Conversational Speech over IP: A Deep Learning Perspective.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Automatic Classification of Autistic Child Vocalisations: A Novel Database and Results.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Cross-Domain Classification of Drowsiness in Speech: The Case of Alcohol Intoxication and Sleep Deprivation.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
Deep recurrent music writer: Memory-enhanced variational autoencoder-based musical score composition and an objective measure.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
Proceedings of the 14th International Conference on Natural Language Processing, 2017
Stimulation of psychological listener experiences by semi-automatically composed electroacoustic environments.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
DeepCoder: Semi-Parametric Variational Autoencoders for Automatic Facial Action Coding.
Proceedings of the IEEE International Conference on Computer Vision, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Automatic multi-lingual arousal detection from voice applied to real product testing applications.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Multi-task deep neural network with shared hidden layers: Breaking down the wall between emotion representations.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 19th IEEE International Conference on e-Health Networking, 2017
Proceedings of the Language Technologies for the Challenges of the Digital Age, 2017
Proceedings of the 47. Jahrestagung der Gesellschaft für Informatik, 2017
Proceedings of the 47. Jahrestagung der Gesellschaft für Informatik, 2017
Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2017
"You sound ill, take the day off": Automatic recognition of speech affected by upper respiratory tract infection.
Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2017
Speech-based Diagnosis of Autism Spectrum Condition by Generative Adversarial Network Representations.
Proceedings of the 2017 International Conference on Digital Health, 2017
Contextual Bidirectional Long Short-Term Memory Recurrent Neural Network Language Models: A Generative Approach to Sentiment Analysis.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017
Sequence to Sequence Autoencoders for Unsupervised Representation Learning from Audio.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
Reading the Author and Speaker: Towards a Holistic and Deep Approach on Automatic Assessment of What is in One's Words.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2017
Proceedings of the 12th International Audio Mostly Conference on Augmented and Participatory Sound and Music Experiences, 2017
Enhancing Speech-Based Depression Detection Through Gender Dependent Vowel-Level Formant Features.
Proceedings of the Artificial Intelligence in Medicine, 2017
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017
The effect of personality trait, age, and gender on the performance of automatic speech valence recognition.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017
VoicePlay - An affective sports game operated by speech emotion recognition based on the component process model.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017
CAST a database: Rapid targeted large-scale big data acquisition via small-world modelling of social media platforms.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
Proceedings of the Social Signal Processing, 2017
Proceedings of the Social Signal Processing, 2017
Proceedings of the Social Signal Processing, 2017
2016
Proceedings of the Recent Advances in Nonlinear Speech Processing, 2016
IEEE Trans. Intell. Veh., 2016
IEEE Trans. Affect. Comput., 2016
The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing.
IEEE Trans. Affect. Comput., 2016
Knowl. Based Syst., 2016
Int. J. Speech Technol., 2016
Computer, 2016
The Effect of Narrow-Band Transmission on Recognition of Paralinguistic Information From Human Vocalizations.
IEEE Access, 2016
IEEE Access, 2016
Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, 2016
Summary for AVEC 2016: Depression, Mood, and Emotion Recognition Workshop and Challenge.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Spectral and Cepstral Audio Noise Reduction Techniques in Speech Emotion Recognition.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Tendencies regarding the effect of emotional intensity in inter corpus phoneme-level speech emotion modelling.
Proceedings of the 26th IEEE International Workshop on Machine Learning for Signal Processing, 2016
Introducing the Weighted Trustability Evaluator for Crowdsourcing Exemplified by Speaker Likability Classification.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Proceedings of the Dialogues with Social Robots, 2016
Facing Realism in Spontaneous Emotion Recognition from Speech: Feature Enhancement by Autoencoder with LSTM Neural Networks.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
The INTERSPEECH 2016 Computational Paralinguistics Challenge: Deception, Sincerity & Native Language.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
At the Border of Acoustics and Linguistics: Bag-of-Audio-Words for the Recognition of Emotions in Speech.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Automatic Analysis of Typical and Atypical Encoding of Spontaneous Emotion in the Voice of Children.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Manual versus Automated: The Challenging Routine of Infant Vocalisation Segmentation in Home Videos to Study Neuro(mal)development.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Does She Speak RTT? Towards an Earlier Identification of Rett Syndrome Through Intelligent Pre-Linguistic Vocalisation Analysis.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Deep Bidirectional Long Short-Term Memory Recurrent Neural Networks for Grapheme-to-Phoneme Conversion Utilizing Complex Many-to-Many Alignments.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Convolutional Neural Networks with Data Augmentation for Classifying Speakers' Native Language.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Sincerity and Deception in Speech: Two Sides of the Same Coin? A Transfer- and Multi-Task Learning Perspective.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016
Discriminatively Trained Recurrent Neural Networks for Continuous Dimensional Emotion Recognition from Audio.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Proceedings of the 23rd International Conference on Pattern Recognition, 2016
Multiscale kernel locally penalised discriminant analysis exemplified by emotion recognition in speech.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016
Language proficiency assessment of English L2 speakers based on joint analysis of prosody and native language.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016
Semi-autonomous data enrichment based on cross-task labelling of missing targets for holistic speech analysis.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Adieu features? End-to-end speech emotion recognition using a deep convolutional recurrent network.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Cross lingual speech emotion recognition using canonical correlation analysis on principal component subspace.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the IEEE 5th Global Conference on Consumer Electronics, 2016
Pairwise Decomposition with Deep Neural Networks and Multiscale Kernel Subspace Learning for Acoustic Scene Classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
SenticNet 4: A Semantic Resource for Sentiment Analysis Based on Conceptual Primitives.
Proceedings of the COLING 2016, 2016
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016
The University of Passau Open Emotion Recognition System for the Multimodal Emotion Challenge.
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016
Towards Cross-lingual Automatic Diagnosis of Autism Spectrum Condition in Children's Voices.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016
Proceedings of the 12th ITG Symposium on Speech Communication, 2016
2015
Proceedings of the Advances in Neural Networks: Computational and Theoretical Issues, 2015
WIREs Data Mining Knowl. Discov., 2015
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Prediction of asynchronous dimensional emotion ratings from audiovisual and physiological data.
Pattern Recognit. Lett., 2015
J. Mach. Learn. Res., 2015
Emotion in the singing voice - a deeperlook at acoustic features in the light ofautomatic classification.
EURASIP J. Audio Speech Music. Process., 2015
A Survey on perceived speaker traits: Personality, likability, pathology, and the first challenge.
Comput. Speech Lang., 2015
The ICSTM+TUM+UP Approach to the 3rd CHIME Challenge: Single-Channel LSTM Speech Enhancement with Multi-Channel Correlation Shaping Dereverberation and LSTM Language Models.
CoRR, 2015
Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015
Exploring the Importance of Individual Differences to the Automatic Estimation of Emotions Induced by Music.
Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge, 2015
AV+EC 2015: The First Affect Recognition Challenge Bridging Across Audio, Video, and Physiological Data.
Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge, 2015
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
The ICL-TUM-PASSAU Approach for the MediaEval 2015 "Affective Impact of Movies" Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015
Automatically Estimating Emotion in Music with Deep Long-Short Term Memory Recurrent Neural Networks.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015
Modelling User Affect and Sentiment in Intelligent User Interfaces: A Tutorial Overview.
Proceedings of the 20th International Conference on Intelligent User Interfaces, 2015
IDGEI 2015: 3rd International Workshop on Intelligent Digital Games for Empowerment and Inclusion.
Proceedings of the 20th International Conference on Intelligent User Interfaces, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
The INTERSPEECH 2015 computational paralinguistics challenge: nativeness, parkinson's & eating condition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Typicality and emotion in the voice of children with autism spectrum condition: evidence across three languages.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Non-linear prediction with LSTM recurrent neural networks for acoustic novelty detection.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015
Dynamic Active Learning Based on Agreement and Applied to Emotion Recognition in Spoken Interactions.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015
ERM4CT 2015: Workshop on Emotion Representations and Modelling for Companion Systems.
Proceedings of the International Workshop on Emotion Representations and Modelling for Companion Technologies, 2015
A novel approach for automatic acoustic novelty detection using a denoising autoencoder with bidirectional LSTM neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR.
Proceedings of the Latent Variable Analysis and Signal Separation, 2015
Bird sounds classification by large scale acoustic features and extreme learning machine.
Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
Cross-corpus acoustic emotion recognition: Variances and strategies (Extended abstract).
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
Context-sensitive learning for enhanced audiovisual emotion classification (Extended abstract).
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
iHEARu-PLAY: Introducing a game for crowdsourced data collection for affective computing.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
Real-time robust recognition of speakers' emotions and characteristics on mobile platforms.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
Proceedings of the 12th International Conference on Advances in Computer Entertainment Technology, 2015
2014
Channel mapping using bidirectional long short-term memory for dereverberation in hands-free voice controlled devices.
IEEE Trans. Consumer Electron., 2014
IEEE ACM Trans. Audio Speech Lang. Process., 2014
IEEE Trans. Affect. Comput., 2014
IEEE Signal Process. Lett., 2014
Neural Networks, 2014
The TUM Gait from Audio, Image and Depth (GAID) database: Multimodal recognition of subjects and traits.
J. Vis. Commun. Image Represent., 2014
Probabilistic speech feature extraction with context-sensitive Bottleneck neural networks.
Neurocomputing, 2014
Feature enhancement by deep LSTM networks for ASR in reverberant multisource environments.
Comput. Speech Lang., 2014
Medium-term speaker states - A review on intoxication, sleepiness and the first challenge.
Comput. Speech Lang., 2014
Comput. Speech Lang., 2014
CoRR, 2014
The state of play of ASC-Inclusion: An Integrated Internet-Based Environment for Social Inclusion of Children with Autism Spectrum Conditions.
CoRR, 2014
On-Line NMF-Based Stereo Up-Mixing of Speech Improves Perceived Reduction of Non-Stationary Noise.
Proceedings of the AES International Conference on Semantic Audio 2014, 2014
Proceedings of the AES International Conference on Semantic Audio 2014, 2014
Proceedings of the 4th International Workshop on Audio/Visual Emotion Challenge, 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014
The Munich Biovoice Corpus: Effects of Physical Exercising, Heart Rate, and Skin Conductance on Human Speech Production.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
IDGEI 2014: 2nd international workshop on intelligent digital games for empowerment and inclusion.
Proceedings of the 19th International Conference on Intelligent User Interfaces, 2014
The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Robust speech recognition using long short-term memory recurrent neural networks for hybrid acoustic modelling.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Audio onset detection: A wavelet packet based approach with recurrent neural networks.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014
Linked Source and Target Domain Subspace Feature Transfer Learning - Exemplified by Speech Emotion Recognition.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Emotion Recognition in the Wild: Incorporating Voice and Lip Activity in Multimodal Decision-Level Fusion.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014
ERM4HCI 2014: The 2nd Workshop on Emotion Representation and Modelling in Human-Computer-Interaction-Systems.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014
Proceedings of the 2014 Workshop on Mapping Personality Traits Challenge and Workshop, 2014
Proceedings of the 2014 Workshop on Mapping Personality Traits Challenge and Workshop, 2014
MAPTRAITS 2014 - The First Audio/Visual Mapping Personality Traits Challenge - An Introduction: Perceived Personality and Social Dimensions.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Deep recurrent de-noising auto-encoder and blind de-reverberation for reverberated speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Multi-resolution linear prediction based features for audio onset detection with bidirectional LSTM neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014
CCA based feature selection with application to continuous depression recognition from acoustic speech features.
Proceedings of the IEEE International Conference on Acoustics, 2014
Introducing shared-hidden-layer autoencoders for transfer learning and their application in acoustic emotion recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Discriminatively trained recurrent neural networks for single-channel speech separation.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014
2013
Image Vis. Comput., 2013
Categorical and dimensional affect analysis in continuous input: Current trends and future directions.
Image Vis. Comput., 2013
Image Vis. Comput., 2013
Int. J. Distance Educ. Technol., 2013
IEEE Intell. Syst., 2013
IEEE Intell. Syst., 2013
Noise robust ASR in reverberated multisource environments applying convolutive NMF and Long Short-Term Memory.
Comput. Speech Lang., 2013
Comput. Speech Lang., 2013
Introduction to the special issue on Paralinguistics in Naturalistic Speech and Language.
Comput. Speech Lang., 2013
A Real-Time Speech Enhancement Framework in Noisy and Reverberated Acoustic Scenarios.
Cogn. Comput., 2013
Likability of human voices: A feature analysis and a neural network regression approach to automatic likability estimation.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013
Proceedings of the 3rd ACM international workshop on Audio/visual emotion challenge, 2013
Workshop summary for the 3rd international audio/visual emotion challenge and workshop (AVEC'13).
Proceedings of the ACM Multimedia Conference, 2013
Recent developments in openSMILE, the munich open-source multimedia feature extractor.
Proceedings of the ACM Multimedia Conference, 2013
The TUM Approach to the MediaEval Music Emotion Task Using Generic Affective Audio Features.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
The INTERSPEECH 2013 computational paralinguistics challenge: social signals, conflict, emotion, autism.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Affect recognition in real-life acoustic conditions - a new perspective on feature selection.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Influence of Low-Level Features Extracted from Rhythmic and Harmonic Sections on Music Genre Classification.
Proceedings of the Man-Machine Interactions 3, 2013
ERM4HCI 2013: the 1st workshop on emotion representation and modelling in human-computer-interaction-systems.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013
The acoustics of eye contact: detecting visual attention from conversational audio cues.
Proceedings of the 6th workshop on Eye gaze in intelligent human machine interaction: gaze in multimodal interaction, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Feature enhancement by bidirectional LSTM networks for conversational speech recognition in highly non-stationary noise.
Proceedings of the IEEE International Conference on Acoustics, 2013
Probabilistic asr feature extraction applying context-sensitive connectionist temporal classification networks.
Proceedings of the IEEE International Conference on Acoustics, 2013
Speaker trait characterization in web videos: Uniting speech, language, and facial features.
Proceedings of the IEEE International Conference on Acoustics, 2013
A discriminative approach to polyphonic piano note transcription using supervised non-negative matrix factorization.
Proceedings of the IEEE International Conference on Acoustics, 2013
Acoustic Geo-Sensing: Recognising cyclists' route, route direction, and route progress from cell-phone audio.
Proceedings of the IEEE International Conference on Acoustics, 2013
Automatic recognition of physiological parameters in the human voice: Heart rate and skin conductance.
Proceedings of the IEEE International Conference on Acoustics, 2013
A comparative study on sparsity penalties for NMF-based speech separation: Beyond LP-norms.
Proceedings of the IEEE International Conference on Acoustics, 2013
Integrating noise estimation and factorization-based speech separation: A novel hybrid approach.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Gait-based person identification by spectral, cepstral and energy-related audio features.
Proceedings of the IEEE International Conference on Acoustics, 2013
Real-life voice activity detection with LSTM Recurrent Neural Networks and an application to Hollywood movies.
Proceedings of the IEEE International Conference on Acoustics, 2013
Hierarchical neural networks and enhanced class posteriors for social signal classification.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013
Signals and communication technology, Springer, ISBN: 978-3-642-36805-9, 2013
2012
Optimization and Parallelization of Monaural Source Separation Algorithms in the openBliSSART Toolkit.
J. Signal Process. Syst., 2012
IEEE Trans. Neural Networks Learn. Syst., 2012
A multitask approach to continuous five-dimensional affect sensing in natural speech.
ACM Trans. Interact. Intell. Syst., 2012
The Voice of Leadership: Models and Performances of Automatic Analysis in Online Speeches.
IEEE Trans. Affect. Comput., 2012
Guest Editorial: Special Section on Naturalistic Affect Resources for System Building and Evaluation.
IEEE Trans. Affect. Comput., 2012
IEEE Trans. Affect. Comput., 2012
IEEE Trans. Affect. Comput., 2012
IEEE Signal Process. Mag., 2012
Int. J. Speech Technol., 2012
Applying multiple classifiers and non-linear dynamics features for detecting sleepiness from speech.
Neurocomputing, 2012
EURASIP J. Adv. Signal Process., 2012
Cogn. Comput., 2012
Cogn. Comput., 2012
Emotion in the speech of children with autism spectrum conditions: prosody and everything else.
Proceedings of the Third Workshop on Child, Computer and Interaction, 2012
Speech, Emotion, Age, Language, Task, and Typicality: Trying to Disentangle Performance and Feature Relevance.
Proceedings of the 2012 International Conference on Privacy, 2012
Dimensional and continuous analysis of emotions for multimedia applications: a tutorial overview.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012
Proceedings of the Advances in Neural Networks - ISNN 2012, 2012
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012
Proceedings of the 5th International Symposium on Communications, 2012
Active Learning by Sparse Instance Tracking and Classifier Confidence in Acoustic Emotion Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Temporal and Situational Context Modeling for Improved Dominance Recognition in Meetings.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Combining Bottleneck-BLSTM and Semi-Supervised Sparse NMF for Recognition of Conversational Speech in Highly Instationary Noise.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Discrimination of Linguistic and Non-Linguistic Vocalizations in Spontaneous Speech: Intra- and Inter-Corpus Perspectives.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Improving Recognition of Speaker States and Traits by Cumulative Evidence: Intoxication, Sleepiness, Age and Gender.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Convolutive Non-Negative Sparse Coding and New Features for Speech Overlap Handling in Speaker Diarization.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the International Conference on Multimodal Interaction, 2012
Proceedings of the International Conference on Multimodal Interaction, 2012
Preserving actual dynamic trend of emotion in dimensional speech emotion recognition.
Proceedings of the International Conference on Multimodal Interaction, 2012
Proceedings of the International Conference on Multimodal Interaction, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Analyzing the memory of BLSTM Neural Networks for enhanced emotion classification in dyadic spoken interactions.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Non-negative matrix factorization for highly noise-robust ASR: To enhance or to recognize?
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Supervised and semi-supervised suppression of background music in monaural speech recordings.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Robust feature extraction for automatic recognition of vibrato singing in recorded polyphonic music.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Speech overlap detection and attribution using convolutive non-negative sparse coding.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Fine-tuning HMMS for nonverbal vocalizations in spontaneous speech: A multicorpus perspective.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the Latent Variable Analysis and Signal Separation, 2012
Speech overlap detection using convolutive non-negative sparse coding: New improvements and insights.
Proceedings of the 20th European Signal Processing Conference, 2012
Music Information Retrieval: An Inspirational Guide to Transfer from Related Disciplines.
Proceedings of the Multimodal Music Processing, 2012
Towards Automatic Intoxication Detection from Speech in Real-Life Acoustic Environments.
Proceedings of the 10th ITG Conference on Speech Communication, 2012
Proceedings of the 10th ITG Conference on Speech Communication, 2012
Sparse, Hierarchical and Semi-Supervised Base Learning for Monaural Enhancement of Conversational Speech.
Proceedings of the 10th ITG Conference on Speech Communication, 2012
Exploring Nonnegative Matrix Factorization for Audio Classification: Application to Speaker Recognition.
Proceedings of the 10th ITG Conference on Speech Communication, 2012
Proceedings of the 10th ITG Conference on Speech Communication, 2012
2011
Tandem decoding of children's speech for keyword detection in a child-robot interaction scenario.
ACM Trans. Speech Lang. Process., 2011
IEEE Trans. Intell. Transp. Syst., 2011
IEEE Trans. Affect. Comput., 2011
Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge.
Speech Commun., 2011
Introduction to the special issue on sensing emotion and affect - Facing realism in speech processing.
Speech Commun., 2011
Künstliche Intell., 2011
Int. J. Speech Technol., 2011
Recognition of Nonprototypical Emotions in Reverberated and Noisy Speech by Nonnegative Matrix Factorization.
EURASIP J. Adv. Signal Process., 2011
Whodunnit - Searching for the most important feature types signalling emotion-related user states in speech.
Comput. Speech Lang., 2011
Proceedings of the AES International Conference Semantic Audio 2011, 2011
Proceedings of the Advances in Nonlinear Speech Processing, 2011
Proceedings of the Advances in Nonlinear Speech Processing, 2011
Robust Multi-stream Keyword and Non-linguistic Vocalization Detection for Computationally Intelligent Virtual Agents.
Proceedings of the Advances in Neural Networks - ISNN 2011, 2011
Automatic Assessment of Singer Traits in Popular Music: Gender, Age, Height and Race.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Multi-Modal Non-Prototypical Music Mood Analysis in Continuous Space: Reliability and Performances.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Proceedings of the Intelligent Technologies for Interactive Entertainment, 2011
Speech-Based Non-Prototypical Affect Recognition for Child-Robot Interaction in Reverberated Environments.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Feature Frame Stacking in RNN-Based Tandem ASR Systems - Learned vs. Predefined Context.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Localization of non-linguistic events in spontaneous speech by Non-Negative Matrix Factorization and Long Short-Term Memory.
Proceedings of the IEEE International Conference on Acoustics, 2011
Audio recognition in the wild: Static and dynamic classification on a real-world database of animal vocalizations.
Proceedings of the IEEE International Conference on Acoustics, 2011
OpenBliSSART: Design and evaluation of a research toolkit for Blind Source Separation in Audio Recognition Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2011
Combining monaural source separation with Long Short-Term Memory for increased robustness in vocalist gender recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Syllabification of conversational speech using Bidirectional Long-Short-Term Memory Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2011
Audiovisual classification of vocal outbursts in human conversation using Long-Short-Term Memory networks.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011
String-based audiovisual fusion of behavioural events for the assessment of dimensional affect.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011
Proceedings of the Cognitive Behavioural Systems, 2011
Proceedings of the Cognitive Behavioural Systems, 2011
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
A novel bottleneck-BLSTM front-end for feature-level context modeling in conversational speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
Proceedings of the Affective Computing and Intelligent Interaction, 2011
Proceedings of the Affective Computing and Intelligent Interaction, 2011
Proceedings of the Computer Analysis of Human Behavior., 2011
2010
IEEE Trans. Affect. Comput., 2010
Combining Long Short-Term Memory and Dynamic Bayesian Networks for Incremental Emotion-Sensitive Artificial Listening.
IEEE J. Sel. Top. Signal Process., 2010
On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues.
J. Multimodal User Interfaces, 2010
EURASIP J. Audio Speech Music. Process., 2010
Determination of Nonprototypical Valence and Arousal in Popular Music: Features and Performances.
EURASIP J. Audio Speech Music. Process., 2010
Bidirectional LSTM Networks for Context-Sensitive Keyword Detection in a Cognitive Virtual Agent Framework.
Cogn. Comput., 2010
Emotion on the Road - Necessity, Acceptance, and Feasibility of Affective Computing in the Car.
Adv. Hum. Comput. Interact., 2010
Segmenting into Adequate Units for Automatic Recognition of Emotion-Related Episodes: A Speech-Based Approach.
Adv. Hum. Comput. Interact., 2010
Proceedings of the 18th International Conference on Multimedia 2010, 2010
3d gesture recognition applying long short-term memory and contextual knowledge in a CAVE.
Proceedings of the 1st ACM international workshop on Multimodal pervasive video analysis, 2010
CINEMO - A French Spoken Language Resource for Complex Emotions: Facts and Baselines.
Proceedings of the International Conference on Language Resources and Evaluation, 2010
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional LSTM modeling.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Recognition of spontaneous conversational speech using long short-term memory phoneme predictions.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Incremental acoustic valence recognition: an inter-corpus perspective on features, matching, and performance in a gating paradigm.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Spoken term detection with Connectionist Temporal Classification: A novel hybrid CTC-DBN decoder.
Proceedings of the IEEE International Conference on Acoustics, 2010
Non-negative matrix factorization as noise-robust feature extractor for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010
Discrimination of speech and non-linguistic vocalizations by Non-Negative Matrix Factorization.
Proceedings of the IEEE International Conference on Acoustics, 2010
Late fusion of individual engines for improved recognition of negative emotion in speech - learning vs. democratic vote.
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the Toward Autonomous, Adaptive, and Context-Aware Multimodal Interfaces. Theoretical and Practical Issues, 2010
Real Time Person Tracking and Behavior Interpretation in Multi Camera Scenarios Applying Homography and Coupled HMMs.
Proceedings of the Analysis of Verbal and Nonverbal Communication and Enactment. The Processing Issues, 2010
Switching Linear Dynamic Models for Recognition of Emotionally Colored and Noisy Speech.
Proceedings of the 9. ITG-Fachtagung Sprachkommunikation 2010, 2010
2009
Being bored? Recognising natural interest by extensive audiovisual integration for real-life application.
Image Vis. Comput., 2009
A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams.
Neurocomputing, 2009
Recognition of Noisy Speech: A Comparative Survey of Robust Model Architecture and Feature Enhancement.
EURASIP J. Audio Speech Music. Process., 2009
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2009), 2009
Proceedings of the Advances in Nonlinear Speech Processing, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Data-driven clustering in emotional space for affect recognition using discriminatively trained LSTM networks.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Recognising interest in conversational speech - comparing bag of frames and supra-segmental features.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009
Proceedings of the 16th International Conference on Digital Signal Processing, 2009
Resolving partial occlusions in crowded environments utilizing range data and video cameras.
Proceedings of the 16th International Conference on Digital Signal Processing, 2009
Proceedings of the 16th International Conference on Digital Signal Processing, 2009
"The Godfather" vs. "Chaos": Comparing Linguistic Analysis Based on On-line Knowledge Sources and Bags-of-N-Grams for Movie Review Valence Estimation.
Proceedings of the 10th International Conference on Document Analysis and Recognition, 2009
GMs in On-Line Handwritten Whiteboard Note Recognition: The Influence of Implementation and Modeling.
Proceedings of the 10th International Conference on Document Analysis and Recognition, 2009
Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
From speech to letters - using a novel neural network architecture for grapheme based ASR.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
Proceedings of the Affective Computing and Intelligent Interaction, 2009
Proceedings of the Affective Computing and Intelligent Interaction, 2009
Proceedings of the Affective Computing and Intelligent Interaction, 2009
2008
EURASIP J. Audio Speech Music. Process., 2008
Proceedings of the First Workshop on Child, Computer and Interaction, 2008
Proceedings of the First Workshop on Child, Computer and Interaction, 2008
Low-Level Fusion of Audio, Video Feature for Multi-Modal Emotion Recognition.
Proceedings of the VISAPP 2008: Proceedings of the Third International Conference on Computer Vision Theory and Applications, Funchal, Madeira, Portugal, January 22-25, 2008, 2008
Emotion sensitive speech control for human-robot interaction in minimal invasive surgery.
Proceedings of the 17th IEEE International Symposium on Robot and Human Interactive Communication, 2008
Proceedings of the Perception in Multimodal Dialogue Systems, 2008
Static and Dynamic Modelling for the Recognition of Non-verbal Vocalisations in Conversational Speech.
Proceedings of the Perception in Multimodal Dialogue Systems, 2008
Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependencies.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Balancing spoken content adaptation and unit length in the recognition of emotion and interest.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Speech recognition in noisy environments using a switching linear dynamic model for feature enhancement.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Combining speech recognition and acoustic word emotion models for robust text-independent emotion recognition.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
Proceedings of the 2008 Second ACM/IEEE International Conference on Distributed Smart Cameras, 2008
Brute-forcing hierarchical functionals for paralinguistics: A waste of feature space?
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the Pattern Recognition, 2008
Proceedings of the Adaptive Multimedia Retrieval. Identifying, 2008
2007
VDM, ISBN: 978-3-8364-1522-4, 2007
Combining frame and turn-level information for robust recognition of emotions within speech.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
The relevance of feature type for the automatic classification of emotional user states: low level descriptors and functionals.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 9th International Conference on Multimodal Interfaces, 2007
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007
Wearable Assistance for the Ballroom-Dance Hobbyist - Holistic Rhythm Analysis and Dance-Style Classification.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007
Suspicious Behavior Detection in Public Transport by Fusion of Low-Level Video Descriptors.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Fast and Robust Meter and Tempo Recognition for the Automatic Discrimination of Ballroom Dance Styles.
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Comparing one and two-stage acoustic modeling in the recognition of emotion in speech.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
Frame vs. Turn-Level: Emotion Recognition from Speech Considering Static and Dynamic Processing.
Proceedings of the Affective Computing and Intelligent Interaction, 2007
Proceedings of the Affective Computing and Intelligent Interaction, 2007
On the Necessity and Feasibility of Detecting a Driver's Emotional State While Driving.
Proceedings of the Affective Computing and Intelligent Interaction, 2007
2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Efficient Recognition of Authentic Dynamic Facial Expressions on the Feedtum Database.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
Segmentation and Recognition of Meeting Events using a Two-Layered HMM and a Combined MLP-HMM Approach.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
Proceedings of the International Conference on Image Processing, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
PhD thesis, 2005
Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Feature Selection and Stacking for Robust Discrimination of Speech, Monophonic Singing, and Polyphonic Music.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
2004
Discrimination of speech and monophonic singing in continuous audio streams applying multi-layer support vector machines.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004
Applying Bayesian belief networks in approximate string matching for robust keyword-based retrieval.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004
Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003
A hybrid music retrieval system using belief networks to integrate multimodal queries and contextual knowledge.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003
HMM-based music retrieval using stereophonic feature information and framelength adaptation.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
Aspekte effizienten Usability Engineerings (Aspects of Efficient Usability Engineering).
Informationstechnik Tech. Inform., 2002
Proceedings of the IEEE International Conference on Systems, Man and Cybernetics: Bridging the Digital Divide, Yasmine Hammamet, Tunisia, October 6-9, 2002, 2002
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002
A new technique for adjusting distraction moments in multitasking non-field usability tests.
Proceedings of the Extended abstracts of the 2002 Conference on Human Factors in Computing Systems, 2002
Experimental evaluation of user errors at the skill-based level in an automative environment.
Proceedings of the Extended abstracts of the 2002 Conference on Human Factors in Computing Systems, 2002
2001
Proceedings of the 2001 workshop on Perceptive user interfaces, 2001