Florian Eyben
According to our database1,
Florian Eyben
authored at least 134 papers
between 2007 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
VocDoc, what happened to my voice? Towards automatically capturing vocal fatigue in the wild.
Biomed. Signal Process. Control., February, 2024
Wav2Small: Distilling Wav2Vec2 to 72K parameters for Low-Resource Speech emotion recognition.
CoRR, 2024
Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition.
CoRR, 2024
2023
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023
Multistage linguistic conditioning of convolutional layers for speech emotion recognition.
Frontiers Comput. Sci., 2023
Going Retro: Astonishingly Simple Yet Effective Rule-based Prosody Modelling for Speech Synthesis Simulating Emotion Dimensions.
CoRR, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Nkululeko: Machine Learning Experiments on Speaker Characteristics Without Programming.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Multimodal Recognition of Valence, Arousal and Dominance via Late-Fusion of Text, Audio and Facial Expressions.
Proceedings of the 31st European Symposium on Artificial Neural Networks, 2023
2022
Voice Analysis for Neurological Disorder Recognition-A Systematic Review and Perspective on Emerging Trends.
Frontiers Digit. Health, 2022
A Comparative Cross Language View On Acted Databases Portraying Basic Emotions Utilising Machine Learning.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Quantifying Cognitive Load from Voice using Transformer-Based Models and a Cross-Dataset Evaluation.
Proceedings of the 21st IEEE International Conference on Machine Learning and Applications, 2022
2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
2020
Neural Comput. Appl., 2020
2019
Affective and behavioural computing: Lessons learnt from the First Computational Paralinguistics Challenge.
Comput. Speech Lang., 2019
CoRR, 2019
2018
Proceedings of the 1st IEEE/ACM International Workshop on Software Engineering for AI in Autonomous Systems, 2018
Proceedings of the 2018 International Conference on Digital Health, 2018
2017
Proceedings of the AES International Conference Semantic Audio 2017, 2017
A Paralinguistic Approach To Speaker Diarisation: Using Age, Gender, Voice Likability and Personality Traits.
Proceedings of the 2017 ACM on Multimedia Conference, 2017
"Did you laugh enough today?" - Deep Neural Networks for Mobile and Wearable Laughter Trackers.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
Automatic multi-lingual arousal detection from voice applied to real product testing applications.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the Language Technologies for the Challenges of the Digital Age, 2017
VoicePlay - An affective sports game operated by speech emotion recognition based on the component process model.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017
2016
The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing.
IEEE Trans. Affect. Comput., 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
2015
Prediction of asynchronous dimensional emotion ratings from audiovisual and physiological data.
Pattern Recognit. Lett., 2015
Emotion in the singing voice - a deeperlook at acoustic features in the light ofautomatic classification.
EURASIP J. Audio Speech Music. Process., 2015
A Survey on perceived speaker traits: Personality, likability, pathology, and the first challenge.
Comput. Speech Lang., 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Non-linear prediction with LSTM recurrent neural networks for acoustic novelty detection.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015
A novel approach for automatic acoustic novelty detection using a denoising autoencoder with bidirectional LSTM neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Cross-corpus acoustic emotion recognition: Variances and strategies (Extended abstract).
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
Context-sensitive learning for enhanced audiovisual emotion classification (Extended abstract).
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
iHEARu-PLAY: Introducing a game for crowdsourced data collection for affective computing.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
Real-time robust recognition of speakers' emotions and characteristics on mobile platforms.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
2014
IEEE Signal Process. Lett., 2014
Medium-term speaker states - A review on intoxication, sleepiness and the first challenge.
Comput. Speech Lang., 2014
CoRR, 2014
Proceedings of the 4th International Workshop on Audio/Visual Emotion Challenge, 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
The Munich Biovoice Corpus: Effects of Physical Exercising, Heart Rate, and Skin Conductance on Human Speech Production.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Audio onset detection: A wavelet packet based approach with recurrent neural networks.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014
Emotion Recognition in the Wild: Incorporating Voice and Lip Activity in Multimodal Decision-Level Fusion.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014
Proceedings of the 2014 Workshop on Mapping Personality Traits Challenge and Workshop, 2014
MAPTRAITS 2014 - The First Audio/Visual Mapping Personality Traits Challenge - An Introduction: Perceived Personality and Social Dimensions.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Multi-resolution linear prediction based features for audio onset detection with bidirectional LSTM neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014
CCA based feature selection with application to continuous depression recognition from acoustic speech features.
Proceedings of the IEEE International Conference on Acoustics, 2014
A frequency-weighted post-filtering transform for compensation of the over-smoothing effect in HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Image Vis. Comput., 2013
Likability of human voices: A feature analysis and a neural network regression approach to automatic likability estimation.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013
Proceedings of the 3rd ACM international workshop on Audio/visual emotion challenge, 2013
Recent developments in openSMILE, the munich open-source multimedia feature extractor.
Proceedings of the ACM Multimedia Conference, 2013
The TUM Approach to the MediaEval Music Emotion Task Using Generic Affective Audio Features.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013
The INTERSPEECH 2013 computational paralinguistics challenge: social signals, conflict, emotion, autism.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Affect recognition in real-life acoustic conditions - a new perspective on feature selection.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
The acoustics of eye contact: detecting visual attention from conversational audio cues.
Proceedings of the 6th workshop on Eye gaze in intelligent human machine interaction: gaze in multimodal interaction, 2013
Automatic recognition of physiological parameters in the human voice: Heart rate and skin conductance.
Proceedings of the IEEE International Conference on Acoustics, 2013
Real-life voice activity detection with LSTM Recurrent Neural Networks and an application to Hollywood movies.
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
A multitask approach to continuous five-dimensional affect sensing in natural speech.
ACM Trans. Interact. Intell. Syst., 2012
IEEE Trans. Affect. Comput., 2012
IEEE Trans. Affect. Comput., 2012
Cogn. Comput., 2012
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012
Temporal and Situational Context Modeling for Improved Dominance Recognition in Meetings.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the International Conference on Multimodal Interaction, 2012
Preserving actual dynamic trend of emotion in dimensional speech emotion recognition.
Proceedings of the International Conference on Multimodal Interaction, 2012
Proceedings of the International Conference on Multimodal Interaction, 2012
Robust feature extraction for automatic recognition of vibrato singing in recorded polyphonic music.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the Latent Variable Analysis and Signal Separation, 2012
Proceedings of the 10th ITG Conference on Speech Communication, 2012
2011
Künstliche Intell., 2011
Proceedings of the AES International Conference Semantic Audio 2011, 2011
Proceedings of the Intelligent Technologies for Interactive Entertainment, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Combining monaural source separation with Long Short-Term Memory for increased robustness in vocalist gender recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Syllabification of conversational speech using Bidirectional Long-Short-Term Memory Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2011
Audiovisual classification of vocal outbursts in human conversation using Long-Short-Term Memory networks.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011
String-based audiovisual fusion of behavioural events for the assessment of dimensional affect.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011
Proceedings of the Affective Computing and Intelligent Interaction, 2011
2010
IEEE Trans. Affect. Comput., 2010
Combining Long Short-Term Memory and Dynamic Bayesian Networks for Incremental Emotion-Sensitive Artificial Listening.
IEEE J. Sel. Top. Signal Process., 2010
On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues.
J. Multimodal User Interfaces, 2010
Bidirectional LSTM Networks for Context-Sensitive Keyword Detection in a Cognitive Virtual Agent Framework.
Cogn. Comput., 2010
Emotion on the Road - Necessity, Acceptance, and Feasibility of Affective Computing in the Car.
Adv. Hum. Comput. Interact., 2010
Proceedings of the 18th International Conference on Multimedia 2010, 2010
3d gesture recognition applying long short-term memory and contextual knowledge in a CAVE.
Proceedings of the 1st ACM international workshop on Multimodal pervasive video analysis, 2010
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional LSTM modeling.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Recognition of spontaneous conversational speech using long short-term memory phoneme predictions.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Spoken term detection with Connectionist Temporal Classification: A novel hybrid CTC-DBN decoder.
Proceedings of the IEEE International Conference on Acoustics, 2010
Late fusion of individual engines for improved recognition of negative emotion in speech - learning vs. democratic vote.
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
Being bored? Recognising natural interest by extensive audiovisual integration for real-life application.
Image Vis. Comput., 2009
A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams.
Neurocomputing, 2009
Proceedings of the Advances in Nonlinear Speech Processing, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Data-driven clustering in emotional space for affect recognition using discriminatively trained LSTM networks.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
From speech to letters - using a novel neural network architecture for grapheme based ASR.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
Proceedings of the Affective Computing and Intelligent Interaction, 2009
Proceedings of the Affective Computing and Intelligent Interaction, 2009
2008
EURASIP J. Audio Speech Music. Process., 2008
Static and Dynamic Modelling for the Recognition of Non-verbal Vocalisations in Conversational Speech.
Proceedings of the Perception in Multimodal Dialogue Systems, 2008
Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependencies.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the Adaptive Multimedia Retrieval. Identifying, 2008
2007
Wearable Assistance for the Ballroom-Dance Hobbyist - Holistic Rhythm Analysis and Dance-Style Classification.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007
Fast and Robust Meter and Tempo Recognition for the Automatic Discrimination of Ballroom Dance Styles.
Proceedings of the IEEE International Conference on Acoustics, 2007