Florian Eyben

According to our database1, Florian Eyben authored at least 134 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
VocDoc, what happened to my voice? Towards automatically capturing vocal fatigue in the wild.
Biomed. Signal Process. Control., February, 2024

Wav2Small: Distilling Wav2Vec2 to 72K parameters for Low-Resource Speech emotion recognition.
CoRR, 2024

Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition.
CoRR, 2024

2023
Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

Multistage linguistic conditioning of convolutional layers for speech emotion recognition.
Frontiers Comput. Sci., 2023

Testing Speech Emotion Recognition Machine Learning Models.
CoRR, 2023

Going Retro: Astonishingly Simple Yet Effective Rule-based Prosody Modelling for Speech Synthesis Simulating Emotion Dimensions.
CoRR, 2023

Speech-based Age and Gender Prediction with Transformers.
CoRR, 2023

audb - Sharing and Versioning of Audio and Annotation Data in Python.
CoRR, 2023

Towards Supporting an Early Diagnosis of Multiple Sclerosis using Vocal Features.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Nkululeko: Machine Learning Experiments on Speaker Characteristics Without Programming.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Masking Speech Contents by Random Splicing: is Emotional Expression Preserved?
Proceedings of the IEEE International Conference on Acoustics, 2023

Multimodal Recognition of Valence, Arousal and Dominance via Late-Fusion of Text, Audio and Facial Expressions.
Proceedings of the 31st European Symposium on Artificial Neural Networks, 2023

2022
Voice Analysis for Neurological Disorder Recognition-A Systematic Review and Perspective on Emerging Trends.
Frontiers Digit. Health, 2022

A Comparative Cross Language View On Acted Databases Portraying Basic Emotions Utilising Machine Learning.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Nkululeko: A Tool For Rapid Speaker Characteristics Detection.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Probing speech emotion recognition transformers for linguistic knowledge.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Quantifying Cognitive Load from Voice using Transformer-Based Models and a Cross-Dataset Evaluation.
Proceedings of the 21st IEEE International Conference on Machine Learning and Applications, 2022

2021
Speaking Corona? Human and Machine Recognition of COVID-19 from Voice.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
Exploiting time-frequency patterns with LSTM-RNNs for low-bitrate audio restoration.
Neural Comput. Appl., 2020

The voice of COVID-19: Acoustic correlates of infection.
CoRR, 2020

2019
Affective and behavioural computing: Lessons learnt from the First Computational Paralinguistics Challenge.
Comput. Speech Lang., 2019

On Laughter and Speech-Laugh, Based on Observations of Child-Robot Interaction.
CoRR, 2019

2018
audEERING's approach to the One-Minute-Gradual Emotion Challenge.
CoRR, 2018

Emotion-Awareness for Intelligent Vehicle Assistants: A Research Agenda.
Proceedings of the 1st IEEE/ACM International Workshop on Software Engineering for AI in Autonomous Systems, 2018

Robust Laughter Detection for Wearable Wellbeing Sensing.
Proceedings of the 2018 International Conference on Digital Health, 2018

2017
Enhancing LSTM RNN-Based Speech Overlap Detection by Artificially Mixed Data.
Proceedings of the AES International Conference Semantic Audio 2017, 2017

A Paralinguistic Approach To Speaker Diarisation: Using Age, Gender, Voice Likability and Personality Traits.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

"Did you laugh enough today?" - Deep Neural Networks for Mobile and Wearable Laughter Trackers.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Seeking the SuperStar: Automatic assessment of perceived singing quality.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Automatic multi-lingual arousal detection from voice applied to real product testing applications.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Detecting Vocal Irony.
Proceedings of the Language Technologies for the Challenges of the Digital Age, 2017

VoicePlay - An affective sports game operated by speech emotion recognition based on the component process model.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017

Deep neural networks for anger detection from real life speech data.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017

2016
The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing.
IEEE Trans. Affect. Comput., 2016

Real-Time Tracking of Speakers' Emotions, States, and Traits on Mobile Platforms.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Prediction of asynchronous dimensional emotion ratings from audiovisual and physiological data.
Pattern Recognit. Lett., 2015

Emotion in the singing voice - a deeperlook at acoustic features in the light ofautomatic classification.
EURASIP J. Audio Speech Music. Process., 2015

A Survey on perceived speaker traits: Personality, likability, pathology, and the first challenge.
Comput. Speech Lang., 2015

Does my speech rock? automatic assessment of public speaking skills.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Non-linear prediction with LSTM recurrent neural networks for acoustic novelty detection.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

A novel approach for automatic acoustic novelty detection using a denoising autoencoder with bidirectional LSTM neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Cross-corpus acoustic emotion recognition: Variances and strategies (Extended abstract).
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Building autonomous sensitive artificial listeners (Extended abstract).
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Context-sensitive learning for enhanced audiovisual emotion classification (Extended abstract).
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

iHEARu-PLAY: Introducing a game for crowdsourced data collection for affective computing.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Real-time robust recognition of speakers' emotions and characteristics on mobile platforms.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014
Autoencoder-based Unsupervised Domain Adaptation for Speech Emotion Recognition.
IEEE Signal Process. Lett., 2014

Medium-term speaker states - A review on intoxication, sleepiness and the first challenge.
Comput. Speech Lang., 2014

A Broadcast News Corpus for Evaluation and Tuning of German LVCSR Systems.
CoRR, 2014

AVEC 2014: 3D Dimensional Affect and Depression Recognition Challenge.
Proceedings of the 4th International Workshop on Audio/Visual Emotion Challenge, 2014

Emotional Analysis of Music: A Comparison of Methods.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

The Munich Biovoice Corpus: Effects of Physical Exercising, Heart Rate, and Skin Conductance on Human Speech Production.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Audio onset detection: A wavelet packet based approach with recurrent neural networks.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Emotion Recognition in the Wild: Incorporating Voice and Lip Activity in Multimodal Decision-Level Fusion.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

MAPTRAITS 2014: The First Audio/Visual Mapping Personality Traits Challenge.
Proceedings of the 2014 Workshop on Mapping Personality Traits Challenge and Workshop, 2014

MAPTRAITS 2014 - The First Audio/Visual Mapping Personality Traits Challenge - An Introduction: Perceived Personality and Social Dimensions.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

On-line continuous-time music mood regression with deep recurrent neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

Single-channel speech separation with memory-enhanced recurrent neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

Multi-resolution linear prediction based features for audio onset detection with bidirectional LSTM neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

CCA based feature selection with application to continuous depression recognition from acoustic speech features.
Proceedings of the IEEE International Conference on Acoustics, 2014

A frequency-weighted post-filtering transform for compensation of the over-smoothing effect in HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
LSTM-Modeling of continuous emotions in an audiovisual affect recognition framework.
Image Vis. Comput., 2013

Likability of human voices: A feature analysis and a neural network regression approach to automatic likability estimation.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013

AVEC 2013: the continuous audio/visual emotion and depression recognition challenge.
Proceedings of the 3rd ACM international workshop on Audio/visual emotion challenge, 2013

Recent developments in openSMILE, the munich open-source multimedia feature extractor.
Proceedings of the ACM Multimedia Conference, 2013

The TUM Approach to the MediaEval Music Emotion Task Using Generic Affective Audio Features.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

The INTERSPEECH 2013 computational paralinguistics challenge: social signals, conflict, emotion, autism.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Detecting overlapping speech with long short-term memory recurrent neural networks.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Using linguistic information to detect overlapping speech.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Affect recognition in real-life acoustic conditions - a new perspective on feature selection.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

The acoustics of eye contact: detecting visual attention from conversational audio cues.
Proceedings of the 6th workshop on Eye gaze in intelligent human machine interaction: gaze in multimodal interaction, 2013

Automatic recognition of physiological parameters in the human voice: Heart rate and skin conductance.
Proceedings of the IEEE International Conference on Acoustics, 2013

Real-life voice activity detection with LSTM Recurrent Neural Networks and an application to Hollywood movies.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
A multitask approach to continuous five-dimensional affect sensing in natural speech.
ACM Trans. Interact. Intell. Syst., 2012

Building Autonomous Sensitive Artificial Listeners.
IEEE Trans. Affect. Comput., 2012

Context-Sensitive Learning for Enhanced Audiovisual Emotion Classification.
IEEE Trans. Affect. Comput., 2012

Real-Time Activity Detection in a Multi-Talker Reverberated Environment.
Cogn. Comput., 2012

Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature Sets.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

Temporal and Situational Context Modeling for Improved Dominance Recognition in Meetings.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

The INTERSPEECH 2012 Speaker Trait Challenge.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

AVEC 2012: the continuous audio/visual emotion challenge.
Proceedings of the International Conference on Multimodal Interaction, 2012

Preserving actual dynamic trend of emotion in dimensional speech emotion recognition.
Proceedings of the International Conference on Multimodal Interaction, 2012

Improving generalisation and robustness of acoustic affect recognition.
Proceedings of the International Conference on Multimodal Interaction, 2012

Robust feature extraction for automatic recognition of vibrato singing in recorded polyphonic music.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Audiovisual vocal outburst classification in noisy acoustic conditions.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Unsupervised clustering of emotion and voice styles for expressive TTS.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Real-Time Speech Separation by Semi-supervised Nonnegative Matrix Factorization.
Proceedings of the Latent Variable Analysis and Signal Separation, 2012

Fully Automatic Audiovisual Emotion Recognition: Voice, Words, and the Face.
Proceedings of the 10th ITG Conference on Speech Communication, 2012

2011
Computational Assessment of Interest in Speech - Facing the Real-Life Challenge.
Künstliche Intell., 2011

Semantic Speech Tagging: Towards Combined Analysis of Speaker Traits.
Proceedings of the AES International Conference Semantic Audio 2011, 2011

Interacting with Emotional Virtual Agents.
Proceedings of the Intelligent Technologies for Interactive Entertainment, 2011

Acoustic-Linguistic Recognition of Interest in Speech with Bottleneck-BLSTM Nets.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A multi-stream ASR framework for BLSTM modeling of conversational speech.
Proceedings of the IEEE International Conference on Acoustics, 2011

Combining monaural source separation with Long Short-Term Memory for increased robustness in vocalist gender recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Deep neural networks for acoustic emotion recognition: Raising the benchmarks.
Proceedings of the IEEE International Conference on Acoustics, 2011

Syllabification of conversational speech using Bidirectional Long-Short-Term Memory Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2011

Audiovisual classification of vocal outbursts in human conversation using Long-Short-Term Memory networks.
Proceedings of the IEEE International Conference on Acoustics, 2011

Come and have an emotional workout with sensitive artificial listeners!
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

String-based audiovisual fusion of behavioural events for the assessment of dimensional affect.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

AVEC 2011-The First International Audio/Visual Emotion Challenge.
Proceedings of the Affective Computing and Intelligent Interaction, 2011

2010
Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies.
IEEE Trans. Affect. Comput., 2010

Combining Long Short-Term Memory and Dynamic Bayesian Networks for Incremental Emotion-Sensitive Artificial Listening.
IEEE J. Sel. Top. Signal Process., 2010

On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues.
J. Multimodal User Interfaces, 2010

Bidirectional LSTM Networks for Context-Sensitive Keyword Detection in a Cognitive Virtual Agent Framework.
Cogn. Comput., 2010

Emotion on the Road - Necessity, Acceptance, and Feasibility of Affective Computing in the Car.
Adv. Hum. Comput. Interact., 2010

Opensmile: the munich versatile and fast open-source audio feature extractor.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

3d gesture recognition applying long short-term memory and contextual knowledge in a CAVE.
Proceedings of the 1st ACM international workshop on Multimodal pervasive video analysis, 2010

Vocalist Gender Recognition in Recorded Popular Music.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Universal Onset Detection with Bidirectional Long Short-Term Memory Neural Networks.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Long short-term memory networks for noise robust speech recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional LSTM modeling.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Recognition of spontaneous conversational speech using long short-term memory phoneme predictions.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Emotion recognition using imperfect speech recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Spoken term detection with Connectionist Temporal Classification: A novel hybrid CTC-DBN decoder.
Proceedings of the IEEE International Conference on Acoustics, 2010

Late fusion of individual engines for improved recognition of negative emotion in speech - learning vs. democratic vote.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Being bored? Recognising natural interest by extensive audiovisual integration for real-life application.
Image Vis. Comput., 2009

A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams.
Neurocomputing, 2009

Improving Keyword Spotting with a Tandem BLSTM-DBN Architecture.
Proceedings of the Advances in Nonlinear Speech Processing, 2009

Robust in-car spelling recognition - a tandem BLSTM-HMM approach.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Data-driven clustering in emotional space for affect recognition using discriminatively trained LSTM networks.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks.
Proceedings of the IEEE International Conference on Acoustics, 2009

Robust vocabulary independent keyword spotting with graphical models.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Acoustic emotion recognition: A benchmark comparison of performances.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

From speech to letters - using a novel neural network architecture for grapheme based ASR.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

A demonstration of audiovisual sensitive artificial listeners.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

OpenEAR - Introducing the munich open-source emotion and affect recognition toolkit.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

2008
Tango or Waltz?: Putting Ballroom Dance Style into Tempo Detection.
EURASIP J. Audio Speech Music. Process., 2008

Static and Dynamic Modelling for the Recognition of Non-verbal Vocalisations in Conversational Speech.
Proceedings of the Perception in Multimodal Dialogue Systems, 2008

Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependencies.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Music Thumbnailing Incorporating Harmony- and Rhythm Structure.
Proceedings of the Adaptive Multimedia Retrieval. Identifying, 2008

2007
Wearable Assistance for the Ballroom-Dance Hobbyist - Holistic Rhythm Analysis and Dance-Style Classification.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Fast and Robust Meter and Tempo Recognition for the Automatic Discrimination of Ballroom Dance Styles.
Proceedings of the IEEE International Conference on Acoustics, 2007


  Loading...