Felix Weninger
According to our database1,
Felix Weninger
authored at least 95 papers
between 2010 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Improving Speed/Accuracy Tradeoff for Online Streaming ASR via Real-Valued and Trainable Strides.
Proceedings of the IEEE International Conference on Acoustics, 2024
2022
Holistic Affect Recognition Using PaNDA: Paralinguistic Non-Metric Dimensional Analysis.
IEEE Trans. Affect. Comput., 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
ChannelAugment: Improving Generalization of Multi-Channel ASR by Training with Input Channel Randomization.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020
IEEE Trans. Cybern., 2020
CoRR, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Dyadic Speech-based Affect Recognition using DAMI-P2C Parent-child Multimodal Interaction Dataset.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020
2019
Affective and behavioural computing: Lessons learnt from the First Computational Paralinguistics Challenge.
Comput. Speech Lang., 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
Three recent trends in Paralinguistics on the way to omniscient machine intelligence.
J. Multimodal User Interfaces, 2018
2017
A Paralinguistic Approach To Speaker Diarisation: Using Age, Gender, Voice Likability and Personality Traits.
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Cross-Domain Classification of Drowsiness in Speech: The Case of Alcohol Intoxication and Sleep Deprivation.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
Multi-task deep neural network with shared hidden layers: Breaking down the wall between emotion representations.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
2016
IEEE Trans. Intell. Veh., 2016
Sincerity and Deception in Speech: Two Sides of the Same Coin? A Transfer- and Multi-Task Learning Perspective.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Discriminatively Trained Recurrent Neural Networks for Continuous Dimensional Emotion Recognition from Audio.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Language proficiency assessment of English L2 speakers based on joint analysis of prosody and native language.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016
2015
J. Mach. Learn. Res., 2015
A Survey on perceived speaker traits: Personality, likability, pathology, and the first challenge.
Comput. Speech Lang., 2015
The INTERSPEECH 2015 computational paralinguistics challenge: nativeness, parkinson's & eating condition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Non-linear prediction with LSTM recurrent neural networks for acoustic novelty detection.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR.
Proceedings of the Latent Variable Analysis and Signal Separation, 2015
2014
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Feature enhancement by deep LSTM networks for ASR in reverberant multisource environments.
Comput. Speech Lang., 2014
Medium-term speaker states - A review on intoxication, sleepiness and the first challenge.
Comput. Speech Lang., 2014
CoRR, 2014
On-Line NMF-Based Stereo Up-Mixing of Speech Improves Perceived Reduction of Non-Stationary Noise.
Proceedings of the AES International Conference on Semantic Audio 2014, 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Robust speech recognition using long short-term memory recurrent neural networks for hybrid acoustic modelling.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Deep recurrent de-noising auto-encoder and blind de-reverberation for reverberated speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Discriminatively trained recurrent neural networks for single-channel speech separation.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014
2013
Int. J. Distance Educ. Technol., 2013
IEEE Intell. Syst., 2013
Noise robust ASR in reverberated multisource environments applying convolutive NMF and Long Short-Term Memory.
Comput. Speech Lang., 2013
Likability of human voices: A feature analysis and a neural network regression approach to automatic likability estimation.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013
Recent developments in openSMILE, the munich open-source multimedia feature extractor.
Proceedings of the ACM Multimedia Conference, 2013
The TUM Approach to the MediaEval Music Emotion Task Using Generic Affective Audio Features.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013
The INTERSPEECH 2013 computational paralinguistics challenge: social signals, conflict, emotion, autism.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Affect recognition in real-life acoustic conditions - a new perspective on feature selection.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Influence of Low-Level Features Extracted from Rhythmic and Harmonic Sections on Music Genre Classification.
Proceedings of the Man-Machine Interactions 3, 2013
The acoustics of eye contact: detecting visual attention from conversational audio cues.
Proceedings of the 6th workshop on Eye gaze in intelligent human machine interaction: gaze in multimodal interaction, 2013
Feature enhancement by bidirectional LSTM networks for conversational speech recognition in highly non-stationary noise.
Proceedings of the IEEE International Conference on Acoustics, 2013
Speaker trait characterization in web videos: Uniting speech, language, and facial features.
Proceedings of the IEEE International Conference on Acoustics, 2013
A discriminative approach to polyphonic piano note transcription using supervised non-negative matrix factorization.
Proceedings of the IEEE International Conference on Acoustics, 2013
A comparative study on sparsity penalties for NMF-based speech separation: Beyond LP-norms.
Proceedings of the IEEE International Conference on Acoustics, 2013
Integrating noise estimation and factorization-based speech separation: A novel hybrid approach.
Proceedings of the IEEE International Conference on Acoustics, 2013
Real-life voice activity detection with LSTM Recurrent Neural Networks and an application to Hollywood movies.
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Optimization and Parallelization of Monaural Source Separation Algorithms in the openBliSSART Toolkit.
J. Signal Process. Syst., 2012
The Voice of Leadership: Models and Performances of Automatic Analysis in Online Speeches.
IEEE Trans. Affect. Comput., 2012
Int. J. Speech Technol., 2012
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012
Proceedings of the 5th International Symposium on Communications, 2012
Combining Bottleneck-BLSTM and Semi-Supervised Sparse NMF for Recognition of Conversational Speech in Highly Instationary Noise.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Discrimination of Linguistic and Non-Linguistic Vocalizations in Spontaneous Speech: Intra- and Inter-Corpus Perspectives.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Improving Recognition of Speaker States and Traits by Cumulative Evidence: Intoxication, Sleepiness, Age and Gender.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Non-negative matrix factorization for highly noise-robust ASR: To enhance or to recognize?
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Supervised and semi-supervised suppression of background music in monaural speech recordings.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Robust feature extraction for automatic recognition of vibrato singing in recorded polyphonic music.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the Latent Variable Analysis and Signal Separation, 2012
Music Information Retrieval: An Inspirational Guide to Transfer from Related Disciplines.
Proceedings of the Multimodal Music Processing, 2012
Towards Automatic Intoxication Detection from Speech in Real-Life Acoustic Environments.
Proceedings of the 10th ITG Conference on Speech Communication, 2012
Proceedings of the 10th ITG Conference on Speech Communication, 2012
Sparse, Hierarchical and Semi-Supervised Base Learning for Monaural Enhancement of Conversational Speech.
Proceedings of the 10th ITG Conference on Speech Communication, 2012
2011
Künstliche Intell., 2011
Recognition of Nonprototypical Emotions in Reverberated and Noisy Speech by Nonnegative Matrix Factorization.
EURASIP J. Adv. Signal Process., 2011
Automatic Assessment of Singer Traits in Popular Music: Gender, Age, Height and Race.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Multi-Modal Non-Prototypical Music Mood Analysis in Continuous Space: Reliability and Performances.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Speech-Based Non-Prototypical Affect Recognition for Child-Robot Interaction in Reverberated Environments.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Localization of non-linguistic events in spontaneous speech by Non-Negative Matrix Factorization and Long Short-Term Memory.
Proceedings of the IEEE International Conference on Acoustics, 2011
Audio recognition in the wild: Static and dynamic classification on a real-world database of animal vocalizations.
Proceedings of the IEEE International Conference on Acoustics, 2011
OpenBliSSART: Design and evaluation of a research toolkit for Blind Source Separation in Audio Recognition Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2011
Combining monaural source separation with Long Short-Term Memory for increased robustness in vocalist gender recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the Cognitive Behavioural Systems, 2011
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
2010
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Non-negative matrix factorization as noise-robust feature extractor for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010
Discrimination of speech and non-linguistic vocalizations by Non-Negative Matrix Factorization.
Proceedings of the IEEE International Conference on Acoustics, 2010