Erik Marchi
Orcid: 0000-0002-5335-6356
According to our database1,
Erik Marchi
authored at least 60 papers
between 2011 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
A Multimodal Approach to Device-Directed Speech Detection with Large Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models.
CoRR, 2023
Less Is More: A Unified Architecture for Device-Directed Speech Detection with Multiple Invocation Types.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
CoRR, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement.
CoRR, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Generating Multilingual Voices Using Speaker Space Translation Based on Bilingual Speaker Data.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Detecting Emotion Primitives from Speech and Their Use in Discerning Categorical Emotions.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
Automatic Emotion Recognition in the Voice of Children with Autism Spectrum Conditions.
PhD thesis, 2019
IEEE Trans. Games, 2019
Affective and behavioural computing: Lessons learnt from the First Computational Paralinguistics Challenge.
Comput. Speech Lang., 2019
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019
Leveraging Acoustic Cues and Paralinguistic Embeddings to Detect Expression from Voice.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Generalised Discriminative Transform via Curriculum Learning for Speaker Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Comput. Intell. Neurosci., 2017
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
2016
The Effect of Narrow-Band Transmission on Recognition of Paralinguistic Information From Human Vocalizations.
IEEE Access, 2016
Introducing the Weighted Trustability Evaluator for Crowdsourcing Exemplified by Speaker Likability Classification.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Facing Realism in Spontaneous Emotion Recognition from Speech: Feature Enhancement by Autoencoder with LSTM Neural Networks.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Automatic Analysis of Typical and Atypical Encoding of Spontaneous Emotion in the Voice of Children.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Discriminatively Trained Recurrent Neural Networks for Continuous Dimensional Emotion Recognition from Audio.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Proceedings of the 23rd International Conference on Pattern Recognition, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Adieu features? End-to-end speech emotion recognition using a deep convolutional recurrent network.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Pairwise Decomposition with Deep Neural Networks and Multiscale Kernel Subspace Learning for Acoustic Scene Classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016
Towards Cross-lingual Automatic Diagnosis of Autism Spectrum Condition in Children's Voices.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016
2015
The ICSTM+TUM+UP Approach to the 3rd CHIME Challenge: Single-Channel LSTM Speech Enhancement with Multi-Channel Correlation Shaping Dereverberation and LSTM Language Models.
CoRR, 2015
AV+EC 2015: The First Affect Recognition Challenge Bridging Across Audio, Video, and Physiological Data.
Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge, 2015
The ICL-TUM-PASSAU Approach for the MediaEval 2015 "Affective Impact of Movies" Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Typicality and emotion in the voice of children with autism spectrum condition: evidence across three languages.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Non-linear prediction with LSTM recurrent neural networks for acoustic novelty detection.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015
A novel approach for automatic acoustic novelty detection using a denoising autoencoder with bidirectional LSTM neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Real-time robust recognition of speakers' emotions and characteristics on mobile platforms.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
Proceedings of the 12th International Conference on Advances in Computer Entertainment Technology, 2015
2014
The state of play of ASC-Inclusion: An Integrated Internet-Based Environment for Social Inclusion of Children with Autism Spectrum Conditions.
CoRR, 2014
The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Audio onset detection: A wavelet packet based approach with recurrent neural networks.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014
Multi-resolution linear prediction based features for audio onset detection with bidirectional LSTM neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Likability of human voices: A feature analysis and a neural network regression approach to automatic likability estimation.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
The INTERSPEECH 2013 computational paralinguistics challenge: social signals, conflict, emotion, autism.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013
2012
Emotion in the speech of children with autism spectrum conditions: prosody and everything else.
Proceedings of the Third Workshop on Child, Computer and Interaction, 2012
Speech, Emotion, Age, Language, Task, and Typicality: Trying to Disentangle Performance and Feature Relevance.
Proceedings of the 2012 International Conference on Privacy, 2012
Improving Recognition of Speaker States and Traits by Cumulative Evidence: Intoxication, Sleepiness, Age and Gender.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
2011
Robust Multi-stream Keyword and Non-linguistic Vocalization Detection for Computationally Intelligent Virtual Agents.
Proceedings of the Advances in Neural Networks - ISNN 2011, 2011