Slim Essid
Orcid: 0000-0002-0028-327X
According to our database1,
Slim Essid
authored at least 135 papers
between 2002 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
Speech self-supervised representations benchmarking: A case for larger probing heads.
Comput. Speech Lang., 2025
2024
Self-Supervised Learning of Multi-Level Audio Representations for Music Segmentation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
TACO: Training-free Sound Prompted Segmentation via Deep Audio-visual CO-factorization.
CoRR, 2024
CoRR, 2024
An Eye for an Ear: Zero-shot Audio Description Leveraging an Image Captioner using Audiovisual Distribution Alignment.
CoRR, 2024
A sound description: Exploring prompt templates and class descriptions to enhance zero-shot audio classification.
CoRR, 2024
Less Forgetting for Better Generalization: Exploring Continual-learning Fine-tuning Methods for Speech Self-supervised Representations.
CoRR, 2024
Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
An eye for an ear: zero-shot audio description leveraging an image captioner with audio-visual token distribution matching.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
A Contrastive Self-Supervised Learning Scheme for Beat Tracking Amenable to Few-Shot Learning.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024
Using Pairwise Link Prediction and Graph Attention Networks for Music Structure Analysis.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
A Lightweight Dual-Stage Framework for Personalized Speech Enhancement Based on Deepfilternet2.
Proceedings of the IEEE International Conference on Acoustics, 2024
On The Choice of the Optimal Temporal Support for Audio Classification with Pre-Trained Embeddings.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the 32nd European Signal Processing Conference, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Fine-Tuning Strategies for Faster Inference Using Speech Self-Supervised Models: A Comparative Study.
Proceedings of the IEEE International Conference on Acoustics, 2023
Cosmopolite Sound Monitoring (CoSMo): A Study of Urban Sound Event Detection Systems Generalizing to Multiple Cities.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
IEEE J. Sel. Top. Signal Process., 2022
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Latent and Adversarial Data Augmentations for Sound Event Detection and Classification.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022
2021
DNN-Based Mask Estimation for Distributed Speech Enhancement in Spatially Unconstrained Microphone Arrays.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Early Detection of User Engagement Breakdown in Spontaneous Human-Humanoid Interaction.
IEEE Trans. Affect. Comput., 2021
Pretext Tasks selection for multitask self-supervised speech representation learning.
CoRR, 2021
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021
Conditional Independence for Pretext Task Selection in Self-Supervised Speech Representation Learning.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Neuro-Steered Music Source Separation With EEG-Based Auditory Attention Decoding And Contrastive-NMF.
Proceedings of the IEEE International Conference on Acoustics, 2021
Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes.
Proceedings of the 29th European Signal Processing Conference, 2021
2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
On-the-fly Detection of User Engagement Decrease in Spontaneous Human-Robot Interaction, International Journal of Social Robotics, 2019.
CoRR, 2020
DNN-based Distributed Multichannel Mask Estimation for Speech Enhancement in Microphone Arrays.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
IEEE Signal Process. Mag., 2019
On-the-Fly Detection of User Engagement Decrease in Spontaneous Human-Robot Interaction Using Recurrent and Deep Neural Networks.
Int. J. Soc. Robotics, 2019
Identify, Locate and Separate: Audio-Visual Object Extraction in Large Video Collections Using Weak Supervision.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
MAD-EEG: an EEG dataset for decoding auditory attention to a target instrument in polyphonic music.
Proceedings of the 2019 Workshop on Speech, Music and Mind, 2019
SAMBASET: A Dataset of Historical Samba de Enredo Recordings for Computational Music Analysis.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
Tracking Beats and Microtiming in Afro-Latin American Music Using Conditional Random Fields and Deep Learning.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
A Music Structure Informed Downbeat Tracking System Using Skip-chain Conditional Random Fields and Deep Learning.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
2018
Biomed. Signal Process. Control., 2018
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018
Structured Output Learning with Abstention: Application to Accurate Opinion Prediction.
Proceedings of the 35th International Conference on Machine Learning, 2018
An Ensemble Learning Approach to Detect Epileptic Seizures from Long Intracranial EEG Recordings.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Attitude Classification in Adjacency Pairs of a Human-Agent Interaction with Hidden Conditional Random Fields.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Multi-task Feature Learning for EEG-based Emotion Recognition Using Group Nonnegative Matrix Factorization.
Proceedings of the 26th European Signal Processing Conference, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018
2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017
Leveraging deep neural networks with nonnegative representations for improved environmental sound classification.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017
Opinion Dynamics Modeling for Movie Review Transcripts Classification with Hidden Conditional Random Fields.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
UE-HRI: a new dataset for the study of user engagement in spontaneous human-robot interactions.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017
Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
EMOEEG: A new multimodal dataset for dynamic EEG-based emotion recognition with audiovisual elicitation.
Proceedings of the 25th European Signal Processing Conference, 2017
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017
2016
Mini-batch stochastic approaches for accelerated multiplicative updates in nonnegative matrix factorisation with beta-divergence.
Proceedings of the 26th IEEE International Workshop on Machine Learning for Signal Processing, 2016
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016
Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Acoustic scene classification with matrix factorization for unsupervised feature learning.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Traitement du Signal, 2015
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 23rd European Signal Processing Conference, 2015
2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Gesture recognition using a NMF-based representation of motion-traces extracted from depth silhouettes.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
IEEE Trans. Multim., 2013
Smooth Nonnegative Matrix Factorization for Unsupervised Audiovisual Document Structuring.
IEEE Trans. Multim., 2013
IEEE Trans. Speech Audio Process., 2013
A multi-modal dance corpus for research into interaction between humans in virtual environments.
J. Multimodal User Interfaces, 2013
Multimodal classification of dance movements using body joint trajectories and step sounds.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013
Soft nonnegative matrix co-factorizationwith application to multimodal speaker diarization.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Decomposing the video editing structure of a talk-show using nonnegative matrix factorization.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012
A regressive boosting approach to automatic audio tagging based on soft annotator fusion.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the Multimodal Music Processing, 2012
2011
A Conditional Random Field Framework for Robust and Scalable Audio-to-Score Matching.
IEEE ACM Trans. Audio Speech Lang. Process., 2011
Optimizing the mapping from a symbolic to an audio representation for music-to-score alignment.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011
Interactive Classification of Sound Objects for Polyphonic Electro-Acoustic Music Annotation.
Proceedings of the AES International Conference Semantic Audio 2011, 2011
Enhanced visualisation of dance performance from automatically synchronised multimodal recordings.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Hidden Discrete Tempo Model: A tempo-aware timing model for audio-to-score alignment.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the Multimedia Semantics: Metadata, Analysis and Interaction, 2011
Proceedings of the Multimedia Semantics: Metadata, Analysis and Interaction, 2011
2010
Proceedings of the 18th International Conference on Multimedia 2010, 2010
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Robust visual features for the multimodal identification of unregistered speakers in TV talk-shows.
Proceedings of the International Conference on Image Processing, 2010
A comparative study of tonal acoustic features for a symbolic level music-to-score alignment.
Proceedings of the IEEE International Conference on Acoustics, 2010
A multimodal approach to initialisation for top-down speaker diarization of television shows.
Proceedings of the 18th European Signal Processing Conference, 2010
2009
Temporal Integration for Audio Classification With Application to Musical Instrument Classification.
IEEE Trans. Speech Audio Process., 2009
Incorporating prior knowledge on the digital media creation process into audio classifiers.
Proceedings of the IEEE International Conference on Acoustics, 2009
2008
Proceedings of the 2nd ACM Workshop on Video Summarization, 2008
Proceedings of the International Conference on Image Processing, 2008
Proceedings of the 2008 16th European Signal Processing Conference, 2008
Alignment kernels for audio classification with application to music instrument recognition.
Proceedings of the 2008 16th European Signal Processing Conference, 2008
2007
IEEE Trans. Circuits Syst. Video Technol., 2007
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007
Combined Supervised and Unsupervised Approaches for Automatic Segmentation of Radiophonic Audio Streams.
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
IEEE Trans. Speech Audio Process., 2006
IEEE Trans. Speech Audio Process., 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
Classification automatique des signaux audio-fréquences : reconnaissance des instruments de musique. (Automatic Classification of Audio Signals: Machine Recognition of Musical Instruments).
PhD thesis, 2005
Inferring Efficient Hierarchical Taxonomies for MIR Tasks: Application to Musical Instruments.
Proceedings of the ISMIR 2005, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
2004
Proceedings of the ISMIR 2004, 2004
Proceedings of the 2004 12th European Signal Processing Conference, 2004
2002
Dynamic temporal segmentation in parametric non-stationary modeling for percussive musical signals.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002
Transient modeling with a frequency-transform subspace algorithm and "transient+sinusoidal" scheme.
Proceedings of the 14th International Conference on Digital Signal Processing, 2002