Romain Serizel

Alfonso Ortega

CoRR, 2024

Diffusion-based Unsupervised Audio-visual Speech Enhancement.

[BibT_eX]

[DOI]

Jean-Eudes Ayilo

Xavier Alameda-Pineda

CoRR, 2024

A decade of DCASE: Achievements, practices, evaluations and future challenges.

[BibT_eX]

[DOI]

CoRR, 2024

Energy Consumption Trends in Sound Event Detection Systems.

[BibT_eX]

[DOI]

Constance Douwes

CoRR, 2024

Domain-Invariant Representation Learning of Bird Sounds.

[BibT_eX]

[DOI]

CoRR, 2024

From Computation to Consumption: Exploring the Compute-Energy Link for Training and Testing Neural Networks for SED Systems.

[BibT_eX]

[DOI]

Constance Douwes

CoRR, 2024

Latent Watermarking of Audio Generative Models.

[BibT_eX]

[DOI]

CoRR, 2024

DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels.

[BibT_eX]

[DOI]

CoRR, 2024

A Phoneme-Scale Assessment of Multichannel Speech Enhancement Algorithms.

[BibT_eX]

[DOI]

Nasser-Eddine Monir

Paul Magron

CoRR, 2024

Normalizing Energy Consumption for Hardware-Independent Evaluation.

[BibT_eX]

[DOI]

Constance Douwes

Proceedings of the 34th IEEE International Workshop on Machine Learning for Signal Processing, 2024

Posterior Sampling Algorithms for Unsupervised Speech Enhancement with Recurrent Variational Autoencoder.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Performance and Energy Balance: A Comprehensive Study of State-of-the-Art Sound Event Detection Systems.

[BibT_eX]

[DOI]

Francesca Ronchini

Proceedings of the IEEE International Conference on Acoustics, 2024

Unsupervised Speech Enhancement with Diffusion-Based Generative Models.

[BibT_eX]

[DOI]

Berné Nortier

Proceedings of the IEEE International Conference on Acoustics, 2024

Self-Supervised Learning for Few-Shot Bird Sound Classification.

[BibT_eX]

[DOI]

Ilyass Moummad

Nicolas Farrugia

Proceedings of the IEEE International Conference on Acoustics, 2024

A Weighted-Variance Variational Autoencoder Model for Speech Enhancement.

[BibT_eX]

[DOI]

Ali Golmakani

Xavier Alameda-Pineda

Proceedings of the IEEE International Conference on Acoustics, 2024

Diffusion-Based Speech Enhancement with a Weighted Generative-Supervised Learning Loss.

[BibT_eX]

[DOI]

Jean-Eudes Ayilo

Proceedings of the IEEE International Conference on Acoustics, 2024

Mixture of Mixups for Multi-label Classification of Rare Anuran Sounds.

[BibT_eX]

[DOI]

Proceedings of the 32nd European Signal Processing Conference, 2024

RoboVox: A Single/Multi-channel Far-field Speaker Recognition Benchmark for a Mobile Robot.

[BibT_eX]

[DOI]

Mickaël Rouvier

Théophile Gonos

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023

Regularized Contrastive Pre-training for Few-shot Bioacoustic Sound Detection.

[BibT_eX]

[DOI]

Ilyass Moummad

Nicolas Farrugia

CoRR, 2023

Pretraining Representations for Bioacoustic Few-shot Detection using Supervised Contrastive Learning.

[BibT_eX]

[DOI]

Ilyass Moummad

Nicolas Farrugia

CoRR, 2023

SAMbA: Speech enhancement with Asynchronous ad-hoc Microphone Arrays.

[BibT_eX]

[DOI]

CoRR, 2023

Post-Processing Independent Evaluation of Sound Event Detection Systems.

[BibT_eX]

[DOI]

Janek Ebbers

Reinhold Haeb-Umbach

CoRR, 2023

From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Self-supervised learning with Diffusion-based multichannel speech enhancement for speaker verification under noisy conditions.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Performance Above All? Energy Consumption vs. Performance, a Study on Sound Event Detection with Heterogeneous Data.

[BibT_eX]

[DOI]

Samuele Cornell

Proceedings of the IEEE International Conference on Acoustics, 2023

Fast and Efficient Speech Enhancement with Variational Autoencoders.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Spice+: Evaluation of Automatic Audio Captioning Systems with Pre-Trained Language Models.

[BibT_eX]

[DOI]

Félix Gontier

Christophe Cerisara

Proceedings of the IEEE International Conference on Acoustics, 2023

Audio-Visual Speech Enhancement with a Deep Kalman Filter Generative Model.

[BibT_eX]

[DOI]

Ali Golmakani

Proceedings of the IEEE International Conference on Acoustics, 2023

Lightweight Annotation and Class Weight Training for Automatic Estimation of Alarm Audibility in Noise.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

BinauRec: A dataset to test the influence of the use of room impulse responses on binaural speech enhancement.

[BibT_eX]

[DOI]

Louis Delebecque

Proceedings of the 31st European Signal Processing Conference, 2023

2022

Weighted variance variational autoencoder for speech enhancement.

[BibT_eX]

[DOI]

Ali Golmakani

Xavier Alameda-Pineda

CoRR, 2022

How to Leverage DNN-based speech enhancement for multi-channel speaker verification?

[BibT_eX]

[DOI]

CoRR, 2022

Joint Optimization of Diffusion Probabilistic-Based Multichannel Speech Enhancement with Far-Field Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Learning Noise Robust ResNet-Based Speaker Embedding for Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Barlow Twins self-supervised learning for robust speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Benchmark of State-of-the-Art Sound Event Detection Systems Evaluated on Synthetic Soundscapes.

[BibT_eX]

[DOI]

Francesca Ronchini

Proceedings of the IEEE International Conference on Acoustics, 2022

Threshold Independent Evaluation of Sound Event Detection Scores.

[BibT_eX]

[DOI]

Janek Ebbers

Reinhold Haeb-Umbach

Proceedings of the IEEE International Conference on Acoustics, 2022

A Comprehensive Exploration of Noise Robustness and Noise Compensation in ResNet and TDNN-based Speaker Recognition Systems.

[BibT_eX]

[DOI]

Jean-François Bonatsre

Proceedings of the 30th European Signal Processing Conference, 2022

Description and Analysis of Novelties Introduced in DCASE Task 4 2022 on the Baseline System.

[BibT_eX]

[DOI]

Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Convolutional Neural Network for Audibility Assessment of Acoustic Alarms.

[BibT_eX]

[DOI]

Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Integrating Isolated Examples with Weakly-Supervised Sound Event Detection: A Direct Approach.

[BibT_eX]

[DOI]

Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Contributions to speech processing and ambient sound analysis.

[BibT_eX]

[DOI]

, 2022

2021

DNN-Based Mask Estimation for Distributed Speech Enhancement in Spatially Unconstrained Microphone Arrays.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

UIAI System for Short-Duration Speaker Verification Challenge 2020.

[BibT_eX]

[DOI]

Md. Sahidullah

Achintya Kumar Sarkar

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

What's all the Fuss about Free Universal Sound Separation Data?

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Sound Event Detection and Separation: A Benchmark on Desed Synthetic Soundscapes.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Distributed Speech Separation in Spatially Unconstrained Microphone Arrays.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Improving Sound Event Detection Metrics: Insights from DCASE 2020.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Compensate multiple distortions for speaker recognition systems.

[BibT_eX]

[DOI]

Proceedings of the 29th European Signal Processing Conference, 2021

Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes.

[BibT_eX]

[DOI]

Proceedings of the 29th European Signal Processing Conference, 2021

The Impact of Non-Target Events in Synthetic Soundscapes for Sound Event Detection.

[BibT_eX]

[DOI]

Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

Automated Audio Captioning by Fine-Tuning BART with AudioSet Tags.

[BibT_eX]

[DOI]

Félix Gontier

Christophe Cerisara

Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

2020

Joint NN-Supported Multichannel Reduction of Acoustic Echo, Reverberation and Noise.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Limitations of Weak Labels for Embedding and Tagging.

[BibT_eX]

[DOI]

Emmanuel Vincent

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Sound Event Detection in Synthetic Domestic Environments.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

DNN-based Distributed Multichannel Mask Estimation for Speech Enhancement in Microphone Arrays.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Foreground-Background Ambient Sound Scene Separation.

[BibT_eX]

[DOI]

Proceedings of the 28th European Signal Processing Conference, 2020

Improving Sound Event Detection in Domestic Environments using Sound Separation.

[BibT_eX]

[DOI]

Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

Training Sound Event Detection on a Heterogeneous Dataset.

[BibT_eX]

[DOI]

Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019

Audio-Based Search and Rescue With a Drone: Highlights From the IEEE Signal Processing Cup 2019 Student Competition [SP Competitions].

[BibT_eX]

[DOI]

IEEE Signal Process. Mag., 2019

CRNN-Based Multiple DoA Estimation Using Acoustic Intensity Features for Ambisonics Recordings.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2019

Joint DNN-Based Multichannel Reduction of Acoustic Echo, Reverberation and Noise.

[BibT_eX]

[DOI]

CoRR, 2019

The Speed Submission to DIHARD II: Contributions & Lessons Learned.

[BibT_eX]

[DOI]

CoRR, 2019

Audio-Based Search and Rescue with a Drone: Highlights from the IEEE Signal Processing Cup 2019 Student Competition.

[BibT_eX]

[DOI]

CoRR, 2019

Regression Versus Classification for Neural Network Based Audio Source Localization.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Semi-supervised Triplet Loss Based Learning of Ambient Audio Embeddings.

[BibT_eX]

[DOI]

Emmanuel Vincent

Proceedings of the IEEE International Conference on Acoustics, 2019

Sound Event Detection in Domestic Environments with Weakly Labeled Data and Soundscape Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

2018

Rank-1 constrained Multichannel Wiener Filter for speech recognition in noisy environments.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2018

CRNN-based Joint Azimuth and Elevation Localization with the Ambisonics Intensity Vector.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Multichannel Speech Separation with Recurrent Neural Networks from High-Order Ambisonics Recordings.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multiple-Input Neural Network-Based Residual Echo Suppression.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Latent Variable Analysis and Signal Separation, 2018

Large-scale weakly labeled semi-supervised sound event detection in domestic environments.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017

Feature Learning With Matrix Factorization Applied to Acoustic Scene Classification.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Deep-neural network approaches for speech recognition with heterogeneous groups of speakers including children.

[BibT_eX]

[DOI]

Diego Giuliani

Nat. Lang. Eng., 2017

Leveraging deep neural networks with nonnegative representations for improved environmental sound classification.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Nonnegative Feature Learning Methods for Acoustic Scene Classification.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

2016

Mini-batch stochastic approaches for accelerated multiplicative updates in nonnegative matrix factorisation with beta-divergence.

[BibT_eX]

[DOI]

Slim Essid

Gaël Richard

Proceedings of the 26th IEEE International Workshop on Machine Learning for Signal Processing, 2016

Machine listening techniques as a complement to video image analysis in forensics.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification.

[BibT_eX]

[DOI]

Slim Essid

Gaël Richard

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Acoustic scene classification with matrix factorization for unsupervised feature learning.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2014

Low-rank Approximation Based Multichannel Wiener Filter Algorithms for Noise Reduction with Application in Cochlear Implants.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

Vocal tract length normalisation approaches to DNN-based children's and adults' speech recognition.

[BibT_eX]

[DOI]