Romain Serizel

According to our database1, Romain Serizel authored at least 92 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Diffusion-based Unsupervised Audio-visual Speech Enhancement.
CoRR, 2024

Energy Consumption Trends in Sound Event Detection Systems.
CoRR, 2024

Domain-Invariant Representation Learning of Bird Sounds.
CoRR, 2024

Normalizing Energy Consumption for Hardware-Independent Evaluation.
CoRR, 2024

From Computation to Consumption: Exploring the Compute-Energy Link for Training and Testing Neural Networks for SED Systems.
CoRR, 2024

Latent Watermarking of Audio Generative Models.
CoRR, 2024

DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels.
CoRR, 2024

A Phoneme-Scale Assessment of Multichannel Speech Enhancement Algorithms.
CoRR, 2024

Posterior Sampling Algorithms for Unsupervised Speech Enhancement with Recurrent Variational Autoencoder.
Proceedings of the IEEE International Conference on Acoustics, 2024

Performance and Energy Balance: A Comprehensive Study of State-of-the-Art Sound Event Detection Systems.
Proceedings of the IEEE International Conference on Acoustics, 2024

Unsupervised Speech Enhancement with Diffusion-Based Generative Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

Self-Supervised Learning for Few-Shot Bird Sound Classification.
Proceedings of the IEEE International Conference on Acoustics, 2024

A Weighted-Variance Variational Autoencoder Model for Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2024

Diffusion-Based Speech Enhancement with a Weighted Generative-Supervised Learning Loss.
Proceedings of the IEEE International Conference on Acoustics, 2024

Mixture of Mixups for Multi-label Classification of Rare Anuran Sounds.
Proceedings of the 32nd European Signal Processing Conference, 2024

RoboVox: A Single/Multi-channel Far-field Speaker Recognition Benchmark for a Mobile Robot.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Regularized Contrastive Pre-training for Few-shot Bioacoustic Sound Detection.
CoRR, 2023

Pretraining Representations for Bioacoustic Few-shot Detection using Supervised Contrastive Learning.
CoRR, 2023

SAMbA: Speech enhancement with Asynchronous ad-hoc Microphone Arrays.
CoRR, 2023

Post-Processing Independent Evaluation of Sound Event Detection Systems.
CoRR, 2023

From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Self-supervised learning with Diffusion-based multichannel speech enhancement for speaker verification under noisy conditions.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Performance Above All? Energy Consumption vs. Performance, a Study on Sound Event Detection with Heterogeneous Data.
Proceedings of the IEEE International Conference on Acoustics, 2023

Fast and Efficient Speech Enhancement with Variational Autoencoders.
Proceedings of the IEEE International Conference on Acoustics, 2023

Spice+: Evaluation of Automatic Audio Captioning Systems with Pre-Trained Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

Audio-Visual Speech Enhancement with a Deep Kalman Filter Generative Model.
Proceedings of the IEEE International Conference on Acoustics, 2023

Lightweight Annotation and Class Weight Training for Automatic Estimation of Alarm Audibility in Noise.
Proceedings of the IEEE International Conference on Acoustics, 2023

BinauRec: A dataset to test the influence of the use of room impulse responses on binaural speech enhancement.
Proceedings of the 31st European Signal Processing Conference, 2023

2022
Weighted variance variational autoencoder for speech enhancement.
CoRR, 2022

How to Leverage DNN-based speech enhancement for multi-channel speaker verification?
CoRR, 2022

Joint Optimization of Diffusion Probabilistic-Based Multichannel Speech Enhancement with Far-Field Speaker Verification.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Learning Noise Robust ResNet-Based Speaker Embedding for Speaker Recognition.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Barlow Twins self-supervised learning for robust speaker recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Benchmark of State-of-the-Art Sound Event Detection Systems Evaluated on Synthetic Soundscapes.
Proceedings of the IEEE International Conference on Acoustics, 2022

Threshold Independent Evaluation of Sound Event Detection Scores.
Proceedings of the IEEE International Conference on Acoustics, 2022

A Comprehensive Exploration of Noise Robustness and Noise Compensation in ResNet and TDNN-based Speaker Recognition Systems.
Proceedings of the 30th European Signal Processing Conference, 2022

Description and Analysis of Novelties Introduced in DCASE Task 4 2022 on the Baseline System.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Convolutional Neural Network for Audibility Assessment of Acoustic Alarms.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Integrating Isolated Examples with Weakly-Supervised Sound Event Detection: A Direct Approach.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Contributions to speech processing and ambient sound analysis.
, 2022

2021
DNN-Based Mask Estimation for Distributed Speech Enhancement in Spatially Unconstrained Microphone Arrays.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

UIAI System for Short-Duration Speaker Verification Challenge 2020.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

What's all the Fuss about Free Universal Sound Separation Data?
Proceedings of the IEEE International Conference on Acoustics, 2021

Sound Event Detection and Separation: A Benchmark on Desed Synthetic Soundscapes.
Proceedings of the IEEE International Conference on Acoustics, 2021

Distributed Speech Separation in Spatially Unconstrained Microphone Arrays.
Proceedings of the IEEE International Conference on Acoustics, 2021

Improving Sound Event Detection Metrics: Insights from DCASE 2020.
Proceedings of the IEEE International Conference on Acoustics, 2021

Compensate multiple distortions for speaker recognition systems.
Proceedings of the 29th European Signal Processing Conference, 2021

Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes.
Proceedings of the 29th European Signal Processing Conference, 2021

The Impact of Non-Target Events in Synthetic Soundscapes for Sound Event Detection.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

Automated Audio Captioning by Fine-Tuning BART with AudioSet Tags.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

2020
Joint NN-Supported Multichannel Reduction of Acoustic Echo, Reverberation and Noise.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Limitations of Weak Labels for Embedding and Tagging.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Sound Event Detection in Synthetic Domestic Environments.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

DNN-based Distributed Multichannel Mask Estimation for Speech Enhancement in Microphone Arrays.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Foreground-Background Ambient Sound Scene Separation.
Proceedings of the 28th European Signal Processing Conference, 2020

Improving Sound Event Detection in Domestic Environments using Sound Separation.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

Training Sound Event Detection on a Heterogeneous Dataset.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019
Audio-Based Search and Rescue With a Drone: Highlights From the IEEE Signal Processing Cup 2019 Student Competition [SP Competitions].
IEEE Signal Process. Mag., 2019

CRNN-Based Multiple DoA Estimation Using Acoustic Intensity Features for Ambisonics Recordings.
IEEE J. Sel. Top. Signal Process., 2019

Joint DNN-Based Multichannel Reduction of Acoustic Echo, Reverberation and Noise.
CoRR, 2019

The Speed Submission to DIHARD II: Contributions & Lessons Learned.
CoRR, 2019

Audio-Based Search and Rescue with a Drone: Highlights from the IEEE Signal Processing Cup 2019 Student Competition.
CoRR, 2019

Regression Versus Classification for Neural Network Based Audio Source Localization.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Semi-supervised Triplet Loss Based Learning of Ambient Audio Embeddings.
Proceedings of the IEEE International Conference on Acoustics, 2019

Sound Event Detection in Domestic Environments with Weakly Labeled Data and Soundscape Synthesis.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

2018
Rank-1 constrained Multichannel Wiener Filter for speech recognition in noisy environments.
Comput. Speech Lang., 2018

CRNN-based Joint Azimuth and Elevation Localization with the Ambisonics Intensity Vector.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Multichannel Speech Separation with Recurrent Neural Networks from High-Order Ambisonics Recordings.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multiple-Input Neural Network-Based Residual Echo Suppression.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition.
Proceedings of the Latent Variable Analysis and Signal Separation, 2018

Large-scale weakly labeled semi-supervised sound event detection in domestic environments.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017
Feature Learning With Matrix Factorization Applied to Acoustic Scene Classification.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Deep-neural network approaches for speech recognition with heterogeneous groups of speakers including children.
Nat. Lang. Eng., 2017

Leveraging deep neural networks with nonnegative representations for improved environmental sound classification.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Nonnegative Feature Learning Methods for Acoustic Scene Classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

2016
Mini-batch stochastic approaches for accelerated multiplicative updates in nonnegative matrix factorisation with beta-divergence.
Proceedings of the 26th IEEE International Workshop on Machine Learning for Signal Processing, 2016

Machine listening techniques as a complement to video image analysis in forensics.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Acoustic scene classification with matrix factorization for unsupervised feature learning.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2014
Low-rank Approximation Based Multichannel Wiener Filter Algorithms for Noise Reduction with Application in Cochlear Implants.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Vocal tract length normalisation approaches to DNN-based children's and adults' speech recognition.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

FBK @ IWSLT 2014 - ASR track.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

2013
Binaural Integrated Active Noise Control and Noise Reduction in Hearing Aids.
IEEE Trans. Speech Audio Process., 2013

A speech distortion weighting based approach to integrated active noise control and noise reduction in hearing aids.
Signal Process., 2013

Rank-1 approximation based multichannel wiener filtering algorithms for noise reduction in cochlear implants.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
A Zone-of-Quiet Based Approach to Integrated Active Noise Control and Noise Reduction for Speech Enhancement in Hearing Aids.
IEEE Trans. Speech Audio Process., 2012

2011
Output SNR analysis of integrated active noise control and noise reduction in hearing aids under a single speech source scenario.
Signal Process., 2011

2010
Integrated Active Noise Control and Noise Reduction in Hearing Aids.
IEEE Trans. Speech Audio Process., 2010

2009
A zone of quiet based approach to integrated active noise control and noise reduction in hearing AIDS.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

A weighted approach for integrated active noise control and noise reduction in hearing aids.
Proceedings of the 17th European Signal Processing Conference, 2009

2008
Accuracy Constraint Determination in Fixed-Point System Design.
EURASIP J. Embed. Syst., 2008


  Loading...