Jonathan Le Roux
Orcid: 0000-0002-3451-171X
According to our database1,
Jonathan Le Roux
authored at least 162 papers
between 2002 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Trans. Int. Soc. Music. Inf. Retr., January, 2024
TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
CoRR, 2024
SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers.
CoRR, 2024
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement.
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024
Improving Audio Captioning Models with Fine-Grained Audio Features, Text Embedding Supervision, and LLM Mix-Up Augmentation.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
NeuroHeed+: Improving Neuro-Steered Speaker Extraction with Joint Auditory Attention Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
SpecDiff-GAN: A Spectrally-Shaped Noise Diffusion GAN for Speech and Music Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2024
RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT.
CoRR, 2023
Location as Supervision for Weakly Supervised Multi-Channel Source Separation of Machine Sounds.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
Style-transfer based Speech and Audio-visual Scene understanding for Robot Action Sequence Acquisition from Videos.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Paᗧ-HuBERT: Self-Supervised Music Source Separation Via Primitive Auditory Clustering And Hidden-Unit Bert.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
Momentum Pseudo-Labeling: Semi-Supervised ASR With Continuously Improving Pseudo-Labels.
IEEE J. Sel. Top. Signal Process., 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Audio-Visual Scene-Aware Dialog and Reasoning Using Audio-Visual Transformers with Joint Student-Teacher Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Improved Domain Generalization via Disentangled Multi-Task Learning in Unsupervised Anomalous Sound Detection.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
IEEE Signal Process. Lett., 2021
CoRR, 2021
CoRR, 2021
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Advanced Long-Context End-to-End Speech Recognition Using Context-Expanded Transformers.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training.
Proceedings of the IEEE International Conference on Acoustics, 2021
Transcription Is All You Need: Learning To Separate Musical Mixtures With Score As Supervision.
Proceedings of the IEEE International Conference on Acoustics, 2021
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Proceedings of the 30th IEEE International Workshop on Machine Learning for Signal Processing, 2020
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
All-in-One Transformer: Unifying Speech Recognition, Audio Tagging, and Event Detection.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Unsupervised Speaker Adaptation Using Attention-Based Speaker Memory for End-to-End ASR.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
IEEE J. Sel. Top. Signal Process., 2019
CoRR, 2019
Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Unidirectional Neural Network Architectures for End-to-End Automatic Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Bootstrapping Single-channel Source Separation via Unsupervised Spatial Clustering on Stereo Mixtures.
Proceedings of the IEEE International Conference on Acoustics, 2019
The Phasebook: Building Complex Masks via Discrete Representations for Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
Phase Reconstruction with Learned Time-Frequency Representations for Single-Channel Speech Separation.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Multi-Channel Deep Clustering: Discriminative Spectral and Spatial Embeddings for Speaker-Independent Speech Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018
2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Prior-based Binary Masking and Discriminative Methods for Reverberant and Noisy Speech Recognition Using Distant Stereo Microphones.
J. Inf. Process., 2017
Multi-microphone speech recognition integrating beamforming, robust feature extraction, and advanced DNN/RNN backend.
Comput. Speech Lang., 2017
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017
Coupled Initialization of Multi-Channel Non-Negative Matrix Factorization Based on Spatial and Spectral Information.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
BLSTM-HMM hybrid system combined with sound activity detection network for polyphonic Sound Event Detection.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
Deep Recurrent Networks for Separation and Recognition of Single-Channel Speech in Nonstationary Background Audio.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
2016
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016
2015
IEEE Signal Process. Mag., 2015
Micbots: Collecting large realistic datasets for speech and audio research using mobile robots.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR.
Proceedings of the Latent Variable Analysis and Signal Separation, 2015
The MERL/SRI system for the 3RD CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Sequential maximum mutual information linear discriminant analysis for speech recognition.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Ensemble integration of calibrated speaker localization and statistical speech detection in domestic environments.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014
Discriminatively trained recurrent neural networks for single-channel speech separation.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014
2013
IEEE Signal Process. Lett., 2013
Proceedings of the 1st International Conference on Learning Representations, 2013
Hierarchical and coupled non-negative dynamical systems with application to audio modeling.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013
The second 'chime' speech separation and recognition challenge: Datasets, tasks and baselines.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012
2011
Computational auditory induction as a missing-data model-fitting problem with Bregman divergence.
Speech Commun., 2011
Bayesian nonparametric spectrogram modeling based on infinite factorial infinite hidden Markov model.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
Proceedings of the Advances in Music Information Retrieval, 2010
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2010
Consistent Wiener Filtering: Generalized Time-Frequency Masking Respecting Spectrogram Consistency.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010
Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010
Statistical Model of Speech Signals Based on Composite Autoregressive System with Application to Blind Source Separation.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010
2008
Proceedings of the Advances in Neural Information Processing Systems 21, 2008
Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Separation of a monaural audio signal into harmonic/percussive components by complementary diffusion on spectrogram.
Proceedings of the 2008 16th European Signal Processing Conference, 2008
2007
Single and Multiple F<sub>0</sub> Contour Estimation Through Parametric Spectrogram Modeling of Speech in Noisy Environments.
IEEE Trans. Speech Audio Process., 2007
Discriminative Training for Large-Vocabulary Speech Recognition Using Minimum Classification Error.
IEEE Trans. Speech Audio Process., 2007
Harmonic-Temporal Clustering of Speech for Single and Multiple F0 Contour Estimation in Noisy Environments.
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
Speech analyzer using a joint estimation model of spectral envelope and fine structure.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
2002
Proceedings of the 10-th International Conference in Central Europe on Computer Graphics, 2002