Gaël Richard

Orcid: 0000-0002-4960-0010

Affiliations:
  • Télécom Paris, Paris, France


According to our database1, Gaël Richard authored at least 264 papers between 1993 and 2024.

Collaborative distances:

Awards

IEEE Fellow

IEEE Fellow 2017, "For contributions to analysis, indexing and decomposition of audio and music signals".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Tackling Interpretability in Audio Classification Networks With Non-negative Matrix Factorization.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Learning Source Disentanglement in Neural Audio Codec.
CoRR, 2024

Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing.
CoRR, 2024

Speech dereverberation constrained on room impulse response characteristics.
CoRR, 2024

Episodic Fine-Tuning Prototypical Networks for Optimization-Based Few-Shot Learning: Application to Audio Classification.
Proceedings of the 34th IEEE International Workshop on Machine Learning for Signal Processing, 2024

Wavetransfer: A Flexible End-to-End Multi-Instrument Timbre Transfer with Diffusion.
Proceedings of the 34th IEEE International Workshop on Machine Learning for Signal Processing, 2024

Winner-takes-all learners are geometry-aware conditional density estimators.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Unsupervised Harmonic Parameter Estimation Using Differentiable DSP and Spectral Optimal Transport.
Proceedings of the IEEE International Conference on Acoustics, 2024

A Fully Differentiable Model for Unsupervised Singing Voice Separation.
Proceedings of the IEEE International Conference on Acoustics, 2024

GLA-GRAD: A Griffin-Lim Extended Waveform Generation Diffusion Model.
Proceedings of the IEEE International Conference on Acoustics, 2024

SpecDiff-GAN: A Spectrally-Shaped Noise Diffusion GAN for Speech and Music Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2024

Structure-Informed Positional Encoding for Music Generation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Invariance-Based Layer Regularization for Sound Event Detection.
Proceedings of the 32nd European Signal Processing Conference, 2024

Using Random Codebooks for Audio Neural AutoEncoders.
Proceedings of the 32nd European Signal Processing Conference, 2024

2023
Audio Signal Processing in the 21st Century: The important outcomes of the past 25 years.
IEEE Signal Process. Mag., July, 2023

Video-to-Music Recommendation Using Temporal Alignment of Segments.
IEEE Trans. Multim., 2023

Unsupervised Music Source Separation Using Differentiable Parametric Source Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Transfer Learning and Bias Correction With Pre-Trained Audio Embeddings.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Singer Identity Representation Learning Using Self-Supervised Techniques.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Learning Interpretable Filters In Wav-UNet For Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Unsupervised Audio Source Separation Using Differentiable Parametric Source Models.
CoRR, 2022

Listen to Interpret: Post-hoc Interpretability for Audio Networks with NMF.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Exploiting Device and Audio Data to Tag Music with User-Aware Listening Contexts.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Phase Shifted Bedrosian Filterbank: An Interpretable Audio Front-End for Time-Domain Audio Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2022

Latent and Adversarial Data Augmentations for Sound Event Detection and Classification.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Rate-Distortion Theoretic Generalization Bounds for Stochastic Learning Algorithms.
Proceedings of the Conference on Learning Theory, 2-5 July 2022, London, UK., 2022

2021
Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Probabilistic semi-nonnegative matrix factorization: a Skellam-based framework.
CoRR, 2021

VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

User-Guided One-Shot Deep Model Adaptation for Music Source Separation.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Heavy Tails in SGD and Compressibility of Overparametrized Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

The Words Remain the Same: Cover Detection with Lyrics Transcription.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Is there a "language of music-video clips" ? A qualitative and quantitative study.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

DarkGAN: Exploiting Knowledge Distillation for Comprehensible Audio Synthesis With GANs.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Cross-Modal Music-Video Recommendation: A Study of Design Choices.
Proceedings of the International Joint Conference on Neural Networks, 2021

Relative Positional Encoding for Transformers with Linear Complexity.
Proceedings of the 38th International Conference on Machine Learning, 2021

Self-Supervised VQ-VAE for One-Shot Music Style Transfer.
Proceedings of the IEEE International Conference on Acoustics, 2021

Neuro-Steered Music Source Separation With EEG-Based Auditory Attention Decoding And Contrastive-NMF.
Proceedings of the IEEE International Conference on Acoustics, 2021

Deep Learning for Audio and Music.
Proceedings of the Multi-faceted Deep Learning - Models and Data, 2021

2020
Groove2Groove MIDI Dataset: synthetic accompaniments in 3k styles.
Dataset, August, 2020

Weakly Supervised Representation Learning for Audio-Visual Scene Analysis.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Groove2Groove: One-Shot Music Style Transfer With Supervision From Synthetic Data.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Augmented skew-symmetric system for shallow-water system with surface tension allowing large gradient of density.
J. Comput. Phys., 2020

Matrix Factorization for High Frequency Non Intrusive Load Monitoring: Definitions and Algorithms.
Proceedings of the NILM '20, 2020

Confidence-based Weighted Loss for Multi-label Classification with Missing Labels.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

The POTUS Corpus, a Database of Weekly Addresses for the Study of Stance in Politics and Virtual Agents.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Multilingual lyrics-to-audio alignment.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

DRUMGAN: Synthesis of Drum Sounds with Timbral Feature Conditioning Using Generative Adversarial Networks.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Should we consider the users in contextual music auto-tagging models?.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Audio-Based Detection of Explicit Content in Music.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Joint Phoneme Alignment and Text-Informed Speech Separation on Highly Corrupted Speech.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Learning to Rank Music Tracks Using Triplet Loss.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Audio-Based Auto-Tagging With Contextual Tags for Music.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Neutral to Lombard Speech Conversion with Deep Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Speech Intelligibility Enhancement by Equalization for in-Car Applications.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Comparing Representations for Audio Synthesis Using Generative Adversarial Networks.
Proceedings of the 28th European Signal Processing Conference, 2020

2019
Audiovisual Analysis of Music Performances: Overview of an Emerging Field.
IEEE Signal Process. Mag., 2019

Independent-Variation Matrix Factorization With Application to Energy Disaggregation.
IEEE Signal Process. Lett., 2019

On the Heavy-Tailed Theory of Stochastic Gradient Descent for Deep Neural Networks.
CoRR, 2019

Augmented Skew-Symetric System for Shallow-Water System with Surface Tension Allowing Large Gradient of Density.
CoRR, 2019

Weakly Informed Audio Source Separation.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Identify, Locate and Separate: Audio-Visual Object Extraction in Large Video Collections Using Weak Supervision.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

EEG-Based Decoding of Auditory Attention to a Target Instrument in Polyphonic Music.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

MAD-EEG: an EEG dataset for decoding auditory attention to a target instrument in polyphonic music.
Proceedings of the 2019 Workshop on Speech, Music and Mind, 2019

First Exit Time Analysis of Stochastic Gradient Descent Under Heavy-Tailed Gradient Noise.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Supervised Symbolic Music Style Translation Using Synthetic Data.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Non-Asymptotic Analysis of Fractional Langevin Monte Carlo for Non-Convex Optimization.
Proceedings of the 36th International Conference on Machine Learning, 2019

2018
Student's t Source and Mixing Models for Multichannel Audio Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Hybrid Projective Nonnegative Matrix Factorization With Drum Dictionaries for Harmonic/Percussive Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Efficient Bayesian Model Selection in PARAFAC via Stochastic Thermodynamic Integration.
IEEE Signal Process. Lett., 2018

A Generative Model for Non-Intrusive Load Monitoring in Commercial Buildings.
CoRR, 2018

Asynchronous Stochastic Quasi-Newton MCMC for Non-Convex Optimization.
Proceedings of the 35th International Conference on Machine Learning, 2018

Alpha-Stable Low-Rank Plus Residual Decomposition for Speech Enhancement.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017
Introduction to the Special Section on Sound Scene and Event Analysis.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Robust Downbeat Tracking Using an Ensemble of Convolutional Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Feature Learning With Matrix Factorization Applied to Acoustic Scene Classification.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Speech intelligibility improvement in car noise environment by voice transformation.
Speech Commun., 2017

Reassigned time-frequency representations of discrete time signals and application to the Constant-Q Transform.
Signal Process., 2017

Règles d'associations temporelles de signaux sociaux pour la synthèse d'agents conversationnels animés. Application aux attitudes sociales.
Rev. d'Intelligence Artif., 2017

Guiding audio source separation by video object information.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Separating time-frequency sources from time-domain convolutive mixtures using non-negative matrix factorization.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Synthetic dataset generation for non-intrusive load monitoring in commercial buildings.
Proceedings of the 4th ACM International Conference on Systems for Energy-Efficient Built Environments, 2017

Leveraging deep neural networks with nonnegative representations for improved environmental sound classification.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Parallelized Stochastic Gradient Markov Chain Monte Carlo algorithms for non-negative matrix factorization.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Motion informed audio source separation.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Alpha-stable multichannel audio source separation.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Multichannel audio source separation: Variational inference of time-frequency sources from time-domain observations.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Drum extraction in single channel audio signals using multi-layer Non negative Matrix Factor Deconvolution.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Overlapping sound event detection with supervised Nonnegative Matrix Factorization.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Semi-blind student's t source separation for multichannel audio convolutive mixtures.
Proceedings of the 25th European Signal Processing Conference, 2017

Nonnegative Feature Learning Methods for Acoustic Scene Classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

2016
Multichannel Audio Source Separation With Probabilistic Reverberation Priors.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Fusion Methods for Speech Enhancement and Audio Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Stochastic Gradient Richardson-Romberg Markov Chain Monte Carlo.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Mini-batch stochastic approaches for accelerated multiplicative updates in nonnegative matrix factorisation with beta-divergence.
Proceedings of the 26th IEEE International Workshop on Machine Learning for Signal Processing, 2016

Using Temporal Association Rules for the Synthesis of Embodied Conversational Agents with a Specific Stance.
Proceedings of the Intelligent Virtual Agents - 16th International Conference, 2016

Genre Specific Dictionaries for Harmonic/Percussive Source Separation.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

Stochastic Quasi-Newton Langevin Monte Carlo.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Machine listening techniques as a complement to video image analysis in forensics.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Stochastic thermodynamic integration: Efficient Bayesian model selection via stochastic gradient MCMC.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Formant shifting for speech intelligibility improvement in car noise environment.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Feature adapted convolutional neural networks for downbeat tracking.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Acoustic scene classification with matrix factorization for unsupervised feature learning.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Autoregressive moving average modeling of late reverberation in the frequency domain.
Proceedings of the 24th European Signal Processing Conference, 2016

2015
TPT-Dance&Actions : un corpus multimodal d'activités humaines.
Traitement du Signal, 2015

Late Reverberation Synthesis: From Radiance Transfer to Feedback Delay Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Multichannel audio source separation with probabilistic reverberation modeling.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Geometric-based reverberator using acoustic rendering networks.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Multipitch estimation using a PLCA-based model: Impact of partial user annotation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Downbeat tracking with multiple features and deep neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A structured nonnegative matrix factorization for source separation.
Proceedings of the 23rd European Signal Processing Conference, 2015

HOG and subband power distribution image features for acoustic scene classification.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
Melody Extraction from Polyphonic Music Signals: Approaches, applications, and challenges.
IEEE Signal Process. Mag., 2014

Blind Denoising with Random Greedy Pursuits.
IEEE Signal Process. Lett., 2014

Variational Bayesian model averaging for audio source separation.
Proceedings of the IEEE Workshop on Statistical Signal Processing, 2014

Informed Audio Source Separation.
Proceedings of the AES International Conference on Semantic Audio 2014, 2014

Template Adaptation for Improving Automatic Music Transcription.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Multiple-order non-negative matrix factorization for speech enhancement.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Gesture recognition using a NMF-based representation of motion-traces extracted from depth silhouettes.
Proceedings of the IEEE International Conference on Acoustics, 2014

Single channel reverberation suppression based on sparse linear prediction.
Proceedings of the IEEE International Conference on Acoustics, 2014

Enhancing downbeat detection when facing different music styles.
Proceedings of the IEEE International Conference on Acoustics, 2014

Controlling the convergence rate to help parameter estimation in a PLCA-based model.
Proceedings of the 22nd European Signal Processing Conference, 2014

2013
Coding-Based Informed Source Separation: Nonnegative Tensor Factorization Approach.
IEEE Trans. Speech Audio Process., 2013

Learning Optimal Features for Polyphonic Audio-to-Score Alignment.
IEEE Trans. Speech Audio Process., 2013

Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription.
IEEE Trans. Speech Audio Process., 2013

Parametric Audio Coding With Exponentially Damped Sinusoids.
IEEE Trans. Speech Audio Process., 2013

An Overview on Perceptually Motivated Audio Indexing and Classification.
Proc. IEEE, 2013

A multi-modal dance corpus for research into interaction between humans in virtual environments.
J. Multimodal User Interfaces, 2013

Multimodal classification of dance movements using body joint trajectories and step sounds.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013

An overview of informed audio source separation.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013

Exploring new features for music classification.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013

Blending real with virtual in 3DLife.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013

Modeling early reflections of room impulse responses using a radiance transfer method.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Introducing a simple fusion framework for audio source separation.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

An Extended Audio Fingerprint Method with Capabilities for Similar Music Detection.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Low bitrate informed source separation of realistic mixtures.
Proceedings of the IEEE International Conference on Acoustics, 2013

Does dereverberation help multichannel source separation? A case study.
Proceedings of the 21st European Signal Processing Conference, 2013

Recognition of Acoustic Emotion.
Proceedings of the Emotion-Oriented Systems, 2013

2012
Multiclass Feature Selection With Kernel Gram-Matrix-Based Criteria.
IEEE Trans. Neural Networks Learn. Syst., 2012

Matching Pursuits with random sequential subdictionaries.
Signal Process., 2012

Informed source separation through spectrogram coding and data embedding.
Signal Process., 2012

Random time-frequency subdictionary design for sparse representations with greedy algorithms.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Adaptive filtering for music/voice separation exploiting the repeating musical structure.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A probabilistic approach to simultaneous extraction of beats and downbeats.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Probabilistic model for main melody extraction using Constant-Q transform.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A regressive boosting approach to automatic audio tagging based on soft annotator fusion.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Audio source separation informed by redundancy with greedy multiscale decompositions.
Proceedings of the 20th European Signal Processing Conference, 2012

Spatial coding-based Informed Source Separation.
Proceedings of the 20th European Signal Processing Conference, 2012

Informed audio source separation: A comparative study.
Proceedings of the 20th European Signal Processing Conference, 2012

Blind Harmonic Adaptive Decomposition applied to supervised source separation.
Proceedings of the 20th European Signal Processing Conference, 2012

A framework for fingerprint-based detection of repeating objects in multimedia streams.
Proceedings of the 20th European Signal Processing Conference, 2012

Fusion of Multimodal Information in Music Content Analysis.
Proceedings of the Multimodal Music Processing, 2012

2011
Gaussian Processes for Underdetermined Source Separation.
IEEE Trans. Signal Process., 2011

A Conditional Random Field Framework for Robust and Scalable Audio-to-Score Matching.
IEEE ACM Trans. Audio Speech Lang. Process., 2011

Introduction to the Special Issue on Music Signal Processing.
IEEE J. Sel. Top. Signal Process., 2011

Signal Processing for Music Analysis.
IEEE J. Sel. Top. Signal Process., 2011

A Musically Motivated Mid-Level Representation for Pitch Estimation and Musical Audio Source Separation.
IEEE J. Sel. Top. Signal Process., 2011

Greedy sparse decompositions: a comparative study.
EURASIP J. Adv. Signal Process., 2011

Informed source separation: Source coding meets source separation.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Optimizing the mapping from a symbolic to an audio representation for music-to-score alignment.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Interactive Classification of Sound Objects for Polyphonic Electro-Acoustic Music Annotation.
Proceedings of the AES International Conference Semantic Audio 2011, 2011

Tutorial on multimedia music signal processing.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

An audio-driven virtual dance-teaching assistant.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

An Interactive System for Electro-Acoustic Music Analysis.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Multi-scale temporal fusion by boosting for music classification.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

A Scalable Audio Fingerprint Method with Robustness to Pitch-Shifting.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Combining monaural source separation with Long Short-Term Memory for increased robustness in vocalist gender recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Audio Signal Representations for Factorization in the Sparse Domain.
Proceedings of the IEEE International Conference on Acoustics, 2011

Hidden Discrete Tempo Model: A tempo-aware timing model for audio-to-score alignment.
Proceedings of the IEEE International Conference on Acoustics, 2011

Adaptive harmonic time-frequency decomposition of audio using shift-invariant PLCA.
Proceedings of the IEEE International Conference on Acoustics, 2011

Entropy-constrained quantization of exponentially damped sinusoids parameters.
Proceedings of the IEEE International Conference on Acoustics, 2011

Machine Learning Techniques for Multimedia Analysis.
Proceedings of the Multimedia Semantics: Metadata, Analysis and Interaction, 2011

Feature Extraction for Multimedia Analysis.
Proceedings of the Multimedia Semantics: Metadata, Analysis and Interaction, 2011

2010
Audio Signal Representations for Indexing in the Transform Domain.
IEEE Trans. Speech Audio Process., 2010

Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals.
IEEE Trans. Speech Audio Process., 2010

Explicit modeling of temporal dynamics within musical signals for acoustical unit similarity.
Pattern Recognit. Lett., 2010

A conditional random field viewpoint of symbolic audio-to-score matching.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

YAAFE, an Easy to Use and Efficient Audio Feature Extraction Software.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

An Improved Hierarchical Approach for Music-to-symbolic Score Alignment.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Robust visual features for the multimodal identification of unregistered speakers in TV talk-shows.
Proceedings of the International Conference on Image Processing, 2010

Robust similarity metrics between audio signals based on asymmetrical spectral envelope matching.
Proceedings of the IEEE International Conference on Acoustics, 2010

A comparative study of tonal acoustic features for a symbolic level music-to-score alignment.
Proceedings of the IEEE International Conference on Acoustics, 2010

Multimodal similarity between musical streams for cover version detection.
Proceedings of the IEEE International Conference on Acoustics, 2010

Robust frequency-based Audio Fingerprinting.
Proceedings of the IEEE International Conference on Acoustics, 2010

Informed Source Separation Using Latent Components.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010

A multimodal approach to initialisation for top-down speaker diarization of television shows.
Proceedings of the 18th European Signal Processing Conference, 2010

2009
Audio Indexing.
Proceedings of the Encyclopedia of Data Warehousing and Mining, Second Edition (4 Volumes), 2009

Temporal Integration for Audio Classification With Application to Musical Instrument Classification.
IEEE Trans. Speech Audio Process., 2009

Automatic Generation of Lead Sheets from Polyphonic Music Signals.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Incorporating prior knowledge on the digital media creation process into audio classifiers.
Proceedings of the IEEE International Conference on Acoustics, 2009

An iterative approach to monaural musical mixture de-soloing.
Proceedings of the IEEE International Conference on Acoustics, 2009

Comparison of different strategies for a SVM-based audio segmentation.
Proceedings of the 17th European Signal Processing Conference, 2009

Main instrument separation from stereophonic audio signals using a source/filter model.
Proceedings of the 17th European Signal Processing Conference, 2009

2008
Estimation of Frequency for AM/FM Models Using the Phase Vocoder Framework.
IEEE Trans. Signal Process., 2008

Fast and Stable YAST Algorithm for Principal and Minor Subspace Tracking.
IEEE Trans. Signal Process., 2008

Performance of ESPRIT for Estimating Mixtures of Complex Exponentials Modulated by Polynomials.
IEEE Trans. Signal Process., 2008

CramÉr-Rao Bounds for Multiple Poles and Coefficients of Quasi-Polynomials in Colored Noise.
IEEE Trans. Signal Process., 2008

Union of MDCT Bases for Audio Coding.
IEEE Trans. Speech Audio Process., 2008

Instrument-Specific Harmonic Atoms for Mid-Level Music Representation.
IEEE Trans. Speech Audio Process., 2008

Transcription and Separation of Drum Signals From Polyphonic Music.
IEEE Trans. Speech Audio Process., 2008

A New Model-Based Algorithm for Optimizing the MPEG-AAC in MS-Stereo.
IEEE Trans. Speech Audio Process., 2008

Fear-type emotion recognition for future audio-based surveillance systems.
Speech Commun., 2008

Fast MIR in a Sparse Transform Domain.
Proceedings of the ISMIR 2008, 2008

Vocal detection in music with support vector machines.
Proceedings of the IEEE International Conference on Acoustics, 2008

Singer melody extraction in polyphonic signals using source separation methods.
Proceedings of the IEEE International Conference on Acoustics, 2008

On the robustness of audio features for musical instrument classification.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

Matching pursuit in adaptive dictionaries for scalable audio coding.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

Alignment kernels for audio classification with application to music instrument recognition.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007
On the Correlation of Automatic Audio and Visual Segmentations of Music Videos.
IEEE Trans. Circuits Syst. Video Technol., 2007

Accurate tempo estimation based on harmonic + noise decomposition.
EURASIP J. Adv. Signal Process., 2007

Supervised and Unsupervised Sequence Modelling for Drum Transcription.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Combined Supervised and Unsupervised Approaches for Automatic Segmentation of Radiophonic Audio Streams.
Proceedings of the IEEE International Conference on Acoustics, 2007

Detection and Analysis of Abnormal Situations Through Fear-Type Acoustic Manifestations.
Proceedings of the IEEE International Conference on Acoustics, 2007

Blind Signal Decompositions for Automatic Transcription of Polyphonic Music: NMF and K-SVD on the Benchmark.
Proceedings of the IEEE International Conference on Acoustics, 2007

Conjugate Gradient Algorithms for Minor Subspace Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2007

Visual analysis for drum sequence transcription.
Proceedings of the 15th European Signal Processing Conference, 2007

2006
High-resolution spectral analysis of mixtures of complex exponentials modulated by polynomials.
IEEE Trans. Signal Process., 2006

A new perturbation analysis for signal enumeration in rotational invariance techniques.
IEEE Trans. Signal Process., 2006

Musical instrument recognition by pairwise classification strategies.
IEEE Trans. Speech Audio Process., 2006

Instrument recognition in polyphonic music based on automatic taxonomies.
IEEE Trans. Speech Audio Process., 2006

A new quantization optimization algorithm for the MPEG advanced audio coder using a statistical subband model of the quantization noise.
IEEE Trans. Speech Audio Process., 2006

De la construction du corpus émotionnel au système de détection. Le point de vue applicatif de la surveillance dans les lieux publics.
Rev. d'Intelligence Artif., 2006

Fear-type emotions of the SAFE Corpus: annotation issues.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

ENST-Drums: an extensive audio-visual database for drum signals processing.
Proceedings of the ISMIR 2006, 2006

Comparing Audio and Video Segmentations for Music Videos Indexing.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Hierarchical Classification of Musical Instruments on Solo Recordings.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Hrhatrac Algorithm for Spectral Line Tracking of Musical Signals.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Yast Algorithm for Minor Subspace Tracking.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Frequency estimation based on adjacent DFT bins.
Proceedings of the 14th European Signal Processing Conference, 2006

2005
Fast approximated power iteration subspace tracking.
IEEE Trans. Signal Process., 2005

Drum Loops Retrieval from Spoken Queries.
J. Intell. Inf. Syst., 2005

Drum Track Transcription of Polyphonic Music Using Noise Subspace Projection.
Proceedings of the ISMIR 2005, 2005

Inferring Efficient Hierarchical Taxonomies for MIR Tasks: Application to Musical Instruments.
Proceedings of the ISMIR 2005, 2005

Events Detection for an Audio-Based Surveillance System.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Extracting note onsets from musical recordings.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Iterative algorithms for multichannel equalization in sound reproduction systems.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Automatic transcription of drum sequences using audiovisual features.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Instrument recognition in polyphonic music.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Yet another subspace tracker.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Sliding window adaptive SVD algorithms.
IEEE Trans. Signal Process., 2004

Musical instrument recognition based on class pairwise feature selection.
Proceedings of the ISMIR 2004, 2004

Methodology and Tools for the evaluation of automatic onset detection algorithms in music.
Proceedings of the ISMIR 2004, 2004

Tempo And Beat Estimation Of Musical Signals.
Proceedings of the ISMIR 2004, 2004

Automatic transcription of drum loops.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Selecting the modeling order for the ESPRIT high resolution method: an alternative approach.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Musical instrument recognition on solo performances.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

2003
Approximated power iterations for fast subspace tracking.
Proceedings of the Seventh International Symposium on Signal Processing and Its Applications, 2003

Automatic labeling of tabla signals.
Proceedings of the ISMIR 2003, 2003

Adaptive ESPRIT algorithm based on the PAST subspace tracker.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Sliding window orthonormal PAST algorithm.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2001
Annotation in the SpeechDat Projects.
Int. J. Speech Technol., 2001

2000
SPEECHDAT-CAR. A Large Speech Database for Automotive Environments.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

1999
The speechdat-car multilingual speech databases for in-car applications: some first validation results.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999


Compensating for variable recording conditions in frontal face authentication algorithms.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1997
Voice mimic system using an articulatory codebook for estimation of vocal tract shape.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996
Analysis/synthesis and modification of the speech aperiodic component.
Speech Commun., 1996

1995
Numerical simulations of fluid flow in the vocal tract.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1994
Time-domain analysis/synthesis of the aperiodic component of speech signals.
Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 1994

1993
A speech formant synthesizer based on harmonic + random formant-waveforms representations.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993


  Loading...