Mark D. Plumbley

Orcid: 0000-0002-9708-1075

Affiliations:
  • University of Surrey, Guildford, UK
  • Queen Mary University of London, UK (2002 - 2014)
  • King's College London, UK (1991 - 2001)


According to our database1, Mark D. Plumbley authored at least 318 papers between 1991 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Awards

IEEE Fellow

IEEE Fellow 2015, "For contributions to latent variable analysis".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift Evaluation Dataset.
Dataset, March, 2024

IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift Development Dataset.
Dataset, February, 2024

Acoustic Scene Classification Across Cities and Devices via Feature Disentanglement.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Towards Generating Diverse Audio Captions via Adversarial Training.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

AudioLDM 2: Learning Holistic Audio Generation With Self-Supervised Pretraining.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Selective-Memory Meta-Learning With Environment Representations for Sound Event Localization and Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

ASiT: Local-Global Audio Spectrogram Vision Transformer for Event Classification.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

A decade of DCASE: Achievements, practices, evaluations and future challenges.
CoRR, 2024

The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection.
CoRR, 2024

FlowSep: Language-Queried Sound Separation with Rectified Flow Matching.
CoRR, 2024

Exploring Differences between Human Perception and Model Inference in Audio Event Recognition.
CoRR, 2024

Integrating IP Broadcasting with Audio Tags: Workflow and Challenges.
CoRR, 2024

Efficient Audio Captioning with Encoder-Level Knowledge Distillation.
CoRR, 2024

Improving Audio Generation with Visual Enhanced Caption.
CoRR, 2024

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound.
CoRR, 2024

Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift.
CoRR, 2024

Separation of the Aortic and Pulmonary Components of the Second Heart Sound via Alternating Optimization.
IEEE Access, 2024

T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining.
Proceedings of the 34th IEEE International Workshop on Machine Learning for Signal Processing, 2024

Investigating Passive Filter Pruning for Efficient CNN-Transformer Audio Captioning.
Proceedings of the 34th IEEE International Workshop on Machine Learning for Signal Processing, 2024

Retrieval-Augmented Text-to-Audio Generation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Audiosr: Versatile Audio Super-Resolution at Scale.
Proceedings of the IEEE International Conference on Acoustics, 2024

Universal Sound Separation with Self-Supervised Audio Masked Autoencoder.
Proceedings of the 32nd European Signal Processing Conference, 2024

Working with AI Sound: Exploring the Future of Workplace AI Sound Technologies.
Proceedings of the 3rd Annual Meeting of the Symposium on Human-Computer Interaction for Work, 2024

Learning Temporal Resolution in Spectrogram for Audio Classification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Numerical evaluation on sub-Nyquist spectrum reconstruction methods.
Frontiers Comput. Sci., December, 2023

META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection.
CoRR, 2023

Separate Anything You Describe.
CoRR, 2023

WavJourney: Compositional Audio Creation with Large Language Models.
CoRR, 2023

Text-Driven Foley Sound Generation With Latent Diffusion Model.
CoRR, 2023

Audio Tagging on an Embedded Hardware Platform.
CoRR, 2023

E-PANNs: Sound Recognition Using Efficient Pre-trained Audio Neural Networks.
CoRR, 2023

Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7.
CoRR, 2023

Universal Source Separation with Weakly Labelled Data.
CoRR, 2023

Efficient CNNs via Passive Filter Pruning.
CoRR, 2023

Leveraging Pre-trained AudioLDM for Text to Sound Generation: A Benchmark Study.
CoRR, 2023

Compressing Audio CNNS with Graph Centrality Based Filter Pruning.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Ontology-aware Learning and Evaluation for Audio Tagging.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Adapting Language-Audio Models as Few-Shot Audio Learners.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

AudioLDM: Text-to-Audio Generation with Latent Diffusion Models.
Proceedings of the International Conference on Machine Learning, 2023

Efficient Similarity-Based Passive Filter Pruning for Compressing CNNS.
Proceedings of the IEEE International Conference on Acoustics, 2023

Simple Pooling Front-Ends for Efficient Audio Classification.
Proceedings of the IEEE International Conference on Acoustics, 2023

Leveraging Pre-Trained AudioLDM for Sound Generation: A Benchmark Study.
Proceedings of the 31st European Signal Processing Conference, 2023

2022
Automated audio captioning: an overview of recent progress and new challenges.
EURASIP J. Audio Speech Music. Process., 2022

ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation.
CoRR, 2022

Automated Audio Captioning via Fusion of Low- and High- Dimensional Features.
CoRR, 2022

Learning the Spectrogram Temporal Resolution for Audio Classification.
CoRR, 2022

Low-complexity CNNs for Acoustic Scene Classification.
CoRR, 2022

Surrey System for DCASE 2022 Task 5: Few-shot Bioacoustic Event Detection with Segment-level Metric Learning.
CoRR, 2022

Continual Learning For On-Device Environmental Sound Classification.
CoRR, 2022

Fish Feeding Intensity Assessment in Aquaculture: A New Audio Dataset AFFIA3K and a Deep Learning Algorithm.
Proceedings of the 32nd IEEE International Workshop on Machine Learning for Signal Processing, 2022

A Passive Similarity based CNN Filter Pruning for Efficient Acoustic Scene Classification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

On Metric Learning for Audio-Text Cross-Modal Retrieval.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Separate What You Describe: Language-Queried Audio Source Separation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Diverse Audio Captioning Via Adversarial Training.
Proceedings of the IEEE International Conference on Acoustics, 2022

A Track-Wise Ensemble Event Independent Network for Polyphonic Sound Event Localization and Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

Deep Neural Decision Forest for Acoustic Scene Classification.
Proceedings of the 30th European Signal Processing Conference, 2022

Leveraging Pre-trained BERT for Audio Captioning.
Proceedings of the 30th European Signal Processing Conference, 2022

Continual Learning for On-Ddevice Environmental Sound Classification.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Low-Complexity CNNs for Acoustic Scene Classification.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Segment-Level Metric Learning for Few-Shot Bioacoustic Event Detection.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Sound Event Localization and Detection for Real Spatial Sound Scenes: Event-Independent Network and Data Augmentation Chains.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021

DCASE2021 UAD-S UMAP Data.
Dataset, July, 2021

CAA-Net: Conditional Atrous CNNs With Attention for Explainable Device-Robust Acoustic Scene Classification.
IEEE Trans. Multim., 2021

Deep Learning-Based Energy Disaggregation and On/Off Detection of Household Appliances.
ACM Trans. Knowl. Discov. Data, 2021

Sound Event Detection: A tutorial.
IEEE Signal Process. Mag., 2021

Sparse Analysis Model Based Dictionary Learning for Signal Declipping.
IEEE J. Sel. Top. Signal Process., 2021

Sub-Nyquist spectrum sensing and learning challenge.
Frontiers Comput. Sci., 2021

Federated Learning With Highly Imbalanced Audio Data.
CoRR, 2021

An Audio-Based Deep Learning Framework ForBBC Television Programme Classification.
CoRR, 2021

Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning.
Proceedings of the 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021

Weighted Magnitude-Phase Loss for Speech Dereverberation.
Proceedings of the IEEE International Conference on Acoustics, 2021

An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection.
Proceedings of the IEEE International Conference on Acoustics, 2021

An Audio-Based Deep Learning Framework For BBC Television Programme Classification.
Proceedings of the 29th European Signal Processing Conference, 2021

Gender Bias in Depression Detection Using Audio Features.
Proceedings of the 29th European Signal Processing Conference, 2021

Audio Captioning Transformer.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

An Encoder-Decoder Based Audio Captioning System with Transfer and Reinforcement Learning.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

CL4AC: A Contrastive Loss for Audio Captioning.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

ARCA23K: An Audio Dataset for Investigating Open-Set Label Noise.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

Using UMAP to Inspect Audio Data for Unsupervised Anomaly Detection Under Domain-Shift Conditions.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

Source Separation of the Second Heart Sound via Alternating Optimization.
Proceedings of the Computing in Cardiology, CinC 2021, Brno, 2021

2020
Sound Event Detection of Weakly Labelled Data With CNN-Transformer and Automatic Threshold Optimization.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Raw Audio for Depression Detection Can Be More Robust Against Gender Imbalance than Mel-Spectrogram Features.
CoRR, 2020

Source Separation with Weakly Labelled Data: an Approach to Computational Auditory Scene Analysis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Learning With Out-of-Distribution Data for Audio Classification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Multi-Band Multi-Resolution Fully Convolutional Neural Networks for Singing Voice Separation.
Proceedings of the 28th European Signal Processing Conference, 2020

Audio for Audio is Better? An Investigation on Transfer Learning Models for Heart Sound Classification.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020

Open-Window: A Sound Event Dataset for Window State Detection and Recognition.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

Event-Independent Network for Polyphonic Sound Event Localization and Detection.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019
Sparse Recovery and Dictionary Learning From Nonlinear Compressive Measurements.
IEEE Trans. Signal Process., 2019

Weakly Labelled AudioSet Tagging With Attention Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Sound Event Detection and Time-Frequency Segmentation from Weakly Labelled Data.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Musical Source Separation: An Introduction.
IEEE Signal Process. Mag., 2019

Deep Learning Based Energy Disaggregation and On/Off Detection of Household Appliances.
CoRR, 2019

Cross-task learning for audio tagging, sound event detection and spatial localization: DCASE 2019 baseline systems.
CoRR, 2019

Weakly labelled AudioSet Classification with Attention Neural Networks.
CoRR, 2019

Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acoustic Scenes.
Proceedings of the IEEE International Conference on Acoustics, 2019

Acoustic Event Detection from Weakly Labeled Data Using Auditory Salience.
Proceedings of the IEEE International Conference on Acoustics, 2019

Generalisation in Environmental Sound Classification: The 'Making Sense of Sounds' Data Set and Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2019

Acoustic Scene Generation with Conditional Samplernn.
Proceedings of the IEEE International Conference on Acoustics, 2019

Sound Event Detection with Sequentially Labelled Data Based on Connectionist Temporal Classification and Unsupervised Clustering.
Proceedings of the IEEE International Conference on Acoustics, 2019

Referenceless Performance Evaluation of Audio Source Separation using Deep Neural Networks.
Proceedings of the 27th European Signal Processing Conference, 2019

Multi-instance Learning for Bipolar Disorder Diagnosis using Weakly Labelled Speech Data.
Proceedings of the 9th International Conference on Digital Public Health, 2019

Sound Event Localization and Detection Using CRNN on Pairs of Microphones.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

2018
Malicious User Detection Based on Low-Rank Matrix Completion in Wideband Spectrum Sensing.
IEEE Trans. Signal Process., 2018

Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

A Contextual Study of Semantic Speech Editing in Radio Production.
Int. J. Hum. Comput. Stud., 2018

DCASE 2018 Challenge baseline with convolutional neural networks.
CoRR, 2018

BSS Eval or Peass? Predicting the Perception of Singing-Voice Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Orthogonality-Regularized Masked NMF for Learning on Weakly Labeled Audio Data.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Inexact Proximal Operators for 𝓁<sub>p</sub>-Quasinorm Minimization.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Joint Separation-Classification Model for Sound Event Detection of Weakly Labelled Data.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Audio Set Classification with Attention Model: A Probabilistic Perspective.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Synthesis of Images by Two-Stage Generative Adversarial Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Large-Scale Weakly Supervised Audio Classification Using Gated Convolutional Neural Network.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Improving Reverberant Speech Separation with Binaural Cues Using Temporal Context and Convolutional Neural Networks.
Proceedings of the Latent Variable Analysis and Signal Separation, 2018

Consistent Dictionary Learning for Signal Declipping.
Proceedings of the Latent Variable Analysis and Signal Separation, 2018

Multi-Resolution Fully Convolutional Neural Networks for Monaural Audio Source Separation.
Proceedings of the Latent Variable Analysis and Signal Separation, 2018

Supporting Audiography: Design of a System for Sentimental Sound Recording, Classification and Playback.
Proceedings of the HCI International 2018, 2018

A Hierarchical Latent Mixture Model for Polyphonic Music Analysis.
Proceedings of the 26th European Signal Processing Conference, 2018

Raw Multi-Channel Audio Source Separation using Multi- Resolution Convolutional Auto-Encoders.
Proceedings of the 26th European Signal Processing Conference, 2018

Attention-based convolutional neural networks for acoustic scene classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

DCASE 2018 Challenge Surrey cross-task convolutional neural network baseline.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

General-purpose audio tagging from noisy labels using convolutional neural networks.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

Predicting the perceived level of reverberation using machine learning.
Proceedings of the 52nd Asilomar Conference on Signals, Systems, and Computers, 2018

2017

Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Two-Stage Single-Channel Audio Source Separation Using Deep Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Polyphonic Sound Event Tracking Using Linear Dynamical Systems.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Surrey-cvssp system for DCASE2017 challenge task4.
CoRR, 2017

Binaural and log-power spectra features with deep neural networks for speech-noise separation.
Proceedings of the 19th IEEE International Workshop on Multimedia Signal Processing, 2017

Clustering Expressive Timing with Regressed Polynomial Coefficients Demonstrated by a Model Selection Test.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Attention and Localization Based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Learning the Mapping Function from Voltage Amplitudes to Sensor Positions in 3D-EMA Using Deep Neural Networks.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Convolutional gated recurrent neural network incorporating spatial features for audio tagging.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

A greedy algorithm with learned statistics for sparse signal reconstruction.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A joint detection-classification model for audio tagging of weakly labelled data.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Fast tagging of natural sounds using marginal co-regularization.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Assessment of musical noise using localization of isolated peaks in time-frequency domain.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Psychophysical Evaluation of Audio Source Separation Methods.
Proceedings of the Latent Variable Analysis and Signal Separation, 2017

Discriminative Enhancement for Single Channel Audio Source Separation Using Deep Neural Networks.
Proceedings of the Latent Variable Analysis and Signal Separation, 2017

Single channel audio source separation using convolutional denoising autoencoders.
Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

Masked non-negative matrix factorization for eire detection using weakly labeled data.
Proceedings of the 25th European Signal Processing Conference, 2017

Multivariate iterative hard thresholding for sparse decomposition with flexible sparsity patterns.
Proceedings of the 25th European Signal Processing Conference, 2017

Automatic music transcription using low rank non-negative matrix decomposition.
Proceedings of the 25th European Signal Processing Conference, 2017

Joint detection and classification convolutional neural network on weakly labelled bird audio detection.
Proceedings of the 25th European Signal Processing Conference, 2017

Neuroevolution for Sound Event Detection in Real Life Audio: A Pilot Study.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

Using deep neural networks to estimate tongue movements from speech face motion.
Proceedings of the 14th International Conference on Auditory-Visual Speech Processing, 2017

2016
Wideband Spectrum Sensing on Real-Time Signals at Sub-Nyquist Sampling Rates in Single and Cooperative Multiple Nodes.
IEEE Trans. Signal Process., 2016

Analysis SimCO Algorithms for Sparse Analysis Model Based Dictionary Learning.
IEEE Trans. Signal Process., 2016

Automatic Environmental Sound Recognition: Performance Versus Computational Cost.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Non-Negative Group Sparsity with Subspace Note Modelling for Polyphonic Transcription.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Hierachical learning for DNN-based acoustic scene classification.
CoRR, 2016

Fully Deep Neural Networks Incorporating Unsupervised Feature Learning for Audio Tagging.
CoRR, 2016

Combining Mask Estimates for Single Channel Audio Source Separation Using Deep Neural Networks.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Detection of overlapping acoustic events using a temporally-constrained probabilistic model.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Evaluation of audio source separation models using hypothesis-driven non-parametric statistical methods.
Proceedings of the 24th European Signal Processing Conference, 2016

Hierarchical Learning for DNN-Based Acoustic Scene Classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

Fully DNN-Based Multi-Label Regression for Audio Tagging.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

Coupled Sparse NMF vs. Random Forest Classification for Real Life Acoustic Event Detection.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

Deep Neural Network Baseline for DCASE Challenge 2016.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

2015
Learning Incoherent Subspaces: Classification via Incoherent Dictionary Learning.
J. Signal Process. Syst., 2015

Detection and Classification of Acoustic Scenes and Events.
IEEE Trans. Multim., 2015

Acoustic Scene Classification: Classifying environments from the sounds they produce.
IEEE Signal Process. Mag., 2015

Deep Remix: Remixing Musical Mixtures Using a Convolutional Deep Neural Network.
CoRR, 2015

Keynote addresses: Reverse engineering the neural mechanisms involved in robust speech processing.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Chime-home: A dataset for sound source recognition in a domestic environment.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Non-negative matrix factorisation incorporating greedy hellinger sparse coding applied to polyphonic music transcription.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A dynamic programming variant of non-negative matrix deconvolution for the transcription of struck string instruments.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network.
Proceedings of the Latent Variable Analysis and Signal Separation, 2015

The Clustering of Expressive Timing Within a Phrase in Classical Piano Performances by Gaussian Mixture Models.
Proceedings of the Music, Mind, and Embodiment - 11th International Symposium, 2015

2014
Multichannel High-Resolution NMF for Modeling Convolutive Mixtures of Non-Stationary Signals in the Time-Frequency Domain.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Score-Informed Source Separation for Musical Audio Recordings: An overview.
IEEE Signal Process. Mag., 2014

Automatic large-scale classification of bird sounds is strongly improved by unsupervised feature learning.
CoRR, 2014

Acoustic Scene Classification.
CoRR, 2014

An Open Dataset for Research on Audio Field Recording Archives: freefield1010.
Proceedings of the AES International Conference on Semantic Audio 2014, 2014

Harmonic Motion: A Toolkit for Processing Gestural Data for Interactive Sound.
Proceedings of the 14th International Conference on New Interfaces for Musical Expression, 2014


Phase-based harmonic/percussive separation.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Polyphonic piano transcription using non-negative Matrix Factorisation with group sparsity.
Proceedings of the IEEE International Conference on Acoustics, 2014

Improving instrument recognition in polyphonic music through system integration.
Proceedings of the IEEE International Conference on Acoustics, 2014

Accounting for phase cancellations in non-negative matrix factorization using weighted distances.
Proceedings of the IEEE International Conference on Acoustics, 2014

Efficient compressive spectrum sensing algorithm for M2M devices.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

Audio-only Bird Classification Using Unsupervised Feature Learning.
Proceedings of the Working Notes for CLEF 2014 Conference, 2014

2013
Learning Incoherent Dictionaries for Sparse Approximation Using Iterative Projections and Rotations.
IEEE Trans. Signal Process., 2013

On Theorem 10 in "On Polar Polytopes and the Recovery of Sparse Representations" [Sep 07 3188-3195].
IEEE Trans. Inf. Theory, 2013

Segregating event streams and noise with a Markov renewal process model.
J. Mach. Learn. Res., 2013

Large-scale analysis of frequency modulation in birdsong databases.
CoRR, 2013

Synchronizing Sequencing Software to a Live Drummer.
Comput. Music. J., 2013

Detection and classification of acoustic scenes and events: An IEEE AASP challenge.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Multichannel HR-NMF for modelling convolutive mixtures of non-stationary signals in the time-frequency domain.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Structured sparsity using backwards elimination for Automatic Music Transcription.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

Learning incoherent subspaces for classification via supervised iterative projections and rotations.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

Improved multiple birdsong tracking with distribution derivative method and Markov renewal process clustering.
Proceedings of the IEEE International Conference on Acoustics, 2013

Automatic Music Transcription using row weighted decompositions.
Proceedings of the IEEE International Conference on Acoustics, 2013

Behavior of greedy sparse representation algorithms on nested supports.
Proceedings of the IEEE International Conference on Acoustics, 2013

Recognition of harmonic sounds in polyphonic audio using a missing feature approach.
Proceedings of the IEEE International Conference on Acoustics, 2013

Score informed audio source separation using constrained nonnegative matrix factorization and score synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2013

Predictive Information in Gaussian Processes with Application to Music Analysis.
Proceedings of the Geometric Science of Information - First International Conference, 2013

Low-rank matrix completion based malicious user detection in cooperative spectrum sensing.
Proceedings of the IEEE Global Conference on Signal and Information Processing, 2013

Learning overcomplete dictionaries with ℓ0-sparse Non-negative Matrix Factorisation.
Proceedings of the IEEE Global Conference on Signal and Information Processing, 2013

Dictionary learning via projected maximal exploration.
Proceedings of the IEEE Global Conference on Signal and Information Processing, 2013

A database and challenge for acoustic scene classification and event detection.
Proceedings of the 21st European Signal Processing Conference, 2013

Probabilistic time-frequency source-filter decomposition of non-stationary signals.
Proceedings of the 21st European Signal Processing Conference, 2013

2012
Performance Following: Real-Time Prediction of Musical Sequences Without a Score.
IEEE Trans. Speech Audio Process., 2012

Reliability-Informed Beat Tracking of Musical Signals.
IEEE Trans. Speech Audio Process., 2012

Audio Inpainting.
IEEE Trans. Speech Audio Process., 2012

Best Practices for Scientific Computing
CoRR, 2012

Structured sparsity for automatic music transcription.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

INK-SVD: Learning incoherent dictionaries for sparse representations.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Instrumentation-based music similarity using sparse representations.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Analysis-based sparse reconstruction with synthesis-based solvers.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Sound Software: Towards software reuse in audio and music research.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Dictionary Learning with Large Step Gradient Descent for Sparse Representations.
Proceedings of the Latent Variable Analysis and Signal Separation, 2012

Group Polytope Faces Pursuit for Recovery of Block-Sparse Signals.
Proceedings of the Latent Variable Analysis and Signal Separation, 2012

Framewise heterodyne chirp analysis of birdsong.
Proceedings of the 20th European Signal Processing Conference, 2012

An alternating descent algorithm for the off-grid DOA estimation problem with sparsity constraints.
Proceedings of the 20th European Signal Processing Conference, 2012

Choosing analysis or synthesis recovery for sparse reconstruction.
Proceedings of the 20th European Signal Processing Conference, 2012

Denoising and segmentation of the second heart sound using matching pursuit.
Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2012

Cognitive music modelling: An information dynamics approach.
Proceedings of the 3rd International Workshop on Cognitive Information Processing, 2012

Using Oracle Analysis for Decomposition-Based Automatic Music Transcription.
Proceedings of the From Sounds to Music and Emotions - 9th International Symposium, 2012

The Melody Triangle: Exploring Pattern and Predictability in Music.
Proceedings of the Musical Metacreation, 2012

2011
Measuring the Performance of Beat Tracking Algorithms Using a Beat Error Histogram.
IEEE Signal Process. Lett., 2011

Fast Dictionary Learning for Sparse Representations of Speech Signals.
IEEE J. Sel. Top. Signal Process., 2011

Onset Event Decoding Exploiting the Rhythmic Structure of Polyphonic Music.
IEEE J. Sel. Top. Signal Process., 2011

On the disjointess of sources in music using different time-frequency representations.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

The Medium is the Message: Composing Instruments and Performing Mappings.
Proceedings of the 11th International Conference on New Interfaces for Musical Expression, 2011

Causal Prediction of Continuous-Valued Music Features.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Blind source separation of periodic sources from sequentially recorded instantaneous mixtures.
Proceedings of the 7th International Symposium on Image and Signal Processing and Analysis, 2011

Real-time Visual Beat Tracking using a Comb Filter Matrix.
Proceedings of the 2011 International Computer Music Conference, 2011

Separating sources from sequentially acquired mixtures of heart signals.
Proceedings of the IEEE International Conference on Acoustics, 2011

Dictionary learning of convolved signals.
Proceedings of the IEEE International Conference on Acoustics, 2011

A constrained matching pursuit approach to audio declipping.
Proceedings of the IEEE International Conference on Acoustics, 2011

Sequential minimal eigenvalues - an approach to analysis dictionary learning.
Proceedings of the 19th European Signal Processing Conference, 2011

Structure-aware dictionary learning with harmonic atoms.
Proceedings of the 19th European Signal Processing Conference, 2011

2010
Sparse Representations in Audio and Music: From Coding to Source Separation.
Proc. IEEE, 2010

Cross-associating unlabelled timbre distributions to create expressive musical mappings.
Proceedings of the First Workshop on Applications of Pattern Analysis, 2010

A measure of statistical complexity based on predictive information
CoRR, 2010

Performance following: Tracking a performance without a score.
Proceedings of the IEEE International Conference on Acoustics, 2010

An L1 criterion for dictionary learning by subspace identification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Gradient Polytope Faces Pursuit for large scale sparse recovery problems.
Proceedings of the IEEE International Conference on Acoustics, 2010

Note onset detection using rhythmic structure.
Proceedings of the IEEE International Conference on Acoustics, 2010

A Multichannel Spatial Compressed Sensing Approach for Direction of Arrival Estimation.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010

SMALLbox - An Evaluation Framework for Sparse Representations and Dictionary Learning Algorithms.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010

2009
Fast Multidimensional Entropy Estimation by k -d Partitioning.
IEEE Signal Process. Lett., 2009

Evaluation of live human-computer music-making: Quantitative and qualitative approaches.
Int. J. Hum. Comput. Stud., 2009

Information dynamics: patterns of expectation and surprise in the perception of music.
Connect. Sci., 2009

Towards a musical beat emphasis function.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Estimating Phase Linearity in the Frequency-Domain ICA Demixing Matrix.
Proceedings of the Independent Component Analysis and Signal Separation, 2009

Extension of Sparse, Adaptive Signal Decompositions to Semi-blind Audio Source Separation.
Proceedings of the Independent Component Analysis and Signal Separation, 2009

Real-Time Chord Recognition for Live Performance.
Proceedings of the 2009 International Computer Music Conference, 2009

Post-Processing Fiddle~: A Real-Time Multi-Pitch Tracking Technique Using Harmonic Partial Subtraction for Use within Live Performance Systems.
Proceedings of the 2009 International Computer Music Conference, 2009

Sparse reconstruction for compressed sensing using Stagewise Polytope Faces Pursuit.
Proceedings of the 16th International Conference on Digital Signal Processing, 2009

Using phase linearity in frequency-domain ICA to tackle the permutation problem.
Proceedings of the IEEE International Conference on Acoustics, 2009

Benchmarking flexible adaptive time-frequency transforms for underdetermined audio source separation.
Proceedings of the IEEE International Conference on Acoustics, 2009

Speech denoising based on a greedy adaptive dictionary algorithm.
Proceedings of the 17th European Signal Processing Conference, 2009

2008
Efficient Bayesian inference for harmonic models via adaptive posterior factorization.
Neurocomputing, 2008

An adaptive stereo basis method for convolutive blind audio source separation.
Neurocomputing, 2008

Theorems on Positive Data: On the Uniqueness of NMF.
Comput. Intell. Neurosci., 2008

Real Time Gesture Learning and Recognition: Towards Automatic Categorization.
Proceedings of the 8th International Conference on New Interfaces for Musical Expression, 2008

Discourse Analysis Evaluation Method for Expressive Musical Interfaces.
Proceedings of the 8th International Conference on New Interfaces for Musical Expression, 2008

A Turing Test for B-Keeper: Evaluating an Interactive.
Proceedings of the 8th International Conference on New Interfaces for Musical Expression, 2008

Rhythmic Analysis for Real-Time audio effects.
Proceedings of the 2008 International Computer Music Conference, 2008

Oracle estimation of adaptive cosine packet transforms for underdetermined audio source separation.
Proceedings of the IEEE International Conference on Acoustics, 2008

Natural Conjugate Gradient on Complex Flag Manifolds for Complex Independent Subspace Analysis.
Proceedings of the Artificial Neural Networks, 2008

Separation of stereo speech signals based on a sparse dictionary algorithm.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

Exploring the effect of rhythmic style classification on automatic tempo estimation.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007
On Polar Polytopes and the Recovery of Sparse Representations.
IEEE Trans. Inf. Theory, 2007

Low Bit-Rate Object Coding of Musical Audio Using Bayesian Harmonic Models.
IEEE Trans. Speech Audio Process., 2007

Context-Dependent Beat Tracking of Musical Audio.
IEEE Trans. Speech Audio Process., 2007

Oracle estimators for the benchmarking of source separation algorithms.
Signal Process., 2007

Audio source separation with a signal-adaptive local cosine transform.
Signal Process., 2007

Real-Time Beat-Synchronous Audio Effects.
Proceedings of the Seventh International Conference on New Interfaces for Musical Expression, 2007

B-Keeper: A Beat-Tracker for Live Performance.
Proceedings of the Seventh International Conference on New Interfaces for Musical Expression, 2007

Adaptive whitening for Improved Real-Time audio onset Detection.
Proceedings of the 2007 International Computer Music Conference, 2007

Geometry and Manifolds for Independent Component Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2007

Flag Manifolds for Subspace ICA Problems.
Proceedings of the IEEE International Conference on Acoustics, 2007

On the Use of Entropy for Beat Tracking Evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Dictionary Learning for L1-Exact Sparse Coding.
Proceedings of the Independent Component Analysis and Signal Separation, 2007

The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals.
Proceedings of the Independent Component Analysis and Signal Separation, 2007

Blind Source Separation using Space-Time Independent Component Analysis.
Proceedings of the Blind Speech Separation, 2007

2006
Unsupervised analysis of polyphonic music by sparse coding.
IEEE Trans. Neural Networks, 2006

Sparse representations of polyphonic music.
Signal Process., 2006

Single-Channel Mixture Decomposition Using Bayesian Harmonic Models.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2006

Recovery of Sparse Representations by Polytope Faces Pursuit.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2006

Riemannian Optimization Method on the Flag Manifold for Independent Subspace Analysis.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2006

Sparse Coding for Convolutive Blind Audio Source Separation.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2006

Source extraction from two-channel mixtures by joint cosine packet analysis.
Proceedings of the 14th European Signal Processing Conference, 2006

A spectral difference approach to downbeat extraction in musical audio.
Proceedings of the 14th European Signal Processing Conference, 2006

2005

Geometrical methods for non-negative ICA: Manifolds, Lie groups and toral subalgebras.
Neurocomputing, 2005

Beat tracking with a two state model [music applications].
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
A "nonnegative PCA" algorithm for independent component analysis.
IEEE Trans. Neural Networks, 2004

Blind Separation of Positive Sources by Globally Convergent Gradient Search.
Neural Comput., 2004

Causal Tempo Tracking of Audio.
Proceedings of the ISMIR 2004, 2004

Fast labelling of notes in music signals.
Proceedings of the ISMIR 2004, 2004

Polyphonic transcription by non-negative sparse coding of power spectra.
Proceedings of the ISMIR 2004, 2004

Real-time temporal segmentation of note objects in music signals.
Proceedings of the 2004 International Computer Music Conference, 2004

Lie Group Methods for Optimization with Orthogonality Constraints.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2004

Optimization Using Fourier Expansion over a Geodesic for Non-negative ICA.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2004

Application of Geometric Dependency Analysis to the Separation of Convolved Mixtures.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2004

2003
Algorithms for nonnegative independent component analysis.
IEEE Trans. Neural Networks, 2003

2002
Conditions for nonnegative independent component analysis.
IEEE Signal Process. Lett., 2002

Automatic Music Transcription and Audio Source Separation.
Cybern. Syst., 2002

2000
On-Line Connectionist Q-Learning Produces Unreliable Performance with A Synonym Finding Task.
Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks, 2000

Information Density and Cortical Magnification Factors.
Proceedings of the Information Theory and the Brain, 2000

1997
Communications and neural networks: theory and practice.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
Unsupervised Neural Network Learning Procedures for Feature Extraction and Classification.
Appl. Intell., 1996

1995
Lyapunov functions for convergence of principal component algorithms.
Neural Networks, 1995

1993
Efficient information transfer and anti-Hebbian neural networks.
Neural Networks, 1993

Generation and Adaptation of Neural Networks by Evolutionary Techniques (GANNET).
Neural Comput. Appl., 1993

1991
The effect of receptor signal-to-noise levels on optimal filtering in a sensory system.
Proceedings of the 1991 International Conference on Acoustics, 1991


  Loading...