Emmanouil Benetos

Orcid: 0000-0002-6820-6764

Affiliations:
  • Queen Mary University of London, UK


According to our database1, Emmanouil Benetos authored at least 209 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Few-Shot Class-Incremental Audio Classification Using Dynamically Expanded Classifier With Self-Attention Modified Prototypes.
IEEE Trans. Multim., 2024

ATGNN: Audio Tagging Graph Neural Network.
IEEE Signal Process. Lett., 2024

A Data-Driven Analysis of Robust Automatic Piano Transcription.
IEEE Signal Process. Lett., 2024

ST-ITO: Controlling Audio Effects for Style Transfer with Inference-Time Optimization.
CoRR, 2024

GraFPrint: A GNN-Based Approach for Audio Identification.
CoRR, 2024

OmniBench: Towards The Future of Universal Omni-Language Models.
CoRR, 2024

LC-Protonets: Multi-label Few-shot learning for world music audio tagging.
CoRR, 2024

Acoustic identification of individual animals with hierarchical contrastive learning.
CoRR, 2024

Domain-Invariant Representation Learning of Bird Sounds.
CoRR, 2024

Foundation Models for Music: A Survey.
CoRR, 2024

MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models.
CoRR, 2024

Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation.
CoRR, 2024

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series.
CoRR, 2024

Explaining models relating objects and privacy.
CoRR, 2024

ComposerX: Multi-Agent Symbolic Music Composition with LLMs.
CoRR, 2024

MuPT: A Generative Symbolic Music Pretrained Transformer.
CoRR, 2024

ChatMusician: Understanding and Generating Music Intrinsically with LLM.
CoRR, 2024

MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

YourMT3+: Multi-Instrument Music Transcription with Enhanced Transformer Architectures and Cross-Dataset STEM Augmentation.
Proceedings of the 34th IEEE International Workshop on Machine Learning for Signal Processing, 2024

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

Learning from Taxonomy: Multi-Label Few-Shot Classification for Everyday Sound Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

Mertech: Instrument Playing Technique Detection Using Self-Supervised Pretrained Model with Multi-Task Finetuning.
Proceedings of the IEEE International Conference on Acoustics, 2024

Mind the Domain Gap: A Systematic Analysis on Bioacoustic Sound Event Detection.
Proceedings of the 32nd European Signal Processing Conference, 2024

Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model.
Proceedings of the 32nd European Signal Processing Conference, 2024


2023
YourMT3 dataset (Part 1).
Dataset, October, 2023

PiJAMA: Piano Jazz with Automatic MIDI Annotations.
Dataset, September, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

MusicNet-16k + EM for YourMT3.
Dataset, April, 2023

Slakh2100-16k for YourMT3.
Dataset, March, 2023

PiJAMA: Piano Jazz with Automatic MIDI Annotations.
Trans. Int. Soc. Music. Inf. Retr., January, 2023

The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation.
CoRR, 2023

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training.
CoRR, 2023

Perceptual Musical Similarity Metric Learning with Graph Neural Networks.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Leveraging Synthetic Data for Improving Chamber Ensemble Separation.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

MARBLE: Music Audio Representation Benchmark for Universal Evaluation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LyricWhiz: Robust Multilingual Zero-Shot Lyrics Transcription by Whispering to ChatGPT.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

From West to East: Who Can Understand the Music of the Others Better?
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

On the Effectiveness of Speech Self-Supervised Learning for Music.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Adapting Language-Audio Models as Few-Shot Audio Learners.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Audio Quality Assessment of Vinyl Music Collections Using Self-Supervised Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

Learning Music Representations with wav2vec 2.0.
Proceedings of the 31st Irish Conference on Artificial Intelligence and Cognitive Science, 2023

2022
FSD-FS.
Dataset, December, 2022

EnsembleSet.
Dataset, May, 2022

Adaptive Scattering Transforms for Playing Technique Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Comparison of Feature Extraction Methods for Sound-Based Classification of Honey Bee Activity.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Humanities and engineering perspectives on music transcription.
Digit. Scholarsh. Humanit., 2022

MAP-Music2Vec: A Simple and Effective Baseline for Self-Supervised Music Audio Representation Learning.
CoRR, 2022

Contrastive Audio-Language Learning for Music.
CoRR, 2022

Anomalous behaviour in loss-gradient based interpretability methods.
CoRR, 2022

Deep Conditional Representation Learning for Drum Sample Retrieval by Vocalisation.
CoRR, 2022

EnsembleSet: a new high quality synthesised dataset for chamber ensemble separation.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Contrastive Audio-Language Learning for Music.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Performance MIDI-to-score conversion by neural beat tracking.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Exploring Transformer's Potential on Automatic Piano Transcription.
Proceedings of the IEEE International Conference on Acoustics, 2022

Learning Music Audio Representations Via Weak Language Supervision.
Proceedings of the IEEE International Conference on Acoustics, 2022

Improving Lyrics Alignment Through Joint Pitch Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

Joint Scattering for Automatic Chick Call Recognition.
Proceedings of the 30th European Signal Processing Conference, 2022

Hypernetworks for Sound event Detection: a Proof-of-Concept.
Proceedings of the 30th European Signal Processing Conference, 2022

Explaining the Decision of Anomalous Sound Detectors.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Leveraging Label Hierachies for Few-Shot Everyday Sound Recognition.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021
Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation.
IEEE Signal Process. Lett., 2021

More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations.
Proceedings of the 13th International Conference on Quality of Multimedia Experience, 2021

Detecting Cover Songs with Pitch Class Key-Invariant Networks.
Proceedings of the 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021

Agreement Among Human and Automated Transcriptions of Global Songs.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Pitch-Informed Instrument Assignment using a Deep Convolutional Network with Multiple Kernel Shapes.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Vocal Harmony Separation Using Time-Domain Neural Networks.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

An Evaluation of Data Augmentation Methods for Sound Scene Geotagging.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

MusCaps: Generating Captions for Music Audio.
Proceedings of the International Joint Conference on Neural Networks, 2021

Revisiting the Onsets and Frames Model with Additive Attention.
Proceedings of the International Joint Conference on Neural Networks, 2021

Prototypical Networks for Domain Adaptation in Acoustic Scene Classification.
Proceedings of the IEEE International Conference on Acoustics, 2021

Joint Multi-Pitch Detection and Score Transcription for Polyphonic Piano Music.
Proceedings of the IEEE International Conference on Acoustics, 2021

Violinist identification based on vibrato features.
Proceedings of the 29th European Signal Processing Conference, 2021

2020
CBFdataset: A Dataset of Chinese Bamboo Flute Performances.
Dataset, May, 2020

CBFdataset: A Dataset of Chinese Bamboo Flute Performances.
Dataset, May, 2020

A large joint sound scene and sound event dataset for source separation of foreground sound events.
Dataset, February, 2020

Speech endpoint annotations and artefact details for ASVspoof 2017 version 2.0 dataset.
Dataset, January, 2020

Investigating the Perceptual Validity of Evaluation Metrics for Automatic Piano Music Transcription.
Trans. Int. Soc. Music. Inf. Retr., 2020

Learning and Evaluation Methodologies for Polyphonic Music Sequence Prediction With LSTMs.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Dataset Artefacts in Anti-Spoofing Systems: A Case Study on the ASVspoof 2017 Benchmark.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Deep generative variational autoencoding for replay spoof detection in automatic speaker verification.
Comput. Speech Lang., 2020

Musical Features for Automatic Music Transcription Evaluation.
CoRR, 2020

Audio Impairment Recognition using a Correlation-Based Feature Representation.
Proceedings of the Twelfth International Conference on Quality of Multimedia Experience, 2020

Subband Modeling for Spoofing Detection in Automatic Speaker Verification.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Development of a Speech Quality Database Under Uncontrolled Conditions.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Memory Controlled Sequential Self Attention for Sound Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Reliable Local Explanations for Machine Listening.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

The Effect of Spectrogram Reconstruction on Automatic Music Transcription: An Alternative Approach to Improve Transcription Accuracy.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Playing Technique Recognition by Joint Time-Frequency Scattering.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Study on the Transferability of Adversarial Attacks in Sound Event Classification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Modeling Plate and Spring Reverberation Using A DSP-Informed Deep Neural Network.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A-CRNN: A Domain Adaptation Model for Sound Event Detection.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
CBFdataset: A Dataset of Chinese Bamboo Flute Performances.
Dataset, November, 2019

Audio-Based identification of Beehive states: The dataset.
Dataset, February, 2019

Audio-Based identification of Beehive states: The dataset.
Dataset, February, 2019

Automatic Music Transcription: An Overview.
IEEE Signal Process. Mag., 2019

Adaptive Noise Reduction for Sound Event Detection Using Subband-Weighted NMF.
Sensors, 2019

Adversarial Attacks in Sound Event Classification.
CoRR, 2019

A general-purpose deep learning approach to model time-varying audio effects.
CoRR, 2019

GAN-based Generation and Automatic Selection of Explanations for Neural Networks.
CoRR, 2019

Polyphonic Sound Event and Sound Activity Detection: A Multi-Task Approach.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Investigating Kernel Shapes and Skip Connections for Deep Learning-Based Harmonic-Percussive Separation.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

City Classification from Multiple Real-World Sound Scenes.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Adapting the Quality of Experience Framework for Audio Archive Evaluation.
Proceedings of the 11th International Conference on Quality of Multimedia Experience QoMEX 2019, 2019

A Comparative Study of Neural Models for Polyphonic Music Sequence Transduction.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Blending Acoustic and Language Model Predictions for Automatic Music Transcription.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Adaptive Time-Frequency Scattering for Periodic Modulation Recognition in Music Signals.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Automatic Music Transcription and Ethnomusicology: a User Study.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Ensemble Models for Spoofing Detection in Automatic Speaker Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Towards Joint Sound Scene and Polyphonic Sound Event Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

SubSpectralNet - Using Sub-spectrogram Based Convolutional Neural Networks for Acoustic Scene Classification.
Proceedings of the IEEE International Conference on Acoustics, 2019

Audio-based Identification of Beehive States.
Proceedings of the IEEE International Conference on Acoustics, 2019

Automatic Transcription of Diatonic Harmonica Recordings.
Proceedings of the IEEE International Conference on Acoustics, 2019

Robustness of Adversarial Attacks in Sound Event Classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Audio Tagging using Linear Noise Modelling Layer.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Onsets, Activity, and Events: A Multi-task Approach for Polyphonic Sound Event Modelling.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

2018
To bee or not to bee: An annotated dataset for beehive sound recognition.
Dataset, July, 2018

Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Speaker recognition with hybrid features from a deep belief network.
Neural Comput. Appl., 2018

A Study On Convolutional Neural Network Based End-To-End Replay Anti-Spoofing.
CoRR, 2018

Optimal Neural Network Feature Selection for Spatial-Temporal Forecasting.
CoRR, 2018

Analysing The Predictions Of a CNN-Based Replay Spoofing Detection System.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Analysing Replay Spoofing Countermeasure Performance under varied conditions.
Proceedings of the 28th IEEE International Workshop on Machine Learning for Signal Processing, 2018

Polyphonic Music Sequence Transduction with Meter-Constrained LSTM Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Towards Complete Polyphonic Music Transcription: Integrating Multi-Pitch Detection and Rhythm Quantization.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

To bee or not to bee: Investigating machine learning approaches for beehive sound recognition.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

An extensible cluster-graph taxonomy for open set sound scene analysis.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017

On-Bird Sound Recordings: Automatic Acoustic Recognition of Activities and Contexts.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Polyphonic Sound Event Tracking Using Linear Dynamical Systems.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

The Digital Music Lab: A Big Data Infrastructure for Digital Musicology.
ACM Journal on Computing and Cultural Heritage, 2017

Sound event detection in synthetic audio: Analysis of the dcase 2016 task results.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Assessing the Relevance of Onset Information for Note Tracking in Piano Music Transcription.
Proceedings of the AES International Conference Semantic Audio 2017, 2017

Automatic Transcription of a Cappella recordings from Multiple Singers.
Proceedings of the AES International Conference Semantic Audio 2017, 2017

Polyphonic Note and Instrument Tracking Using Linear Dynamical Systems.
Proceedings of the AES International Conference Semantic Audio 2017, 2017

A Study on LSTM Networks for Polyphonic Music Sequence Modelling.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Multi-Pitch Detection and Voice Assignment for A Cappella Recordings of Multiple Singers.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

On the memory properties of recurrent neural models.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

2016
An End-to-End Neural Network for Polyphonic Piano Music Transcription.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

A Morphological Model for Simulating Acoustic Scenes and Its Application to Sound Event Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Learning a Feature Space for Similarity in World Music.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

The Sousta Corpus: Beat-Informed Automatic Transcription of Traditional Dance Tunes.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

An Attack/Decay Model for Piano Transcription.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

Detection of overlapping acoustic events using a temporally-constrained probabilistic model.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Digital music lab: A framework for analysing big music data.
Proceedings of the 24th European Signal Processing Conference, 2016

2015
Detection and Classification of Acoustic Scenes and Events.
IEEE Trans. Multim., 2015

An End-to-End Neural Network for Polyphonic Music Transcription.
CoRR, 2015

An evaluation framework for event detection using a morphological model of acoustic scenes.
CoRR, 2015

An Efficient Temporally-Constrained Probabilistic Model for Multiple-Instrument Music Transcription.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

A hybrid recurrent neural network for music transcription.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Alternate level clustering for drum transcription.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
Improving Automatic Music Transcription Through Key Detection.
Proceedings of the AES International Conference on Semantic Audio 2014, 2014

Incremental Dataset Definition for Large Scale Musicological Research.
Proceedings of the 1st International Workshop on Digital Libraries for Musicology, 2014


An RNN-based Music Language Model for Improving Automatic Music Transcription.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Template Adaptation for Improving Automatic Music Transcription.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Learning motion-difference features using Gaussian restricted Boltzmann machines for efficient human action recognition.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Improving instrument recognition in polyphonic music through system integration.
Proceedings of the IEEE International Conference on Acoustics, 2014

Automatic transcription of pitched and unpitched sounds from polyphonic music.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Automatic music transcription: challenges and future directions.
J. Intell. Inf. Syst., 2013

Detection and classification of acoustic scenes and events: An IEEE AASP challenge.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

A Machine Learning Approach to Voice Separation in Lute Tablature.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Explicit Duration Hidden Markov Models for Multiple-Instrument Polyphonic Music Transcription.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Automatic Transcription of Turkish Makam Music.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

A database and challenge for acoustic scene classification and event detection.
Proceedings of the 21st European Signal Processing Conference, 2013

2012
A Shift-Invariant Latent Variable Model for Automatic Music Transcription.
Comput. Music. J., 2012

Automatic Music Transcription: Breaking the Glass Ceiling.
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

Temporally-Constrained Convolutive Probabilistic Latent Component Analysis for Multi-pitch Detection.
Proceedings of the Latent Variable Analysis and Signal Separation, 2012

Score-informed transcription for automatic piano tutoring.
Proceedings of the 20th European Signal Processing Conference, 2012

2011
Joint Multi-Pitch Detection Using Harmonic Envelope Estimation for Polyphonic Music Transcription.
IEEE J. Sel. Top. Signal Process., 2011

A temporally-constrained convolutive probabilistic model for pitch detection.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

The Temperament Police: The Truth, the Ground Truth, and Nothing but the Truth.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Polyphonic music transcription using note onset and offset detection.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Auditory Spectrum-Based Pitched Instrument Onset Detection.
IEEE Trans. Speech Audio Process., 2010

Non-Negative Tensor Factorization Applied to Music Genre Classification.
IEEE Trans. Speech Audio Process., 2010

Multiple-F0 estimation of piano sounds exploiting spectral structure and temporal evolution.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2010

2009
Pitched Instrument Onset Detection based on Auditory Spectra.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

2008
Computationally Efficient and Robust BIC-Based Speaker Segmentation.
IEEE Trans. Speech Audio Process., 2008

Music Genre Classification: A Multilinear Approach.
Proceedings of the ISMIR 2008, 2008

A tensor-based approach for automatic music genre classification.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

Movie Analysis with Emphasis to Dialogue and Action Scene Detection.
Proceedings of the Multimodal Processing and Interaction, Audio, Video, Text, 2008

2007
A neural network approach to audio-assisted movie dialogue detection.
Neurocomputing, 2007

Systematic comparison of BIC-based speaker segmentation systems.
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007

2006
Automatic speaker change detection with the Bayesian information criterion using MPEG-7 features and a fusion scheme.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

Musical instrument classification using non-negative matrix factorization algorithms.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

Automatic Speaker Segmentation using Multiple Features and Distance Measures: A Comparison of Three Approaches.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Applying Supervised Classifiers Based on Non-negative Matrix Factorization to Musical Instrument Classification.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Musical Instrument Classification using Non-Negative Matrix Factorization Algorithms and Subset Feature Selection.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Testing supervised classifiers based on non-negative matrix factorization to musical instrument classification.
Proceedings of the 14th European Signal Processing Conference, 2006


  Loading...