Tuomas Virtanen
Orcid: 0000-0002-4604-9729Affiliations:
- Tampere University of Technology, Finland
According to our database1,
Tuomas Virtanen
authored at least 268 papers
between 2000 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2025
The Accuracy Cost of Weakness: A Theoretical Analysis of Fixed-Segment Weak Labeling for Events in Time.
CoRR, February, 2025
CoRR, January, 2025
IEEE Signal Process. Lett., 2025
2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Language-based machine perception: linguistic perspectives on the compilation of captioning datasets.
Digit. Scholarsh. Humanit., 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024
Proceedings of the 2024 IEEE 5th International Symposium on the Internet of Sounds (IS2), Erlangen, Germany, September 30, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
From Weak to Strong Sound Event Labels using Adaptive Change-Point Detection and Active Learning.
Proceedings of the 32nd European Signal Processing Conference, 2024
Automatic Live Music Song Identification Using Multi-level Deep Sequence Similarity Learning.
Proceedings of the 32nd European Signal Processing Conference, 2024
Reference Channel Selection by Multi-Channel Masking for End-to-End Multi-Channel Speech Enhancement.
Proceedings of the 32nd European Signal Processing Conference, 2024
2023
Permutation Invariant Recurrent Neural Networks for Sound Source Tracking Applications.
CoRR, 2023
CoRR, 2023
CoRR, 2023
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
Representation Learning for Audio Privacy Preservation Using Source Separation and Robust Adversarial Learning.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the 31st European Signal Processing Conference, 2023
Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints.
Proceedings of the 31st European Signal Processing Conference, 2023
Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System.
Proceedings of the 31st European Signal Processing Conference, 2023
Position Tracking of a Varying Number of Sound Sources with Sliding Permutation Invariant Training.
Proceedings of the 31st European Signal Processing Conference, 2023
2022
Self-Supervised Learning of Audio Representations From Audio-Visual Data Using Spatial Alignment.
IEEE J. Sel. Top. Signal Process., 2022
IEEE J. Sel. Top. Signal Process., 2022
Subjective Evaluation of Deep Neural Network Based Speech Enhancement Systems in Real-World Conditions.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022
Domestic Activity Clustering from Audio via Depthwise Separable Convolutional Autoencoder Network.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022
Unsupervised Audio-Caption Aligning Learns Correspondences Between Individual Sound Events and Textual Phrases.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the 30th European Signal Processing Conference, 2022
Proceedings of the 30th European Signal Processing Conference, 2022
Proceedings of the 30th European Signal Processing Conference, 2022
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022
STARSS22: A Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022
2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Joint speaker separation and recognition using non-negative matrix deconvolution with adaptive dictionary.
Comput. Speech Lang., 2021
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021
Towards Sonification in Multimodal and User-friendlyExplainable Artificial Intelligence.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021
Zero-Shot Audio Classification with Factored Linear and Nonlinear Acoustic-Semantic Projections.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Deep Neural Network Based Low-Latency Speech Separation with Asymmetric Analysis-Synthesis Window Pair.
Proceedings of the 29th European Signal Processing Conference, 2021
WaveTransformer: An Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information.
Proceedings of the 29th European Signal Processing Conference, 2021
Mobile Microphone Array Speech Detection and Localization in Diverse Everyday Environments.
Proceedings of the 29th European Signal Processing Conference, 2021
Proceedings of the 29th European Signal Processing Conference, 2021
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021
A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021
Low-Complexity Acoustic Scene Classification for Multi-Device Audio: Analysis of DCASE 2021 Challenge Systems.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021
2020
Dataset used in COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations.
Dataset, June, 2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
IEEE Signal Process. Lett., 2020
WaveTransformer: A Novel Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information.
CoRR, 2020
COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations.
CoRR, 2020
Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation.
Proceedings of the 22nd IEEE International Workshop on Multimedia Signal Processing, 2020
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Memory Requirement Reduction of Deep Neural Networks for Field Programmable Gate Arrays Using Low-Bit Quantization of Parameters.
Proceedings of the 28th European Signal Processing Conference, 2020
A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and Detection.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020
Acoustic Scene Classification in DCASE 2020 Challenge: Generalization Across Devices and Low Complexity Solutions.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020
2019
Code of the method presented in the paper: Drossos et al, "Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling," in proceedings of DCASE 2019.
Dataset, November, 2019
J. Supercomput., 2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks.
IEEE J. Sel. Top. Signal Process., 2019
Digit. Signal Process., 2019
CoRR, 2019
Memory Requirement Reduction of Deep Neural Networks Using Low-bit Quantization of Parameters.
CoRR, 2019
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Unsupervised Adversarial Domain Adaptation Based on The Wasserstein Distance For Acoustic Scene Classification.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Detection of Typical Pronunciation Errors in Non-native English Speech Using Convolutional Recurrent Neural Networks.
Proceedings of the International Joint Conference on Neural Networks, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the 27th European Signal Processing Conference, 2019
Acoustic Scene Classification in DCASE 2019 Challenge: Closed and Open Set Classification and Data Mismatch Setups.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019
Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019
Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019
2018
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Multichannel Blind Sound Source Separation Using Spatial Covariance Model With Level and Time Differences and Nonnegative Matrix Factorization.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
EURASIP J. Audio Speech Music. Process., 2018
CoRR, 2018
Proceedings of the 28th IEEE International Workshop on Machine Learning for Signal Processing, 2018
An Active Learning Method Using Clustering and Committee-Based Sample Selection for Sound Event Classification.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Time-Frequency Masking Strategies for Single-Channel Low-Latency Speech Enhancement Using Neural Networks.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Deep Neural Network Based Speech Separation Optimizing an Objective Estimator of Intelligibility for Low Latency Applications.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Expectation-Maximization Algorithms for Itakura-Saito Nonnegative Matrix Factorization.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Reducing Interference with Phase Recovery in DNN-based Monaural Singing Voice Separation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
Estimation of Time-Varying Room Impulse Responses of Multiple Sound Sources from Observed Mixture and Isolated Source Signals.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Monaural Singing Voice Separation with Skip-Filtering Connections and Recurrent Inference of Time-Frequency Mask.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Direction of Arrival Estimation for Multiple Sound Sources Using Convolutional Recurrent Neural Network.
Proceedings of the 26th European Signal Processing Conference, 2018
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018
2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
CoRR, 2017
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017
Assessment of human and machine performance in acoustic scene classification: Dcase 2016 case study.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017
A recurrent encoder-decoder approach with skip-filtering connections for monaural singing voice separation.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Sound event detection using spatial features and convolutional recurrent neural network.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Time-difference of arrival model for spherical microphone arrays and application to direction of arrival estimation.
Proceedings of the 25th European Signal Processing Conference, 2017
Proceedings of the 25th European Signal Processing Conference, 2017
Proceedings of the 25th European Signal Processing Conference, 2017
ASR in Classroom Today: Automatic Visualization of Conceptual Network in Science Classrooms.
Proceedings of the Data Driven Approaches in Digital Education, 2017
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017
Sound Event Detection Using Weakly Labeled Dataset with Stacked Convolutional and Recurrent Neural Network.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017
2016
Blind Separation of Audio Mixtures Through Nonnegative Tensor Factorization of Modulation Spectrograms.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Speech Commun., 2016
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016
Recurrent neural networks for polyphonic sound event detection in real life recordings.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE Global Conference on Signal and Information Processing, 2016
Proceedings of the 24th European Signal Processing Conference, 2016
Proceedings of the 24th European Signal Processing Conference, 2016
Proceedings of the 24th European Signal Processing Conference, 2016
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016
2015
Coupled Dictionaries for Exemplar-Based Speech Enhancement and Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Compositional Models for Audio Processing: Uncovering the structure of sound mixtures.
IEEE Signal Process. Mag., 2015
Digit. Signal Process., 2015
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015
Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Low-latency sound-source-separation using non-negative matrix factorisation with coupled analysis and synthesis dictionaries.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Exemplar-based speech enhancement for deep neural network based automatic speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Speaker Verification Using Adaptive Dictionaries in Non-negative Spectrogram Deconvolution.
Proceedings of the Latent Variable Analysis and Signal Separation, 2015
Automatic recognition of environmental sound events using all-pole group delay features.
Proceedings of the 23rd European Signal Processing Conference, 2015
Multi-label vs. combined single-label sound event detection with deep neural networks.
Proceedings of the 23rd European Signal Processing Conference, 2015
2014
Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Direction of Arrival Based Spatial Covariance Model for Blind Sound Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
EURASIP J. Audio Speech Music. Process., 2014
Exemplar-based noise robust automatic speech recognition using modulation spectrogram features.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Modelling primitive streaming of simple tone sequences through factorisation of modulation pattern tensors.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Semi-supervised non-negative tensor factorisation of modulation spectrograms for monaural speech separation.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Multichannel audio separation by direction of arrival based spatial covariance model and non-negative matrix factorization.
Proceedings of the IEEE International Conference on Acoustics, 2014
Ultrasound-coupled semi-supervised nonnegative matrix factorisation for speech enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the 22nd European Signal Processing Conference, 2014
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014
2013
IEEE Trans. Speech Audio Process., 2013
On the human ability to discriminate audio ambiances from similar locations of an urban environment.
Pers. Ubiquitous Comput., 2013
EURASIP J. Audio Speech Music. Process., 2013
Modelling non-stationary noise with spectral factorisation in automatic speech recognition.
Comput. Speech Lang., 2013
Music self-similarity modeling using augmented nonnegative matrix factorization of block and stripe patterns.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
The 9th annual MLSP competition: New methods for acoustic classification of multiple simultaneous bird species in a noisy environment.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Non-negative tensor factorisation of modulation spectrograms for monaural sound source separation.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Supervised model training for overlapping sound events based on unsupervised source separation.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Acquiring variable length speech bases for factorisation-based noise robust speech recognition.
Proceedings of the 21st European Signal Processing Conference, 2013
Proceedings of the 21st European Signal Processing Conference, 2013
Proceedings of the Sound, Music, and Motion - 10th International Symposium, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
2012
IEEE Trans. Speech Audio Process., 2012
Exemplar-based sparse representation and sparse discrimination for noise robust speaker identification.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012
Proceedings of the 5th International Symposium on Communications, 2012
Human sound perception - what can we learn from it when developing audio analysis algorithms?
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012
Group Sparsity for Speaker Identity Discrimination in Factorisation-based Speech Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Non-negative matrix factorization for highly noise-robust ASR: To enhance or to recognize?
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Modelling spectro-temporal dynamics in factorisation-based noise-robust automatic speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Multiple Instrument Mixtures Source Separation Evaluation Using Instrument-Dependent NMF Models.
Proceedings of the Latent Variable Analysis and Signal Separation, 2012
Permutation alignment of frequency-domain ICA by the maximization of intra-source envelope correlations.
Proceedings of the 20th European Signal Processing Conference, 2012
Detection, separation and recognition of speech from continuous signals using spectral factorisation.
Proceedings of the 20th European Signal Processing Conference, 2012
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012
2011
IEEE Trans. Speech Audio Process., 2011
Musical Instrument Sound Multi-Excitation Model for Non-Negative Spectrogram Factorization.
IEEE J. Sel. Top. Signal Process., 2011
Multichannel audio upmixing based on non-negative tensor factorization representation.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Mapping Sparse Representation to State Likelihoods in Noise-Robust Automatic Speech Recognition.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the 19th European Signal Processing Conference, 2011
2010
IEEE Trans. Speech Audio Process., 2010
IEEE Trans. Speech Audio Process., 2010
EURASIP J. Audio Speech Music. Process., 2010
Audio Query by Example Using Similarity Measures between Probability Density Functions of Features.
EURASIP J. Audio Speech Music. Process., 2010
State-based labelling for a sparse representation of speech and its application to robust speech recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Non-negative matrix factorization based compensation of music for automatic speech recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Sound source separation in monaural music signals using excitation-filter model and em algorithm.
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the 18th European Signal Processing Conference, 2010
Proceedings of the 18th European Signal Processing Conference, 2010
Proceedings of the 18th European Signal Processing Conference, 2010
2009
Musical Instrument Recognition in Polyphonic Audio Using Source-Filter Model for Sound Separation.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009
Mixtures of Gamma Priors for Non-negative Matrix Factorization Based Speech Separation.
Proceedings of the Independent Component Analysis and Signal Separation, 2009
Interpolating hidden Markov model and its application to automatic instrument recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009
Spectral covariance in prior distributions of non-negative matrix factorization based speech separation.
Proceedings of the 17th European Signal Processing Conference, 2009
Proceedings of the 17th European Signal Processing Conference, 2009
Proceedings of the 17th European Signal Processing Conference, 2009
2008
Combining pitch-based inference and non-negative spectrogram factorization in separating vocals from polyphonic music.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008
Accompaniment separation and karaoke application based on automatic melody transcription.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Voice activity detection in the presence of breathing noise using neural network and hidden Markov model.
Proceedings of the 2008 16th European Signal Processing Conference, 2008
2007
Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria.
IEEE Trans. Speech Audio Process., 2007
Singer Identification in Polyphonic Music Using Vocal Separation and Pattern Recognition Methods.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007
Query by Example of Audio Signals using Euclidean Distance Between Gaussian Mixture Models.
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
Speech recognition using factorial hidden Markov models for separation in the feature space.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
2005
Proceedings of the 13th European Signal Processing Conference, 2005
Proceedings of the 13th European Signal Processing Conference, 2005
Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine.
Proceedings of the 13th European Signal Processing Conference, 2005
2004
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004
2003
Proceedings of the 2003 International Computer Music Conference, 2003
2002
Proceedings of the IEEE International Conference on Acoustics, 2002
2000
Comput. Methods Programs Biomed., 2000
Proceedings of the IEEE International Conference on Acoustics, 2000
Recognition of acoustic noise mixtures by combined bottom-up and top-down processing.
Proceedings of the 10th European Signal Processing Conference, 2000