Kazuyoshi Yoshii

Orcid: 0000-0001-8387-8609

According to our database1, Kazuyoshi Yoshii authored at least 180 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Joint Audio Source Localization and Separation with Distributed Microphone Arrays Based on Spatially-Regularized Multichannel NMF.
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024

Neural Steerer: Novel Steering Vector Synthesis with a Causal Neural Field over Frequency and Direction.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Neural Steerer: Novel Steering Vector Synthesis with a Causal Neural Field over Frequency and Source Positions.
CoRR, 2023

Time-Domain Audio Source Separation Based on Gaussian Processes with Deep Kernel Learning.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Neural Band-to-Piano Score Arrangement with Stepless Difficulty Control.
Proceedings of the IEEE International Conference on Acoustics, 2023

Neural Fast Full-Rank Spatial Covariance Analysis for Blind Source Separation.
Proceedings of the 31st European Signal Processing Conference, 2023

Multimodal Multifaceted Music Emotion Recognition Based on Self-Attentive Fusion of Psychology-Inspired Symbolic and Acoustic Features.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

CTC2: End-to-End Drum Transcription Based on Connectionist Temporal Classification With Constant Tempo Constraint.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Joint Drum Transcription and Metrical Analysis Based on Periodicity-Aware Multi-Task Learning.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Audio-to-Score Singing Transcription Based on Joint Estimation of Pitches, Onsets, and Metrical Positions With Tatum-Level CTC Loss.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Learning Multifaceted Self-Similarity for Musical Structure Analysis.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Autoregressive Moving Average Jointly-Diagonalizable Spatial Covariance Analysis for Joint Source Separation and Dereverberation.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Generalized Fast Multichannel Nonnegative Matrix Factorization Based on Gaussian Scale Mixtures for Blind Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Computationally-Efficient Overdetermined Blind Source Separation Based on Iterative Source Steering.
IEEE Signal Process. Lett., 2022

Joint Localization and Synchronization of Distributed Camera-Attached Microphone Arrays for Indoor Scene Analysis.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

DNN-free Low-Latency Adaptive Speech Enhancement Based on Frame-Online Beamforming Powered by Block-Online Fastmnmf.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

Tracking the Evolution of a Band's Live Performances over Decades.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

End-to-End Lyrics Transcription Informed by Pitch and Onset Estimation.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Difficulty-Aware Neural Band-to-Piano Score Arrangement based on Note- and Statistic-Level Criteria.
Proceedings of the IEEE International Conference on Acoustics, 2022

Flow-Based Fast Multichannel Nonnegative Matrix Factorization for Blind Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2022

Elliptically Contoured Alpha-Stable Representation for MUSIC-Based Sound Source Localization.
Proceedings of the 30th European Signal Processing Conference, 2022

2021
Neural Full-Rank Spatial Covariance Analysis for Blind Source Separation.
IEEE Signal Process. Lett., 2021

MirrorNet: A Deep Reflective Approach to 2D Pose Estimation for Single-Person Images.
J. Inf. Process., 2021

Non-local musical statistics as guides for audio-to-score piano transcription.
Inf. Sci., 2021

Musical rhythm transcription based on Bayesian piece-specific score models capturing repetitions.
Inf. Sci., 2021

Global Structure-Aware Drum Transcription Based on Self-Attention Mechanisms.
CoRR, 2021

A Real-Time Drum-Wise Volume Visualization System for Learning Volume-Balanced Drum Performance.
Proceedings of the Entertainment Computing - ICEC 2021, 2021

Phase-Aware Joint Beat and Downbeat Estimation Based on Periodicity of Metrical Structure.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Joint Estimation of Note Values and Voices for Audio-to-Score Piano Transcription.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Alpha-Stable Autoregressive Fast Multichannel Nonnegative Matrix Factorization for Joint Speech Enhancement and Dereverberation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Pitch-Timbre Disentanglement Of Musical Instrument Sounds Based On Vae-Based Metric Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021

Autoregressive Fast Multichannel Nonnegative Matrix Factorization For Joint Blind Source Separation And Dereverberation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Statistical Correction of Transcribed Melody Notes Based on Probabilistic Integration of a Music Language Model and a Transcription Error Model.
Proceedings of the IEEE International Conference on Acoustics, 2021

Gamma Process FastMNMF for Separating an Unknown Number of Sound Sources.
Proceedings of the 29th European Signal Processing Conference, 2021

2020
Semi-Supervised Neural Chord Estimation Based on a Variational Autoencoder With Latent Chord Labels and Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Bayesian Melody Harmonization Based on a Tree-Structured Generative Model of Chord Sequences and Melodies.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Fast Multichannel Nonnegative Matrix Factorization With Directivity-Aware Jointly-Diagonalizable Spatial Covariance Matrices for Blind Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

A Flow-Based Deep Latent Variable Model for Speech Spectrogram Modeling and Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Bayesian Singing Transcription Based on a Hierarchical Generative Model of Keys, Musical Notes, and F0 Trajectories.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Flow-Based Independent Vector Analysis for Blind Source Separation.
IEEE Signal Process. Lett., 2020

Statistical learning and estimation of piano fingering.
Inf. Sci., 2020

Semi-supervised Neural Chord Estimation Based on a Variational Autoencoder with Discrete Labels and Continuous Textures of Chords.
CoRR, 2020

MirrorNet: A Deep Bayesian Approach to Reflective 2D Pose Estimation from Human Images.
CoRR, 2020

A Method for Analysis of Shared Structure in Large Music Collections using Techniques from Genetic Sequencing and Graph Theory.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Multi-Instrument Music Transcription Based on Deep Spherical Clustering of Spectrograms and Pitchgrams.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Music Structure Analysis Based on an LSTM-HSMM Hybrid Model.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

The MIDI Degradation Toolkit: Symbolic Music Augmentation and Correction.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Adaptive Neural Speech Enhancement with a Denoising Variational Autoencoder.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Unsupervised Robust Speech Enhancement Based on Alpha-Stable Fast Multichannel Nonnegative Matrix Factorization.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Fast Multichannel Correlated Tensor Factorization for Blind Source Separation.
Proceedings of the 28th European Signal Processing Conference, 2020

Semi-supervised Multichannel Speech Separation Based on a Phone- and Speaker-Aware Deep Generative Model of Speech Spectrograms.
Proceedings of the 28th European Signal Processing Conference, 2020

A Variational Autoencoder for Joint Chord and Key Estimation from Audio Chromagrams.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

End-to-end Music-mixed Speech Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Integration of Semi-Blind Speech Source Separation and Voice Activity Detection for Flexible Spoken Dialogue.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Computer-Resource-Aware Deep Speech Separation with a Run-Time-Specified Number of BLSTM Layers.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Tatum-Level Drum Transcription Based on a Convolutional Recurrent Neural Network with Language Model-Based Regularized Training.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Semi-Supervised Multichannel Speech Enhancement With a Deep Speech Prior.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Multi-Step Chord Sequence Prediction Based on Aggregated Multi-Scale Encoder-Decoder Network.
CoRR, 2019

Music Transcription Based on Bayesian Piece-Specific Score Models Capturing Repetitions.
CoRR, 2019

End-To-End Melody Note Transcription Based on a Beat-Synchronous Attention Mechanism.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Joint Singing Pitch Estimation and Voice Separation Based on a Neural Harmonic Structure Renderer.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Audio-Visual SLAM towards Human Tracking and Human-Robot Interaction in Indoor Environments.
Proceedings of the 28th IEEE International Conference on Robot and Human Interactive Communication, 2019

Multi-Step Chord Sequence Prediction Based On Aggregated Multi-Scale Encoder-Decoder Networks.
Proceedings of the 29th IEEE International Workshop on Machine Learning for Signal Processing, 2019

Deep Bayesian Unsupervised Source Separation Based On A Complex Gaussian Mixture Model.
Proceedings of the 29th IEEE International Workshop on Machine Learning for Signal Processing, 2019

Blending Acoustic and Language Model Predictions for Automatic Music Transcription.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Statistical Music Structure Analysis Based on a Homogeneity-, Repetitiveness-, and Regularity-Aware Hierarchical Hidden Semi-Markov Model.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Bayesian Drum Transcription Based on Nonnegative Matrix Factor Decomposition with a Deep Score Prior.
Proceedings of the IEEE International Conference on Acoustics, 2019

Joint Transcription of Lead, Bass, and Rhythm Guitars Based on a Factorial Hidden Semi-Markov Model.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Deep Generative Model of Speech Complex Spectrograms.
Proceedings of the IEEE International Conference on Acoustics, 2019

Automatic Singing Transcription Based on Encoder-decoder Recurrent Neural Networks with a Weakly-supervised Attention Mechanism.
Proceedings of the IEEE International Conference on Acoustics, 2019

Unsupervised Melody Style Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2019

Improved Metrical Alignment of Midi Performance Based on a Repetition-aware Online-adapted Grammar.
Proceedings of the IEEE International Conference on Acoustics, 2019

Automatic Chord Estimation Based on a Frame-wise Convolutional Recurrent Neural Network with Non-Aligned Annotations.
Proceedings of the 27th European Signal Processing Conference, 2019

Fast Multichannel Source Separation Based on Jointly Diagonalizable Spatial Covariance Matrices.
Proceedings of the 27th European Signal Processing Conference, 2019

Cauchy Multichannel Speech Enhancement with a Deep Speech Prior.
Proceedings of the 27th European Signal Processing Conference, 2019

2018
Bayesian Multichannel Audio Source Separation Based on Integrated Source and Spatial Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Speech Enhancement Based on Bayesian Low-Rank and Sparse Decomposition of Multichannel Magnitude Spectrograms.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Statistical Piano Reduction Controlling Performance Difficulty.
CoRR, 2018

Interactive Arrangement of Chords and Melodies Based on a Tree-Structured Generative Model.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Correlated Tensor Factorization for Audio Source Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Unsupervised Beamforming Based on Multichannel Nonnegative Matrix Factorization for Noisy Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Towards Complete Polyphonic Music Transcription: Integrating Multi-Pitch Detection and Rhythm Quantization.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An End-to-End Approach to Joint Social Signal Detection and Automatic Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Independent Low-Rank Tensor Analysis for Audio Source Separation.
Proceedings of the 26th European Signal Processing Conference, 2018

Sequential Generation of Singing F0 Contours from Musical Note Sequences Based on WaveNet.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Bayesian Multichannel Speech Enhancement with a Deep Speech Prior.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Probabilistic Sequential Patterns for Singing Transcription.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Rhythm Transcription of Polyphonic Piano Music Based on Merged-Output HMM for Multiple Voices.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Note Value Recognition for Piano Transcription Using Markov Random Fields.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Simultaneous Identification and Localization of Still and Mobile Speakers Based on Binaural Robot Audition.
J. Robotics Mechatronics, 2017

Layout Optimization of Cooperative Distributed Microphone Arrays Based on Estimation of Source Separation Performance.
J. Robotics Mechatronics, 2017

Audio-Visual Beat Tracking Based on a State-Space Model for a Robot Dancer Performing with a Human Dancer.
J. Robotics Mechatronics, 2017

Low Latency and High Quality Two-Stage Human-Voice-Enhancement System for a Hose-Shaped Rescue Robot.
J. Robotics Mechatronics, 2017

Generative Statistical Models with Self-Emergent Grammar of Chord Sequences.
CoRR, 2017

Note Value Recognition for Rhythm Transcription Using a Markov Random Field Model for Musical Scores and Performances of Piano Music.
CoRR, 2017

Infinite probabilistic latent component analysis for audio source separation.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Semi-Blind speech enhancement basedon recurrent neural network for source separation and dereverberation.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

A diagonal plus low-rank covariance model for computationally efficient source separation.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Function- and Rhythm-Aware Melody Harmonization Based on Tree-Structured Parsing and Split-Merge Sampling of Chord Sequences.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Scale- and Rhythm-Aware Musical Note Estimation for Vocal F0 Trajectories Based on a Semi-Tatum-Synchronous Hierarchical Hidden Semi-Markov Model.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Performance Error Detection and Post-Processing for Fast and Accurate Symbolic Music Alignment.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Bayesian multichannel nonnegative matrix factorization for audio source separation and localization.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Singing Voice Separation and Vocal F0 Estimation Based on Mutual Combination of Robust Principal Component Analysis and Subharmonic Summation.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Musical Similarity and Commonness Estimation Based on Probabilistic Generative Models of Musical Elements.
Int. J. Semantic Comput., 2016

Sound-based online localization for an in-pipe snake robot.
Proceedings of the 2016 IEEE International Symposium on Safety, 2016

Student's t multichannel nonnegative matrix factorization for blind source separation.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

A Hierarchical Bayesian Model of Chords, Pitches, and Spectrograms for Multipitch Analysis.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

Musical Note Estimation for F0 Trajectories of Singing Voices Based on a Bayesian Semi-Beat-Synchronous HMM.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

Musical Typicality: How Many Similar Songs Exist?.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

Online simultaneous localization and mapping of multiple sound sources and asynchronous microphone arrays.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Student's T nonnegative matrix factorization and positive semidefinite tensor factorization for single-channel audio source separation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Tree-structured probabilistic model of monophonic written music based on the generative theory of tonal music.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Rhythm transcription of MIDI performances based on hierarchical Bayesian modelling of repetition and modification of musical note patterns.
Proceedings of the 24th European Signal Processing Conference, 2016

A unified Bayesian model of time-frequency clustering and low-rank approximation for multi-channel source separation.
Proceedings of the 24th European Signal Processing Conference, 2016

Variational Bayesian multi-channel robust NMF for human-voice enhancement with a deformable and partially-occluded microphone array.
Proceedings of the 24th European Signal Processing Conference, 2016

2015
Toward a quizmaster robot for speech-based multiparty interaction.
Adv. Robotics, 2015

Unified inter- and intra-recording duration model for multiple music audio alignment.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Human-voice enhancement based on online RPCA for a hose-shaped rescue robot with a microphone array.
Proceedings of the 2015 IEEE International Symposium on Safety, 2015

Identification and Localization of One or Two Concurrent Speakers in a Binaural Robotic Context.
Proceedings of the 2015 IEEE International Conference on Systems, 2015

Infinite Superimposed Discrete All-Pole Modeling for Multipitch Analysis of Wavelet Spectrograms.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

Musical Similarity and Commonness Estimation Based on Probabilistic Generative Models.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

Songle Widget: Making Animation and Physical Devices Synchronized with Music Videos on the Web.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

Optimizing the layout of multiple mobile robots for cooperative sound source separation.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Audio-visual beat tracking based on a state-space model for a music robot dancing with humans.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Microphone-accelerometer based 3D posture estimation for a hose-shaped rescue robot.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Bayesian integration of sound source separation and speech recognition: a new approach to simultaneous speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A feedback framework for improved chord recognition based on NMF-based approximate note transcription.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Singing voice analysis and editing based on mutually dependent F0 estimation and source separation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Challenges in deploying a microphone array to localize and separate sound sources in real auditory scenes.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Recognition of In-Field Frog Chorusing Using Bayesian Nonparametric Microphone Array Processing.
Proceedings of the Computational Sustainability, 2015

2014
Nonparametric Bayesian dereverberation of power spectrograms based on infinite-order autoregressive processes.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

AutoMashUpper: automatic creation of multi-song music mashups.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

A sound-based online method for estimating the time-varying posture of a hose-shaped robot.
Proceedings of the 2014 IEEE International Symposium on Safety, 2014

LyricsRadar: A Lyrics Retrieval System Based on Latent Topics of Lyrics.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Spotting a Query Phrase from Polyphonic Music Audio Signals Based on Semi-supervised Nonnegative Matrix Factorization.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Bayesian Audio Alignment based on a Unified Model of Music Composition and Performance.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Cultivating vocal activity detection for music audio signals in a circulation-type crowdsourcing ecosystem.
Proceedings of the IEEE International Conference on Acoustics, 2014

Vocal timbre analysis using latent Dirichlet allocation and cross-gender vocal timbre similarity.
Proceedings of the IEEE International Conference on Acoustics, 2014

Timbre replacement of harmonic and drum components for music audio signals.
Proceedings of the IEEE International Conference on Acoustics, 2014

A robot quizmaster that can localize, separate, and recognize simultaneous utterances for a fastest-voice-first quiz game.
Proceedings of the 14th IEEE-RAS International Conference on Humanoid Robots, 2014

2013
A nested infinite Gaussian mixture model for identifying known and unknown audio events.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013

Beyond NMF: Time-Domain Audio Source Separation without Phase Reconstruction.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Transfer Learning In Mir: Sharing Learned Latent Representations For Music Audio Classification And Similarity.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Chord-Sequence-Factory: A Chord Arrangement System Modifying Factorized Chord Sequence Probabilities.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

AutoMashUpper: An Automatic Multi-Song Mashup System.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Nested iGMM recognition and multiple hypothesis tracking of moving sound sources for mobile robot audition.
Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013

Infinite Positive Semidefinite Tensor Factorization for Source Separation of Mixture Signals.
Proceedings of the 30th International Conference on Machine Learning, 2013

Infinite kernel linear prediction for joint estimation of spectral envelope and fundamental frequency.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
A Nonparametric Bayesian Multipitch Analyzer Based on Infinite Latent Harmonic Allocation.
IEEE Trans. Speech Audio Process., 2012

PodCastle and Songle: Crowdsourcing-Based Web Services for Retrieval and Browsing of Speech and Music Content.
Proceedings of the First International Workshop on Crowdsourcing Web Search, 2012

PodCastle and songle: crowdsourcing-based web services for spoken content retrieval and active music listening.
Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia, 2012

PodCastle and songle: Crowdsourcing-based web services for spoken document retrieval and active music listening.
Proceedings of the 2012 Information Theory and Applications Workshop, 2012

Infinite Composite Autoregressive Models for Music Signal Analysis.
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

Unsupervised music understanding based on nonparametric Bayesian models.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
A Vocabulary-Free Infinity-Gram Model for Nonparametric Bayesian Chord Progression Analysis.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Timbre and Melody Features for the Recognition of Vocal Activity and Instrumental Solos in Polyphonic Music.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Songle: A Web Service for Active Music Listening Improved by User Contributions.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

2010
Infinite Latent Harmonic Allocation: A Nonparametric Bayesian Approach to Multipitch Analysis.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

2009
MusicCommentator: Generating Comments Synchronized with Musical Audio Signals by a Joint Probabilistic Model of Acoustic and Textual Features.
Proceedings of the Entertainment Computing, 2009

Continuous pLSI and Smoothing Techniques for Hybrid Music Recommendation.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

2008
An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model.
IEEE Trans. Speech Audio Process., 2008

Music Thumbnailer: Visualizing Musical Pieces in Thumbnail Images Based on Acoustic Features.
Proceedings of the ISMIR 2008, 2008

Automatic Chord Recognition Based on Probabilistic Integration of Chord Transition and Bass Pitch Estimation.
Proceedings of the ISMIR 2008, 2008

A Robot Singer with Music Recognition Based on Real-Time Beat Tracking.
Proceedings of the ISMIR 2008, 2008

A robot uses its own microphone to synchronize its steps to musical beats while scatting and singing.
Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

A robot listens to music and counts its beats aloud by separating music from counting voice.
Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

2007
Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With Harmonic Structure Suppression.
IEEE Trans. Speech Audio Process., 2007

Drumix: An Audio Player with Real-time Drum-part Rearrangement Functions for Active Music Listening.
Inf. Media Technol., 2007

Improving Efficiency and Scalability of Model-Based Music Recommender System Based on Incremental Training.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

A biped robot that keeps steps in time with musical beats while listening to music with its own ears.
Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

2006
Hybrid Collaborative and Content-based Music Recommendation Using Probabilistic Model with Latent User Preferences.
Proceedings of the ISMIR 2006, 2006

An Error Correction Framework Based on Drum Pattern Periodicity for Improving Drum Sound Detection.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2004
Automatic Drum Sound Description for Real-World Music Using Template Adaptation and Matching Methods.
Proceedings of the ISMIR 2004, 2004

Drum sound identification for polyphonic music using template adaptation and matching methods.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004


  Loading...