Kazuyoshi Yoshii
Orcid: 0000-0001-8387-8609
According to our database1,
Kazuyoshi Yoshii
authored at least 180 papers
between 2004 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Joint Audio Source Localization and Separation with Distributed Microphone Arrays Based on Spatially-Regularized Multichannel NMF.
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024
Neural Steerer: Novel Steering Vector Synthesis with a Causal Neural Field over Frequency and Direction.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Neural Steerer: Novel Steering Vector Synthesis with a Causal Neural Field over Frequency and Source Positions.
CoRR, 2023
Time-Domain Audio Source Separation Based on Gaussian Processes with Deep Kernel Learning.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the 31st European Signal Processing Conference, 2023
Multimodal Multifaceted Music Emotion Recognition Based on Self-Attentive Fusion of Psychology-Inspired Symbolic and Acoustic Features.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
CTC2: End-to-End Drum Transcription Based on Connectionist Temporal Classification With Constant Tempo Constraint.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
Joint Drum Transcription and Metrical Analysis Based on Periodicity-Aware Multi-Task Learning.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
Audio-to-Score Singing Transcription Based on Joint Estimation of Pitches, Onsets, and Metrical Positions With Tatum-Level CTC Loss.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
2022
Autoregressive Moving Average Jointly-Diagonalizable Spatial Covariance Analysis for Joint Source Separation and Dereverberation.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Generalized Fast Multichannel Nonnegative Matrix Factorization Based on Gaussian Scale Mixtures for Blind Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Computationally-Efficient Overdetermined Blind Source Separation Based on Iterative Source Steering.
IEEE Signal Process. Lett., 2022
Joint Localization and Synchronization of Distributed Camera-Attached Microphone Arrays for Indoor Scene Analysis.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022
DNN-free Low-Latency Adaptive Speech Enhancement Based on Frame-Online Beamforming Powered by Block-Online Fastmnmf.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022
Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Difficulty-Aware Neural Band-to-Piano Score Arrangement based on Note- and Statistic-Level Criteria.
Proceedings of the IEEE International Conference on Acoustics, 2022
Flow-Based Fast Multichannel Nonnegative Matrix Factorization for Blind Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2022
Elliptically Contoured Alpha-Stable Representation for MUSIC-Based Sound Source Localization.
Proceedings of the 30th European Signal Processing Conference, 2022
2021
IEEE Signal Process. Lett., 2021
MirrorNet: A Deep Reflective Approach to 2D Pose Estimation for Single-Person Images.
J. Inf. Process., 2021
Inf. Sci., 2021
Musical rhythm transcription based on Bayesian piece-specific score models capturing repetitions.
Inf. Sci., 2021
CoRR, 2021
A Real-Time Drum-Wise Volume Visualization System for Learning Volume-Balanced Drum Performance.
Proceedings of the Entertainment Computing - ICEC 2021, 2021
Phase-Aware Joint Beat and Downbeat Estimation Based on Periodicity of Metrical Structure.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021
Alpha-Stable Autoregressive Fast Multichannel Nonnegative Matrix Factorization for Joint Speech Enhancement and Dereverberation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Pitch-Timbre Disentanglement Of Musical Instrument Sounds Based On Vae-Based Metric Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021
Autoregressive Fast Multichannel Nonnegative Matrix Factorization For Joint Blind Source Separation And Dereverberation.
Proceedings of the IEEE International Conference on Acoustics, 2021
Statistical Correction of Transcribed Melody Notes Based on Probabilistic Integration of a Music Language Model and a Transcription Error Model.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the 29th European Signal Processing Conference, 2021
2020
Semi-Supervised Neural Chord Estimation Based on a Variational Autoencoder With Latent Chord Labels and Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Bayesian Melody Harmonization Based on a Tree-Structured Generative Model of Chord Sequences and Melodies.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Fast Multichannel Nonnegative Matrix Factorization With Directivity-Aware Jointly-Diagonalizable Spatial Covariance Matrices for Blind Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
A Flow-Based Deep Latent Variable Model for Speech Spectrogram Modeling and Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Bayesian Singing Transcription Based on a Hierarchical Generative Model of Keys, Musical Notes, and F0 Trajectories.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
IEEE Signal Process. Lett., 2020
Semi-supervised Neural Chord Estimation Based on a Variational Autoencoder with Discrete Labels and Continuous Textures of Chords.
CoRR, 2020
MirrorNet: A Deep Bayesian Approach to Reflective 2D Pose Estimation from Human Images.
CoRR, 2020
A Method for Analysis of Shared Structure in Large Music Collections using Techniques from Genetic Sequencing and Graph Theory.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Multi-Instrument Music Transcription Based on Deep Spherical Clustering of Spectrograms and Pitchgrams.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Unsupervised Robust Speech Enhancement Based on Alpha-Stable Fast Multichannel Nonnegative Matrix Factorization.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 28th European Signal Processing Conference, 2020
Semi-supervised Multichannel Speech Separation Based on a Phone- and Speaker-Aware Deep Generative Model of Speech Spectrograms.
Proceedings of the 28th European Signal Processing Conference, 2020
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
Integration of Semi-Blind Speech Source Separation and Voice Activity Detection for Flexible Spoken Dialogue.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
Computer-Resource-Aware Deep Speech Separation with a Run-Time-Specified Number of BLSTM Layers.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
Tatum-Level Drum Transcription Based on a Convolutional Recurrent Neural Network with Language Model-Based Regularized Training.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
2019
Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Multi-Step Chord Sequence Prediction Based on Aggregated Multi-Scale Encoder-Decoder Network.
CoRR, 2019
Music Transcription Based on Bayesian Piece-Specific Score Models Capturing Repetitions.
CoRR, 2019
End-To-End Melody Note Transcription Based on a Beat-Synchronous Attention Mechanism.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Joint Singing Pitch Estimation and Voice Separation Based on a Neural Harmonic Structure Renderer.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Audio-Visual SLAM towards Human Tracking and Human-Robot Interaction in Indoor Environments.
Proceedings of the 28th IEEE International Conference on Robot and Human Interactive Communication, 2019
Multi-Step Chord Sequence Prediction Based On Aggregated Multi-Scale Encoder-Decoder Networks.
Proceedings of the 29th IEEE International Workshop on Machine Learning for Signal Processing, 2019
Deep Bayesian Unsupervised Source Separation Based On A Complex Gaussian Mixture Model.
Proceedings of the 29th IEEE International Workshop on Machine Learning for Signal Processing, 2019
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
Statistical Music Structure Analysis Based on a Homogeneity-, Repetitiveness-, and Regularity-Aware Hierarchical Hidden Semi-Markov Model.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
Bayesian Drum Transcription Based on Nonnegative Matrix Factor Decomposition with a Deep Score Prior.
Proceedings of the IEEE International Conference on Acoustics, 2019
Joint Transcription of Lead, Bass, and Rhythm Guitars Based on a Factorial Hidden Semi-Markov Model.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Automatic Singing Transcription Based on Encoder-decoder Recurrent Neural Networks with a Weakly-supervised Attention Mechanism.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Improved Metrical Alignment of Midi Performance Based on a Repetition-aware Online-adapted Grammar.
Proceedings of the IEEE International Conference on Acoustics, 2019
Automatic Chord Estimation Based on a Frame-wise Convolutional Recurrent Neural Network with Non-Aligned Annotations.
Proceedings of the 27th European Signal Processing Conference, 2019
Fast Multichannel Source Separation Based on Jointly Diagonalizable Spatial Covariance Matrices.
Proceedings of the 27th European Signal Processing Conference, 2019
Proceedings of the 27th European Signal Processing Conference, 2019
2018
Bayesian Multichannel Audio Source Separation Based on Integrated Source and Spatial Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Speech Enhancement Based on Bayesian Low-Rank and Sparse Decomposition of Multichannel Magnitude Spectrograms.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Interactive Arrangement of Chords and Melodies Based on a Tree-Structured Generative Model.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Unsupervised Beamforming Based on Multichannel Nonnegative Matrix Factorization for Noisy Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Towards Complete Polyphonic Music Transcription: Integrating Multi-Pitch Detection and Rhythm Quantization.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
An End-to-End Approach to Joint Social Signal Detection and Automatic Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 26th European Signal Processing Conference, 2018
Sequential Generation of Singing F0 Contours from Musical Note Sequences Based on WaveNet.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
2017
Rhythm Transcription of Polyphonic Piano Music Based on Merged-Output HMM for Multiple Voices.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Simultaneous Identification and Localization of Still and Mobile Speakers Based on Binaural Robot Audition.
J. Robotics Mechatronics, 2017
Layout Optimization of Cooperative Distributed Microphone Arrays Based on Estimation of Source Separation Performance.
J. Robotics Mechatronics, 2017
Audio-Visual Beat Tracking Based on a State-Space Model for a Robot Dancer Performing with a Human Dancer.
J. Robotics Mechatronics, 2017
Low Latency and High Quality Two-Stage Human-Voice-Enhancement System for a Hose-Shaped Rescue Robot.
J. Robotics Mechatronics, 2017
CoRR, 2017
Note Value Recognition for Rhythm Transcription Using a Markov Random Field Model for Musical Scores and Performances of Piano Music.
CoRR, 2017
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017
Semi-Blind speech enhancement basedon recurrent neural network for source separation and dereverberation.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017
A diagonal plus low-rank covariance model for computationally efficient source separation.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017
Function- and Rhythm-Aware Melody Harmonization Based on Tree-Structured Parsing and Split-Merge Sampling of Chord Sequences.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Scale- and Rhythm-Aware Musical Note Estimation for Vocal F0 Trajectories Based on a Semi-Tatum-Synchronous Hierarchical Hidden Semi-Markov Model.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Performance Error Detection and Post-Processing for Fast and Accurate Symbolic Music Alignment.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Bayesian multichannel nonnegative matrix factorization for audio source separation and localization.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
2016
Singing Voice Separation and Vocal F0 Estimation Based on Mutual Combination of Robust Principal Component Analysis and Subharmonic Summation.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Musical Similarity and Commonness Estimation Based on Probabilistic Generative Models of Musical Elements.
Int. J. Semantic Comput., 2016
Proceedings of the 2016 IEEE International Symposium on Safety, 2016
Student's t multichannel nonnegative matrix factorization for blind source separation.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016
A Hierarchical Bayesian Model of Chords, Pitches, and Spectrograms for Multipitch Analysis.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016
Musical Note Estimation for F0 Trajectories of Singing Voices Based on a Bayesian Semi-Beat-Synchronous HMM.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016
Online simultaneous localization and mapping of multiple sound sources and asynchronous microphone arrays.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016
Student's T nonnegative matrix factorization and positive semidefinite tensor factorization for single-channel audio source separation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Tree-structured probabilistic model of monophonic written music based on the generative theory of tonal music.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Rhythm transcription of MIDI performances based on hierarchical Bayesian modelling of repetition and modification of musical note patterns.
Proceedings of the 24th European Signal Processing Conference, 2016
A unified Bayesian model of time-frequency clustering and low-rank approximation for multi-channel source separation.
Proceedings of the 24th European Signal Processing Conference, 2016
Variational Bayesian multi-channel robust NMF for human-voice enhancement with a deformable and partially-occluded microphone array.
Proceedings of the 24th European Signal Processing Conference, 2016
2015
Adv. Robotics, 2015
Unified inter- and intra-recording duration model for multiple music audio alignment.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015
Human-voice enhancement based on online RPCA for a hose-shaped rescue robot with a microphone array.
Proceedings of the 2015 IEEE International Symposium on Safety, 2015
Identification and Localization of One or Two Concurrent Speakers in a Binaural Robotic Context.
Proceedings of the 2015 IEEE International Conference on Systems, 2015
Infinite Superimposed Discrete All-Pole Modeling for Multipitch Analysis of Wavelet Spectrograms.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015
Musical Similarity and Commonness Estimation Based on Probabilistic Generative Models.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015
Songle Widget: Making Animation and Physical Devices Synchronized with Music Videos on the Web.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015
Optimizing the layout of multiple mobile robots for cooperative sound source separation.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015
Audio-visual beat tracking based on a state-space model for a music robot dancing with humans.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015
Bayesian integration of sound source separation and speech recognition: a new approach to simultaneous speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
A feedback framework for improved chord recognition based on NMF-based approximate note transcription.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Singing voice analysis and editing based on mutually dependent F0 estimation and source separation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Challenges in deploying a microphone array to localize and separate sound sources in real auditory scenes.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Recognition of In-Field Frog Chorusing Using Bayesian Nonparametric Microphone Array Processing.
Proceedings of the Computational Sustainability, 2015
2014
Nonparametric Bayesian dereverberation of power spectrograms based on infinite-order autoregressive processes.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
IEEE ACM Trans. Audio Speech Lang. Process., 2014
A sound-based online method for estimating the time-varying posture of a hose-shaped robot.
Proceedings of the 2014 IEEE International Symposium on Safety, 2014
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014
Spotting a Query Phrase from Polyphonic Music Audio Signals Based on Semi-supervised Nonnegative Matrix Factorization.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014
Bayesian Audio Alignment based on a Unified Model of Music Composition and Performance.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014
Cultivating vocal activity detection for music audio signals in a circulation-type crowdsourcing ecosystem.
Proceedings of the IEEE International Conference on Acoustics, 2014
Vocal timbre analysis using latent Dirichlet allocation and cross-gender vocal timbre similarity.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
A robot quizmaster that can localize, separate, and recognize simultaneous utterances for a fastest-voice-first quiz game.
Proceedings of the 14th IEEE-RAS International Conference on Humanoid Robots, 2014
2013
A nested infinite Gaussian mixture model for identifying known and unknown audio events.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013
Transfer Learning In Mir: Sharing Learned Latent Representations For Music Audio Classification And Similarity.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013
Chord-Sequence-Factory: A Chord Arrangement System Modifying Factorized Chord Sequence Probabilities.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013
Nested iGMM recognition and multiple hypothesis tracking of moving sound sources for mobile robot audition.
Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013
Infinite Positive Semidefinite Tensor Factorization for Source Separation of Mixture Signals.
Proceedings of the 30th International Conference on Machine Learning, 2013
Infinite kernel linear prediction for joint estimation of spectral envelope and fundamental frequency.
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
A Nonparametric Bayesian Multipitch Analyzer Based on Infinite Latent Harmonic Allocation.
IEEE Trans. Speech Audio Process., 2012
PodCastle and Songle: Crowdsourcing-Based Web Services for Retrieval and Browsing of Speech and Music Content.
Proceedings of the First International Workshop on Crowdsourcing Web Search, 2012
PodCastle and songle: crowdsourcing-based web services for spoken content retrieval and active music listening.
Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia, 2012
PodCastle and songle: Crowdsourcing-based web services for spoken document retrieval and active music listening.
Proceedings of the 2012 Information Theory and Applications Workshop, 2012
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
A Vocabulary-Free Infinity-Gram Model for Nonparametric Bayesian Chord Progression Analysis.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Timbre and Melody Features for the Recognition of Vocal Activity and Instrumental Solos in Polyphonic Music.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
2010
Infinite Latent Harmonic Allocation: A Nonparametric Bayesian Approach to Multipitch Analysis.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
2009
MusicCommentator: Generating Comments Synchronized with Musical Audio Signals by a Joint Probabilistic Model of Acoustic and Textual Features.
Proceedings of the Entertainment Computing, 2009
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009
2008
An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model.
IEEE Trans. Speech Audio Process., 2008
Music Thumbnailer: Visualizing Musical Pieces in Thumbnail Images Based on Acoustic Features.
Proceedings of the ISMIR 2008, 2008
Automatic Chord Recognition Based on Probabilistic Integration of Chord Transition and Bass Pitch Estimation.
Proceedings of the ISMIR 2008, 2008
Proceedings of the ISMIR 2008, 2008
A robot uses its own microphone to synchronize its steps to musical beats while scatting and singing.
Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008
A robot listens to music and counts its beats aloud by separating music from counting voice.
Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008
2007
Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With Harmonic Structure Suppression.
IEEE Trans. Speech Audio Process., 2007
Drumix: An Audio Player with Real-time Drum-part Rearrangement Functions for Active Music Listening.
Inf. Media Technol., 2007
Improving Efficiency and Scalability of Model-Based Music Recommender System Based on Incremental Training.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007
A biped robot that keeps steps in time with musical beats while listening to music with its own ears.
Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007
2006
Hybrid Collaborative and Content-based Music Recommendation Using Probabilistic Model with Latent User Preferences.
Proceedings of the ISMIR 2006, 2006
An Error Correction Framework Based on Drum Pattern Periodicity for Improving Drum Sound Detection.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2004
Automatic Drum Sound Description for Real-World Music Using Template Adaptation and Matching Methods.
Proceedings of the ISMIR 2004, 2004
Drum sound identification for polyphonic music using template adaptation and matching methods.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004