Masataka Goto
Orcid: 0000-0003-1167-0977Affiliations:
- National Institute of Advanced Industrial Science and Technology, Ibaraki, Japan
According to our database1,
Masataka Goto
authored at least 288 papers
between 1994 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on zbmath.org
-
on orcid.org
On csauthors.net:
Bibliography
2025
Kiite World: Socializing Map-Based Music Exploration Through Playlist Sharing and Synchronized Listening.
Proceedings of the MultiMedia Modeling, 2025
2024
Rail DRAGON: Long-Reach Bendable Modularized Rail Structure for Constant Observation Inside PCV.
IEEE Robotics Autom. Lett., 2024
DanceUnisoner: A Parametric, Visual, and Interactive Simulation Interface for Choreographic Composition of Group Dance.
IEICE Trans. Inf. Syst., 2024
MDX-Mixer: Music Demixing by Leveraging Source Signals Separated by Existing Demixing Models.
IEICE Trans. Inf. Syst., 2024
2023
Kiite Cafe: A Web Service Enabling Users to Listen to the Same Song at the Same Moment While Reacting to the Song.
IEICE Trans. Inf. Syst., November, 2023
IEICE Trans. Inf. Syst., September, 2023
IEICE Trans. Inf. Syst., April, 2023
Content-Based Music-Image Retrieval Using Self- and Cross-Modal Feature Embedding Memory.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023
IteraTTA: An Interface for Exploring Both Text Prompts and Audio Priors in Generating Music With Text-to-Audio Models.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Unveiling the Impact of Musical Factors in Judging a Song on First Listen: Insights From a User Survey.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Decoding Drums, Instrumentals, Vocals, and Mixed Sources in Music Using Human Brain Activity With fMRI.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Transformer-Based Beat Tracking With Low-Resolution Encoder and High-Resolution Decoder.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
CatAlyst: Domain-Extensible Intervention for Preventing Task Procrastination Using Large Generative Models.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023
Lyric App Framework: A Web-based Framework for Developing Interactive Lyric-driven Musical Applications.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023
2022
User Model. User Adapt. Interact., 2022
IEEE ACM Trans. Audio Speech Lang. Process., 2022
IEEE ACM Trans. Audio Speech Lang. Process., 2022
IEEE ACM Trans. Audio Speech Lang. Process., 2022
IEEE Access, 2022
BO as Assistant: Using Bayesian Optimization for Asynchronously Generating Design Suggestions.
Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology, 2022
Proceedings of the IUI 2022: 27th International Conference on Intelligent User Interfaces, Helsinki, Finland, March 22, 2022
An Analysis of Using Fuzzy Annotations in CRNN-Based Joint Beat and Downbeat Tracking.
Proceedings of the 30th European Signal Processing Conference, 2022
2021
MirrorNet: A Deep Reflective Approach to 2D Pose Estimation for Single-Person Images.
J. Inf. Process., 2021
Vocal-Accompaniment Compatibility Estimation Using Self-Supervised and Joint-Embedding Techniques.
IEEE Access, 2021
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021
Proceedings of the IUI '21: 26th International Conference on Intelligent User Interfaces, 2021
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021
Toward an Understanding of Lyrics-viewing Behavior While Listening to Music on a Smartphone.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021
Tool- and Domain-Agnostic Parameterization of Style Transfer Effects Leveraging Pretrained Perceptual Metrics.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
2020
ACM Trans. Graph., 2020
Trans. Int. Soc. Music. Inf. Retr., 2020
Bayesian Singing Transcription Based on a Hierarchical Generative Model of Keys, Musical Notes, and F0 Trajectories.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Modeling N-th Order Derivative Creation Based on Content Attractiveness and Time-Dependent Popularity.
IEICE Trans. Inf. Syst., 2020
CoRR, 2020
MirrorNet: A Deep Bayesian Approach to Reflective 2D Pose Estimation from Human Images.
CoRR, 2020
Query/Task Satisfaction and Grid-based Evaluation Metrics Under Different Image Search Intents.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020
Proceedings of the RecSys 2020: Fourteenth ACM Conference on Recommender Systems, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the IUI '20: 25th International Conference on Intelligent User Interfaces, 2020
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Unsupervised Disentanglement of Pitch and Timbre for Isolated Musical Instrument Sounds.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Enhancing Participation Experience in VR Live Concerts by Improving Motions of Virtual Audience Avatars.
Proceedings of the 2020 IEEE International Symposium on Mixed and Augmented Reality, 2020
Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020
2019
Vis. Comput., 2019
Music Interfaces Based on Automatic Music Signal Analysis: New Ways to Create and Listen to Music.
IEEE Signal Process. Mag., 2019
End-To-End Melody Note Transcription Based on a Beat-Synchronous Attention Mechanism.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Joint Singing Pitch Estimation and Voice Separation Based on a Neural Harmonic Structure Renderer.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
ABCPRec: Adaptively Bridging Consumer and Producer Roles for User-Generated Content Recommendation.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019
DualDiv: diversifying items and explanation styles in explainable hybrid recommendation.
Proceedings of the 13th ACM Conference on Recommender Systems, 2019
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019
Audio-Based Automatic Generation of a Piano Reduction Score by Considering the Musical Structure.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019
Autocomplete vocal-<i>f</i><sub>o</sub> annotation of songs using musical repetitions.
Proceedings of the 24th International Conference on Intelligent User Interfaces: Companion, 2019
Query-by-Blending: A Music Exploration System Blending Latent Vector Representations of Lyric Word, Song Audio, and Artist.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
AIST Dance Video Database: Multi-Genre, Multi-Dancer, and Multi-Camera Database for Dance Information Processing.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
Intelligent User Interfaces for Music Discovery: The Past 20 Years and What's to Come.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
Joint Transcription of Lead, Bass, and Rhythm Guitars Based on a Factorial Hidden Semi-Markov Model.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Automatic Singing Transcription Based on Encoder-decoder Recurrent Neural Networks with a Weakly-supervised Attention Mechanism.
Proceedings of the IEEE International Conference on Acoustics, 2019
Zero-mean Convolutional Network with Data Augmentation for Sound Level Invariant Singing Voice Separation.
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Songrium Derivation Factor Analysis: A Web Service for Browsing Derivation Factors by Modeling N-th Order Derivative Creation.
IEICE Trans. Inf. Syst., 2018
Comput. Graph. Forum, 2018
DeployGround: A Framework for Streamlined Programming from API playgrounds to Application Deployment.
Proceedings of the 2018 IEEE Symposium on Visual Languages and Human-Centric Computing, 2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Songle Sync: A Large-Scale Web-based Platform for Controlling Various Devices in Synchronization with Music.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
Proceedings of the 23rd International Conference on Intelligent User Interfaces, 2018
Proceedings of the 23rd International Conference on Intelligent User Interfaces, 2018
Listener Anonymizer: Camouflaging Play Logs to Preserve User's Demographic Anonymity.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018
Instrudive: A Music Visualization System Based on Automatically Recognized Instrumentation.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018
Proceedings of the Twelfth International Conference on Web and Social Media, 2018
Chordscanner: Browsing Chord Progressions Based on Musical Typicality and Intra-composer Consistency.
Proceedings of the 2018 International Computer Music Conference, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Instlistener: An Expressive Parameter Estimation System Imitating Human Performances of Monophonic Musical Instruments.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Music Structure Boundary Detection and Labelling by a Deconvolution of Path-Enhanced Self-Similarity Matrix.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 26th European Signal Processing Conference, 2018
Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 2018
Proceedings of the Companion Publication of the 19th International ACM SIGACCESS Conference on Computers and Accessibility, 2018
2017
QueryShare: Working Together to Facilitate Exploratory Multimedia Searches without Skill in Creating.
Proceedings of the 13th International Symposium on Open Collaboration, 2017
User-Generated Variables: Streamlined Interaction Design for Feature Requests and Implementations.
Proceedings of the Companion to the first International Conference on the Art, 2017
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2017
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017
Proceedings of the 22nd International Conference on Intelligent User Interfaces, 2017
Lyric Jumper: A Lyrics-Based Music Exploratory Web Service by Modeling Lyrics Generative Process.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Multi-Part Pattern Analysis: Combining Structure Analysis and Source Separation to Discover Intra-Part Repeated Sequences.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Scale- and Rhythm-Aware Musical Note Estimation for Vocal F0 Trajectories Based on a Semi-Tatum-Synchronous Hierarchical Hidden Semi-Markov Model.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Song2Guitar: A Difficulty-Aware Arrangement System for Generating Guitar Solo Covers from Polyphonic Audio of Popular Music.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
Proceedings of the 2017 International Computer Music Conference, 2017
Proceedings of the Companion of the 2017 ACM/IEEE International Conference on Human-Robot Interaction, 2017
f3.js: A Parametric Design Tool for Physical Computing Devices for Both Interaction Designers and End-users.
Proceedings of the 2017 Conference on Designing Interactive Systems, 2017
Proceedings of the Advances in Computer Entertainment Technology, 2017
OngaCREST Project: Building a Similarity-Aware Information Environment for a Content-Symbiotic Society.
Proceedings of the Human-Harmonized Information Technology, Volume 2, 2017
2016
Musical Similarity and Commonness Estimation Based on Probabilistic Generative Models of Musical Elements.
Int. J. Semantic Comput., 2016
Improvements of Voice Timbre Control Based on Perceived Age in Singing Voice Conversion.
IEICE Trans. Inf. Syst., 2016
A choreographic authoring system for character dance animation reflecting a user's preference.
Proceedings of the Poster Proceedings of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation, 2016
Proceedings of the 31st Annual ACM Symposium on Applied Computing, 2016
PlaylistPlayer: An Interface Using Multiple Criteria to Change the Playback Order of a Music Playlist.
Proceedings of the 21st International Conference on Intelligent User Interfaces, 2016
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016
Proceedings of the IEEE International Conference on Data Mining Workshops, 2016
Student's T nonnegative matrix factorization and positive semidefinite tensor factorization for single-channel audio source separation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the COLING 2016, 2016
Why Did You Cover That Song?: Modeling N-th Order Derivative Creation with Content Popularity.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016
2015
AutoGuitarTab: Computer-Aided Composition of Rhythm and Lead Guitar Parts in the Tablature Space.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Expression Control in Singing Voice Synthesis: Features, approaches, evaluation, and challenges.
IEEE Signal Process. Mag., 2015
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015
Form Follows Function(): An IDE to Create Laser-cut Interfaces and Microcontroller Programs from Single Code Base.
Proceedings of the 28th Annual ACM Symposium on User Interface Software & Technology, 2015
A music video authoring system synchronizing climax of video clips and music via rearrangement of musical bars.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2015
Infinite Superimposed Discrete All-Pole Modeling for Multipitch Analysis of Wavelet Spectrograms.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015
Song2Quartet: A System for Generating String Quartet Cover Songs from Polyphonic Audio of Popular Music.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015
ExploratoryVideoSearch: A Music Video Search System Based on Coordinate Terms and Diversification.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015
Musical Similarity and Commonness Estimation Based on Probabilistic Generative Models.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015
Songle Widget: Making Animation and Physical Devices Synchronized with Music Videos on the Web.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015
A feedback framework for improved chord recognition based on NMF-based approximate note transcription.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 International Conference on Culture and Computing, 2015
Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, 2015
2014
IEEE ACM Trans. Audio Speech Lang. Process., 2014
IEICE Trans. Inf. Syst., 2014
Songrium: a music browsing assistance service with interactive visualization and exploration of protect a web of music.
Proceedings of the 23rd International World Wide Web Conference, 2014
Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation, 2014
Proceedings of the 14th International Conference on New Interfaces for Musical Expression, 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014
Spotting a Query Phrase from Polyphonic Music Audio Signals Based on Semi-supervised Nonnegative Matrix Factorization.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014
Unisoner: An Interactive Interface for Derivative Chorus Creation from Various Singing Voices on the Web.
Proceedings of the Music Technology meets Philosophy, 2014
Proceedings of the Music Technology meets Philosophy, 2014
An Automatic Singing Impression Estimation Method Using Factor Analysis and Multiple Regression.
Proceedings of the Music Technology meets Philosophy, 2014
Proceedings of the Music Technology meets Philosophy, 2014
AutoChorusCreator: Four-Part Chorus Generator with Musical Feature Control, Using Search Spaces Constructed from Rules of Music Theory.
Proceedings of the Music Technology meets Philosophy, 2014
Cultivating vocal activity detection for music audio signals in a circulation-type crowdsourcing ecosystem.
Proceedings of the IEEE International Conference on Acoustics, 2014
Vocal timbre analysis using latent Dirichlet allocation and cross-gender vocal timbre similarity.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the second international conference on Human-agent interaction, 2014
Proceedings of the 5th Augmented Human International Conference, 2014
Gender-dependent spectrum differential models for perceived age control based on direct waveform modification in singing voice conversion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
Automated choreography synthesis using a Gaussian process leveraging consumer-generated dance motions.
Proceedings of the 11th Conference on Advances in Computer Entertainment Technology, 2014
2013
Songrium: a music browsing assistance service based on visualization of massive open collaboration within music content creation community.
Proceedings of the 9th International Symposium on Open Collaboration, Hong Kong, China, August 05, 2013
Proceedings of the ACM Multimedia Conference, 2013
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013
Transfer Learning In Mir: Sharing Learned Latent Representations For Music Audio Classification And Similarity.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013
Chord-Sequence-Factory: A Chord Arrangement System Modifying Factorized Chord Sequence Probabilities.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013
An investigation of acoustic features for singing voice conversion based on perceptual age.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Evaluation of a singing voice conversion method based on many-to-many eigenvoice conversion.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Infinite Positive Semidefinite Tensor Factorization for Source Separation of Mixture Signals.
Proceedings of the 30th International Conference on Machine Learning, 2013
Infinite kernel linear prediction for joint estimation of spectral envelope and fundamental frequency.
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
A Nonparametric Bayesian Multipitch Analyzer Based on Infinite Latent Harmonic Allocation.
IEEE Trans. Speech Audio Process., 2012
IEEE Trans. Speech Audio Process., 2012
PodCastle and Songle: Crowdsourcing-Based Web Services for Retrieval and Browsing of Speech and Music Content.
Proceedings of the First International Workshop on Crowdsourcing Web Search, 2012
PodCastle and songle: crowdsourcing-based web services for spoken content retrieval and active music listening.
Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia, 2012
PodCastle and songle: Crowdsourcing-based web services for spoken document retrieval and active music listening.
Proceedings of the 2012 Information Theory and Applications Workshop, 2012
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012
PodCastle: Collaborative Training of Language Models on the Basis of Wisdom of Crowds.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
A spectral envelope estimation method based on F0-adaptive multi-frame integration analysis.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the Multimodal Music Processing, 2012
Proceedings of the Multimodal Music Processing, 2012
Singing voice conversion method based on many-to-many eigenvoice conversion and training data generation using a singing-to-singing synthesis system.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
2011
LyricSynchronizer: Automatic Synchronization System Between Musical Audio Signals and Lyrics.
IEEE J. Sel. Top. Signal Process., 2011
Music Listening in the Future: Augmented Music-Understanding Interfaces and Crowd Music Listening.
Proceedings of the AES International Conference Semantic Audio 2011, 2011
Proceedings of the 1st international ACM workshop on Music information retrieval with user-centered and multimodal strategies, Scottsdale, AZ, USA, November 28, 2011
A Vocabulary-Free Infinity-Gram Model for Nonparametric Bayesian Chord Progression Analysis.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Timbre and Melody Features for the Recognition of Vocal Activity and Instrumental Solos in Polyphonic Music.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011
PodCastle: Recent Advances of a Spoken Document Retrieval Service Improved by Anonymous User Contributions.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Vocalistener2: A singing synthesis system able to mimic a user's singing in terms of voice timbre changes as well as pitch and dynamics.
Proceedings of the IEEE International Conference on Acoustics, 2011
Polyphonic audio-to-score alignment based on Bayesian Latent Harmonic Allocation Hidden Markov Model.
Proceedings of the IEEE International Conference on Acoustics, 2011
Simultaneous processing of sound source separation and musical instrument identification using Bayesian spectral modeling.
Proceedings of the IEEE International Conference on Acoustics, 2011
Concurrent estimation of singing voice F0 and phonemes by using spectral envelopes estimated from polyphonic music.
Proceedings of the IEEE International Conference on Acoustics, 2011
Gradient-based musical feature extraction based on scale-invariant feature transform.
Proceedings of the 19th European Signal Processing Conference, 2011
Proceedings of the 2011 ACM Conference on Computer Supported Cooperative Work, 2011
2010
A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval.
IEEE Trans. Speech Audio Process., 2010
Editorial for the Special Issue on Signal Models and Representations of Musical and Environmental Sounds.
IEEE Trans. Speech Audio Process., 2010
PodCastle: A Spoken Document Retrieval Service Improved by Anonymous User Contributions.
Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation, 2010
Infinite Latent Harmonic Allocation: A Nonparametric Bayesian Approach to Multipitch Analysis.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Query-by-conducting: An Interface to Retrieve Classical-music Interpretations by Real-time Tempo Input.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
Parameter Estimation for Harmonic and Inharmonic Models by Using Timbre Feature Distributions.
J. Inf. Process., 2009
Musicream: Integrated Music-Listening Interface for Active, Flexible, and Unexpected Encounters with Musical Pieces.
J. Inf. Process., 2009
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009
PodCastle: a spoken document retrieval system for podcasts and its performance improvement by anonymous user contributions.
Proceedings of the third workshop on Searching spontaneous conversational speech, 2009
MusicCommentator: Generating Comments Synchronized with Musical Audio Signals by a Joint Probabilistic Model of Acoustic and Textual Features.
Proceedings of the Entertainment Computing, 2009
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Podcastle: collaborative training of acoustic models on the basis of wisdom of crowds for podcast transcription.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
The use of acoustically detected filled and silent pauses in spontaneous speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009
2008
An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model.
IEEE Trans. Speech Audio Process., 2008
IEEE Trans. Speech Audio Process., 2008
Proc. IEEE, 2008
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008
Music Thumbnailer: Visualizing Musical Pieces in Thumbnail Images Based on Acoustic Features.
Proceedings of the ISMIR 2008, 2008
Instrument Equalizer for Query-by-Example Retrieval: Improving Sound Source Separation Based on Integrated Harmonic and Inharmonic Models.
Proceedings of the ISMIR 2008, 2008
Hyperlinking Lyrics: A Method for Creating Hyperlinks Between Phrases in Song Lyrics.
Proceedings of the ISMIR 2008, 2008
Three techniques for improving automatic synchronization between music and lyrics: Fricative detection, filler model, and novel feature vectors for vocal activity detection.
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With Harmonic Structure Suppression.
IEEE Trans. Speech Audio Process., 2007
Drumix: An Audio Player with Real-time Drum-part Rearrangement Functions for Active Music Listening.
Inf. Media Technol., 2007
Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music.
Inf. Media Technol., 2007
Instrument Identification in Polyphonic Music: Feature Weighting to Minimize Influence of Sound Overlaps.
EURASIP J. Adv. Signal Process., 2007
EURASIP J. Adv. Signal Process., 2007
Improving Efficiency and Scalability of Model-Based Music Recommender System Based on Incremental Training.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007
A Supervised Approach for Detecting Boundaries in Music Using Difference Features and Boosting.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007
Proceedings of the 8th International Conference on Music Information Retrieval, 2007
Proceedings of the 8th International Conference on Music Information Retrieval, 2007
Proceedings of the 8th International Conference on Music Information Retrieval, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Presentation sensei: a presentation training system using speech and image processing.
Proceedings of the 9th International Conference on Multimodal Interfaces, 2007
Integration and Adaptation of Harmonic and Inharmonic Models for Separating Polyphonic Musical Signals.
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
A chorus section detection method for musical audio signals and its application to a music listening station.
IEEE Trans. Speech Audio Process., 2006
Hybrid Collaborative and Content-based Music Recommendation Using Probabilistic Model with Latent User Preferences.
Proceedings of the ISMIR 2006, 2006
MusicRainbow: A New User Interface to Discover Artists Using Audio-based Similarity and Web-based Labeling.
Proceedings of the ISMIR 2006, 2006
AIST Annotation for the RWC Music Database.
Proceedings of the ISMIR 2006, 2006
Musical Instrument Recognizer "Instrogram" and Its Application to Music Retrieval Based on Instrumentation Similarity.
Proceedings of the Eigth IEEE International Symposium on Multimedia (ISM 2006), 2006
Automatic Synchronization between Lyrics and Music CD Recordings Based on Viterbi Alignment of Segregated Vocal Signals.
Proceedings of the Eigth IEEE International Symposium on Multimedia (ISM 2006), 2006
An automatic singing skill evaluation method for unknown melodies using pitch interval accuracy and vibrato features.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Speaker identification under noisy environments by using harmonic structure extraction and reliable frame weighting.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
An Error Correction Framework Based on Drum Pattern Periodicity for Improving Drum Sound Detection.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Instrogram: A New Musical Instrument Recognition Technique Without Using Onset Detection NOR F0 Estimation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
F0 Estimation Method for Singing Voice in Polyphonic Audio Signal Based on Statistical Vocal Model and Viterbi Search.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Proceedings of the 2006 Conference on Human Factors in Computing Systems, 2006
2005
Proceedings of the 2005 IEEE International Conference on Wireless And Mobile Computing, 2005
Instrument Identification in Polyphonic Music: Feature Weighting with Mixed Sounds, Pitch-Dependent Timbre Modeling, and Use of Musical Context.
Proceedings of the ISMIR 2005, 2005
Musicream: New Music Playback Interface for Streaming, Sticking, Sorting, and Recalling Musical Pieces.
Proceedings of the ISMIR 2005, 2005
Singer Identification Based on Accompaniment Sound Reduction and Reliable Frame Selection.
Proceedings of the ISMIR 2005, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Speech repair: quick error correction just by using selection operation for speech input interfaces.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
An Auto-Regressive, Non-Stationary Excited Signal Parameter Estimation Method and an Evaluation of a Singing-Voice Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
2004
A real-time music-scene-description system: predominant-F0 estimation for detecting melody and bass lines in real-world audio signals.
Speech Commun., 2004
Automatic Drum Sound Description for Real-World Music Using Template Adaptation and Matching Methods.
Proceedings of the ISMIR 2004, 2004
Proceedings of the ISMIR 2004, 2004
Speech-Recognition Interfaces for Music Information Retrieval: 'Speech Completion' and 'Speech Spotter'.
Proceedings of the ISMIR 2004, 2004
Drum sound identification for polyphonic music using template adaptation and matching methods.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004
Speech spotter: on-demand speech recognition in human-human conversation on the telephone or in face-to-face situations.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
Proceedings of the 16th Annual ACM Symposium on User Interface Software and Technology, 2003
Proceedings of the ISMIR 2003, 2003
Proceedings of the ISMIR 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Speech shift: direct speech-input-mode switching through intentional control of voice pitch.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the IJCAI-03, 2003
Pitch-Dependent Musical Instrument Identification and Its Application to Musical Sound Ontology.
Proceedings of the Developments in Applied Artificial Intelligence, 2003
Proceedings of the 2003 International Computer Music Conference, 2003
Musical instrument identification based on F0-dependent multivariate normal distribution.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
Proceedings of the ISMIR 2002, 2002
Speech completion: on-demand completion assistance using filled pauses for speech input interfaces.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
2001
Evaluation of exchanging time for mechanism of exchanging parts of programs during execution.
Syst. Comput. Jpn., 2001
Real-time sound source localization and separation system and its application to automatic speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the 2001 International Computer Music Conference, 2001
A predominant-F<sub>0</sub> estimation method for CD recordings: MAP estimation using EM algorithm for adaptive tone models.
Proceedings of the IEEE International Conference on Acoustics, 2001
2000
A robust predominant-F0 estimation method for real-time detection of melody and bass lines in CD recordings.
Proceedings of the IEEE International Conference on Acoustics, 2000
1999
Real-time beat tracking for drumless audio signals: Chord change detection for musical decisions.
Speech Commun., 1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
1998
Proceedings of the 1998 International Computer Music Conference, 1998
Proceedings of the 1998 International Computer Music Conference, 1998
1997
Proceedings of the 1997 International Computer Music Conference, 1997
1996
A Jazz Session System for Interplay Among All Players - VirJa Session (Virtual Jazz Session System).
Proceedings of the 1996 International Computer Music Conference, 1996
Localization by harmonic structure and its application to harmonic sound stream segregation.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
1995
Proceedings of the 1995 International Computer Music Conference, 1995
Proceedings of the 1995 International Computer Music Conference, 1995
1994
Proceedings of the Second ACM International Conference on Multimedia '94, 1994
Proceedings of the 1994 International Computer Music Conference, 1994