Shigeki Sagayama
Affiliations:- Meiji University, Tokyo, Japan
According to our database1,
Shigeki Sagayama
authored at least 217 papers
between 1986 and 2022.
Collaborative distances:
Collaborative distances:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
Use of Nods Less Synchronized with Turn-Taking and Prosody During Conversations in Adults with Autism.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Entrainment Analysis for Assessment of Autistic Speech Prosody Using Bottleneck Features of Deep Neural Network.
Proceedings of the IEEE International Conference on Acoustics, 2022
Semi-automatic music piece creation based on impression words extracted from object and background in color image.
Proceedings of the 10th IEEE Global Conference on Consumer Electronics, 2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
IEICE Trans. Inf. Syst., 2020
Music Recreation in Nursing Home using Automatic Music Accompaniment System and Score of VLN.
Proceedings of the 2nd IEEE Global Conference on Life Sciences and Technologies, 2020
Automatic Music Completion Based on Joint Optimization of Harmony Progression and Voicing.
J. Inf. Process., 2019
Autism Spectrum Disorder Discrimination Based on Voice Activities Related to Fillers and Laughter.
Proceedings of the 53rd Annual Conference on Information Sciences and Systems, 2019
Proceedings of the 53rd Annual Conference on Information Sciences and Systems, 2019
Proceedings of the 53rd Annual Conference on Information Sciences and Systems, 2019
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Composite Wavelet Model for Stability-Oriented Speech Synthesis from Cepstral Features.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Multiresolutional Hierarchical Bayesian NMF for Detailed Audio Analysis of Music Performances.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Rhythm Transcription of Polyphonic Piano Music Based on Merged-Output HMM for Multiple Voices.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Proceedings of the 22nd International Conference on Digital Signal Processing, 2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Real-Time Audio-to-Score Alignment of Music Performances Containing Errors and Arbitrary Repeats and Skips.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
A Real-time Audio-to-audio Karaoke Generation System for Monaural Recordings Based on Singing Voice Suppression and Key Conversion Techniques.
J. Inf. Process., 2016
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016
Blind Suppression of Nonstationary Diffuse Acoustic Noise Based on Spatial Covariance Matrix Decomposition.
J. Signal Process. Syst., 2015
Autoregressive Hidden Semi-Markov Model of Symbolic Music Performance for Score Following.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015
Automatic Piano Reduction from Ensemble Scores Based on Merged-Output Hidden Markov Model.
Proceedings of the Looking Back, 2015
Singing Voice Enhancement in Monaural Music Signals Based on Two-stage Harmonic/Percussive Sound Separation on Multiple Resolution Spectrograms.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Harmonic/percussive sound separation based on anisotropic smoothness of spectrograms.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
CoRR, 2014
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014
Merged-Output Hidden Markov Model for Score Following of MIDI Performance with Ornaments, Desynchronized Voices, Repeats and Skips.
Proceedings of the Music Technology meets Philosophy, 2014
HMM-Based Automatic Arrangement for Guitars with Transposition and its Implementation.
Proceedings of the Music Technology meets Philosophy, 2014
An auxiliary-function approach to online independent vector analysis for real-time blind source separation.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014
IEEE Trans. Speech Audio Process., 2013
J. Inf. Process., 2013
Bayesian Nonparametric Approach to Blind Separation of Infinitely Many Sparse Sources.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2013
Text-to-speech synthesizer based on combination of composite wavelet and hidden Markov models.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
General algorithms for estimating spectrogram and transfer functions of target signal for blind suppression of diffuse noise.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Probabilistic speech F<sub>0</sub> contour model incorporating statistical vocabulary model of phrase-accent command sequence.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Probabilistic model of two-dimensional rhythm tree structure representation for automatic transcription of polyphonic MIDI signals.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Proceedings of the Guide to Computing for Expressive Music Performance, 2013
Introduction to the Special Section on Deep Learning for Speech and Language Processing.
IEEE Trans. Speech Audio Process., 2012
Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012
Context-free 2D Tree Structure Model of Musical Notes for Bayesian Modeling of Polyphonic Spectrograms.
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012
Proceedings of the International Symposium on Communications and Information Technologies, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the Non-Cochlear Sound: Proceedings of the 38th International Computer Music Conference, 2012
Comparative evaluations of various harmonic/percussive sound separation algorithms based on anisotropic continuity of spectrogram.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Explicit beat structure modeling for non-negative matrix factorization-based multipitch analysis.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Constrained and regularized variants of non-negative matrix factorization incorporating music-specific constraints.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
A tandem connectionist model using combination of multi-scale spectro-temporal features for acoustic event detection.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Beyond Timbral Statistics: Improving Music Classification Using Percussive Patterns and Bass Lines.
IEEE ACM Trans. Audio Speech Lang. Process., 2011
IEEE Trans. Speech Audio Process., 2011
Computational auditory induction as a missing-data model-fitting problem with Bregman divergence.
Speech Commun., 2011
Polyphonic Pitch Estimation and Instrument Identification by Joint Modeling of Sustained and Attack Sounds.
IEEE J. Sel. Top. Signal Process., 2011
IEEE J. Sel. Top. Signal Process., 2011
Bayesian nonparametric spectrogram modeling based on infinite factorial infinite hidden Markov model.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011
Polyhymnia: An Automatic Piano Performance System with Statistical Modeling of Polyphonic Expression and Musical Symbol Interpretation.
Proceedings of the 11th International Conference on New Interfaces for Musical Expression, 2011
Using Spectral Fluctuation of Speech in Multi-Feature HMM-Based Voice Activity Detection.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Concurrent Optimization of Context Clustering and GMM for Offline Handwritten Word Recognition Using HMM.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Automatic video annotation via Hierarchical Topic Trajectory Model considering cross-modal correlations.
Proceedings of the IEEE International Conference on Acoustics, 2011
Multichannel harmonic and percussive component separation by joint modeling of spatial and spectral continuity.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the 33th Annual Meeting of the Cognitive Science Society, 2011
Musical Instrument Identification Based on New Boosting Algorithm with Probabilistic Decisions.
Proceedings of the Speech, Sound and Music Processing: Embracing Research in India, 2011
Proceedings of the Advances in Music Information Retrieval, 2010
Speech Spectrum Modeling for Joint Estimation of Spectral Envelope and Fundamental Frequency.
IEEE Trans. Speech Audio Process., 2010
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010
Proceedings of the Entertainment Computing - ICEC 2010, 9th International Conference, 2010
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Autoregressive MFCC Models for Genre Classification Improved by Harmonic-percussion Separation.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Monophonic Instrument Sound Segregation by Clustering NMF Components Based on Basis Similarity and Gain Disjointness.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Melody line estimation in homophonic music audio signals based on temporal-variability of melodic source.
Proceedings of the IEEE International Conference on Acoustics, 2010
R-means localization: A simple iterative algorithm for range-difference-based source localization.
Proceedings of the IEEE International Conference on Acoustics, 2010
A sparse component model of source signals and its application to blind source separation.
Proceedings of the IEEE International Conference on Acoustics, 2010
Designing the Wiener post-filter for diffuse noise suppression using imaginary parts of inter-channel cross-spectra.
Proceedings of the IEEE International Conference on Acoustics, 2010
Consistent Wiener Filtering: Generalized Time-Frequency Masking Respecting Spectrogram Consistency.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010
Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010
Crystal-MUSIC: Accurate Localization of Multiple Sources in Diffuse Noise Environments Using Crystal-Shaped Microphone Arrays.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010
Proceedings of the Latent Variable Analysis and Signal Separation, 2010
Note detection with dynamic bayesian networks as a postanalysis step for NMF-based multiple pitch estimation techniques.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009
Proceedings of the Entertainment Computing, 2009
Musical Bass-Line Pattern Clustering and Its Application to Audio Genre Classification.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009
Stereo-input speech recognition using sparseness-based time-frequency masking in a reverberant environment.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Audio genre classification using percussive pattern clustering combined with timbral features.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009
Rhythm map: Extraction of unit rhythmic patterns and analysis of rhythmic structure from music acoustic signals.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Extending Nonnegative Matrix Factorization - A discussion in the context of multiple frequency estimation of musical signals.
Proceedings of the 17th European Signal Processing Conference, 2009
IEEE Trans. Speech Audio Process., 2008
Sound Source Localization with Front-Back Judgement by Two Microphones Asymmetrically Mounted on a Sphere.
J. Multim., 2008
Proceedings of the ISMIR 2008, 2008
Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008
On-line handwritten Kanji string recognition based on grammar description of character structures.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Harmonic-Temporal-Timbral Clustering (HTTC) for the analysis of multi-instrument polyphonic music signals.
Proceedings of the IEEE International Conference on Acoustics, 2008
Auxiliary function approach to parameter estimation of constrained sinusoidal model for monaural speech separation.
Proceedings of the IEEE International Conference on Acoustics, 2008
A blind noise decorrelation approach with crystal arrays on designing post-filters for diffuse noise suppression.
Proceedings of the IEEE International Conference on Acoustics, 2008
Separation of a monaural audio signal into harmonic/percussive components by complementary diffusion on spectrogram.
Proceedings of the 2008 16th European Signal Processing Conference, 2008
Single and Multiple F<sub>0</sub> Contour Estimation Through Parametric Spectrogram Modeling of Speech in Noisy Environments.
IEEE Trans. Speech Audio Process., 2007
IEEE Trans. Speech Audio Process., 2007
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007
Proceedings of the 8th International Conference on Music Information Retrieval, 2007
Proceedings of the IJCAI 2007, 2007
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Harmonic-Temporal Clustering of Speech for Single and Multiple F0 Contour Estimation in Noisy Environments.
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Speech analyzer using a joint estimation model of spectral envelope and fine structure.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Model Adaptation for Long Convolutional Distortion by Maximum Likelihood Based State Filtering Approach.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Proceedings of the 28th International Conference of the IEEE Engineering in Medicine and Biology Society, 2006
Specmurt Analysis of Multi-Pitch Music Signals with Adaptive Estimation of Common Harmonic Structure .
Proceedings of the ISMIR 2005, 2005
Harmonic-Temporal Clustering via Deterministic Annealing EM Algorithm for Audio Feature Extraction.
Proceedings of the ISMIR 2005, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Audio stream segregation of multi-pitch music signal based on time-space clustering using Gaussian kernel 2-dimensional model.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents.
Proceedings of the Life-like characters - tools, affective functions, and applications., 2004
Proceedings of the ISMIR 2004, 2004
Complex spectrum circle centroid for microphone-array-based noisy speech recognition.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Specmurt anasylis: a piano-roll-visualization of polyphonic music signal by deconvolution of log-frequency spectrum.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004
Model composition by lagrange polynomial approximation for robust speech recognition in noisy environment.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Multi-pitch trajectory estimation of concurrent speech based on harmonic GMM and nonlinear kalman filtering.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Separation of harmonic structures based on tied Gaussian mixture model and information criterion for concurrent sounds.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Proceedings of the ISMIR 2003, 2003
Proceedings of the 7th International Conference on Document Analysis and Recognition (ICDAR 2003), 2003
Generation of Hierarchical Dictionary for Stroke-order Free Kanji Handwriting Recognition Based on Substroke HMM.
Proceedings of the 7th International Conference on Document Analysis and Recognition (ICDAR 2003), 2003
Pen Pressure Features for Writer-Independent On-Line Handwriting Recognition Based on Substroke HMM.
Proceedings of the 16th International Conference on Pattern Recognition, 2002
Proceedings of the Eighth International Workshop on Frontiers in Handwriting Recognition, 2002
Proceedings of the IEEE International Conference on Acoustics, 2002
Proceedings of the IEEE 5th Workshop on Multimedia Signal Processing, 2002
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the 6th International Conference on Document Analysis and Recognition (ICDAR 2001), 2001
Proceedings of the IEEE International Conference on Acoustics, 2001
Speaker adaptation of acoustic models using correlations of training transfer vectors.
Syst. Comput. Jpn., 2000
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000
Jacobian adaptation of HMM with initial model selection for noisy speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the IEEE International Conference on Acoustics, 2000
An address data entry system with a multimodal interface including speech recognition.
Syst. Comput. Jpn., 1999
Two-step generation of variable-word-length language model integrating local and global constraints.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
Speech recognition and synthesis technology development at NTT for telecommunications services.
Int. J. Speech Technol., 1997
Vector-field-smoothed Bayesian learning for fast and incremental speaker/telephone-channel adaptation.
Comput. Speech Lang., 1997
Fast adaptation of acoustic models to environmental noise using jacobian adaptation algorithm.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
A speaker-adaptation technique for context-dependent models represented by hidden markov networks.
Syst. Comput. Jpn., 1996
Comput. Speech Lang., 1996
LR-parser-driven viterbi search with hypotheses merging mechanism using context-dependent phone models.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Minimum classification error training for a small amount of data enhanced by vector-field-smoothed Bayesian learning.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
Speech Commun., 1995
IEICE Trans. Inf. Syst., 1995
IEICE Trans. Inf. Syst., 1995
Automatic Determination of the Number of Mixture Components for Continuous HMMs Based a Uniform Variance Criterion.
IEICE Trans. Inf. Syst., 1995
Speech Recognition Using Function-Word <i>N</i>-Grams and Content-Word <i>N</i>-Grams.
IEICE Trans. Inf. Syst., 1995
Fast and accurate beam search using forward heuristic functions in HMM-LR speech recognition.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Proceedings of the 1995 International Conference on Acoustics, 1995
Proceedings of the 1995 International Conference on Acoustics, 1995
Proceedings of the 1995 International Conference on Acoustics, 1995
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Tree-structured speaker clustering for speaker-independent continuous speech recognition.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994
Suprasegmental duration control with matrix parsing in continuous speech recognition.
Speech Commun., 1993
Speech Commun., 1993
Syst. Comput. Jpn., 1993
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
A dynamic approach to speaker adaptation of hidden Markov networks for speech recognition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
Spoken Language Translation System.
Proceedings of the 13th International Joint Conference on Artificial Intelligence. Chambéry, France, August 28, 1993
Proceedings of the IEEE International Conference on Acoustics, 1993
Proceedings of the IEEE International Conference on Acoustics, 1993
Rapid speaker adaptation using speaker-mixture allophone models applied to speaker-independent speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 1993
Proceedings of the Second International Conference on Spoken Language Processing, 1992
Appropriate error criterion selection for continuous speech HMM minimum error training.
Proceedings of the Second International Conference on Spoken Language Processing, 1992
Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs.
Proceedings of the Second International Conference on Spoken Language Processing, 1992
The SSS-LR continuous speech recognition system: integrating SSS-derived allophone models and a phoneme-context-dependent LR parser.
Proceedings of the Second International Conference on Spoken Language Processing, 1992
Proceedings of the Second International Conference on Spoken Language Processing, 1992
Proceedings of the Second International Conference on Spoken Language Processing, 1992
Proceedings of the Second International Conference on Spoken Language Processing, 1992
Proceedings of the Second International Conference on Spoken Language Processing, 1992
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992
Proceedings of the Second European Conference on Speech Communication and Technology, 1991
Phoneme-context-dependent LR parsing algorithms for HMM-based continuous speech recognition.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991
A pairwise discriminant approach to robust phoneme recognition by time-delay neural networks.
Proceedings of the 1991 International Conference on Acoustics, 1991
Proceedings of the 1991 International Conference on Acoustics, 1991
Proceedings of the First International Conference on Spoken Language Processing, 1990
Proceedings of the First International Conference on Spoken Language Processing, 1990
Proceedings of the First International Conference on Spoken Language Processing, 1990
Proceedings of the First International Conference on Spoken Language Processing, 1990
Proceedings of the First International Conference on Spoken Language Processing, 1990
Proceedings of the First International Conference on Spoken Language Processing, 1990
Proceedings of the First International Conference on Spoken Language Processing, 1990
Proceedings of the 1990 International Conference on Acoustics, 1990
Proceedings of the IEEE International Conference on Acoustics, 1989
Proceedings of the IEEE International Conference on Acoustics, 1986