Seiichi Nakagawa
Orcid: 0000-0002-6533-5536
According to our database1,
Seiichi Nakagawa
authored at least 242 papers
between 1978 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Significance of relative phase features for shouted and normal speech classification.
EURASIP J. Audio Speech Music. Process., December, 2024
Proceedings of the 13th IEEE Global Conference on Consumer Electronics, 2024
2023
A Study of Speech Recognition, Speech Translation, and Speech Summarization of TED English Lectures.
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023
Attention-based CNN and Relative Phase Feature Modeling for Improved Imagined Speech Recognition.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
2022
Learning affective representations based on magnitude and dynamic relative phase information for speech emotion recognition.
Speech Commun., 2022
Summarization of Spoken Lectures Based on MMR Method and Important/Unimportant Sentence Classification Using BERT.
Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022
2021
Replay attack detection using variable-frequency resolution phase and magnitude features.
Comput. Speech Lang., 2021
Classification of Imagined and Heard Speech Using Amplitude Spectrum and Relative Phase of EEG.
Proceedings of the 3rd IEEE Global Conference on Life Sciences and Technologies, 2021
Proceedings of the 10th IEEE Global Conference on Consumer Electronics, 2021
2020
Effectiveness of Fine Linear Frequency Spectral Feature for Acoustic Event Detection.
Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020
2019
Discriminative Learning of Filterbank Layer within Deep Neural Network Based Speech Recognition for Speaker Adaptation.
IEICE Trans. Inf. Syst., 2019
EURASIP J. Audio Speech Music. Process., 2019
Replay Attack Detection Using Linear Prediction Analysis-Based Relative Phase Features.
IEEE Access, 2019
Replay Attack Detection Using Magnitude and Phase Information with Attention-based Adaptive Filters.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE 8th Global Conference on Consumer Electronics, 2019
2018
Multim. Tools Appl., 2018
Rapid Speaker Adaptation of Neural Network Based Filterbank Layer for Automatic Speech Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Replay Attacks Detection Using Phase and Magnitude Features with Various Frequency Resolutions.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Pitch Synchronized Relative Phase with Peak Error Detection For Noise-robust Speaker Recognition.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the IEEE 7th Global Conference on Consumer Electronics, 2018
2017
IEEE J. Sel. Top. Signal Process., 2017
Noise robust voice activity detection using joint phase and magnitude based feature enhancement.
J. Ambient Intell. Humaniz. Comput., 2017
Automatic Explanation Spot Estimation Method Targeted at Text and Figures in Lecture Slides.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the IEEE 6th Global Conference on Consumer Electronics, 2017
Pseudo-pitch-synchronized phase information extraction and its application for robust speaker recognition.
Proceedings of the IEEE 6th Global Conference on Consumer Electronics, 2017
2016
DNN-Based Amplitude and Phase Feature Enhancement for Noise Robust Speaker Identification.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Investigation of glottal features and annotation procedures for speech emotion recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Domain adaptation of a speech translation system for lectures by utilizing frequently appearing parallel phrases in-domain.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Robust speech recognition using DNN-HMM acoustic model combining noise-aware training with spectral subtraction.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Combination of syllable based N-gram search and word search for spoken term detection through spoken queries and IV/OOV classification.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
Deep neural network based acoustic model using speaker-class information for short time utterance.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
Speech recognition for mixed speech and music by NMF using various cost functions and noise adaptive training methods.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
2014
Single-channel dereverberation by feature mapping using cascade neural networks for robust distant speaker identification and speech recognition.
EURASIP J. Audio Speech Music. Process., 2014
Comput. Speech Lang., 2014
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014
Sopoken Term Detection Based on a Syllable N-gram Index at the NTCIR-11 SpokenQuery&Doc Task.
Proceedings of the 11th NTCIR Conference on Evaluation of Information Access Technologies, 2014
Speech recognition based on Itakura-Saito divergence and dynamics/sparseness constraints from mixed sound of speech and music by non-negative matrix factorization.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
2013
A robust/fast spoken term detection method based on a syllable n-gram index with a distance metric.
Speech Commun., 2013
Development and Evaluation of Spoken Dialog Systems with One or Two Agents through Two Domains.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Robust/fast out-of-vocabulary spoken term detection by N-gram index with exact distance through text/speech input.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Single channel dereverberation method in log-melspectral domain using limited stereo data for distant speaker identification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Fast NMF based approach and VQ based approach using MFCC distance measure for speech recognition from mixed sound.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Speaker identification using pseudo pitch synchronized phase information in noisy environments.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
2012
IEEE Trans. Speech Audio Process., 2012
IEEE Trans. Speech Audio Process., 2012
Class-Based N-Gram Language Model for New Words Using Out-of-Vocabulary to In-Vocabulary Similarity.
IEICE Trans. Inf. Syst., 2012
Risk-Based Semi-Supervised Discriminative Language Modeling for Broadcast Transcription.
IEICE Trans. Inf. Syst., 2012
IEICE Trans. Inf. Syst., 2012
Improving the Readability of ASR Results for Lectures Using Multiple Hypotheses and Sentence-Level Knowledge.
IEICE Trans. Inf. Syst., 2012
Development of large vocabulary continuous speech recognition system for Mongolian language.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Fast NMF based approach and improved VQ based approach for speech recognition from mixed sound.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
An online evaluation system for English pronunciation intelligibility for Japanese English learners.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
On the use of phase information-based joint factor analysis for speaker verification under channel mismatch condition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
Soft-clustering technique for training data in Age-and gender-independent speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
2011
Distant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm.
IEICE Trans. Inf. Syst., 2011
High speed spoken term detection by combination of n-gram array of a syllable lattice and LVCSR result for NTCIR-SpokenDoc.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011
Speech Recognition in Mixed Sound of Speech and Music Based on Vector Quantization and Non-Negative Matrix Factorization.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
New Feature Parameters for Pronunciation Evaluation in English Presentations at International Conferences.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Hidden Boosted MMI and Hierarchical State Posterior Feature for Automatic Speech Recognition Based on Hidden Conditional Neural Fields.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Efficient out-of-vocabulary term detection by n-gram array indices with distance from a syllable lattice.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
2010
ACM Trans. Asian Lang. Inf. Process., 2010
IEICE Trans. Inf. Syst., 2010
Evaluation of Combinational Use of Discriminant Analysis-Based Acoustic Feature Transformation and Discriminative Training.
IEICE Trans. Inf. Syst., 2010
IEICE Trans. Inf. Syst., 2010
Topic dependent class based language model evaluation on automatic speech recognition.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010
Out-of-vocabulary term detection by n-gram array with distance from continuous syllable recognition results.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010
Proceedings of the Information Processing and Management of Uncertainty in Knowledge-Based Systems. Applications, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Integration of cache-based model and topic dependent class model with soft clustering and soft voting.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Automatic evaluation of English pronunciation by Japanese speakers using various acoustic features and pattern recognition techniques.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Speaker identification by combining MFCC and phase information in noisy environments.
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Estimating the position and orientation of an acoustic source with a microphone array network.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
High improvement of speaker identification and verification by combining MFCC and phase information.
Proceedings of the IEEE International Conference on Acoustics, 2009
Language Model Based on Word Order Sensitive Matrix Representation in Latent Semantic Analysis for Speech Recognition.
Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009
Response timing generation and response type selection for a spontaneous spoken dialog system.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration, 2009
Proceedings of the Fifth International Conference on Information Assurance and Security, 2009
2008
Robust Speech Recognition by Combining Short-Term and Long-Term Spectrum Based Position-Dependent CMN with Conventional CMN.
IEICE Trans. Inf. Syst., 2008
Linear Discriminant Analysis Using a Generalized Mean of Class Covariances and Its Application to Speech Recognition.
IEICE Trans. Inf. Syst., 2008
Noisy Speech Recognition Based on Integration/Selection of Multiple Noise Suppression Methods Using Noise GMMs.
IEICE Trans. Inf. Syst., 2008
Proceedings of the International Conference on Language Resources and Evaluation, 2008
Blind dereverberation based on CMN and spectral subtraction by multi-channel LMS algorithm.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Evaluating spoken language model based on filler prediction model in speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Analysis of relationship between impression of human-to-human conversations and prosodic change and its modeling.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Class lecture summarization taking into account consecutiveness of important sentences.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the ACL 2008, 2008
2007
Robust distant speaker recognition based on position-dependent CMN by combining speaker-specific GMM with speaker-adapted HMM.
Speech Commun., 2007
Inf. Media Technol., 2007
A Machine Learning Approach for an Indonesian-English Cross Language Question Answering System.
IEICE Trans. Inf. Syst., 2007
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007
Analysis of effect of compensation parameter estimation for CMN on speech/speaker recognition.
Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007
Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007
One-pass LVCSR algorithm using linear lexicon search and 1-best approximation tree-structured lexicon search.
Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007
Selection of optimal dimensionality reduction method using chernoff bound for segmental unit input HMM.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Construction of spoken language model including fillers using filler prediction model.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Prosody change and response timing analysis in spontaneously spoken dialogs and their modeling in a spoken dialog system.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
A statistical method of evaluating pronunciation proficiency for presentation in English.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Automatic extraction of cue phrases for important sentences in lecture speech and automatic lecture speech summarization.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Robust Distant Speech Recognition by Combining Position-Dependent CMN with Conventional CMN.
Proceedings of the IEEE International Conference on Acoustics, 2007
Generalization of Linear Discriminant Analysis used in Segmental Unit Input HMM for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2007
Development of VAD evaluation framework CENSREC-1-C and investigation of relationship between VAD and speech recognition performance.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
A machine learning approach for indonesian question answering system.
Proceedings of the IASTED International Conference on Artificial Intelligence and Applications, 2007
Proceedings of the ACL 2007, 2007
2006
Response Timing Detection Using Prosodic and Linguistic Information for Human-friendly Spoken Dialog Systems.
Inf. Media Technol., 2006
Text-Independent/Text-Prompted Speaker Recognition by Combining Speaker-Specific GMM with Speaker Adapted Syllable-Based HMM.
IEICE Trans. Inf. Syst., 2006
Robust Distant Speech Recognition by Combining Multiple Microphone-Array Processing with Position-Dependent CMN.
EURASIP J. Adv. Signal Process., 2006
Summarization of spoken Lectures Based on Linguistic Surface and prosodic Information.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006
Noisy speech recognition based on selection of multiple noise suppression methods using noise GMMs.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
2005
Syst. Comput. Jpn., 2005
Large-vocabulary continuous speech recognition using linear lexicon search and 1-best approximation tree-structured lexicon search.
Syst. Comput. Jpn., 2005
Detection and recognition of correction utterances on misrecognition of spoken dialog system.
Syst. Comput. Jpn., 2005
An Unsupervised Speaker Adaptation Method for Lecture-Style Spontaneous Speech Recognition Using Multiple Recognition Systems.
IEICE Trans. Inf. Syst., 2005
Improving Keyword Recognition of Spoken Queries by Combining Multiple Speech Recognizer's Outputs for Speech-driven WEB Retrieval Task.
IEICE Trans. Inf. Syst., 2005
Robust distant speech recognition based on position dependent CMN using a novel multiple microphone processing technique.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Robust distant speaker recognition based on position dependent cepstral mean normalization.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Multimodal interface for organization name input based on combination of isolated word recognition and continuous base-word recognition.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the Information Retrieval Technology, 2005
2004
Estimating high-confidence portions based on agreement among outputs of multiple LVCSR models.
Syst. Comput. Jpn., 2004
Robust spoken document retrieval methods for misrecognition and out-of-vocabulary keywords.
Syst. Comput. Jpn., 2004
Confidence measure and rejection based on correctness probability of recognition candidates.
Syst. Comput. Jpn., 2004
A Statistical Method of Evaluating Pronunciation Proficiency for English Words Spoken by Japanese.
IEICE Trans. Inf. Syst., 2004
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004
Unsupervised speaker adaptation using high confidence portion recognition results by multiple recognition systems.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Speech interface for name input based on combination of recognition methods using syllable-based n-gram and word dictionary.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Keyword recognition and extraction by multiple-LVCSRs with 60, 000 words in speech-driven WEB retrieval task.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Text-independent speaker recognition by combining speaker-specific GMM with speaker adapted syllable-based HMM.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Integrating Cross-Lingually Relevant News Articles and Monolingual Web Documents in Bilingual Lexicon Acquisition.
Proceedings of the COLING 2004, 2004
2003
Syst. Comput. Jpn., 2003
Proceedings of the SIGDIAL 2003 Workshop, 2003
Generation of natural response timing using decision tree based on prosodic and linguistic information.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Text-independent speaker recognition by speaker-specific GMM and speaker adapted syllable-based HMM.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Evaluating multiple LVCSR model combination in NTCIR-3 speech-driven web retrieval task.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Comparison of effects of acoustic and language knowledge on spontaneous speech perception/recognition between human and automatic speech recognizer.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
Syst. Comput. Jpn., 2002
Differences of speech rate, interphoneme distance and likelihood caused by speaking style, their relationship, and recognition performance.
Syst. Comput. Jpn., 2002
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002
A confidence measure based on agreement among multiple LVCSR models - correlation between pair of acoustic models and confidence.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Comparing isolately spoken keywords with spontaneously spoken queries for Japanese spoken document retrieval.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Evaluation of spectral subtraction with smoothing of time direction on the Aurora 2 task.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Detection and recognition of repaired speech on misrecognized utterances for speech input of car navigation system.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
2001
Proceedings of the Text, Speech and Dialogue, 4th International Conference, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Instantaneous estimation of accentuation habits for Japanese students to learn English pronunciation.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
A fast calculation method in LVCSRS by time-skipping and clustering of probability density distributions.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Experimental evaluation on confidence of agreement among multiple Japanese LVCSR models.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2001
Proceedings of the IEEE International Conference on Acoustics, 2001
2000
A Semantic Interpreter and a Cooperative Response Generator for a Robust Spoken Dialogue System.
Int. J. Pattern Recognit. Artif. Intell., 2000
Relationship among speaking style, inter-phoneme's distance and speech recognition performance.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
A system for retrieving broadcast news speech documents using voice input keywords and similarity between words.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Instantaneous estimation of prosodic pronunciation habits for Japanese students to learn English pronunciation.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Performance comparison among HMM, DTW, and human abilities in terms of identifying stress patterns of word utterances.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Overview of an intelligent system for information retrieval based on human-machine dialogue through spoken language.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Usability of Browser-Based Pen-Touch/Speech User Interfaces for Form-Based Application in Mobile Environment.
Proceedings of the Advances in Multimodal Interfaces, 2000
1999
Proceedings of the Text, Speech and Dialogue - Second International Workshop, 1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
1998
Text-independent speaker recognition using non-linear frame likelihood transformation.
Speech Commun., 1998
Comparison of continuous speech recognition systems with unknown-word processing for speech disfluencies.
Syst. Comput. Jpn., 1998
Modeling of variations in cepstral coefficients caused by F0 changes and its application to speech processing.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Discriminative training of GMM using a modified EM algorithm for speaker recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Dealing with out-of-vocabulary words and speech disfluencies in an n-gram based speech understanding system.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Continuous speech recognition using segmental unit input HMMs with a mixture of probability density functions and context dependency.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Evaluation of Japanese manners of generating word accent of English based on a stressed syllable detection technique.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
1997
Syst. Comput. Jpn., 1997
An English conversation and pronunciation CAI system using speech recognition technology.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
A Robust Dialogue System with Spontaneous Speech Understanding and Cooperative Response.
Proceedings of the Interactive Spoken Dialog Systems: Bringing Speech and NLP Together in Real Applications@ACL/EACL 1997, 1997
1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Frame level likelihood normalization for text-independent speaker identification using Gaussian mixture models.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
1995
IEICE Trans. Inf. Syst., 1995
Relationship among Recognition Rate, Rejection Rate and False Alarm Rate in a Spoken Word Recognition System.
IEICE Trans. Inf. Syst., 1995
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Investigation on unknown word processing and strategies for spontaneous speech understanding.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
1994
Estimation of the probability density function and a <i>posteriori</i> probability by neural networks, and applications to vowel recognition.
Syst. Comput. Jpn., 1994
A context-free grammar-driven, one-pass HMM-based continuous speech recognition method.
Syst. Comput. Jpn., 1994
A comparison study of output probability functions in HMMs through spoken digit recognition.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
An unsupervised speaker adaptation method for continuous parameter HMM by maximum a posteriori probability estimation.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Concept and grammar acquisition based on combining with visual and auditory information.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
1993
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
Proceedings of the IEEE International Conference on Acoustics, 1993
1992
Proceedings of the Second International Conference on Spoken Language Processing, 1992
A frame-synchronous continuous speech recognition algorithm using a top-down parsing of context-free grammar.
Proceedings of the Second International Conference on Spoken Language Processing, 1992
Relationship among phoneme/word recognition rate, perplexity and sentence recognition and comparison of language models.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992
1991
Proceedings of the Second European Conference on Speech Communication and Technology, 1991
1990
Syst. Comput. Jpn., 1990
Proceedings of the First International Conference on Spoken Language Processing, 1990
Proceedings of the First International Conference on Spoken Language Processing, 1990
Proceedings of the First International Conference on Spoken Language Processing, 1990
Comparison among time-delay neural networks, LVQ2 discrete parameter HMM and continuous parameter HMM.
Proceedings of the 1990 International Conference on Acoustics, 1990
1989
Proceedings of the First European Conference on Speech Communication and Technology, 1989
Proceedings of the First European Conference on Speech Communication and Technology, 1989
1988
Proceedings of the 9th International Conference on Pattern Recognition, 1988
1987
Speaker-independent word recognition by less cost and stochastic dynamic time warping method.
Proceedings of the European Conference on Speech Technology, 1987
Spoken sentence recognition by time-synchronous parsing algorithm of context-free grammar.
Proceedings of the IEEE International Conference on Acoustics, 1987
1986
Syllable-based connected spoken word recognition by two pass O(n) DP matching and hidden Markov models.
Proceedings of the IEEE International Conference on Acoustics, 1986
Proceedings of the IEEE International Conference on Acoustics, 1986
1985
Syst. Comput. Jpn., 1985
1984
Connected spoken word recognition algorithms by constant time delay DP, O(n) DP and augmented continuous DP matching.
Inf. Sci., 1984
1983
A Recognition Method of Connected Spoken Words With Syntactical Constraints by Augmented Continuous DP Algorithm.
Proceedings of the 8th International Joint Conference on Artificial Intelligence. Karlsruhe, 1983
A connected spoken word recognition method by O(n) dynamic programming pattern matching algorithm.
Proceedings of the IEEE International Conference on Acoustics, 1983
1979
A Parallel Tree Search Method.
Proceedings of the Sixth International Joint Conference on Artificial Intelligence, 1979
1978
A word recognition method from a classified phoneme string in the Lithan speech understanding system.
Proceedings of the IEEE International Conference on Acoustics, 1978