Seiichi Nakagawa

Orcid: 0000-0002-6533-5536

According to our database1, Seiichi Nakagawa authored at least 242 papers between 1978 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Significance of relative phase features for shouted and normal speech classification.
EURASIP J. Audio Speech Music. Process., December, 2024

Elderly Speech Recognition Using Whisper and Speaker Adaptation.
Proceedings of the 13th IEEE Global Conference on Consumer Electronics, 2024

2023
A Study of Speech Recognition, Speech Translation, and Speech Summarization of TED English Lectures.
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023

Attention-based CNN and Relative Phase Feature Modeling for Improved Imagined Speech Recognition.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Learning affective representations based on magnitude and dynamic relative phase information for speech emotion recognition.
Speech Commun., 2022

Summarization of Spoken Lectures Based on MMR Method and Important/Unimportant Sentence Classification Using BERT.
Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022

2021
Replay attack detection using variable-frequency resolution phase and magnitude features.
Comput. Speech Lang., 2021

Classification of Imagined and Heard Speech Using Amplitude Spectrum and Relative Phase of EEG.
Proceedings of the 3rd IEEE Global Conference on Life Sciences and Technologies, 2021

Improvement of Elderly Speech Recognition Using Gammatone Filterbank Adaptation.
Proceedings of the 10th IEEE Global Conference on Consumer Electronics, 2021

2020
Effectiveness of Fine Linear Frequency Spectral Feature for Acoustic Event Detection.
Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020

2019
Discriminative Learning of Filterbank Layer within Deep Neural Network Based Speech Recognition for Speaker Adaptation.
IEICE Trans. Inf. Syst., 2019

Replay attack detection with auditory filter-based relative phase features.
EURASIP J. Audio Speech Music. Process., 2019

Replay Attack Detection Using Linear Prediction Analysis-Based Relative Phase Features.
IEEE Access, 2019

Replay Attack Detection Using Magnitude and Phase Information with Attention-based Adaptive Filters.
Proceedings of the IEEE International Conference on Acoustics, 2019

Evaluation of Real Robot Agent Interface for Spoken Dialogue System.
Proceedings of the IEEE 8th Global Conference on Consumer Electronics, 2019

2018
Phase and reverberation aware DNN for distant-talking speech enhancement.
Multim. Tools Appl., 2018

Rapid Speaker Adaptation of Neural Network Based Filterbank Layer for Automatic Speech Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Replay Attacks Detection Using Phase and Magnitude Features with Various Frequency Resolutions.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Pitch Synchronized Relative Phase with Peak Error Detection For Noise-robust Speaker Recognition.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Multiple Phase Information Combination for Replay Attacks Detection.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Speech Recognition for English Uttered by Japanese with Various Proficiency Levels.
Proceedings of the IEEE 7th Global Conference on Consumer Electronics, 2018

2017
Spoofing Speech Detection Using Modified Relative Phase Information.
IEEE J. Sel. Top. Signal Process., 2017

Noise robust voice activity detection using joint phase and magnitude based feature enhancement.
J. Ambient Intell. Humaniz. Comput., 2017

Automatic Explanation Spot Estimation Method Targeted at Text and Figures in Lecture Slides.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Phase aware deep neural network for noise robust voice activity detection.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

A deep neural network integrated with filterbank learning for speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Lyric recognition in monophonic singing using pitch-dependent DNN.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Detection of overlapping acoustic events based on NMF with shared basis vectors.
Proceedings of the IEEE 6th Global Conference on Consumer Electronics, 2017

Pseudo-pitch-synchronized phase information extraction and its application for robust speaker recognition.
Proceedings of the IEEE 6th Global Conference on Consumer Electronics, 2017

2016
DNN-Based Amplitude and Phase Feature Enhancement for Noise Robust Speaker Identification.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Speech analysis of sung-speech and lyric recognition in monophonic singing.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Investigation of glottal features and annotation procedures for speech emotion recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Domain adaptation of a speech translation system for lectures by utilizing frequently appearing parallel phrases in-domain.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Relative phase information for detecting human speech and spoofed speech.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Robust speech recognition using DNN-HMM acoustic model combining noise-aware training with spectral subtraction.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Combination of syllable based N-gram search and word search for spoken term detection through spoken queries and IV/OOV classification.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Deep neural network based acoustic model using speaker-class information for short time utterance.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Speech recognition for mixed speech and music by NMF using various cost functions and noise adaptive training methods.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Single-channel dereverberation by feature mapping using cascade neural networks for robust distant speaker identification and speech recognition.
EURASIP J. Audio Speech Music. Process., 2014

Effect of acoustic and linguistic contexts on human and machine speech recognition.
Comput. Speech Lang., 2014

Speaker Identification by Combining Various Vocal Tract and Vocal Source Features.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Sopoken Term Detection Based on a Syllable N-gram Index at the NTCIR-11 SpokenQuery&Doc Task.
Proceedings of the 11th NTCIR Conference on Evaluation of Information Access Technologies, 2014

Speech recognition based on Itakura-Saito divergence and dynamics/sparseness constraints from mixed sound of speech and music by non-negative matrix factorization.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Elimination of person names in spoken documents for privacy protection.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
A robust/fast spoken term detection method based on a syllable n-gram index with a distance metric.
Speech Commun., 2013

Development and Evaluation of Spoken Dialog Systems with One or Two Agents through Two Domains.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Spoken Term Detection by N-gram Index with Exact Distance for NTCIR-SpokenDoc2.
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Overview of the NTCIR-10 SpokenDoc-2 Task.
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Development and evaluation of spoken dialog systems with one or two agents.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Robust/fast out-of-vocabulary spoken term detection by N-gram index with exact distance through text/speech input.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Single channel dereverberation method in log-melspectral domain using limited stereo data for distant speaker identification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Fast NMF based approach and VQ based approach using MFCC distance measure for speech recognition from mixed sound.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Speaker identification using pseudo pitch synchronized phase information in noisy environments.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Topic-Dependent-Class-Based $n$-Gram Language Model.
IEEE Trans. Speech Audio Process., 2012

Speaker Identification and Verification by Combining MFCC and Phase Information.
IEEE Trans. Speech Audio Process., 2012

Class-Based N-Gram Language Model for New Words Using Out-of-Vocabulary to In-Vocabulary Similarity.
IEICE Trans. Inf. Syst., 2012

Risk-Based Semi-Supervised Discriminative Language Modeling for Broadcast Transcription.
IEICE Trans. Inf. Syst., 2012

Hidden Conditional Neural Fields for Continuous Phoneme Speech Recognition.
IEICE Trans. Inf. Syst., 2012

Improving the Readability of ASR Results for Lectures Using Multiple Hypotheses and Sentence-Level Knowledge.
IEICE Trans. Inf. Syst., 2012

Development of large vocabulary continuous speech recognition system for Mongolian language.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012

Developing Partially-Transcribed Speech Corpus from Edited Transcriptions.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Multi-objective optimization for semi-supervised discriminative language modeling.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Fast NMF based approach and improved VQ based approach for speech recognition from mixed sound.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

An online evaluation system for English pronunciation intelligibility for Japanese English learners.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

On the use of phase information-based joint factor analysis for speaker verification under channel mismatch condition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Soft-clustering technique for training data in Age-and gender-independent speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Distant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm.
IEICE Trans. Inf. Syst., 2011

High speed spoken term detection by combination of n-gram array of a syllable lattice and LVCSR result for NTCIR-SpokenDoc.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

Speech Recognition in Mixed Sound of Speech and Music Based on Vector Quantization and Non-Negative Matrix Factorization.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Lattice-Based Risk Minimization Training for Unsupervised Language Model Adaptation.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

New Feature Parameters for Pronunciation Evaluation in English Presentations at International Conferences.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Hidden Boosted MMI and Hierarchical State Posterior Feature for Automatic Speech Recognition Based on Hidden Conditional Neural Fields.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Efficient out-of-vocabulary term detection by n-gram array indices with distance from a syllable lattice.
Proceedings of the IEEE International Conference on Acoustics, 2011

Automatic speech recognition using Hidden Conditional Neural Fields.
Proceedings of the IEEE International Conference on Acoustics, 2011

Detection of precisely transcribed parts from inexact transcribed corpus.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Topic-Dependent Language Model with Voting on Noun History.
ACM Trans. Asian Lang. Inf. Process., 2010

Speaker Recognition by Combining MFCC and Phase Information in Noisy Conditions.
IEICE Trans. Inf. Syst., 2010

Evaluation of Combinational Use of Discriminant Analysis-Based Acoustic Feature Transformation and Discriminative Training.
IEICE Trans. Inf. Syst., 2010

Distant Speech Recognition Using a Microphone Array Network.
IEICE Trans. Inf. Syst., 2010

Topic dependent class based language model evaluation on automatic speech recognition.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Out-of-vocabulary term detection by n-gram array with distance from continuous syllable recognition results.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Evaluation of Privacy Protection Techniques for Speech Signals.
Proceedings of the Information Processing and Management of Uncertainty in Knowledge-Based Systems. Applications, 2010

Speech recognition using long-term phase information.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Integration of cache-based model and topic dependent class model with soft clustering and soft voting.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Lecture subtopic retrieval by retrieval keyword expansion using subordinate concept.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Constructing Japanese test collections for spoken term detection.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Automatic evaluation of English pronunciation by Japanese speakers using various acoustic features and pattern recognition techniques.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Improving the readability of class lecture ASR results using a confusion network.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Speaker identification by combining MFCC and phase information in noisy environments.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Effective use of pause information in language modelling for speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Topic dependent language model based on topic voting on noun history.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Estimating the position and orientation of an acoustic source with a microphone array network.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

High improvement of speaker identification and verification by combining MFCC and phase information.
Proceedings of the IEEE International Conference on Acoustics, 2009

Language Model Based on Word Order Sensitive Matrix Representation in Latent Semantic Analysis for Speech Recognition.
Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009

Response timing generation and response type selection for a spontaneous spoken dialog system.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Analysis and Robust Extraction of Changing Named Entities.
Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration, 2009

Privacy Protection for Speech Information.
Proceedings of the Fifth International Conference on Information Assurance and Security, 2009

2008
Robust Speech Recognition by Combining Short-Term and Long-Term Spectrum Based Position-Dependent CMN with Conventional CMN.
IEICE Trans. Inf. Syst., 2008

Linear Discriminant Analysis Using a Generalized Mean of Class Covariances and Its Application to Speech Recognition.
IEICE Trans. Inf. Syst., 2008

Noisy Speech Recognition Based on Integration/Selection of Multiple Noise Suppression Methods Using Noise GMMs.
IEICE Trans. Inf. Syst., 2008

Developing Corpus of Japanese Classroom Lecture Speech Contents.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Blind dereverberation based on CMN and spectral subtraction by multi-channel LMS algorithm.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A browsing system for classroom lecture speech.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Evaluating spoken language model based on filler prediction model in speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Analysis of relationship between impression of human-to-human conversations and prosodic change and its modeling.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Speech recognition performance of CJLC: corpus of Japanese lecture contents.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Class lecture summarization taking into account consecutiveness of important sentences.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Robust Extraction of Named Entity Including Unfamiliar Word.
Proceedings of the ACL 2008, 2008

2007
Robust distant speaker recognition based on position-dependent CMN by combining speaker-specific GMM with speaker-adapted HMM.
Speech Commun., 2007

Indonesian-Japanese Transitive Translation using English for CLIR.
Inf. Media Technol., 2007

A Machine Learning Approach for an Indonesian-English Cross Language Question Answering System.
IEICE Trans. Inf. Syst., 2007

A Spoken Dialog System for Chat-Like Conversations Considering Response Timing.
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Analysis of effect of compensation parameter estimation for CMN on speech/speaker recognition.
Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007

Power linear discriminant analysis.
Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007

One-pass LVCSR algorithm using linear lexicon search and 1-best approximation tree-structured lexicon search.
Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007

Selection of optimal dimensionality reduction method using chernoff bound for segmental unit input HMM.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Construction of spoken language model including fillers using filler prediction model.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Prosody change and response timing analysis in spontaneously spoken dialogs and their modeling in a spoken dialog system.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

A statistical method of evaluating pronunciation proficiency for presentation in English.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Speaker recognition by combining MFCC and phase information.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Automatic extraction of cue phrases for important sentences in lecture speech and automatic lecture speech summarization.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Robust Distant Speech Recognition by Combining Position-Dependent CMN with Conventional CMN.
Proceedings of the IEEE International Conference on Acoustics, 2007

Generalization of Linear Discriminant Analysis used in Segmental Unit Input HMM for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2007

Development of VAD evaluation framework CENSREC-1-C and investigation of relationship between VAD and speech recognition performance.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

A machine learning approach for indonesian question answering system.
Proceedings of the IASTED International Conference on Artificial Intelligence and Applications, 2007

Expanding Indonesian-Japanese Small Translation Dictionary Using a Pivot Language.
Proceedings of the ACL 2007, 2007

2006
Response Timing Detection Using Prosodic and Linguistic Information for Human-friendly Spoken Dialog Systems.
Inf. Media Technol., 2006

Text-Independent/Text-Prompted Speaker Recognition by Combining Speaker-Specific GMM with Speaker Adapted Syllable-Based HMM.
IEICE Trans. Inf. Syst., 2006

Robust Distant Speech Recognition by Combining Multiple Microphone-Array Processing with Position-Dependent CMN.
EURASIP J. Adv. Signal Process., 2006

Summarization of spoken Lectures Based on Linguistic Surface and prosodic Information.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

A spoken Dialog System with Automatic Recovery Mechanism from misrecognition.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

Noisy speech recognition based on selection of multiple noise suppression methods using noise GMMs.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2005
Combining outputs of multiple LVCSR models by machine learning.
Syst. Comput. Jpn., 2005

Large-vocabulary continuous speech recognition using linear lexicon search and 1-best approximation tree-structured lexicon search.
Syst. Comput. Jpn., 2005

Detection and recognition of correction utterances on misrecognition of spoken dialog system.
Syst. Comput. Jpn., 2005

An Unsupervised Speaker Adaptation Method for Lecture-Style Spontaneous Speech Recognition Using Multiple Recognition Systems.
IEICE Trans. Inf. Syst., 2005

Improving Keyword Recognition of Spoken Queries by Combining Multiple Speech Recognizer's Outputs for Speech-driven WEB Retrieval Task.
IEICE Trans. Inf. Syst., 2005

Robust distant speech recognition based on position dependent CMN using a novel multiple microphone processing technique.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Robust distant speaker recognition based on position dependent cepstral mean normalization.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

A statistical method of evaluating pronunciation proficiency for Japanese words.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Multimodal interface for organization name input based on combination of isolated word recognition and continuous base-word recognition.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Query Transitive Translation Using IR Score for Indonesian-Japanese CLIR.
Proceedings of the Information Retrieval Technology, 2005

2004
Estimating high-confidence portions based on agreement among outputs of multiple LVCSR models.
Syst. Comput. Jpn., 2004

Robust spoken document retrieval methods for misrecognition and out-of-vocabulary keywords.
Syst. Comput. Jpn., 2004

Confidence measure and rejection based on correctness probability of recognition candidates.
Syst. Comput. Jpn., 2004

A Statistical Method of Evaluating Pronunciation Proficiency for English Words Spoken by Japanese.
IEICE Trans. Inf. Syst., 2004

An Empirical Study on Multiple LVCSR Model Combination by Machine Learning.
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

Unsupervised speaker adaptation using high confidence portion recognition results by multiple recognition systems.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Speech interface for name input based on combination of recognition methods using syllable-based n-gram and word dictionary.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Keyword recognition and extraction by multiple-LVCSRs with 60, 000 words in speech-driven WEB retrieval task.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Robust distant speech recognition based on position dependent CMN.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Text-independent speaker recognition by combining speaker-specific GMM with speaker adapted syllable-based HMM.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Integrating Cross-Lingually Relevant News Articles and Monolingual Web Documents in Bilingual Lexicon Acquisition.
Proceedings of the COLING 2004, 2004

2003
Speaker change detection and speaker clustering using VQ distortion measure.
Syst. Comput. Jpn., 2003

Interpreter for Highly Portable Spoken Dialogue System.
Proceedings of the SIGDIAL 2003 Workshop, 2003

Generation of natural response timing using decision tree based on prosodic and linguistic information.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Text-independent speaker recognition by speaker-specific GMM and speaker adapted syllable-based HMM.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Evaluating multiple LVCSR model combination in NTCIR-3 speech-driven web retrieval task.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Comparison of effects of acoustic and language knowledge on spontaneous speech perception/recognition between human and automatic speech recognizer.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Detection and recognition of correction utterance in spontaneously spoken dialog.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Confidence of agreement among multiple LVCSR models and model combination by SVM.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Speech recognition under noisy environments using segmental unit input HMM.
Syst. Comput. Jpn., 2002

Differences of speech rate, interphoneme distance and likelihood caused by speaking style, their relationship, and recognition performance.
Syst. Comput. Jpn., 2002

English Speech Database Read by Japanese Learners for CALL System Development.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

A confidence measure based on agreement among multiple LVCSR models - correlation between pair of acoustic models and confidence.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Syllable recognition using syllable-segment statistics and syllable-based HMM.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Comparing isolately spoken keywords with spontaneously spoken queries for Japanese spoken document retrieval.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Speaker independent speech recognition using features based on glottal sound source.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Evaluation of spectral subtraction with smoothing of time direction on the Aurora 2 task.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Detection and recognition of repaired speech on misrecognized utterances for speech input of car navigation system.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
A Development Tool for Spoken Dialogue Systems and Its Evaluation.
Proceedings of the Text, Speech and Dialogue, 4th International Conference, 2001

Automatic construction of CALL system from TV news program with captions.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Instantaneous estimation of accentuation habits for Japanese students to learn English pronunciation.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

A fast calculation method in LVCSRS by time-skipping and clustering of probability density distributions.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Experimental evaluation on confidence of agreement among multiple Japanese LVCSR models.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2001

Discriminative training of HMM using maximum normalized likelihood algorithm.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
A Semantic Interpreter and a Cooperative Response Generator for a Robust Spoken Dialogue System.
Int. J. Pattern Recognit. Artif. Intell., 2000

Relationship among speaking style, inter-phoneme's distance and speech recognition performance.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A system for retrieving broadcast news speech documents using voice input keywords and similarity between words.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Instantaneous estimation of prosodic pronunciation habits for Japanese students to learn English pronunciation.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Quality improvement of PSOLA analysis-synthesis using partial zero-phase conversion.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Performance comparison among HMM, DTW, and human abilities in terms of identifying stress patterns of word utterances.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A portable development tool for spoken dialogue systems.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Overview of an intelligent system for information retrieval based on human-machine dialogue through spoken language.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Usability of Browser-Based Pen-Touch/Speech User Interfaces for Form-Based Application in Mobile Environment.
Proceedings of the Advances in Multimodal Interfaces, 2000

1999
A Retrieval System of Broadcast News Speech Documents through Keyboard and Voice.
Proceedings of the Text, Speech and Dialogue - Second International Workshop, 1999

HMM composition of segmental unit input HMM for noisy speech recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Text-independent speaker recognition using non-linear frame likelihood transformation.
Speech Commun., 1998

Comparison of continuous speech recognition systems with unknown-word processing for speech disfluencies.
Syst. Comput. Jpn., 1998

Modeling of variations in cepstral coefficients caused by F0 changes and its application to speech processing.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Discriminative training of GMM using a modified EM algorithm for speaker recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Text-independent speaker recognition using multiple information sources.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Dealing with out-of-vocabulary words and speech disfluencies in an n-gram based speech understanding system.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Continuous speech recognition using segmental unit input HMMs with a mixture of probability density functions and context dependency.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Evaluation of Japanese manners of generating word accent of English based on a stressed syllable detection technique.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997
Speech recognition using hidden Markov models based on segmental statistics.
Syst. Comput. Jpn., 1997

An English conversation and pronunciation CAI system using speech recognition technology.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Automatic detection of accent in English words spoken by Japanese students.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Speaker verification using frame and utterance level likelihood normalization.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

A Robust Dialogue System with Spontaneous Speech Understanding and Cooperative Response.
Proceedings of the Interactive Spoken Dialog Systems: Bringing Speech and NLP Together in Real Applications@ACL/EACL 1997, 1997

1996
Prosodic manipulation system of speech material for perceptual experiments.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Automatic detection of accent nuclei at the head of words for speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Frame level likelihood normalization for text-independent speaker identification using Gaussian mixture models.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Evaluation of segmental unit input HMM.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
A Comparative Study of Output Probability Functions in HMMs.
IEICE Trans. Inf. Syst., 1995

Relationship among Recognition Rate, Rejection Rate and False Alarm Rate in a Spoken Word Recognition System.
IEICE Trans. Inf. Syst., 1995

Comparative evaluation of segmental unit input HMM and conditional density HMM.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Investigation on unknown word processing and strategies for spontaneous speech understanding.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1994
Estimation of the probability density function and a <i>posteriori</i> probability by neural networks, and applications to vowel recognition.
Syst. Comput. Jpn., 1994

A context-free grammar-driven, one-pass HMM-based continuous speech recognition method.
Syst. Comput. Jpn., 1994

A comparison study of output probability functions in HMMs through spoken digit recognition.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

An unsupervised speaker adaptation method for continuous parameter HMM by maximum a posteriori probability estimation.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Three language identification methods based on HMMs.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Concept and grammar acquisition based on combining with visual and auditory information.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Evaluation of unknown word processing in a spoken word recognition system.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

1993
Spoken language identification using ergodic HMM with emphasized state transition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Evaluation of VQ-distortion based HMM.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

A new speech recognition method based on VQ-distortion measure and HMM.
Proceedings of the IEEE International Conference on Acoustics, 1993

1992
Speech recognition using various sequential networks.
Syst. Comput. Jpn., 1992

Speaker-independent, text-independent language identification by HMM.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

A frame-synchronous continuous speech recognition algorithm using a top-down parsing of context-free grammar.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Relationship among phoneme/word recognition rate, perplexity and sentence recognition and comparison of language models.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
The syntax-oriented spoken Japanese understanding system SPOJOS-SYNO II.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

1990
Segmentation of continuous speech by HMM and bayesian probability.
Syst. Comput. Jpn., 1990

Diction for phoneme/syllable/word-category and identification of language using HMM.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Sentence recognition method using word cooccurrence probability and its evaluation.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Speaker adaptation of continuous parameter HMM.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Comparison among time-delay neural networks, LVQ2 discrete parameter HMM and continuous parameter HMM.
Proceedings of the 1990 International Conference on Acoustics, 1990

1989
The syntax-oriented speech understanding system - SPOJUS-SYNO.
Proceedings of the First European Conference on Speech Communication and Technology, 1989

A lOObit/s speech coding using a speech recognition technique.
Proceedings of the First European Conference on Speech Communication and Technology, 1989

1988
A method for continuous speech segmentation using HMM.
Proceedings of the 9th International Conference on Pattern Recognition, 1988

1987
Speaker-independent word recognition by less cost and stochastic dynamic time warping method.
Proceedings of the European Conference on Speech Technology, 1987

Spoken sentence recognition by time-synchronous parsing algorithm of context-free grammar.
Proceedings of the IEEE International Conference on Acoustics, 1987

1986
Syllable-based connected spoken word recognition by two pass O(n) DP matching and hidden Markov models.
Proceedings of the IEEE International Conference on Acoustics, 1986

On quick word spotting techniques.
Proceedings of the IEEE International Conference on Acoustics, 1986

1985
A connected spoken word recognition algorithm by augmented continuous DP matching.
Syst. Comput. Jpn., 1985

1984
Connected spoken word recognition algorithms by constant time delay DP, O(n) DP and augmented continuous DP matching.
Inf. Sci., 1984

1983
A Recognition Method of Connected Spoken Words With Syntactical Constraints by Augmented Continuous DP Algorithm.
Proceedings of the 8th International Joint Conference on Artificial Intelligence. Karlsruhe, 1983

A connected spoken word recognition method by O(n) dynamic programming pattern matching algorithm.
Proceedings of the IEEE International Conference on Acoustics, 1983

1979
A Parallel Tree Search Method.
Proceedings of the Sixth International Joint Conference on Artificial Intelligence, 1979

1978
A word recognition method from a classified phoneme string in the Lithan speech understanding system.
Proceedings of the IEEE International Conference on Acoustics, 1978


  Loading...