Sadaoki Furui

According to our database1, Sadaoki Furui authored at least 248 papers between 1980 and 2020.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Awards

IEEE Fellow

IEEE Fellow 1993, "For contributions to speech analysis, speech recognition, and speaker identification.".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2020
Modeling of Perceptual Speaker Embedding and Its Application to Speech and Speaker Recognition.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

2016
Improving Eye Motion Sequence Recognition Using Electrooculography Based on Context-Dependent HMM.
Comput. Intell. Neurosci., 2016

2013
A noise-robust speech recognition approach incorporating normalized speech/non-speech likelihood into hypothesis scores.
Speech Commun., 2013

A statistical approach for person verification using human behavioral patterns.
EURASIP J. Image Video Process., 2013

Statistical Person Verification Using Behavioral Patterns from Complex Human Motion.
Proceedings of the New Trends in Image Analysis and Processing - ICIAP 2013, 2013

2012
Distance-based Factor Graph Linearization and Sampled Max-sum Algorithm for Efficient 3D Potential Decoding of Macromolecules.
Inf. Media Technol., 2012

Active Learning Using Phone-Error Distribution for Speech Modeling.
IEICE Trans. Inf. Syst., 2012

Selected Topics from LVCSR Research for Asian Languages at Tokyo Tech.
IEICE Trans. Inf. Syst., 2012

Robust Gait-Based Person Identification against Walking Speed Variations.
IEICE Trans. Inf. Syst., 2012

Vocabulary expansion through automatic abbreviation generation for Chinese voice search.
Comput. Speech Lang., 2012

Question answering using statistical language modelling.
Comput. Speech Lang., 2012

HMM Based Continuous EOG Recognition for Eye-input Speech Interface.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Pipeline decomposition of speech decoders and their implementation based on delayed evaluation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Efficient model training for HMM-based person identification by gait.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Semi-synchronous speech and pen input for mobile user interfaces.
Speech Commun., 2011

Committee-Based Active Learning for Speech Recognition.
IEICE Trans. Inf. Syst., 2011

Person authentication using 3D human motion.
Proceedings of the 2011 joint ACM workshop on Human gesture and behavior understanding, 2011

Data-intensive approaches for ASR.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Sentence Selection by Direct Likelihood Maximization for Language Model Adaptation.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Structural Joint Factor Analysis for Speaker Recognition.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Speech Processing Tools - An Introduction to Interoperability.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Acoustic Forest for SMAP-Based Speaker Verification.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Cross-Channel Spectral Subtraction for meeting speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Structural MAP adaptation in GMM-supervector based speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Designing text corpus using phone-error distribution for acoustic modeling.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Predicting the phonetic realizations of word-final consonants in context - A challenge for French grapheme-to-phoneme converters.
Speech Commun., 2010

Introduction to the Issue on Speech Processing for Natural Interaction With Intelligent Environments.
IEEE J. Sel. Top. Signal Process., 2010

Unsupervised Acoustic Model Adaptation Based on Ensemble Methods.
IEEE J. Sel. Top. Signal Process., 2010

Gaussian Mixture Optimization Based on Efficient Cross-Validation.
IEEE J. Sel. Top. Signal Process., 2010

A New Hybrid Method for Machine Transliteration.
IEICE Trans. Inf. Syst., 2010

Adaptation to Pronunciation Variations in Indonesian Spoken Query-Based Information Retrieval.
IEICE Trans. Inf. Syst., 2010

Modeling liaison in French by using decision trees.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

VAD-measure-embedded decoder with online model adaptation.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

An empirical comparison of the t<sup>3</sup>, juicer, HDecode and sphinx3 decoders.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Topic and style-adapted language modeling for Thai broadcast news ASR.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Exploring web-browser based runtimes engines for creating ubiquitous speech interfaces.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

High-Level Feature Extraction Using SIFT GMMs and Audio Models.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Robust Gait Recognition Against Speed Variation.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Investigations on ensemble based unsupervised adaptation methods.
Proceedings of the IEEE International Conference on Acoustics, 2010

Speech modeling based on committee-based active learning.
Proceedings of the IEEE International Conference on Acoustics, 2010

Jointly Optimizing a Two-Step Conditional Random Field Model for Machine Transliteration and Its Fast Decoding Algorithm.
Proceedings of the ACL 2010, 2010

Optimizing Question Answering Accuracy by Maximizing Log-Likelihood.
Proceedings of the ACL 2010, 2010

2009
Corrigendum to: "Thai speech processing technology: A review" [Speech Communication 49 (1) (2007) 8-27].
Speech Commun., 2009

Lexical units for Thai LVCSR.
Speech Commun., 2009

Harnessing graphics processors for the fast computation of acoustic likelihoods in speech recognition.
Comput. Speech Lang., 2009

Automatic Chinese Abbreviation Generation Using Conditional Random Field.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Robust Speech Recognition in the Car Environment.
Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2009

Target speech GMM-based spectral compensation for noise robust speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Speaker adaptation based on two-step active learning.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Robust speech recognition using VAD-measure-embedded decoder.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Selected topics from 40 years of research on speech and speaker recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

40 Years of Progress in Automatic Speaker Recognition.
Proceedings of the Advances in Biometrics, Third International Conference, 2009

Unsupervisec cross-validation adaptation algorithms for improved adaptation performance.
Proceedings of the IEEE International Conference on Acoustics, 2009

Generalization of specialized on-the-fly composition.
Proceedings of the IEEE International Conference on Acoustics, 2009

Independent component analysis for noisy speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

Fast acoustic computations using graphics processors.
Proceedings of the IEEE International Conference on Acoustics, 2009

CLEF 2009 Question Answering Experiments at Tokyo Institute of Technology.
Proceedings of the Working Notes for CLEF 2009 Workshop co-located with the 13th European Conference on Digital Libraries (ECDL 2009) , Corfù, Greece, September 30, 2009

Generalization problem in ASR acoustic model training and adaptation.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Combining a Two-step Conditional Random Field Model and a Joint Source Channel Model for Machine Transliteration.
Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration, 2009

Discriminative Lexicon Adaptation for Improved Character Accuracy - A New Direction in Chinese Language Modeling.
Proceedings of the ACL 2009, 2009

2008
Speaker recognition.
Scholarpedia, 2008

Evaluation of a Noise-Robust Multi-Stream Speaker Verification Method Using <i>F</i><sub>0</sub> Information.
IEICE Trans. Inf. Syst., 2008

Language Model Adaptation Using Machine-Translated Text for Resource-Deficient Languages.
EURASIP J. Audio Speech Music. Process., 2008

Differences between acoustic characteristics of spontaneous and read speech and their effects on speech recognition performance.
Comput. Speech Lang., 2008

Tokyo Tech at TRECVID 2008.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

TAC 2008 Question Answering Experiments at Tokyo Institute of Technology.
Proceedings of the First Text Analysis Conference, 2008

Development of a speech recognition system for Icelandic using machine translated text.
Proceedings of the First International Workshop on Spoken Languages Technologies for Under-Resourced Languages, 2008

Automatically estimating number of scenes for rushes summarization.
Proceedings of the 2nd ACM Workshop on Video Summarization, 2008

Thai Broadcast News Corpus Construction and Evaluation.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Using Singular Value Decomposition to Compute Answer Similarity in a Language Independent Approach to Question Answering.
Proceedings of the Large-Scale Knowledge Resources. Construction and Application, 2008

Automatic Score Scene Detection for Baseball Video.
Proceedings of the Large-Scale Knowledge Resources. Construction and Application, 2008

Time-lag adaptation for semi-synchronous speech and pen input.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Improvement of eigenvoice-based speaker adaptation by parameter space clustering.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Aggregated cross-validation and its efficient application to Gaussian mixture optimization.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Implementation and evaluation of fast on-the-fly WFST composition algorithms.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Robust spoken term detection using combination of phone-based and word-based recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A new mutual information measure for independent component alalysis.
Proceedings of the IEEE International Conference on Acoustics, 2008

Collecting a Why-Question Corpus for Development and Evaluation of an Automatic QA-System.
Proceedings of the ACL 2008, 2008

2007
Thai speech processing technology: A review.
Speech Commun., 2007

Robust Speech Recognition Using Factorial HMMs for Home Environments.
EURASIP J. Adv. Signal Process., 2007

Audio-Visual Speech Recognition Using Lip Information Extracted from Side-Face Images.
EURASIP J. Audio Speech Music. Process., 2007

TokyoTech's TRECVID2007 Notebook.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

TREC 2007 Question Answering Experiments at Tokyo Institute of Technology.
Proceedings of The Sixteenth Text REtrieval Conference, 2007

NTCIR-6 CLQA Question Answering Experiments at the Tokyo Institute of Technology.
Proceedings of the 6th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2007

Dynamic language model adaptation using presentation slides for lecture speech recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Towards better language modeling for Thai LVCSR.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Predictive minimum Bayes risk classification for robust speech recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Semi-Synchronous Speech and Pen Input.
Proceedings of the IEEE International Conference on Acoustics, 2007

The Effect of Spectral Space Reduction in Spontaneous Speech on Recognition Performances.
Proceedings of the IEEE International Conference on Acoustics, 2007

Combining Gaussian Mixture Model with Global Variance Term to Improve the Quality of an HMM-Based Polyglot Speech Synthesizer.
Proceedings of the IEEE International Conference on Acoustics, 2007

Speech Recognition using FHMMS Robust Against Nonstationary Noise.
Proceedings of the IEEE International Conference on Acoustics, 2007

Home-environment adaptation of phoneme factorial hidden Markov models.
Proceedings of the 15th European Signal Processing Conference, 2007

CLEF2007 Question Answering Experiments at Tokyo Institute of Technology.
Proceedings of the Working Notes for CLEF 2007 Workshop co-located with the 11th European Conference on Digital Libraries (ECDL 2007), 2007

A robust scene recognition system for baseball broadcast using data-driven approach.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

A language modeling approach to question answering on speech transcripts.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Introduction of the METI project "development of fundamental speech recognition technology".
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

The Titech large vocabulary WFST speech recognition system.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
A multi-stage approach for Thai spoken language understanding.
Speech Commun., 2006

New approach to the polyglot speech generation by means of an HMM-based speaker adaptable synthesizer.
Speech Commun., 2006

Sentence-extractive automatic speech summarization and evaluation techniques.
Speech Commun., 2006

Robust Scene Extraction Using Multi-Stream HMMs for Baseball Broadcast.
IEICE Trans. Inf. Syst., 2006

TokyoTech's TRECVID2006 Notebook.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

TREC 2006 Question Answering Experiments at Tokyo Institute of Technology.
Proceedings of the Fifteenth Text REtrieval Conference, 2006

Recent Advances in Automatic speech Summarization.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

Factoid Question Answering with Web, Mobile and Speech Interfaces.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Class Model Adaptation for Speech Summarisation.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Robust scene recognition using language models for scene contexts.
Proceedings of the 8th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2006

A weight estimation method using LDA for multi-band speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Perplexity based linguistic model adaptation for speech summarisation.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Automatic Sentence Segmentation of Speech for Automatic Summarization.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Towards Optimal Bayes Decision for Speech Recognition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Topic and Stylistic Adaptation for Speech Summarisation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

A Stream-Weight and Threshold Estimation Method Using Adaboost for Multi-Stream Speaker Verification.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Rapid Development of Web-Based Monolingual Question Answering Systems.
Proceedings of the Advances in Information Retrieval, 2006

CLEF2006 Question Answering Experiments at Tokyo Institute of Technology.
Proceedings of the Working Notes for CLEF 2006 Workshop co-located with the 10th European Conference on Digital Libraries (ECDL 2006), 2006

2005
Toward Robust Speech Recognition and Understanding.
J. VLSI Signal Process., 2005

Predictive hidden Markov model selection for speech recognition.
IEEE Trans. Speech Audio Process., 2005

Analysis and recognition of spontaneous speech using Corpus of Spontaneous Japanese.
Speech Commun., 2005

Tree-Structured Clustering Methods for Piecewise Linear-Transformation-Based Noise Adaptation.
IEICE Trans. Inf. Syst., 2005

Recent Progress in Corpus-Based Spontaneous Speech Recognition.
IEICE Trans. Inf. Syst., 2005

Why Is the Recognition of Spontaneous Speech so Hard?
Proceedings of the Text, Speech and Dialogue, 8th International Conference, 2005

TREC 2005 Question Answering Experiments at Tokyo Institute of Technology.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

A Unified Approach to Japanese and English Question Answering.
Proceedings of the Fifth NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2005

Analysis of spectral space reduction in spontaneous speech and its effects on speech recognition performances.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Cross-language synthesis with a polyglot synthesizer.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Language model adaptation for resource deficient languages using translated data.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Cluster-based modeling for ubiquitous speech recognition.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Stream-weight optimization by LDA and adaboost for multi-stream speaker verification.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Robust highlight extraction using multi-stream hidden Markov models for baseball video.
Proceedings of the 2005 International Conference on Image Processing, 2005

Noisy Speech Recognition Based on Robust End-point Detection and Model Adaptation.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

A Stream-Weight Optimization Method for Multi-Stream HMMS Based on Likelihood Value Normalization.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Polyglot Synthesis Using a Mixture of Monolingual Corpora.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Sentence extraction-based presentation summarization techniques and evaluation metrics.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

A Statistical Classification Approach to Question Answering using Web Data.
Proceedings of the 4th International Conference on Cyberworlds (CW 2005), 2005

Multimodal Speaker Verification Using Ear Image Features Extracted by PCA and ICA.
Proceedings of the Audio- and Video-Based Biometric Person Authentication, 2005

2004
Guest Editorial.
J. VLSI Signal Process., 2004

Multi-Modal Speech Recognition Using Optical-Flow Analysis for Lip Images.
J. VLSI Signal Process., 2004

Speech-to-text and speech-to-speech summarization of spontaneous speech.
IEEE Trans. Speech Audio Process., 2004

Introduction to the Special Issue on Spontaneous Speech Processing.
IEEE Trans. Speech Audio Process., 2004

Piecewise-linear transformation-based HMM adaptation for noisy speech.
Speech Commun., 2004

Phonology and Morphology Modeling in a Very Large Vocabulary Hungarian Dictation System.
IEICE Trans. Inf. Syst., 2004

Dynamic Bayesian Network-Based Acoustic Models Incorporating Speaking Rate Effects.
IEICE Trans. Inf. Syst., 2004

Noise Robust Speech Recognition Using <i>F</i><sub>0</sub> Contour Information.
IEICE Trans. Inf. Syst., 2004

Speech Summarization: An Approach through Word Extraction and a Method for Evaluation.
IEICE Trans. Inf. Syst., 2004

Evaluation of tree-structured piecewise linear transformation-based noise adaptation on AURORA2 database.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Belief-based nonlinear rescoring in Thai speech understanding.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Spontaneous speech recognition using a massively parallel decoder.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Unsupervised language model adaptation methods for spontaneous speech.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Noise-robust speaker verification using F0 features.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

A tree-structured clustering method integrating noise and SNR for piecewise linear-transformation-based noise adaptation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

A stream-weight optimization method for audio-visual speech recognition using multi-stream HMMs.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
A new approach to automatic speech summarization.
IEEE Trans. Multim., 2003

A Statistical Approach to Automatic Speech Summarization.
EURASIP J. Adv. Signal Process., 2003

Tree-structured noise-adapted HMM modeling for piecewise linear-transformation-based adaptation.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Combination of finite state automata and neural network for spoken language understanding.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Evaluation of the stochastic morphosyntactic language model on a one million word hungarian dictation task.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Time adjustable mixture weights for speaking rate fluctuation.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Evaluation method for automatic speech summarization.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Robust methods in automatic speech recognition and understanding.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Predictive hidden Markov model selection for decision tree state tying.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Unsupervised class-based language model adaptation for spontaneous speech recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Finite-state transducer based modeling of morphosyntax with applications to Hungarian LVCSR.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Automatic speech summarization based on sentence extraction and compaction.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Deriving disambiguous queries in a spoken interactive ODQA system.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Audio-visual speech recognition using lip movement extracted from side-face images.
Proceedings of the AVSP 2003, 2003

2002
On-line incremental speaker adaptation for broadcast news transcription.
Speech Commun., 2002

Finite-state transducer based hungarian LVCSR with explicit modeling of phonological changes.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

A new lexicon optimization method for LVCSR based on linguistic and acoustic characteristics of words.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Noise robust speech recognition using F0 contour extracted by hough transform.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Parallel Computing-Based Architecture for Mixed-Initiative Spoken Dialogue.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Analysis on individual differences in automatic transcription of spontaneous presentations.
Proceedings of the IEEE International Conference on Acoustics, 2002

Automatic speech summarization applied to English broadcast news speech.
Proceedings of the IEEE International Conference on Acoustics, 2002

Recent progress in spontaneous speech recognition and understanding.
Proceedings of the IEEE 5th Workshop on Multimedia Signal Processing, 2002

2001
From Read Speech Recognition to Spontaneous Speech Understanding.
Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, 2001

Towards automatic transcription of spontaneous presentations.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Advances in automatic speech summarization.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Ubiquitous speech processing.
Proceedings of the IEEE International Conference on Acoustics, 2001

Neural-network-based HMM adaptation for noisy speech.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Maximum likelihood estimation of K-distribution parameters via the expectation-maximization algorithm.
IEEE Trans. Signal Process., 2000

Automatic recognition and understanding of spoken language - a first step toward natural human-machine communication.
Proc. IEEE, 2000

Special issue on spoken language processing.
Proc. IEEE, 2000

Japanese Broadcast News Transcription and Information Extraction.
Commun. ACM, 2000

Spontaneous Speech Corpus of Japanese.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

An online incremental speaker adaptation method using speaker-clustered initial models.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Robust speech recognition via modeling spectral coefficients with HMM's with complex Gaussian components.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Improvements in automatic speech summarization and evaluation methods.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Toward the realization of spontaneous speech recognition - introduction of a Japanese priority program and preliminary results -.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

On-line incremental speaker adaptation with automatic speaker change detection.
Proceedings of the IEEE International Conference on Acoustics, 2000

Automatic speech summarization based on word significance and linguistic likelihood.
Proceedings of the IEEE International Conference on Acoustics, 2000

Speech recognition technology in the ubiquitous/wearable computing environment.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Japanese large-vocabulary continuous-speech recognition using a newspaper corpus and broadcast news.
Speech Commun., 1999

A study of models and a priori threshold updating in speaker verification.
Syst. Comput. Jpn., 1999

Recent advances in Japanese broadcast news transcription.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Message-driven speech recognition and topic-word extraction.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Automatic Speech Recognition and its Application to Information Extraction.
Proceedings of the 27th Annual Meeting of the Association for Computational Linguistics, 1999

1998
N-Best-based unsupervised speaker adaptation for speech recognition.
Comput. Speech Lang., 1998

Designing a multimodal dialogue system for information retrieval.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Topic extraction with multiple topic-words in broadcast-news speech.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Tribute to James L. Flanagan.
Speech Commun., 1997

Recent advances in speaker recognition.
Pattern Recognit. Lett., 1997

Toward automatic transcription of Japanese broadcast news.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Connected digit recognition in spontaneous speech.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Japanese large-vocabulary continuous-speech recognition using a business-newspaper corpus.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Smoothed N-best-based speaker adaptation for speech recognition.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

An efficient search method for large-vocabulary continuous-speech recognition.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Recent Advances in Speaker Recognition (Invited Paper).
Proceedings of the Audio- and Video-Based Biometric Person Authentication, 1997

Introduction to Part IV.
Proceedings of the Computing Prosody, 1997

1996
Comments on "Towards increasing speech recognition error rates" by H. Bourlard, H. Hermansky, and N. Morgan.
Speech Commun., 1996

Speaker recognition using HMM composition in noisy environments.
Comput. Speech Lang., 1996

Improved extended HMM composition by incorporating power variance.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Japanese large-vocabulary continuous-speech recognition using a business-newspaper corpus.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

N-best-based instantaneous speaker adaptation method for speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Adaptation method based on HMM composition and EM algorithm.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Language model acquisition from a text corpus for speech understanding.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Robust methods of updating model and a priori threshold in speaker verification.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
Interactive voice technology development for telecommunications applications.
Speech Commun., 1995

Likelihood normalization for speaker verification using a phoneme- and speaker-independent model.
Speech Commun., 1995

A study of speaker adaptation based on minimum classification error training.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Flexible speech recognition.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

A maximum likelihood procedure for a universal adaptation method based on HMM composition.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMM's.
IEEE Trans. Speech Audio Process., 1994

Editorial.
Speech Communication, 1994

Large-vocabulary continuous speech recognition algorithm applied to a multi-modal telephone directory assistance system.
Speech Communication, 1994

Dictation Machine Based on Japanese Character Source Modeling.
Int. J. Pattern Recognit. Artif. Intell., 1994

A Large-Vocabulary Continuous Speech Recognition Algorithm and its Application to a Multi-Modal Telephone Directory Assistance System.
Proceedings of the Human Language Technology, 1994

Phoneme-level voice individuality used in speaker recognition.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Speaker adaptation of tied-mixture-based phoneme models for text-prompted speaker recognition.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Concatenated phoneme models for text-variable speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 1993

1992
Recent advances in speech recognition technology at NTT laboratories.
Speech Commun., 1992

Recent Topics in Speech Recognition Research at NTT Laboratories.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992

Speaker recognition using concatenated phoneme models.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMMs.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
Speaker-dependent-feature extraction, recognition and processing techniques.
Speech Commun., 1991

Recent advances in speech recognition.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

A text-independent speaker recognition method robust against utterance variations.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
Text-independent speaker recognition using vocal tract and pitch information.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Line spectrum pair frequency - based distance measures for speech recognition.
Proceedings of the First International Conference on Spoken Language Processing, 1990

A continuous speech recognition system based on a two-level grammar approach.
Proceedings of the 1990 International Conference on Acoustics, 1990

On the use of hierarchical spectral dynamics in speech recognition.
Proceedings of the 1990 International Conference on Acoustics, 1990

1989
Unsupervised speaker adaptation method based on hierarchical spectral clustering.
Proceedings of the IEEE International Conference on Acoustics, 1989

1988
A VQ-based preprocessor using cepstral dynamic features for speaker-independent large vocabulary word recognition.
IEEE Trans. Acoust. Speech Signal Process., 1988

Spectral movement function and its application to speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 1988

1987
A VQ-based preprocessor using cepstral dynamic features for large vocabulary word recognition.
Proceedings of the IEEE International Conference on Acoustics, 1987

1986
Speaker-independent isolated word recognition using dynamic features of speech spectrum.
IEEE Trans. Acoust. Speech Signal Process., 1986

Research of individuality features in speech waves and automatic speaker recognition techniques.
Speech Commun., 1986

Speaker-independent isolated word recognition based on emphasized spectral dynamics.
Proceedings of the IEEE International Conference on Acoustics, 1986

1983
Isolated word recognition using phoneme-like templates.
Proceedings of the IEEE International Conference on Acoustics, 1983

1980
Experimental studies in a new automatic speaker verification system using telephone speech.
Proceedings of the IEEE International Conference on Acoustics, 1980


  Loading...