Keikichi Hirose

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Crosslinguistic comparison on the perception of Mandarin attitudinal speech.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A measure of phonetic similarity to quantify pronunciation variation by using ASR technology.

[BibT_eX]

[DOI]

Tianze Shi

Shun Kasahara

Teeraphon Pongkittiphan

Proceedings of the 18th International Congress of Phonetic Sciences, 2015

2014

Introduction to the Issue on Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2014

Speaker-basis Accent Clustering Using Invariant Structure Analysis and the Speech Accent Archive.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Visualization of pronunciation diversity of world Englishes from a speaker's self-centered viewpoint.

[BibT_eX]

[DOI]

Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014

Application of matrix variate Gaussian mixture model to statistical voice conversion.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Semi-supervised noise dictionary adaptation for exemplar-based noise robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Feature Enhancement With Joint Use of Consecutive Corrupted and Noise Feature Vectors With Discriminative Region Weighting.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2013

Japanese lexical accent recognition for a CALL system by deriving classification equations with perceptual experiments.

[BibT_eX]

[DOI]

Speech Commun., 2013

Context labels based on "bunsetsu" for HMM-based speech synthesis of Japanese.

[BibT_eX]

[DOI]

Hiroya Hashimoto

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Automatic recognition of vowel length in Japanese for a CALL system motivated by perceptual experiments.

[BibT_eX]

[DOI]

Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2013

Automatic detection of the words that will become unintelligible through Japanese accented pronunciation of English.

[BibT_eX]

[DOI]

Teeraphon Pongkittiphan

Takehiko Makino

Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2013

OJAD: a free online accent and intonation dictionary for teachers and learners of Japanese.

[BibT_eX]

[DOI]

Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2013

Failure transitions for joint n-gram models and G2p conversion.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Development of a web framework for teaching and learning Japanese prosody: OJAD (online Japanese accent dictionary).

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Audio classification using dominant spatial patterns in time-frequency space.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Generation of fundamental frequency contours for Thai speech synthesis using tone nucleus model.

[BibT_eX]

[DOI]

Oraphan Krityakien

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A free online accent and intonation dictionary for teachers and learners of Japanese.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Artificial bandwidth extension based on regularized piecewise linear mapping with discriminative region weighting and long-Span features.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Improved estimation of femininity using GMM supervectors and SVR for voice therapy of Gender Identity Disorder Clients.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Discriminative piecewise linear transformation based on deep learning for noise robust automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Speaker-invariant and rhythm-sensitive representation of spoken words.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012

A method for generation of Mandarin F0 contours based on tone nucleus model and superpositional model.

[BibT_eX]

[DOI]

Qinghua Sun

Speech Commun., 2012

Analysis of ECG signals Using Data-Adaptive Time Domain Filtering for Cardiovascular Disease Diagnosis.

[BibT_eX]

[DOI]

Md. Rabiul Islam

Somlal Das

Adv. Data Sci. Adapt. Anal., 2012

Automatic Chinese pronunciation error detection using SVM trained with structural features.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Performance improvement of automatic pronunciation assessment in a noisy classroom.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Audio-visual feature integration based on piecewise linear transformation for noise robust automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Effects of Speaker Adaptive Training on Tensor-based Arbitrary Speaker Conversion.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Improving WFST-based G2P Conversion with Alignment Constraints and RNNLM N-best Rescoring.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Dynamic Grammars with Lookahead Composition for WFST-based Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

An alignment matching method to explore pseudosyllable properties across different corpora.

[BibT_eX]

[DOI]

Raymond W. M. Ng

Thomas Hain

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Improved Prediction of Japanese Word Accent Sandhi Using CRF.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Improved Automatic Extraction of Generation Process Model Commands and Its use for Generating Fundamental Frequency Contours for Training HMM-based Speech Synthesis.

[BibT_eX]

[DOI]

Hiroya Hashimoto

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

MFCC enhancement using joint corrupted and noise feature space for highly non-stationary noise environments.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Spectrogram based features selection using multiple kernel learning for speech/music discrimination.

[BibT_eX]

[DOI]

Sharmin Nilufar

Nilanjan Ray

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Syllable: A self-contained unit to model pronunciation variation.

[BibT_eX]

[DOI]

Raymond W. M. Ng

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Unseen noise robust speech recognition using adaptive piecewise linear transformation.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

WFST-Based Grapheme-to-Phoneme Conversion: Open Source tools for Alignment, Model-Building and Decoding.

[BibT_eX]

[DOI]

Proceedings of the 10th International Workshop on Finite State Methods and Natural Language Processing, 2012

2011

Harmonic modification and data adaptive filtering based approach to robust pitch estimation.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2011

Regularized Maximum Likelihood Linear Regression Adaptation for Computer-Assisted Language Learning Systems.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2011

Rule-based method for pitch level classification for a Japanese pitch accent CALL system.

[BibT_eX]

[DOI]

Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2011

Comparison of native and non-native evaluations of the naturalness of Japanesewords with prosody modified through voice morphing.

[BibT_eX]

[DOI]

Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2011

Representing fundamental frequency contours generated by HMM-based speech synthesis using generation process model.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

Dominant harmonic modification with data adaptive filter based algorithm for robust pitch estimation.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2011), 2011

Adaptive thresholding approach for robust voiced/unvoiced classification.

[BibT_eX]

[DOI]

Shamim Ahmad

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2011), 2011

Weighted noise subtraction and adaptive soft-thresholding approach to speech enhancement.

[BibT_eX]

[DOI]

Somlal Das

Md. Ekramul Hamid

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2011), 2011

Prosody Conversion for Emotional Mandarin Speech Synthesis Using the Tone Nucleus Model.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

One-to-Many Voice Conversion Based on Tensor Representation of Speaker Space.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Painless WFST Cascade Construction for LVCSR - Transducersaurus.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Measurement of Objective Intelligibility of Japanese Accented English Using ERJ (English Read by Japanese) Database.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Gesture Design of Hand-to-Speech Converter Derived from Speech-to-Hand Converter Based on Probabilistic Integration Model.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Problems Encountered by Japanese EL2 with English Short Vowels as Illustrated on a 3D Vowel Chart.

[BibT_eX]

[DOI]

Toshiko Isei-Jaakkola

Takatoshi Naka

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Adaptation of Prosody in Speech Synthesis by Changing Command Values of the Generation Process Model of Fundamental Frequency.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Open Source WFST Tools for LVCSR Cascade Development.

[BibT_eX]

[DOI]

Proceedings of the Finite-State Methods and Natural Language Processing, 2011

Structure-constrained distribution matching using quadratic programming and its application to pronunciation evaluation.

[BibT_eX]

[DOI]

Proceedings of the First Asian Conference on Pattern Recognition, 2011

2010

Hilbert Spectrum in Time-Frequency Representation of Audio Signals Considering Disjoint Orthogonality.

[BibT_eX]

[DOI]

Adv. Data Sci. Adapt. Anal., 2010

Improved generation of prosodic features in HMM-based Mandarin speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

A method for modeling and generating Mandarin tone contour with phrase intonation based on the generation process model.

[BibT_eX]

[DOI]

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Dialect-based speaker classification using speaker-invariant dialect features.

[BibT_eX]

[DOI]

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Pitch estimation of noisy speech signals using EMD-fourier based hybrid algorithm.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010

Data adaptive analysis of ECG signals for cardiovascular disease diagnosis.

[BibT_eX]

[DOI]

Md. Rabiul Islam

Shamim Ahmad

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010

Improving Mandarin segmental duration prediction with automatically extracted syntax features.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Improved generation of fundamental frequency in HMM-based speech synthesis using generation process model.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Integration of multilayer regression analysis with structure-based pronunciation assessment.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Affective story teller: a TTS system for emotional expressivity.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Modeling of sentence-medial pauses in bangla readout speech: occurrence and duration.

[BibT_eX]

[DOI]

Shyamal Kr. Das Mandal

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Regularized-MLLR speaker adaptation for computer-assisted language learning system.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Pitch Pattern Recognition of Isolated Words for the Development of a Japanese Language Call System.

[BibT_eX]

Proceedings of the Electronic Speech Signal Processing, 2010

Using FO Contour Generation Process Model for Improved and Flexible Control of Prosodie Features in HMM-based Speech Synthesis.

[BibT_eX]

Proceedings of the Electronic Speech Signal Processing, 2010

In Honor of Hiroya Fujisaki.

[BibT_eX]

Proceedings of the Electronic Speech Signal Processing, 2010

2009

Improved structure-based automatic estimation of pronunciation proficiency.

[BibT_eX]

[DOI]

Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2009

Analysis and comparison of automatic language proficiency assessment between shadowed sentences and read sentences.

[BibT_eX]

[DOI]

Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2009

Context Awareness using Environmental Sound Cues and Commonsense Knowledge.

[BibT_eX]

Helmut Prendinger

Proceedings of the SIGMAP 2009, 2009

Optimal event search using a structural cost function - improvement of structure to speech conversion.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

How to improve TTS systems for emotional expressivity.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

On invariant structural representation for speech recognition: theoretical validation and experimental improvement.

[BibT_eX]

[DOI]

Yu Qiao

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Structural analysis of dialects, sub-dialects and sub-sub-dialects of Chinese.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Analysis and utilization of MLLR speaker adaptation technique for learners' pronunciation evaluation.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Analysis of voice fundamental frequency contours of continuing and terminating prosodic phrases in four swiss German dialects.

[BibT_eX]

[DOI]

Adrian Leemann

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Speech generation from hand gestures based on space mapping.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Control of prosodic focus in corpus-based generation of fundamental frequency contours of Japanese based on the generation process model.

[BibT_eX]

[DOI]

Keiko Ochi

Proceedings of the IEEE International Conference on Acoustics, 2009

Easy Living in the Virtual World: A Noble Approach to Integrate Real World Activities to Virtual Worlds.

[BibT_eX]

[DOI]

Helmut Prendinger

Mitsuru Ishizuka

Proceedings of the 2009 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2009

Sub-structure-based estimation of pronunciation proficiency and classification of learners.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

An automatic approach to virtual living based on environmental sound cues.

[BibT_eX]

[DOI]

M. Al Masum Shaikh

A. Nakasone

P. Helmut

Proceedings of the Affective Computing and Intelligent Interaction, 2009

Emotional speech synthesis by sensing affective information from text.

[BibT_eX]

[DOI]

M. Al Masum Shaikh

Mitsuru Ishizuka

Proceedings of the Affective Computing and Intelligent Interaction, 2009

2008

Filled pauses as cues to the complexity of upcoming phrases for native and non-native listeners.

[BibT_eX]

[DOI]

Speech Commun., 2008

Tone Recognition of Continuous Mandarin Speech Based on Tone Nucleus Model and Neural Network.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2008

Kolmogorov-Smirnov Test in Text-Dependent Automatic Speaker Identification.

[BibT_eX]

[DOI]

Sangeeta Biswas

Shamim Ahmad

Mohammed Nasser

Eng. Lett., 2008

Corpus-based synthesis of Mandarin speech with F0 contours generated by superposing tone components on rule-generated phrase components.

[BibT_eX]

[DOI]

Qinghua Sun

Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Automatic Assessment of Language Proficiency through Shadowing.

[BibT_eX]

[DOI]

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Decomposition of rotational distortion caused by VTL difference using eigenvalues of its transformation matrix.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Structure to speech conversion - speech generation based on infant-like vocal imitation.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Control of prosodic focus in corpus-based generation of fundamental frequency based on the generation process model.

[BibT_eX]

[DOI]

Keiko Ochi

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Robust voiced/unvoiced speech classification using empirical mode decomposition and periodic correlation model.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Assigning suitable phrasal tones and pitch accents by sensing affective information from text to synthesize human-like speech.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Automatic pronunciation evaluation of language learners' utterances generated through shadowing.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Directional dependency of cepstrum on vocal tract length.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Multi-stream parameterization for structural speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Single-Mixture Audio Source Separation by Subspace Decomposition of Hilbert Spectrum.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

Energy constrained frequency-domain normalized LMS algorithm for blind channel identification.

[BibT_eX]

[DOI]

Mohammed Ariful Haque

Signal Image Video Process., 2007

Analysis of Tones in Cantonese Speech Based on the Command-Response Model.

[BibT_eX]

[DOI]

Phonetica, 2007

Two-step generation of Mandarin F0 contours based on tone nucleus and superpositional models.

[BibT_eX]

[DOI]

Qinghua Sun

Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

CRF-based statistical learning of Japanese accent sandhi for developing Japanese text-to-speech synthesis systems.

[BibT_eX]

[DOI]

Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Structural representation of pronunciation and its application for classifying Japanese learners of English.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Speech and Language Technology in Education, 2007

Features of pauses and conjunctions at syntactic and discourse boundaries in Japanese monologues.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

A framework of reply speech generation for concept-to-speech conversion in spoken dialogue systems.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Pitch estimation of noisy speech signals using empirical mode decomposition.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Structural assessment of language learners' pronunciation.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Corpus-based generation of prosodic features from text based on generation process model.

[BibT_eX]

[DOI]

Keiko Ochi

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

F0 models show Chinese speakers of Japanese insert intonational boundaries and drop pitch.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

EMD based soft-thresholding for speech enhancement.

[BibT_eX]

[DOI]

Erhan Deger

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Automatic recognition of connected vowels only using speaker-invariant representation of speech dynamics.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Development of a Femininity Estimator using Speaker Recognition Techniques for Voice Therapy of Gender Identity Disorder Clients.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Latent Prosody Model of Continuous Mandarin Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Speech enhancement using soft thresholding with DCT-EMD based hybrid algorithm.

[BibT_eX]

[DOI]

Erhan Deger

Proceedings of the 15th European Signal Processing Conference, 2007

2006

Modeling the effects of emphasis and question on fundamental frequency contours of Cantonese utterances.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2006

Quantitative and structural modeling of voice fundamental frequency contours of speech in Mandarin.

[BibT_eX]

[DOI]

Speech Commun., 2006

Separation of Mixed Audio Signals by Decomposing Hilbert Spectrum with Modified EMD.

[BibT_eX]

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2006

Structural Representation of the pronunciation and its Use for Call.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

Comparison of Perceived Prosodic Boundaries and Global Characteristics of Voice Fundamental Frequency Contours in Mandarin Speech.

[BibT_eX]

[DOI]

Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Localization based audio source separation by sub-band beamforming.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

Factors affecting speakers² choice of fillers in Japanese presentations.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Development of a program for self assessment of Japanese pronunciation by English learners.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Tone recognition of continuous speech of standard Chinese using neural network and tone nucleus model.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Corpus-based generation of fundamental frequency contours using generation process model and considering emotional focuses.

[BibT_eX]

[DOI]

Yasufumi Asano

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Unfilled pauses in Japanese sentences read aloud by non-native learners.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Localization Based Separation of Mixed Audio Signals with Binary Masking of Hilbert Spectrum.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Para-Linguistic Information Represented as Distortion of the Acoustic Universal Structure In Speech.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Separation of Mixed Audio Signals by Source Localization and Binary Masking with Hilbert Spectrum.

[BibT_eX]

[DOI]

Proceedings of the Independent Component Analysis and Blind Signal Separation, 2006

Factors influencing ratios of filled pauses at clause boundaries in Japanese.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop on Experimental Linguistics, 2006

2005

Tone nucleus-based multi-level robust acoustic tonal modeling of sentential F0 variations for Chinese continuous speech tone recognition.

[BibT_eX]

[DOI]

Speech Commun., 2005

Synthesis of F0 contours using generation process model parameters predicted from unlabeled corpora: application to emotional speech synthesis.

[BibT_eX]

[DOI]

Speech Commun., 2005

Editorial.

[BibT_eX]

[DOI]

Daniel Hirst

Yoshinori Sagisaka

Speech Commun., 2005

Audio source separation by source localization with Hilbert spectrum.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Filled pauses as cues to the complexity of following phrases.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Generation of fundamental frequency contours for Mandarin speech synthesis based on tone nucleus model.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Estimation of intonation variation with constrained tone transformations.

[BibT_eX]

[DOI]

Hisashi Kawai

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Japanese vowel recognition based on structural representation of speech.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Multi-band approach of audio source discrimination with empirical mode decomposition.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Corpus-based extraction of F0 contour generation process model parameters.

[BibT_eX]

[DOI]

Yusuke Furuyama

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Analysis of the effects of word emphasis and echo question on F0 contours of Cantonese utterances.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Structural representation of the non-native pronunciations.

[BibT_eX]

[DOI]

Toshiko Isei-Jaakkola

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Identification and Synthesis of Cantonese Tones Based on the Command-Response Model for F0 Contour Generation.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

The effects of filled pauses on native and non-native listeners2 speech processing.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop (ITRW) on Disfluency in Spontaneous Speech, 2005

Improved concept-to-speech generation in a dialogue system on road guidance.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Cyberworlds (CW 2005), 2005

2004

Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents.

[BibT_eX]

Proceedings of the Life-like characters - tools, affective functions, and applications., 2004

Tone nucleus modeling for Chinese lexical tone recognition.

[BibT_eX]

[DOI]

Speech Commun., 2004

A spoken dialogue system for document information retrieval utilizing topic knowledge.

[BibT_eX]

[DOI]

Shinya Kiriyama

Syst. Comput. Jpn., 2004

Prosodic Analysis and Modeling of Nagauta Singing to Generate Prosodic Contours from Standard Scores.

[BibT_eX]

[DOI]

Bungo Matsuoka

IEICE Trans. Inf. Syst., 2004

Foreword.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2004

Automatic Extraction of Tone Command Parameters for the Model of F0 Contour Generation for Standard Chinese.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2004

On the effectiveness of MFCCs and their statistical distribution properties in speaker identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Virtual Environments, 2004

Corpus-based synthesis of fundamental frequency contours with various speaking styles from text using F0 contour generation process model.

[BibT_eX]

[DOI]

Kentaro Sato

Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

Analysis of fundamental frequency contours of Cantonese based on a command-response model.

[BibT_eX]

[DOI]

Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

Analysis and synthesis of Cantonese F0 contours based on the command-response model.

[BibT_eX]

[DOI]

Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Analysis of Shanghainese F0 contours based on the command-response model.

[BibT_eX]

[DOI]

Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Efficient tone classification of speaker independent continuous Chinese speech using anchoring based discriminating features.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Clause types and filed pauses in Japanese spontaneous monologues.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Formulating contextual tonal variations in Mandarin.

[BibT_eX]

[DOI]

Hisashi Kawai

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Audio source separation from the mixture using empirical mode decomposition with independent subspace analysis.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Use of prosodic features for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Improvement in corpus-based generation of F0 contours using generation process model for emotional speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Analysis of F0 contours of Cantonese utterances based on the command-response model.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

N-gram language modeling of Japanese using bunsetsu boundaries.

[BibT_eX]

[DOI]

Sungyup Chung

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

A study on robust segmentation and location of tone nuclei in Chinese continuous speech.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Data-driven generation of F0 contours using a superpositional model.

[BibT_eX]

[DOI]

Speech Commun., 2003

Mora F0 representation for accent type identification in continuous speech and considerations on its relation with perceived pitch values.

[BibT_eX]

[DOI]

Carlos Toshinori Ishi

Speech Commun., 2003

Estimation of resonant characteristics based on AR-HMM modeling and spectral envelope conversion of vowel sounds.

[BibT_eX]

[DOI]

Nobuyuki Nishizawa

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Considerations on vowel durations for Japanese CALL system.

[BibT_eX]

[DOI]

Taro Mouri

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Automatic estimation of perceptual age using speaker modeling techniques.

[BibT_eX]

[DOI]

Keita Yamauchi

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Improvement of non-native speech recognition by effectively modeling frequently observed pronunciation habits.

[BibT_eX]

[DOI]

Koichi Osaki

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Prosodic analysis and modeling of the NAGAUTA singing to synthesize its prosodic patterns from the standard notation.

[BibT_eX]

[DOI]

Bungo Matsuoka

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

CART-based factor analysis of intelligibility reduction in Japanese English.

[BibT_eX]

[DOI]

Changchen Guo

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Speech generation from concept for realizing conversation with an agent in a virtual room.

[BibT_eX]

[DOI]

Junji Tago

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Corpus-based synthesis of fundamental frequency contours of Japanese using automatically-generated prosodic corpus and generation process model.

[BibT_eX]

[DOI]

Takayuki Ono

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

A pronunciation training system for Japanese lexical accents with corrective feedback in learner's voice.

[BibT_eX]

[DOI]

Frédéric Gendrin

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Use of linguistic information for automatic extraction of f_0 contour generation process model parameters.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Acoustic model selection and voice quality assessment for HMM-based Mandarin speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

A multilevel framework to model the inherently confounding nature of sentential F0sentential F0 contours contours for recognizing Chinese lexical tones.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

Temporal rate change of dialogue speech in prosodic units as compared to read speech.

[BibT_eX]

[DOI]

Speech Commun., 2002

Development and evaluation of a spoken dialogue system for academic document retrieval with a focus on reply generation.

[BibT_eX]

[DOI]

Shinya Kiriyama

Syst. Comput. Jpn., 2002

A New Korean Corpus-Based Text-to-Speech System.

[BibT_eX]

[DOI]

Sanghun Kim

Youngjik Lee

Int. J. Speech Technol., 2002

Separation of voiced source characteristics and vocal tract transfer function characteristics for speech sounds by iterative analysis based on AR-HMM model.

[BibT_eX]

[DOI]

Nobuyuki Nishizawa

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Automatic extraction of model parameters from fundamental frequency contours of English utterances.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Acoustic modeling of sentence stress using differential features between syllables for English rhythm learning system development.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Corpus-based analysis of English spoken by Japanese students in view of the entire phonemic system of English.

[BibT_eX]

[DOI]

Gakuto Kurata

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Integration of MLLR adaptation with pronunciation proficiency adaptation for non-native speech recognition.

[BibT_eX]

[DOI]

Gakuto Kurata

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Robust speech recognition using inter-speaker and intra-speaker adaptation.

[BibT_eX]

[DOI]

Baojie Li

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Statistical language modeling with prosodic boundaries and its use for continuous speech recognition.

[BibT_eX]

[DOI]

Makoto Terao

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Improved corpus-based synthesis of fundamental frequency contours using generation process model.

[BibT_eX]

[DOI]

Masaya Eto

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

A method for automatic extraction of model parameters from fundamental frequency contours of speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

Automatic estimation of one's age with his/her speech based upon acoustic modeling techniques of speakers.

[BibT_eX]

[DOI]

Mariko Sekiguchi

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

Use of topic knowledge in spoken dialogue information retrieval system for academic documents.

[BibT_eX]

[DOI]

Shinya Kiriyama

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Pruning of redundant synthesis instances based on weighted vector quantization.

[BibT_eX]

[DOI]

Sanghun Kim

Youngjik Lee

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Identification of accent and intonation in sentences for CALL systems.

[BibT_eX]

[DOI]

Carlos Toshinori Ishi

Ryuji Nishide

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Corpus-based synthesis of fundamental frequency contours based on a generation process model.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Generation of F0 contours using a model-constrained data-driven method.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

A minimax search algorithm for robust continuous speech recognition.

[BibT_eX]

[DOI]

Qiang Hue

IEEE Trans. Speech Audio Process., 2000

Teaching the pronunciation of Japanese double-mora phonemes using speech recognition technology.

[BibT_eX]

[DOI]

Speech Commun., 2000

Experimental Evaluation of A Functional Modeling of Fundamental Frequency Contours of Standard Chinese Sentences.

[BibT_eX]

[DOI]

Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000

Discriminating Chinese lexical tones by anchoring F0 features.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Rapid adaptation of n-gram language models using inter-word correlation for speech recognition.

[BibT_eX]

[DOI]

Koki Sasaki

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Data-driven intonation modeling using a neural network and a command response model.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Modeling and generation of accentual phrase F0 contours based on discrete HMMs synchronized at mora-unit transitions.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Development of a formant-based analysis-synthesis system and generation of high quality liquid sounds of Japanese.

[BibT_eX]

[DOI]

Nobuyuki Nishizawa

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Synthesis of fundamental FDrequency contours of standard Chinese sentences from tone sandhi and focus conditions.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Robust recognition using multiple utterances.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Modeling phone correlation for speaker adaptive speech recognition.

[BibT_eX]

[DOI]

Baojie Li

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Efficient search strategy in large vocabulary continuous speech recognition using prosodic boundary information.

[BibT_eX]

[DOI]

Shi-wook Lee

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Identification of Japanese double-mora phonemes considering speaking rate for the use in CALL systems.

[BibT_eX]

[DOI]

Carlos Toshinori Ishi

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Analytical and perceptual study on the role of acoustic features in realizing emotional speech.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Overview of an intelligent system for information retrieval based on human-machine dialogue through spoken language.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Anchoring hypothesis and its application to tone recognition of Chinese continuous speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

Synthesis of vibrato singing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

Detection of prosodic word boundaries by statistical modeling of mora transitions of fundamental frequency contours and its use for continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

Robust speech recognition based on a Bayesian prediction approach.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 1999

Improving Viterbi Bayesian predictive classification via sequential bayesian learning in robust speech recognition.

[BibT_eX]

[DOI]

Speech Commun., 1999

Detecting accent sandhi in Japanese using a superpositional F0 model.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Efficient weight training for selection based synthesis.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Tone recognition of Chinese continuous speech using tone critical segments.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Prosodic word boundary detection using statistical modeling of moraic fundamental frequency contours and its use for continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998

A robust tone recognition method of Chinese based on sub-syllabic F0 contours.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

A linguistic and prosodic database for data-driven Japanese TTS synthesis.

[BibT_eX]

[DOI]

Takashi Natsume

A synthesis-oriented model of phrasal pitch movements in standard Chinese.

[BibT_eX]

[DOI]

Separation of singing and piano sounds.

[BibT_eX]

[DOI]

A method for measuring the intelligibility and nonnativeness of phone quality in foreign language pronunciation training.

[BibT_eX]

[DOI]

A minimax search algorithm for CDHMM based robust continuous speech recognition.

[BibT_eX]

[DOI]

Representing prosodic words using statistical models of moraic transition of fundamental frequency contours of Japanese.

[BibT_eX]

[DOI]

On the relationship of speech rates with prosodic units in dialogue speech.

[BibT_eX]

[DOI]

Accent type recognition and syntactic boundary detection of Japanese using statistical modeling of moraic transitions of fundamental frequency contours.

[BibT_eX]

[DOI]

Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997

Quantitative analysis and formulation of tone concatenation in Chinese F0 contours.

[BibT_eX]

[DOI]

Ren-Hua Wang

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Use of recurrent network for unknown language rejection in language identification system.

[BibT_eX]

[DOI]

HingKeung Kwan

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

A CALL system using speech recognition to train the pronunciation of Japanese long vowels, the mora nasal and mora obstruents.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

A method of representing fundamental frequency contours of Japanese using statistical models of moraic transition.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Robust speech recognition based on Viterbi Bayesian predictive classification.

[BibT_eX]

[DOI]

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996

Adaptive recognition method based on posterior use of distribution pattern of output probabilities.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Detection of phrase boundaries in Japanese by low-pass filtering of fundamental frequency contours.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Prosodic manipulation system of speech material for perceptual experiments.

[BibT_eX]

[DOI]

Seiichi Nakagawa

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Language training system utilizing speech modification.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Unknown language rejection in language identification system.

[BibT_eX]

[DOI]

HingKeung Kwan

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Synthesizing dialogue speech of Japanese based on the quantitative analysis of prosodic features.

[BibT_eX]

[DOI]

Mayumi Sakata

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Detection of syntactic boundaries by partial analysis-by-synthesis of fundamental frequency contours.

[BibT_eX]

[DOI]

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995

A Scheme for Word Detection in Continuous Speech Using Likelihood Scores of Segments Modified by Their Context Within a Word.

[BibT_eX]

[DOI]

Sumio Ohno

IEICE Trans. Inf. Syst., 1995

Duration Modeling with Decreased Intra-Group Temporal Variation for HMM-Based Phoneme Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 1995

Tone Recognition of Chinese Dissyllables Using Hidden Markov Models.

[BibT_eX]

[DOI]

Xinhui Hu

IEICE Trans. Inf. Syst., 1995

Analysis and synthesis of prosodic features in spoken dialogue of Japanese.

[BibT_eX]

[DOI]

Mayumi Sakata

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Recognized phoneme-based N-gram modeling in automatic language identification.

[BibT_eX]

[DOI]

HingKeung Kwan

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

HMM-based tone recognition of Chinese trisyllables using double codebooks on fundamental frequency and waveform power.

[BibT_eX]

[DOI]

Xinhui Hu

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1994

Analysis and synthesis of fundamental frequency contours for the spoken dialogue in Japanese.

[BibT_eX]

[DOI]

Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 1994

A scheme for Chinese speech synthesis by rule based on pitch-synchronous multi-pulse excitation LP method.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

A method for word spotting in continuous speech using both segmental and contextual likelihood scores.

[BibT_eX]

[DOI]

Sumio Ohno

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Role of prosodic features in the human process of speech perception.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Speech recognition using HMM with decreased intra-group variation in the temporal structure.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Recognition of Chinese tones in monosyllabic and disyllabic speech using HMM.

[BibT_eX]

[DOI]

Xinhui Hu

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Use of prosodic features in the recognition of continuous speech.

[BibT_eX]

[DOI]

Hiroyuki Konno

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Prosodic characteristics of a spoken dialogue for information query.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

1993

Speech signal processing using optical method.

[BibT_eX]

[DOI]

Speech Commun., 1993

Generation of speech reply in the speech response system.

[BibT_eX]

[DOI]

Yasuharu Asano

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Proposal and implementation of a spoken word recognizer using utterance normalization and multiple templates on a single VLSI chip.

[BibT_eX]

[DOI]

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Utterance normalization using vowel features in a spoken word recognition system for multiple speakers.

[BibT_eX]

[DOI]

Sumio Ohno

Proceedings of the IEEE International Conference on Acoustics, 1993

1992

The influence of semantic and syntactic information on spoken sentence recognition.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Prosody and syntax in spoken sentences of standard Chinese.

[BibT_eX]

[DOI]

Haitao Lei

Proceedings of the Second International Conference on Spoken Language Processing, 1992

A method of dialogue management for the speech response system.

[BibT_eX]

[DOI]

Yasuharu Asano

Proceedings of the Second International Conference on Spoken Language Processing, 1992

A scheme for pitch extraction of speech using autocorrelation function with frame length proportional to the time lag.

[BibT_eX]

[DOI]

Shigenobu Seto

Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1990

Chinese four tone recognition based on the model for process of generating F0 contours of sentences.

[BibT_eX]

[DOI]

Changfu Wang

Proceedings of the First International Conference on Spoken Language Processing, 1990

Manifestation of linguistic and para-linguistic information in the voice fundamental frequency contours of spoken Japanese.

[BibT_eX]

[DOI]

Noboru Takahashi

Proceedings of the First International Conference on Spoken Language Processing, 1990

Proposal and evaluation of a new scheme for reliable pitch extraction of speech.

[BibT_eX]

[DOI]

Shigenobu Seto

Proceedings of the First International Conference on Spoken Language Processing, 1990

Influence of context and knowledge on the perception of continuous speech.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Spoken Language Processing, 1990

Analysis and modeling of tonal features in polysyllabic words and sentences of the standard Chinese.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Spoken Language Processing, 1990

Proposal and evaluation of a new type of terminal analog speech synthesizer.

[BibT_eX]

[DOI]

Yasuharu Asano

Proceedings of the First International Conference on Spoken Language Processing, 1990

Spoken word recognition for multiple speakers based on path-limited DP matching and a method for speaker normalization.

[BibT_eX]

[DOI]

Proceedings of the 1990 International Conference on Acoustics, 1990

A system for synthesizing Japanese speech from orthographic text.

[BibT_eX]

[DOI]

Proceedings of the 1990 International Conference on Acoustics, 1990

1987

A new system for reliable pitch extraction of speech.

[BibT_eX]

[DOI]

Keisuke Shimizu

Proceedings of the IEEE International Conference on Acoustics, 1987

1986

Generation of prosodic symbols for rule-synthesis of connected speech of Japanese.

[BibT_eX]

[DOI]

Hisashi Kawai

Proceedings of the IEEE International Conference on Acoustics, 1986

Use of optical signal processing techniques to spectrum analysis of speech.

[BibT_eX]

[DOI]

Yasuhiro Kosugi

Proceedings of the IEEE International Conference on Acoustics, 1986

A new approach to continuous speech recognition based on considerations on human processes of speech perception.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1986

Acoustic characteristics and the underlying rules of intonation of the common Japanese used by radio and television announcers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1986

1984

Synthesis by rule of voice fundamental frequency contours of spoken Japanese from linguistic information.

[BibT_eX]

[DOI]

Mikio Yamaguchi

Proceedings of the IEEE International Conference on Acoustics, 1984

Automatic recognition of spoken words from a large vocabulary using syllable templates.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1984

1982

Analysis and synthesis of voice fundamental frequency contours of spoken sentences.

[BibT_eX]

[DOI]