Yoshinori Sagisaka

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016

Comparison of Grapheme-to-Phoneme Conversion Methods on a Myanmar Pronunciation Dictionary.

[BibT_eX]

[DOI]

Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing, 2016

Visualization of Mandarin Chinese Tone Production of Japanese L2 Learners for evaluation.

[BibT_eX]

[DOI]

Proceedings of the Language Teaching, Learning and Technology 2016, 2016

Analysis of Chinese Syllable Durations in Running Speech of Japanese L2 Learners.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

Analysis on L2 learners' perception errors between geminate and singleton of Japanese consonants using loudness related parameters.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

A study of the production of unstressed vowels by Japanese speakers of English using the J-AESOP corpus.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

Cross-modal description of sentiment information embedded in speech.

[BibT_eX]

[DOI]

Kanako Watanabe

Proceedings of the 18th International Congress of Phonetic Sciences, 2015

2014

Integrating Dictionaries into an Unsupervised Model for Myanmar Word Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Fifth Workshop on South and Southeast Asian Natural Language Processing, 2014

Communicative F0 generation based on impressions.

[BibT_eX]

[DOI]

Lu Shao

Proceedings of the 5th IEEE Conference on Cognitive Infocommunications, 2014

Sentiment analysis of color attributes derived from vowel sound impression for multimodal expression.

[BibT_eX]

[DOI]

Kanako Watanabe

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013

Global F0 control parameter prediction based on impressions for communicative prosody generation.

[BibT_eX]

[DOI]

Lu Shao

Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

A Purely Monotonic Approach to Machine Translation for Similar Languages.

[BibT_eX]

[DOI]

Proceedings of the 2013 International Conference on Asian Language Processing, 2013

Density Maximization in Context-Sense Metric Space for All-words WSD.

[BibT_eX]

[DOI]

Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012

Trans-disciplinary spoken language processing studies for scientific understanding of second language learner's characteristics.

[BibT_eX]

[DOI]

Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

2011

Perceptual Training of Vowel Length Contrast of Japanese by L2 Listeners: Effects of an Isolated Word versus a Word Embedded in Sentences.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Perceptual Studies of Japanese Geminate Insertion Phenomena Based on Timing Control Characteristics.

[BibT_eX]

[DOI]

Proceedings of the 17th International Congress of Phonetic Sciences, 2011

A Requirement of Texts for Evaluation of Rhythm in English Speech by Learners.

[BibT_eX]

[DOI]

Shizuka Nakamura

Proceedings of the 17th International Congress of Phonetic Sciences, 2011

2010

The effect of a word embedded in a sentence and speaking rate variation on the perceptual training of geminate and singleton consonant distinction.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Computational Modeling of Timing Control and its Application to Objective Evaluation of the Second Language Proficiency.

[BibT_eX]

Proceedings of the Electronic Speech Signal Processing, 2010

2009

Analysis on paralinguistic prosody control in perceptual impression space using multiple dimensional scaling.

[BibT_eX]

[DOI]

Speech Commun., 2009

Perceptual training of singleton and geminate stops in Japanese language by Korean learners.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Effects of mora-timing in English rhythm control by Japanese learners.

[BibT_eX]

[DOI]

Shizuka Nakamura

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Model-based automatic evaluation of L2 learner's English timing.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Objective evaluation of English learners' timing control based on a measure reflecting perceptual characteristics.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Corpus-based speech synthesis from reading speech to communicative speech.

[BibT_eX]

[DOI]

Proceedings of the First International Workshop on Spoken Languages Technologies for Under-Resourced Languages, 2008

Three-sectional-staff characterization of Cantonese level tones.

[BibT_eX]

[DOI]

Rerrario Shui-Ching Ho

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Objective evaluation of second language learner2s translation proficiency using statistical translation measures.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop on Experimental Linguistics, 2008

Model-based duration analysis on English natives and Thai learners.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop on Experimental Linguistics, 2008

2007

Syllable-based Thai duration model using multi-level linear regression and syllable accommodation.

[BibT_eX]

[DOI]

Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Event detection of speech signals based on auditory processing with a dynamic compressive gammachirp filterbank.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Inter-language prosodic style modification experiment using word impression vector for communicative speech generation.

[BibT_eX]

[DOI]

Ke Li

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

F0 analysis of perceptual distance among Cantonese level tones.

[BibT_eX]

[DOI]

Rerrario Shui-Ching Ho

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006

Speech recognition of foreign out-of-vocabulary words using a hierarchical language model.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Language modeling of Chinese personal names based on character units for continuous Chinese speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2005

Generation and perception of F0 markedness for communicative speech synthesis.

[BibT_eX]

[DOI]

Takumi Yamashita

Yoko Kokenawa

Speech Commun., 2005

Effect of speaking rate on the acceptability of change in segment duration.

[BibT_eX]

[DOI]

Speech Commun., 2005

Effect of intra-phrase position on acceptability of change in segment duration in sentence speech.

[BibT_eX]

[DOI]

Speech Commun., 2005

Editorial.

[BibT_eX]

[DOI]

Keikichi Hirose

Daniel Hirst

Speech Commun., 2005

Application of auditory image model for speech event detection.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Analysis on command sequences of a F0 generation model for Mandarin speech and its application to their automatic extraction.

[BibT_eX]

[DOI]

Ke Li

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Communicative speech synthesis using constituent word attributes.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Improved speech recognition word lattice translation by confidence measure.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Speech recognition of a named entity.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

F0 control characterization by perceptual impressions on speaking attitudes using Multiple Dimensional Scaling analysis.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

Mis-recognized utterance detection using hierarchical language model.

[BibT_eX]

[DOI]

Gen-ichiro Kikui

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Analysis of the phone level contributions to objective evaluation of English speech by non-natives.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003

Multi-class composite N-gram language model.

[BibT_eX]

[DOI]

Shuntaro Isogai

Speech Commun., 2003

Multiclass composite N-gram language model based on connection direction.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 2003

Generation and perception of f_0 markedness in conversational speech with adverbs expressing degrees.

[BibT_eX]

[DOI]

Takumi Yamashita

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Word class modeling for speech recognition with out-of-task words using a hierarchical language model.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Corpus-based modeling of naturalness estimation in timing control for non-native speech.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Analysis and modeling of syllable duration for Thai speech synthesis.

[BibT_eX]

[DOI]

Virongrong Tesprasit

Rungkarn Siricharoenchai

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002

Speaker clustering for speech recognition using vocal tract parameters.

[BibT_eX]

[DOI]

Masaki Naito

Li Deng

Speech Commun., 2002

A stochastic speech understanding method to generate interlingual representations.

[BibT_eX]

[DOI]

Koichi Tanigaki

Syst. Comput. Jpn., 2002

Effects of intra-phrase position on acceptability of changes in segmental duration in sentence speech.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001

Unit selection synthesis.

[BibT_eX]

[DOI]

Proceedings of the 4th ITRW on Speech Synthesis, 2001

A hybrid approach to enhance task portability of acoustic models in Chinese speech recognition.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Structured language model for class identification of out-of-vocabulary words arising from multiple wordclasses.

[BibT_eX]

[DOI]

Shigehiko Onishi

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Pronunciation variant analysis using speaking style parallel corpus.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

New language models using phrase structures extracted from parse trees.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Multi-class composite n-gram language model using multiple word clusters and word successions.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Multi-Class Composite N-gram Language Model for Spoken Language Processing Using Multiple Word Clusters.

[BibT_eX]

[DOI]

Shuntaro Isogai

Proceedings of the Association for Computational Linguistic, 2001

2000

Statistical language modeling with a class-basedn-multigram model.

[BibT_eX]

[DOI]

Sabine Deligne

Comput. Speech Lang., 2000

An embedded knowledge integration for hybrid language modelling.

[BibT_eX]

[DOI]

Shuwu Zhang

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A tagger-aided language model with a stack decoder.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A language model for conversational speech recognition using information designed for speech translation.

[BibT_eX]

[DOI]

Kouichi Tanigaki

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Fine keyword clustering using a thesaurus and example sentences for speech translation.

[BibT_eX]

[DOI]

Yumi Wakita

Kenji Matsui

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A hierarchical language model incorporating class-dependent word models for OOV words recognition.

[BibT_eX]

[DOI]

Koichi Tanigaki

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Evaluation of the ATR-matrix speech translation system with a pair comparison method between the system and humans.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Rules, but what for? - rule description as efficient and robust abstraction of corpora and optimal fitting to applications -.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Pronunciation variants description using recognition error modeling with phonetic derivation hypotheses.

[BibT_eX]

[DOI]

Hideharu Nakajima

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Analysis of acoustic models trained on a large-scale Japanese speech database.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Cellular-phone based speech-to-speech translation system ATR-MATRIX.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Integrating detailed information into a language model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

Automatic generation of multiple pronunciations based on neural networks.

[BibT_eX]

[DOI]

Takayoshi Yoshimura

Speech Commun., 1999

Phoneme boundary estimation using bidirectional recurrent neural networks and its applications.

[BibT_eX]

[DOI]

Mike Schuster

Syst. Comput. Jpn., 1999

Multiple pronunciation dictionary using HMM-state confusion characteristics.

[BibT_eX]

[DOI]

Yumi Wakita

Harald Singer

Comput. Speech Lang., 1999

Improving n-gram modeling using distance-related unit association maximum entropy language modeling.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Part-of-speech n-gram and word n-gram fused language model.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Model-based speaker normalization methods for speech recognition.

[BibT_eX]

[DOI]

Masaki Naito

Li Deng

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Japanese spontaneous speech database with wide regional and age distribution.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Multi-class composite N-gram based on connection direction.

[BibT_eX]

[DOI]

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998

Reliable utterance segment recognition by integrating a grammar with statistical language constraints.

[BibT_eX]

[DOI]

Speech Commun., 1998

Model parameter estimation for mixture density polynomial segment models.

[BibT_eX]

[DOI]

Kuldip K. Paliwal

Comput. Speech Lang., 1998

Grammatical word graph re-generation for spontaneous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

A Japanese-to-English speech translation system: ATR-MATRIX.

[BibT_eX]

[DOI]

Effects of phonetic quality and duration on perceptual acceptability of temporal changes in speech.

[BibT_eX]

[DOI]

Neural network based pronunciation modeling with applications to speech recognition.

[BibT_eX]

[DOI]

Takayoshi Yoshimura

Speaker clustering for speech recognition using the parameters characterizing vocal-tract dimensions.

[BibT_eX]

[DOI]

Masaki Naito

Li Deng

Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Speaker normalized acoustic modeling based on 3-D Viterbi decoding.

[BibT_eX]

[DOI]

Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Learning a Syntagmatic and Paradigmatic Structure from Language Data with a Bi-Multigram Model.

[BibT_eX]

[DOI]

Sabine Deligne

Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

1997

Automatic extraction of fundamental frequency control rules by statistical analysis.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 1997

ATR Speech Translation Research Project in Japan.

[BibT_eX]

Künstliche Intell., 1997

Speech recognition using HMM-state confusion characteristics.

[BibT_eX]

[DOI]

Yumi Wakita

Harald Singer

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Integration of grammar and statistical language constraints for partial word-sequence recognition.

[BibT_eX]

[DOI]

Hajime Tsukada

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Cyclic autocorrelation-based linear prediction analysis of speech.

[BibT_eX]

[DOI]

Kuldip K. Paliwal

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Automatic generation of a pronunciation dictionary based on a pronunciation network.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Segment boundary estimation using recurrent neural networks.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Fast word-graph generation for spontaneous conversational speech translation.

[BibT_eX]

[DOI]

Tohru Shimizu

Harald Singer

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Task adaptation using MAP estimation in N-gram language modeling.

[BibT_eX]

[DOI]

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Accent Phrase Segmentation by F0 Clustering Using Superpositional Modelling.

[BibT_eX]

[DOI]

Proceedings of the Computing Prosody, 1997

Measuring temporal compensation effect in speech perception.

[BibT_eX]

[DOI]

Proceedings of the Computing Prosody, 1997

Comparison of F0 Control Rules Derived from Multiple Speech Databases.

[BibT_eX]

[DOI]

Toshio Hirai

Proceedings of the Computing Prosody, 1997

Prediction of Major Phrase Boundary Location and Pause Insertion Using a Stochastic Context-free Grammar.

[BibT_eX]

[DOI]

Shigeru Fujio

Proceedings of the Computing Prosody, 1997

1996

Japanese speech databases for robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Speech recognition based on acoustically derived segment units.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Spontaneous dialogue speech recognition using cross-word context constrained word graphs.

[BibT_eX]

[DOI]

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Variable-order N-gram generation by word-class splitting and consecutive word grouping.

[BibT_eX]

[DOI]

Hirokazu Masataki

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Design of a speech recognition system based on acoustically derived segmental units.

[BibT_eX]

[DOI]

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995

Editorial.

[BibT_eX]

[DOI]

Eric Moulines

Speech Commun., 1995

Acoustic characteristics of speaker individuality: Control and conversion.

[BibT_eX]

[DOI]

Hisao Kuwabara

Speech Commun., 1995

Speech spectrum conversion based on speaker interpolation and multi-functional representation with weighting by radial basis function networks.

[BibT_eX]

[DOI]

Speech Commun., 1995

Speech segment network approach for optimization of synthesis unit set.

[BibT_eX]

[DOI]

Comput. Speech Lang., 1995

Effect of rasta-type processing for speech recognition with speaking-rate mismatches.

[BibT_eX]

[DOI]

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Minimum classification error training algorithm for feature extractor and pattern classifier in speech recognition.

[BibT_eX]

[DOI]

Kuldip K. Paliwal

Michiel Bacchiani

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Automatic detection of major phrase boundaries using statistical properties of superpositional F0 control model parameters.

[BibT_eX]

[DOI]

Toshio Hirai

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Automatic prosodic segmentation by F0 clustering using superpositional modeling.

[BibT_eX]

[DOI]

Proceedings of the 1995 International Conference on Acoustics, 1995

Stochastic modeling of pause insertion using context-free grammar.

[BibT_eX]

[DOI]

Shigeru Fujio

Proceedings of the 1995 International Conference on Acoustics, 1995

1994

Automatic extraction of FO control parameters using statistical analysis.

[BibT_eX]

[DOI]

Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 1994

Effect of speaking style on parameters of fundamental frequency contour.

[BibT_eX]

[DOI]

Toshio Hirai

Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 1994

A speech and language database for speech translation research.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Acceptability of temporal modification in consonant and vowel onsets.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Voice adaptation using multi-functional transformation with weighting by radial basis function networks.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Prediction of prosodic phrase boundaries using stochastic context-free grammar.

[BibT_eX]

[DOI]

Shigeru Fujio

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Speech spectrum transformation by speaker interpolation.

[BibT_eX]

[DOI]

Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993

Duration modelling with multiple split regression.

[BibT_eX]

[DOI]

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Tree-based unit selection for English speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1993

1992

ATR μ-talk speech synthesis system.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Acceptability and discrimination threshold for distortion of segmental duration in Japanese words.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Pause characteristics and local phrase-dependency structure in Japanese.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Speech segment network approach for an optimal synthesis unit set.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Optimization of intonation control using statistical F0 resetting characteristics.

[BibT_eX]

[DOI]

Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

Concatenative speech synthesis by minimum distortion criteria.

[BibT_eX]

[DOI]

Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991

Statistical modeling of segmental duration and power control for Japanese.

[BibT_eX]

[DOI]

Katsuhiko Mimura

Proceedings of the Second European Conference on Speech Communication and Technology, 1991

1990

ATR Japanese speech database as a tool of speech recognition and synthesis.

[BibT_eX]

[DOI]

Speech Commun., 1990

Speech synthesis from text.

[BibT_eX]

[DOI]

IEEE Commun. Mag., 1990

On unit selection algorithms and their evaluation in non-uniform unit speech synthesis.

[BibT_eX]

[DOI]

Katsuo Abe

Proceedings of the ESCA Workshop on Speech Synthesis, 1990

The control of segmental duration in speech synthesis using linguistic properties.

[BibT_eX]

[DOI]

Proceedings of the ESCA Workshop on Speech Synthesis, 1990

On the unit search criteria and algorithms for speech synthesis using non-uniform units.

[BibT_eX]

[DOI]

Katsuo Abe

Proceedings of the First International Conference on Spoken Language Processing, 1990

A large-scale Japanese speech database.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Spoken Language Processing, 1990

Statistical analysis for segmental duration rules in Japanese speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Spoken Language Processing, 1990

On the prediction of global F0 shape for Japanese text-to-speech.

[BibT_eX]

[DOI]

Proceedings of the 1990 International Conference on Acoustics, 1990

1989

Adaptive manipulation of non-uniform synthesis units using multi-level unit transcription.

[BibT_eX]

[DOI]

Proceedings of the First European Conference on Speech Communication and Technology, 1989

Construction of a large-scale Japanese speech database and its management system.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1989

1988

Speech synthesis by rule using an optimal selection of non-uniform synthesis units.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1988

1987

Acoustic-phonetic labels in a Japanese speech database.

[BibT_eX]

[DOI]

Shigeru Katagiri

Proceedings of the European Conference on Speech Technology, 1987

1986

Composite phoneme units for the speech synthesis of Japanese.

[BibT_eX]

[DOI]

Hirokazu Sato

Speech Commun., 1986

Word identification method for Japanese text-to-speech conversion system.

[BibT_eX]

[DOI]