Katsuhiko Shirai

Affiliations:
  • Waseda University, Tokyo, Japan


According to our database1, Katsuhiko Shirai authored at least 113 papers between 1980 and 2012.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2012
Method for Collection of Acted Speech Using Various Situation Scripts.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

2011
Temporal AM-FM combination for robust speech recognition.
Speech Commun., 2011

Collection and Analysis of Emotional Speech Focused on the Psychological and Acoustical Diversity.
Proceedings of the 17th International Congress of Phonetic Sciences, 2011

2010
Construction of Decision Model for a System to Start Communicating with a Human Using Hidden Markov Model.
Proceedings of the 9th IEEE/ACIS International Conference on Computer and Information Science, 2010

2009
Decision Model for a System to Start Communicating with a Human Using HMM.
Proceedings of the NBiS 2009, 2009

Decision Model for a Robot to Start Communicating with a Human.
Proceedings of the 2009 International Conference on Complex, 2009

2008
Detection of speech and music based on spectral tracking.
Speech Commun., 2008

Recognizing Reverberant Speech Based on Amplitude and Frequency Modulation.
IEICE Trans. Inf. Syst., 2008

A comparative study on AM and FM features.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Noisy speech recognition using temporal AM-FM combination.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
A study on temporal features derived by analytic signal.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006
Robust speech recognition based on dynamical selections.
Proceedings of the Second IASTED International Conference on Computational Intelligence, 2006

Feature parameters and confident weights for robust speech recognition under noisy environment.
Proceedings of the Second IASTED International Conference on Computational Intelligence, 2006

Enhancing Robustness of Speech Recognition by Approach of Feature with Confident Weight.
Proceedings of the IASTED International Conference on Artificial Intelligence and Applications, 2006

2005
Discrimination of speech, musical instruments and singing voices using the temporal patterns of sinusoidal segments in audio signals.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Construction of the Walking Motion Model for Animated Characters' Motion.
Proceedings of the Internet and Multimedia Systems and Applications, 2005

2004
Sounds of Speech Based Spoken Document Categorization: A Subword Representation Method.
IEICE Trans. Inf. Syst., 2004

Approach of feature with confident weight for robust speech recognition.
Proceedings of the IEEE 6th Workshop on Multimedia Signal Processing, 2004

Analysis of the phone level contributions to objective evaluation of English speech by non-natives.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Probabilistic Multi-Lateral Security Model for Ubiquitous Multimedia Services.
Proceedings of the 24th International Conference on Distributed Computing Systems Workshops (ICDCS 2004 Workshops), 2004

2003
Automatic closed-caption production system on TV programs for hearing-impaired people.
Syst. Comput. Jpn., 2003

Statistical estimation of phoneme's most stable point based on universal constraint.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Corpus-based modeling of naturalness estimation in timing control for non-native speech.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
An Efficient Lip-Reading Method Robust to Illumination Variations.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2002

Humanoid Robots in Waseda University-Hadaly-2 and WABIAN.
Auton. Robots, 2002

Accurate Human Face Extraction using Genetic Algorithm and Subspace Method.
Proceedings of the Soft Computing Systems - Design, Management and Applications, 2002

2001
The multi-lateral security framework for the ubiquitous audiovisual services.
Proceedings of the IEEE International Conference on Systems, 2001

AR-ARMA HMM and its application to speech enhancement.
Proceedings of the Signal and Image Processing (SIP 2001), 2001

Information Retrieval using Relevance Feedback.
Proceedings of the Third Second Workshop Meeting on Evaluation of Chinese & Japanese Text Retrieval and Text Summarization, 2001

Pronunciation variant analysis using speaking style parallel corpus.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Speech enhancement based on IMM with NPHMM.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Multi-class composite n-gram language model using multiple word clusters and word successions.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

The Multi-lateral Security Framework for the Ubiquitous Internet Multimedia.
Proceedings of the Fifth IASTED International Conference Internet and Multimedia Systems and Applications (IMSA 2001), 2001

2000
Experiments on the TREC-9 Filtering Track.
Proceedings of The Ninth Text REtrieval Conference, 2000

Modeling of spoken dialogue control for improvement of dialogue efficiency.
Proceedings of the IEEE International Conference on Systems, 2000

Controlling non-verbal information in speaker-change for spoken dialogue.
Proceedings of the IEEE International Conference on Systems, 2000

Re-estimation of LPC coefficients in the sense of l∞ criterion.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Using machine learning method and subword unit representations for spoken document categorization.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

An automatic timing detection method for superimposing closed captions of TV programs.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Improvement of dialogue efficiency by dialogue control model according to performance of processes.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Overview of an intelligent system for information retrieval based on human-machine dialogue through spoken language.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Designing a domain independent platform of spoken dialogue system.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Accurate Extraction of Human Face Area using Subspace Method and Genetic Algorithm.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

Visual approach for automatic pitch period estimation.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Japanese large-vocabulary continuous-speech recognition using a newspaper corpus and broadcast news.
Speech Commun., 1999

Proposal and Evaluation of Significant Words Selection Method Based on AIC.
Proceedings of the First NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition, 1999

Improving recognition correct rate of important words in large vocabulary speech recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

A recombination strategy for multi-band speech recognition based on mutual information criterion.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Cognitive experiments on timing lag for superimposing closed captions.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

A post-processing of speech for hearing impaired integrate into standard digital audio decoders.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Speech enhancement based on neural predictive hidden Markov model.
Signal Process., 1998

J-MUSE; The Development of Pronunciation CAI System Based on Japanese Speech Recognition Intensified to Detect Errors.
Proceedings of WebNet 98, 1998

Controlling gaze of humanoid in communication with human.
Proceedings of the Proceedings 1998 IEEE/RSJ International Conference on Intelligent Robots and Systems. Innovations in Theory, 1998

Use of non-verbal information in communication between human and robot.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Word sequence pair spotting for synchronization of speech and text in production of closed-caption TV programs for the hearing impaired.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Project for Production of Closed-Caption TV Programs for the Hearing Impaired.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

1997
Toward automatic transcription of Japanese broadcast news.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Facial Expressions Recognition Using Discrete Hopfield Neural Network.
Proceedings of the Proceedings 1997 International Conference on Image Processing, 1997

Japanese large-vocabulary continuous-speech recognition using a business-newspaper corpus.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Difference in visual information between face to face and telephone dialogues.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
Efficient recursive estimation for speech enhancement in colored noise.
IEEE Signal Process. Lett., 1996

Speech recognition in nonstationary noise based on parallel HMMs and spectral subtraction.
Syst. Comput. Jpn., 1996

Spoken dialogue interface in a dual task situation.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Modeling of spoken dialogue with and without visual information.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Estimation of statistical phoneme center considering phonemic environments.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Japanese large-vocabulary continuous-speech recognition using a business-newspaper corpus.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Analysis of head movements and its role in spoken dialogue.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1995
Handling of user interruption to achieve timing-free utterances for spoken dialogue interface.
Syst. Comput. Jpn., 1995

Estimation of statistical phoneme center and its application to accurate phoneme modelling.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

A design system for special purpose processors based on architectures for distributed processing.
Proceedings of the Proceedings EURO-DAC'95, 1995

1994
Editorial.
Speech Communication, 1994

Generation of prosody in speech synthesis using large speech data-base.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Evaluation of phonetic feature recognition with a time-delay neural network.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Phoneme recognition in various styles of utterance based on mutual information criterion.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Multimodal drawing tool using speech, mouse and key-board.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Effects on utterances caused by knowledge on the hearer.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Automatic training of phoneme dictionary based on mutual information criterion.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Markov model based noise modeling and its application to noisy speech recognition using dynamical features of speech.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Estimation and generation of articulatory motion using neural networks.
Speech Commun., 1993

Word spotting in conversational speech based on phonemic unit likelihood by mutual information criterion.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Speech recognition under the unstationary noise based on the noise Markov model and spectral-subtraction.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1992
Phoneme recognition in continuous speech based on mutual information considering phonemic duration and connectivity.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Spectral mapping onto probabilistic domain using neural networks and its application to speaker adaptive phoneme recognition.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Speaker adaptive phoneme recognition based on feature mapping from spectral domain to probabilistic domain.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
Optimal construction of context sensitive quantizer for phoneme recognition in continuous speech.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Text-to-speech synthesizer using superposition of sinusoidal waves generated by synchronized oscillators.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Application of neural networks to articulatory motion estimation.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
Speech synthesis using superposition of sinusoidal waves generated by synchronized oscillators.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Speaker adaptable phoneme recognition selecting reliable acoustic features based on mutual information.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Interactive design environment of VLSI architecture for digital signal processing.
Proceedings of the 1990 International Conference on Acoustics, 1990

Speaker adaptive phoneme recognition by multi-level clustering based on mutual information criterion.
Proceedings of the 1990 International Conference on Acoustics, 1990

1989
Phoneme recognition in continuous speech using feature selection based on mutual information.
Proceedings of the First European Conference on Speech Communication and Technology, 1989

Multi-level clustering of acoustic features for phoneme recognition based on mutual information.
Proceedings of the IEEE International Conference on Acoustics, 1989

1988
Speaker identification based on frequency distribution of vector-quantized spectra.
Syst. Comput. Jpn., 1988

Expert system for designing digital signal processor architectures.
Microprocess. Microsystems, 1988

1987
The robot musician 'wabot-2' (waseda robot-2).
Robotics, 1987

Speaker adaptive phoneme recognition in continuous speech based on vector quantization.
Proceedings of the European Conference on Speech Technology, 1987

Description of task dependent knowledge for speech understanding system.
Proceedings of the European Conference on Speech Technology, 1987

1986
Estimating articulatory motion from speech wave.
Speech Commun., 1986

Estimation of articulatory parameters by table look-up method and its application for speaker independent phoneme recognition.
Proceedings of the IEEE International Conference on Acoustics, 1986

Pitch contour control in Japanese conversational speech.
Proceedings of the IEEE International Conference on Acoustics, 1986

Effects of tempo and context on jaw openings for vowels in vowel sequence words.
Proceedings of the IEEE International Conference on Acoustics, 1986

Phoneme recognition in connected speech using both static and dynamic properties of spectrum described by vector quantization.
Proceedings of the IEEE International Conference on Acoustics, 1986

A network model dealing with focus of conversation for speech understanding system.
Proceedings of the IEEE International Conference on Acoustics, 1986

Linguistic Knowledge Extraction from Real Language Behavior.
Proceedings of the 11th International Conference on Computational Linguistics, 1986

1984
Phrase speech recognition of large vocabulary using feature in articulatory domain.
Proceedings of the IEEE International Conference on Acoustics, 1984

1983
The non-stationary analysis of speech waves by the Hierarchical method.
Speech Commun., 1983

An estimation of the production process for fricative consonants.
Speech Commun., 1983

Considerations on articulatory dynamics for continuous speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 1983

1982
Recognition of semivowels and consonants in continuous speech using articulatory parameters.
Proceedings of the IEEE International Conference on Acoustics, 1982

Japanese Sentence Analysis System Essay - Evaluation Of Dictionary Derived From Real Text Data.
Proceedings of the 9th International Conference on Computational Linguistics, 1982

1981
Vowel identification in continuous speech using articulatory parameters.
Proceedings of the IEEE International Conference on Acoustics, 1981

1980
A Trial Of Japanese Text Input System Using Speech Recognition.
Proceedings of the 8th International Conference on Computational Linguistics, 1980


  Loading...