Akinobu Lee

Orcid: 0009-0005-5323-9191

According to our database1, Akinobu Lee authored at least 62 papers between 1998 and 2024.

Collaborative distances:



In proceedings 
PhD thesis 


On csauthors.net:


Refining Synthesized Speech Using Speaker Information and Phone Masking for Data Augmentation of Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Accent-Preserving Voice Conversion between Native-Nonnative Speakers for Second Language Learning.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Context and knowledge aware conversational model and system combination for grounded response generation.
Comput. Speech Lang., 2020

Fact-based Dialogue Generation with Convergent and Divergent Decoding.
CoRR, 2020

An Ensemble Dialogue System for Facts-Based Sentence Generation.
CoRR, 2019

User Generated Dialogue Systems: uDialogue.
Proceedings of the Human-Harmonized Information Technology, Volume 2, 2017

Prosodically-enhanced recurrent neural network language models.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Voice interaction system with 3D-CG virtual agent for stand-alone smartphones.
Proceedings of the second international conference on Human-agent interaction, 2014

Mmdagent - A fully open-source toolkit for voice interaction systems.
Proceedings of the IEEE International Conference on Acoustics, 2013

Bayesian Context Clustering Using Cross Validation for Speech Recognition.
IEICE Trans. Inf. Syst., 2011

Evaluation of Tree-Trellis Based Decoding in Over-Million LVCSR.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A Covariance-Tying Technique for HMM-Based Speech Synthesis.
IEICE Trans. Inf. Syst., 2010

Voice activity detection based on conditional random fields using multiple features.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Speaker adaptation based on nonlinear spectral transform for speech recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Tying covariance matrices to reduce the footprint of HMM-based speech synthesis systems.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Voice conversion based on simultaneous modelling of spectrum and F0.
Proceedings of the IEEE International Conference on Acoustics, 2009

A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System.
IEICE Trans. Inf. Syst., 2008

Development, Long-Term Operation and Portability of a Real-Environment Speech-Oriented Guidance System.
IEICE Trans. Inf. Syst., 2008

Probabilistic answer selection based on conditional random fields for spoken dialog system.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Acoustic modeling based on model structure annealing for speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Speaker recognition based on variational Bayesian method.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Voice activity detection applied to hands-free spoken dialogue robot based on decoding using acoustic and language model.
Proceedings of the 1st International Conference on Robot Communication and Coordination, 2007

Real-Time Continuous Speech Recognition System on SH-4A Microprocessor.
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007

Insights Gained from Development and Long-Term Operation of a Real-Environment Speech-Oriented Guidance System.
Proceedings of the IEEE International Conference on Acoustics, 2007

Blind source separation based on a fast-convergence algorithm combining ICA and beamforming.
IEEE Trans. Speech Audio Process., 2006

Improving Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics in Noisy Environments Using Multi-Template Models.
IEICE Trans. Inf. Syst., 2006

Embedded Julius: Continuous Speech Recognition Software for Microprocessor.
Proceedings of the IEEE 8th Workshop on Multimedia Signal Processing, 2006

Voice conversion based on mixtures of factor analyzers.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

An HMM-based singing voice synthesis system.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Reducing computation on parallel decoding using frame-wise confidence scores.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Hidden Semi-Markov Model Based Speech Recognition System using Weighted Finite-State Transducer.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Noise-robust hands-free speech recognition based on spatial subtraction array and known noise superimposition.
Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

Speech extraction in a car interior using frequency-domain ICA with rapid filter adaptations.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Operating a public spoken guidance system in real environment.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Rapid unsupervised speaker adaptation based on multi-template HMM sufficient statistics in noisy environments.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Speech Enhancement Based on Blind Source Separation in Car Environments.
Proceedings of the 21st International Conference on Data Engineering Workshops, 2005

Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents.
Proceedings of the Life-like characters - tools, affective functions, and applications., 2004

Noise robust real world spoken dialogue system using GMM based rejection of unintended inputs.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Recent progress of open-source LVCSR engine julius and Japanese model repository.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Non-audible murmur (NAM) speech recognition using a stethoscopic NAM microphone.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Robust speech recognition with spectral subtraction in low SNR.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Public speech-oriented guidance system with adult and child discrimination capability.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Real-time word confidence scoring using local posterior probabilities on tree trellis search.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Audible (normal) speech and inaudible murmur recognition using NAM microphone.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

Unsupervised speaker adaptation based on HMM sufficient statistics in various noisy environments.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Continuous Speech Recognition Consortium an Open Repository for CSR Tools and Models.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

ASKA: receptionist robot with speech dialogue system.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, Lausanne, Switzerland, September 30, 2002

Spectral subtraction in noisy environments applied to speaker adaptation based on HMM sufficient statistics.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Speech enhancement in car environment using blind source separation.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Selective multi-path acoustic model based on database likelihoods.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Evaluation on unsupervised speaker adaptation based on sufficient HMM statictics of selected speakers.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Unsupervised noisy environment adaptation algorithm using MLLR and speaker selection.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Automatic n-gram language model creation from web resources.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Julius - an open source real-time large vocabulary recognition engine.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Elderly acoustic model for large vocabulary continuous speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Gaussian mixture selection using context-independent HMM.
Proceedings of the IEEE International Conference on Acoustics, 2001

IPA Japanese Dictation Free Software Project.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Automatic diagnosis of recognition errors in large vocabulary continuous speech recognition systems.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Free software toolkit for Japanese large vocabulary continuous speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A new phonetic tied-mixture model for efficient decoding.
Proceedings of the IEEE International Conference on Acoustics, 2000

An efficient two-pass search algorithm using word trellis index.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
