Satoshi Nakamura
Orcid: 0000-0001-6956-3803Affiliations:
- Nara Institute of Science and Technology, Ikoma, Japan
- ATR Spoken Language Communication Labs, Kyoto, Japan
- National Institute of Information and Communications Technology (NICT), Spoken Language Communication Group, Keihanna Science City, Japan
- Sharp Corporation, Nara, Japan
- Kyoto University, Japan (PhD 1992)
According to our database1,
Satoshi Nakamura
authored at least 719 papers
between 1988 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on id.loc.gov
On csauthors.net:
Bibliography
2024
Comput. Vis. Media, August, 2024
Improving Speech Translation Accuracy and Time Efficiency With Fine-Tuned wav2vec 2.0-Based Speech Segmentation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Int. J. Hum. Comput. Stud., 2024
IEICE Trans. Inf. Syst., 2024
A Neural Transformer Framework for Simultaneous Tasks of Segmentation, Classification, and Caller Identification of Marmoset Vocalization.
CoRR, 2024
A Word Order Synchronization Metric for Evaluating Simultaneous Interpretation and Translation.
CoRR, 2024
Word Order in English-Japanese Simultaneous Interpretation: Analyses and Evaluation using Chunk-wise Monotonic Translation.
CoRR, 2024
Response Generation for Cognitive Behavioral Therapy with Large Language Models: Comparative Study with Socratic Questioning.
CoRR, 2024
Do as I Demand, Not as I Say: A Dataset for Developing a Reflective Life-Support Robot.
IEEE Access, 2024
Applying Syntax-Prosody Mapping Hypothesis and Boundary-Driven Theory to Neural Sequence-to-Sequence Speech Synthesis.
IEEE Access, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Automated Essay Scoring Using Grammatical Variety and Errors with Multi-Task Learning and Item Response Theory.
Proceedings of the 19th Workshop on Innovative Use of NLP for Building Educational Applications, 2024
2023
Eye-movement analysis on facial expression for identifying children and adults with neurodevelopmental disorders.
Frontiers Digit. Health, March, 2023
End-to-end dialogue structure parsing on multi-floor dialogue based on multi-task learning.
Frontiers Robotics AI, February, 2023
Reflective action selection based on positive-unlabeled learning and causality detection model.
Comput. Speech Lang., 2023
CoRR, 2023
NAIST-SIC-Aligned: Automatically-Aligned English-Japanese Simultaneous Interpretation Corpus.
CoRR, 2023
Modeling Multiple User Interests using Hierarchical Knowledge for Conversational Recommender System.
CoRR, 2023
Japanese Neural Incremental Text-to-Speech Synthesis Framework With an Accent Phrase Input.
IEEE Access, 2023
Proceedings of the ACM SIGGRAPH 2023 Posters, 2023
Emotion Prediction Using Multi-source Biosignals During Cognitive Behavior Therapy with Conversational Virtual Agents.
Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023
Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023
Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023
Tagged End-to-End Simultaneous Speech Translation Training Using Simultaneous Interpretation Data.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023
Proceedings of the 20th International Conference on Spoken Language Translation, 2023
Proceedings of the 20th International Conference on Spoken Language Translation, 2023
Inter-connection: Effective Connection between Pre-trained Encoder and Decoder for Speech Translation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 25th International Conference on Multimodal Interaction, 2023
Proceedings of the International Conference on Multimodal Interaction, 2023
Computational analyses of linguistic features with schizophrenic and autistic traits along with formal thought disorders.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023
Self-Adaptive Incremental Machine Speech Chain for Lombard TTS with High-Granularity ASR Feedback in Dynamic Noise Condition.
Proceedings of the IEEE International Conference on Acoustics, 2023
Multimodal Voice Activity Prediction: Turn-taking Events Detection in Expert-Novice Conversation.
Proceedings of the International Conference on Human-Agent Interaction, 2023
Acceptability and Trustworthiness of Virtual Agents by Effects of Theory of Mind and Social Skills Training.
Proceedings of the 17th IEEE International Conference on Automatic Face and Gesture Recognition, 2023
Predicting Autistic Traits Using Eye Movement during Visual Perspective Taking and Facial Emotion Identification.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023
Social Performance Rating During Social Skills Training in Adults with Autism Spectrum Disorder and Schizophrenia.
Proceedings of the 11th International Conference on Affective Computing and Intelligent Interaction, ACII 2023, 2023
2022
Modeling Unsupervised Empirical Adaptation by DPGMM and DPGMM-RNN Hybrid Model to Extract Perceptual Features for Low-Resource ASR.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
A Machine Speech Chain Approach for Dynamically Adaptive Lombard TTS in Static and Dynamic Noise Environments.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Tackling multiple object tracking with complicated motions - Re-designing the integration of motion and appearance.
Image Vis. Comput., 2022
IEICE Trans. Inf. Syst., 2022
Online EEG-Based Emotion Prediction and Music Generation for Inducing Affective States.
IEICE Trans. Inf. Syst., 2022
Applying Meta-Learning and Iso Principle for Development of EEG-Based Emotion Induction System.
Frontiers Digit. Health, 2022
Automatic Thoughts and Facial Expressions in Cognitive Restructuring With Virtual Agents.
Frontiers Comput. Sci., 2022
Actor-identified Spatiotemporal Action Detection - Detecting Who Is Doing What in Videos.
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the 19th International Conference on Spoken Language Translation, 2022
Proceedings of the 19th International Conference on Spoken Language Translation, 2022
Proceedings of the 19th International Conference on Spoken Language Translation, 2022
Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Applying Syntax-Prosody Mapping Hypothesis and Prosodic Well-Formedness Constraints to Neural Sequence-to-Sequence Speech Synthesis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the International Conference on Multimodal Interaction, 2022
Linguistic Features of Clients and Counselors for Early Detection of Mental Health Issues in Online Text-based Counseling.
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
Analysis of Feedback Contents and Estimation of Subjective Scores in Social Skills Training.
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
Pseudo Ambiguous and Clarifying Questions Based on Sentence Structures Toward Clarifying Question Answering System.
Proceedings of the Second DialDoc Workshop on Document-grounded Dialogue and Conversational Question Answering, 2022
2021
Instance-Level Heterogeneous Domain Adaptation for Limited-Labeled Sketch-to-Photo Retrieval.
IEEE Trans. Multim., 2021
Tackling Perception Bias in Unsupervised Phoneme Discovery Using DPGMM-RNN Hybrid Model and Functional Load.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2021
Image Vis. Comput., 2021
IEICE Trans. Inf. Syst., 2021
IEICE Trans. Inf. Syst., 2021
Using Perturbed Length-aware Positional Encoding for Non-autoregressive Neural Machine Translation.
CoRR, 2021
Multimodal Chain: Cross-Modal Collaboration Through Listening, Speaking, and Visualizing.
IEEE Access, 2021
IEEE Access, 2021
Multilingual Machine Translation Evaluation Metrics Fine-tuned on Pseudo-Negative Examples for WMT 2021 Metrics Task.
Proceedings of the Sixth Conference on Machine Translation, 2021
Proceedings of the Sixth Conference on Machine Translation, 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the SA '21: SIGGRAPH Asia 2021 Technical Communications, Tokyo, Japan, December 14, 2021
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021
Multi-Encoder Sequential Attention Network for Context-Aware Speech Recognition in Japanese Dialog Conversation.
Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021
Using Local Phrase Dependency Structure Information in Neural Sequence-to-Sequence Speech Synthesis.
Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021
Simultaneous Speech-to-Speech Translation System with Transformer-Based Incremental ASR, MT, and TTS.
Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021
Proceedings of the 18th International Conference on Spoken Language Translation, 2021
NAIST English-to-Japanese Simultaneous Translation System for IWSLT 2021 Simultaneous Text-to-text Task.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021
Large-Scale English-Japanese Simultaneous Interpretation Corpus: Construction and Analyses with Sentence-Aligned Data.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021
Proceedings of the 18th International Conference on Spoken Language Translation, 2021
Proceedings of the Conversational AI for Natural Human-Centric Interaction, 2021
Transcribing Paralinguistic Acoustic Cues to Target Language Text in Transformer-Based Speech-to-Text Translation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Unsupervised Neural-Based Graph Clustering for Variable-Length Speech Representation Discovery of Zero-Resource Languages.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Dynamically Adaptive Machine Speech Chain Inference for TTS in Noisy Environment: Listen and Speak Louder.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Weakly-Supervised Speech-to-Text Mapping with Visually Connected Non-Parallel Speech-Text Data Using Cyclic Partially-Aligned Transformer.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 18th International Conference on Natural Language Processing (ICON 2021), National Institute of Technology Silchar, Silchar, India, December 16, 2021
Proceedings of the 18th International Conference on Natural Language Processing (ICON 2021), National Institute of Technology Silchar, Silchar, India, December 16, 2021
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021
Proceedings of the ICMI '21 Companion: Companion Publication of the 2021 International Conference on Multimodal Interaction, Montreal, QC, Canada, October 18, 2021
Proceedings of the ICMI '21 Companion: Companion Publication of the 2021 International Conference on Multimodal Interaction, Montreal, QC, Canada, October 18, 2021
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021
Clustering of Human Movement Trajectories based on Distributional Representations Derived from Bi-directional LSTM Network with Geographical Coordinates.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021
Proceedings of the 9th International Winter Conference on Brain-Computer Interface, 2021
Relationship between Mood Improvement and Questioning to Evaluate Automatic Thoughts in Cognitive Restructuring with a Virtual Agent.
Proceedings of the 2021 9th International Conference on Affective Computing and Intelligent Interaction, 2021
2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
End-to-End Speech Translation With Transcoding by Multi-Task Learning for Distant Language Pairs.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Mach. Transl., 2020
Analysis of conversational listening skills toward agent-based social skills training.
J. Multimodal User Interfaces, 2020
IEICE Trans. Inf. Syst., 2020
Leveraging Neural Caption Translation with Visually Grounded Paraphrase Augmentation.
IEICE Trans. Inf. Syst., 2020
Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS.
CoRR, 2020
Image Captioning with Visual Object Representations Grounded in the Textual Modality.
CoRR, 2020
An Interactive Image Editing System Using an Uncertainty-Based Confirmation Strategy.
IEEE Access, 2020
IEEE Access, 2020
IEEE Access, 2020
Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020
Proceedings of the SIGGRAPH '20: Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2020
Towards Speech Entrainment: Considering ASR Information in Speaking Rate Variation of TTS Waveform Generation.
Proceedings of the 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2020
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
NAIST's Machine Translation Systems for IWSLT 2020 Conversational Speech Translation Task.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020
Caption Generation of Robot Behaviors Based on Unsupervised Learning of Action Segments.
Proceedings of the Conversational Dialogue Systems for the Next Decade, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Incremental Machine Speech Chain Towards Enabling Listening While Speaking in Real-Time.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Augmenting Images for ASR and TTS Through Single-Loop and Dual-Loop Multimodal Chain Framework.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020
Analysis of Mood Changes and Facial Expressions during Cognitive Behavior Therapy through a Virtual Agent.
Proceedings of the Companion Publication of the 2020 International Conference on Multimodal Interaction, 2020
Objective Prediction of Social Skills Level for Automated Social Skills Training Using Audio and Text Information.
Proceedings of the Companion Publication of the 2020 International Conference on Multimodal Interaction, 2020
Music Generation and Emotion Estimation from EEG Signals for Inducing Affective States.
Proceedings of the Companion Publication of the 2020 International Conference on Multimodal Interaction, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Using Panoramic Videos for Multi-Person Localization and Tracking In A 3D Panoramic Coordinate.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Analysis of selective attention processing on experienced simultaneous interpreters using EEG phase synchronization.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020
Sequential Attention-based Detection of Semantic Incongruities from EEG While Listening to Speech.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020
Proceedings of the 28th International Conference on Computational Linguistics, 2020
Incorporating Noisy Length Constraints into Transformer with Length-aware Positional Encodings.
Proceedings of the 28th International Conference on Computational Linguistics, 2020
Automatic Machine Translation Evaluation using Source Language Inputs and Cross-lingual Language Model.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2020
2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Additional Operations of Simple HITs on Microtask Crowdsourcing for Worker Quality Prediction.
J. Inf. Process., 2019
Neural Oscillation-Based Classification of Japanese Spoken Sentences During Speech Perception.
IEICE Trans. Inf. Syst., 2019
Electroencephalogram-Based Single-Trial Detection of Language Expectation Violations in Listening to Speech.
Frontiers Comput. Neurosci., 2019
Associative knowledge feature vector inferred on external knowledge base for dialog state tracking.
Comput. Speech Lang., 2019
CoRR, 2019
Conversational Response Re-ranking Based on Event Causality and Role Factored Tensor Event Embedding.
CoRR, 2019
From Speech Chain to Multimodal Chain: Leveraging Cross-modal Data Augmentation for Semi-supervised Learning.
CoRR, 2019
Classification of alkaloids according to the starting substances of their biosynthetic pathways using graph convolutional neural networks.
BMC Bioinform., 2019
IEEE Access, 2019
IEEE Access, 2019
Neural iTTS: Toward Synthesizing Speech in Real-time with End-to-end Neural Text-to-Speech Framework.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2019
Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019
Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019
Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019
Proceedings of the Increasing Naturalness and Flexibility in Spoken Dialogue Interaction, 2019
VQVAE Unsupervised Unit Discovery and Multi-Scale Code2Spec Inverter for Zerospeech Challenge 2019.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Neural Conversation Model Controllable by Given Dialogue Act Based on Adversarial Learning and Label-aware Objective.
Proceedings of the 12th International Conference on Natural Language Generation, 2019
Proceedings of the Adjunct of the 2019 International Conference on Multimodal Interaction, 2019
Detecting Syntactic Violations from Single-trial EEG using Recurrent Neural Networks.
Proceedings of the Adjunct of the 2019 International Conference on Multimodal Interaction, 2019
Proceedings of the Adjunct of the 2019 International Conference on Multimodal Interaction, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Speech Artifact Removal from Eeg Recordings of Spoken Word Production with Tensor Decomposition.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
IEEE ACM Trans. Audio Speech Lang. Process., 2018
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Intra-gender statistical singing voice conversion with direct waveform modification using log-spectral differential.
Speech Commun., 2018
Mach. Transl., 2018
Construction of Spontaneous Emotion Corpus from Indonesian TV Talk Shows and Its Application on Multimodal Emotion Recognition.
IEICE Trans. Inf. Syst., 2018
Semantically Readable Distributed Representation Learning and Its Expandability Using a Word Semantic Vector Dictionary.
IEICE Trans. Inf. Syst., 2018
Learning Supervised Feature Transformations on Zero Resources for Improved Acoustic Unit Discovery.
IEICE Trans. Inf. Syst., 2018
Optimization of Information-Seeking Dialogue Strategy for Argumentation-Based Dialogue System.
CoRR, 2018
CoRR, 2018
CoRR, 2018
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018
Multi-Scale Alignment and Contextual History for Attention Mechanism in Sequence-to-Sequence Model.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Speech Chain for Semi-Supervised Learning of Japanese-English Code-Switching ASR and TTS.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Toward Multi-Features Emphasis Speech Translation: Assessment of Human Emphasis Production and Perception with Speech and Text Clues.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Pre- and post-processes for automatic colorization using a fully convolutional network.
Proceedings of the SIGGRAPH Asia 2018 Posters, Tokyo, Japan, December 04-07, 2018, 2018
Unsupervised Counselor Dialogue Clustering for Positive Emotion Elicitation in Neural Dialogue System.
Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, 2018
Multi-Modal Multi-Task Deep Learning For Speaker And Emotion Recognition Of TV-Series Data.
Proceedings of the 2018 Oriental COCOSDA, 2018
Proceedings of the 2018 Oriental COCOSDA, 2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Japanese Dialogue Corpus of Information Navigation and Attentive Listening Annotated with Extended ISO-24617-2 Dialogue Act Tags.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018
Dialogue Scenario Collection of Persuasive Dialogue with Emotional Expressions via Crowdsourcing.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018
Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018
Proceedings of the 15th International Conference on Spoken Language Translation, 2018
Proceedings of the 15th International Conference on Spoken Language Translation, 2018
Proceedings of the 15th International Conference on Spoken Language Translation, 2018
Impact of Deception Information on Negotiation Dialog Management: A Case Study on Doctor-Patient Conversations.
Proceedings of the 9th International Workshop on Spoken Dialogue System Technology, 2018
Dialogue Act Classification in Reference Interview Using Convolutional Neural Network with Byte Pair Encoding.
Proceedings of the 9th International Workshop on Spoken Dialogue System Technology, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Detection of Dementia from Responses to Atypical Questions Asked by Embodied Conversational Agents.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Single-Trial Detection of Semantic Anomalies From EEG During Listening to Spoken Sentences.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018
Information Filtering Method for Twitter Streaming Data Using Human-in-the-Loop Machine Learning.
Proceedings of the Database and Expert Systems Applications, 2018
TRANS-AM: Discovery Method of Optimal Input Vectors Corresponding to Objective Variables.
Proceedings of the Big Data Analytics and Knowledge Discovery, 2018
Detecting suppression of negative emotion by time series change of cerebral blood flow using fNIRS.
Proceedings of the 2018 IEEE EMBS International Conference on Biomedical & Health Informatics, 2018
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Eliciting Positive Emotion through Affect-Sensitive Dialogue Response Generation: A Neural Network Approach.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
A Vibration Control Method of an Electrolarynx Based on Statistical <i>F</i><sub>0</sub> Pattern Prediction.
IEICE Trans. Inf. Syst., 2017
IEICE Trans. Inf. Syst., 2017
Analysis of the Effect of Dependency Information on Predicate-Argument Structure Analysis and Zero Anaphora Resolution.
CoRR, 2017
Proceedings of the Second Conference on Machine Translation, 2017
Proceedings of the Second Conference on Machine Translation, 2017
Proceedings of the International Conference on Web Intelligence, 2017
Recognizing Emotionally Coloured Dialogue Speech Using Speaker-Adapted DNN-CNN Bottleneck Features.
Proceedings of the Speech and Computer - 19th International Conference, 2017
Proceedings of the 18th Annual SIGdial Meeting on Discourse and Dialogue, 2017
Proceedings of the 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment, 2017
Proceedings of the Advances in Network-Based Information Systems, 2017
Proceedings of the Fifteenth IAPR International Conference on Machine Vision Applications, 2017
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017
Proceedings of the Advanced Social Interaction with Agents, 2017
Subject-Independent Classification of Japanese Spoken Sentences by Multiple Frequency Bands Phase Pattern of EEG Response During Speech Perception.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Physically Constrained Statistical F<sub>0</sub> Prediction for Electrolaryngeal Speech Enhancement.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Structured-Based Curriculum Learning for End-to-End English-Japanese Speech Translation.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Toward Expressive Speech Translation: A Unified Sequence-to-Sequence LSTMs Approach for Translating Words and Emphasis.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017
Acquisition and Assessment of Semantic Content for the Generation of Elaborateness and Indirectness in Spoken Dialogue Systems.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Feature optimized DPGMM clustering for unsupervised subword modeling: A contribution to zerospeech 2017.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
An investigation of how to design control parameters for statistical voice timbre control.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
A Simple and Strong Baseline: NAIST-NICT Neural Machine Translation System for WAT2017 English-Japanese Translation Task.
Proceedings of the 4th Workshop on Asian Translation, 2017
Proceedings of the First Workshop on Neural Machine Translation, 2017
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
Processing negative emotions through social communication: Multimodal database construction and analysis.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017
2016
ACM Trans. Interact. Intell. Syst., 2016
Postfilters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Speech Commun., 2016
Learning local word reorderings for hierarchical phrase-based statistical machine translation.
Mach. Transl., 2016
Inf. Media Technol., 2016
A Statistical Sample-Based Approach to GMM-Based Voice Conversion Using Tied-Covariance Acoustic Models.
IEICE Trans. Inf. Syst., 2016
Non-Native Text-to-Speech Preserving Speaker Individuality Based on Partial Correction of Prosodic and Phonetic Characteristics.
IEICE Trans. Inf. Syst., 2016
IEICE Trans. Inf. Syst., 2016
Enhancing Event-Related Potentials Based on Maximum a Posteriori Estimation with a Spatial Correlation Prior.
IEICE Trans. Inf. Syst., 2016
Improvements of Voice Timbre Control Based on Perceived Age in Singing Voice Conversion.
IEICE Trans. Inf. Syst., 2016
Proceedings of the 25th International Conference on World Wide Web, 2016
Unsupervised Linear Discriminant Analysis for Supporting DPGMM Clustering in the Zero Resource Scenario.
Proceedings of the SLTU-2016, 2016
Deep bottleneck features and sound-dependent i-vectors for simultaneous recognition of speech and environmental sounds.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
F0 transformation techniques for statistical voice conversion with direct waveform modification with spectral differential.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Iterative training of a DPGMM-HMM acoustic unit recognizer in a zero resource scenario.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Proceedings of the SIGDIAL 2016 Conference, 2016
Proceedings of the SIGDIAL 2016 Conference, 2016
Selecting Syntactic, Non-redundant Segments in Active Learning for Machine Translation.
Proceedings of the NAACL HLT 2016, 2016
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recognition.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Proceedings of the Dialogues with Social Robots, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Unsupervised Joint Estimation of Grapheme-to-Phoneme Conversion Systems and Acoustic Model Adaptation for Non-Native Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Acoustic-to-Articulatory Inversion Mapping Based on Latent Trajectory Gaussian Mixture Model.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Supervised Learning of Acoustic Models in a Zero Resource Setting to Improve DPGMM Clustering.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
A Hybrid System for Continuous Word-Level Emphasis Modeling Based on HMM State Clustering and Adaptive Training.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Transferring Emphasis in Speech Translation Using Hard-Attentional Neural Network Models.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016
Proceedings of the 18th International Conference on Information Integration and Web-based Applications and Services, 2016
Automatic detection of very early stage of dementia through multimodal interaction with computer avatars.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016
An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Statistical F0 prediction for electrolaryngeal speech enhancement considering generative process of F0 contours within product of experts framework.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Noise suppression method for body-conducted soft speech enhancement based on external noise monitoring.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Implementation of F0 transformation for statistical singing voice conversion based on direct waveform modification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Real-time vibration control of an electrolarynx based on statistical F0 contour prediction.
Proceedings of the 24th European Signal Processing Conference, 2016
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2016
Removing noise from event-related potentials using a probabilistic generative model with grouped covariance matrices.
Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2016
A Continuous Space Rule Selection Model for Syntax-based Statistical Machine Translation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
2015
Multichannel Signal Separation Combining Directional Clustering and Nonnegative Matrix Factorization with Spectrogram Restoration.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Trans. Assoc. Comput. Linguistics, 2015
IEICE Trans. Inf. Syst., 2015
An Investigation of Machine Translation Evaluation Metrics in Cross-lingual Question Answering.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015
Proceedings of the SIGDIAL 2015 Conference, 2015
Keynote speech 3: Toward simultaneous, natural and multimodal speech-to-speech translation.
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015
Construction and analysis of social-affective interaction corpus in English and Indonesian.
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015
Learning to Generate Pseudo-Code from Source Code Using Statistical Machine Translation (T).
Proceedings of the 30th IEEE/ACM International Conference on Automated Software Engineering, 2015
Proceedings of the 30th IEEE/ACM International Conference on Automated Software Engineering, 2015
Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015
Proceedings of the 12th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2015, 2015
Improving translation of emphasis with pause prediction in speech-to-speech translation systems.
Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015
Proceedings of the 20th International Conference on Intelligent User Interfaces, 2015
Context awareness and priority control for ITS based on automatic speech recognition.
Proceedings of the 14th International Conference on ITS Telecommunications, 2015
Articulatory controllable speech modification based on Gaussian mixture models with direct waveform modification using spectrum differential.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Modulation spectrum-constrained trajectory training algorithm for HMM-based speech synthesis.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Non-audible murmur enhancement based on statistical conversion using air- and body-conductive microphones in noisy environments.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Non-native speech synthesis preserving speaker individuality based on partial correction of prosodic and phonetic characteristics.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Statistical singing voice conversion based on direct waveform modification with global variance.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Preserving word-level emphasis in speech-to-speech translation using linear regression HSMMs.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Combination of two-dimensional cochleogram and spectrogram features for deep learning-based ASR.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Modulation spectrum-constrained trajectory training algorithm for GMM-based Voice Conversion.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Parameter generation algorithm considering Modulation Spectrum for HMM-based speech synthesis.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Statistical modeling of binaural signal and its application to binaural source separation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
EEG signal enhancement using multi-channel wiener filter with a spatial correlation prior.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
WFST-based structural classification integrating dnn acoustic features and RNN language features for speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
An evaluation of EEG ocular artifact removal with a multi-channel wiener filter based on probabilistic generative model.
Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2015
Proceedings of the Blizzard Challenge 2015, 2015
An Enhanced Electrolarynx with Automatic Fundamental Frequency Control based on Statistical Prediction.
Proceedings of the 17th International ACM SIGACCESS Conference on Computers & Accessibility, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
A study of social-affective communication: Automatic prediction of emotion triggers and responses in television talk shows.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
The NAIST ASR system for the 2015 Multi-Genre Broadcast challenge: On combination of deep learning systems using a rank-score function.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
Neural Reranking Improves Subjective Quality of Machine Translation: NAIST at WAT2015.
Proceedings of the 2nd Workshop on Asian Translation, 2015
Syntax-based Simultaneous Translation through Prediction of Unseen Syntactic Constituents.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015
Proceedings of the Natural Language Dialog Systems and Intelligent Assistants, 2015
Proceedings of the Natural Language Dialog Systems and Intelligent Assistants, 2015
Proceedings of the Natural Language Dialog Systems and Intelligent Assistants, 2015
Proceedings of the Natural Language Dialog Systems and Intelligent Assistants, 2015
Proceedings of the Natural Language Dialog Systems and Intelligent Assistants, 2015
2014
Segmentation for Efficient Supervised Language Annotation with an Explicit Cost-Utility Tradeoff.
Trans. Assoc. Comput. Linguistics, 2014
Musical-noise-free blind speech extraction integrating microphone array and iterative spectral subtraction.
Signal Process., 2014
Parameter Generation Methods With Rich Context Models for High-Quality and Flexible Text-To-Speech Synthesis.
IEEE J. Sel. Top. Signal Process., 2014
IEICE Trans. Inf. Syst., 2014
A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation.
IEICE Trans. Inf. Syst., 2014
Utilizing Human-to-Human Conversation Examples for a Multi Domain Chat-Oriented Dialog System.
IEICE Trans. Inf. Syst., 2014
Structured Adaptive Regularization of Weight Vectors for a Robust Grapheme-to-Phoneme Conversion Model.
IEICE Trans. Inf. Syst., 2014
IEICE Trans. Inf. Syst., 2014
Proceedings of SSST@EMNLP 2014, 2014
Recent progress in developing grapheme-based speech recognition for Indonesian ethnic languages: Javanese, Sundanese, Balinese and Bataks.
Proceedings of the 4th Workshop on Spoken Language Technologies for Under-resourced Languages, 2014
Proceedings of the 4th Workshop on Spoken Language Technologies for Under-resourced Languages, 2014
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Improving the robustness of example-based dialog retrieval using recursive neural network paraphrase identification.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014
Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014
Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014
Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014
Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
Proceedings of the Situated Dialog in Speech-Based Human-Computer Interaction, 2014
Proceedings of the Situated Dialog in Speech-Based Human-Computer Interaction, 2014
Articulatory controllable speech modification based on statistical feature mapping with Gaussian mixture models.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Direct F<sub>0</sub> control of an electrolarynx based on statistical excitation feature prediction and its evaluation through simulation.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Data-driven generation of text balloons based on linguistic and acoustic features of a comics-anime corpus.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Structured soft margin confidence weighted learning for grapheme-to-phoneme conversion.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Statistical singing voice conversion with direct waveform modification based on the spectrum differential.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
A hearing impairment simulation method using audiogram-based approximation of auditory charatecteristics.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
An evaluation of excitation feature prediction in a hybrid approach to electrolaryngeal speech enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Music signal separation based on Bayesian spectral amplitude estimator with automatic target prior adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Theoretical analysis of biased MMSE short-time spectral amplitude estimator and its extension to musical-noise-free speech enhancement.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014
Divergence optimization in nonnegative matrix factorization with spectrogram restoration for multichannel signal separation.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014
Optimized joint noise suppression and dereverberation based on blind signal extraction for hands-free speech recognition system.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014
Proceedings of the COLING 2014, 2014
Proceedings of the COLING 2014, 2014
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
An evaluation of target speech for a nonaudible murmur enhancement system in noisy environments.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
An inter-speaker evaluation through simulation of electrolarynx control based on statistical F0 prediction.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
Recursive neural network paraphrase identification for example-based dialog retrieval.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
Gender-dependent spectrum differential models for perceived age control based on direct waveform modification in singing voice conversion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
Hybrid multichannel signal separation using supervised nonnegative matrix factorization with spectrogram restoration.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
Linguistic and Acoustic Features for Automatic Identification of Autism Spectrum Disorders in Children's Narrative.
Proceedings of the Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, 2014
2013
Investigation of intra-speaker spectral parameter variation and its prediction towards improvement of spectral conversion metric.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Towards language preservation: Design and collection of graphemically balanced and parallel speech corpora of Indonesian ethnic languages.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013
Proceedings of the 2013 IEEE 14th International Conference on Mobile Data Management, Milan, Italy, June 3-6, 2013, 2013
Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, 2013
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013
Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, 2013
A hybrid approach to electrolaryngeal speech enhancement based on spectral subtraction and statistical voice conversion.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Improvements to HMM-based speech synthesis based on parameter generation with rich context models.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
A digital signal processor implementation of silent/electrolaryngeal speech enhancement based on real-time statistical voice conversion.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
An investigation of acoustic features for singing voice conversion based on perceptual age.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Simple, lexicalized choice of translation timing for simultaneous speech translation.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Evaluation of a singing voice conversion method based on many-to-many eigenvoice conversion.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Modality and contextual differences in computer based non-verbal communication training.
Proceedings of the IEEE 4th International Conference on Cognitive Infocommunications, 2013
Proceedings of the Working Notes for CLEF 2013 Conference , 2013
Inter-Sentence Features and Thresholded Minimum Error Rate Training: NAIST at CLEF 2013 QA4MRE.
Proceedings of the Working Notes for CLEF 2013 Conference , 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Semi-blind algorithm for joint noise suppression and dereverberation based on higher-order statistics and acoustic model likelihood.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Proceedings of the First Workshop on Natural Language Processing for Medical and Healthcare Fields@IJCNLP 2013, 2013
2012
Distributed speech translation technologies for multiparty multilingual communication.
ACM Trans. Speech Lang. Process., 2012
Sequence-Based Pronunciation Variation Modeling for Spontaneous ASR Using a Noisy Channel Approach.
IEICE Trans. Inf. Syst., 2012
Minimum Bayes-Risk decoding extended with similar examples: NAIST-NICT at IWSLT 2012.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012
Proceedings of the Natural Interaction with Robots, 2012
An Evaluation of Parameter Generation Methods with Rich Context Models in HMM-Based Speech Synthesis.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
A bootstrapping approach for SLU portability to a new language by inducting unannotated user queries.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the IEEE 3rd International Conference on Cognitive Infocommunications, 2012
Singing voice conversion method based on many-to-many eigenvoice conversion and training data generation using a singing-to-singing synthesis system.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
2011
ACM Trans. Speech Lang. Process., 2011
Temporal modulation normalization for robust speech feature extraction and recognition.
Multim. Tools Appl., 2011
A Bayesian Model of Transliteration and Its Human Evaluation When Integrated into a Machine Translation System.
IEICE Trans. Inf. Syst., 2011
Sub-band temporal modulation envelopes and their normalization for automatic speech recognition in reverberant environments.
Comput. Speech Lang., 2011
Learning, Generation and Recognition of Motions by Reference-Point-Dependent Probabilistic Models.
Adv. Robotics, 2011
Toward Construction of Spoken Dialogue System that Evokes Users' Spontaneous Backchannels.
Proceedings of the SIGDIAL 2011 Conference, 2011
Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems, 2011
Analysis on Effects of Text-to-Speech and Avatar Agent in Evoking Users' Spontaneous Listener's Reactions.
Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
A sampling-based environment population projection approach for rapid acoustic model adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2011
Increasing discriminative capability on MAP-based mapping function estimation for acoustic model adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2011
Unsupervised determination of efficient Korean LVCSR units using a Bayesian Dirichlet process model.
Proceedings of the IEEE International Conference on Acoustics, 2011
Providing Immersive Virtual Experience with First-Person Perspective Omnidirectional Movies and Three Dimensional Sound Field.
Proceedings of the Virtual and Mixed Reality - New Trends, 2011
3-D Sound Reproduction System for Immersive Environments Based on the Boundary Surface Control Principle.
Proceedings of the Virtual and Mixed Reality - New Trends, 2011
Blind noise suppression for Non-Audible Murmur recognition with stereo signal processing.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
Proceedings of the Spoken Dialogue Systems Technology and Design, 2011
Online Learning of Bayes Risk-Based Optimization of Dialogue Management for Document Retrieval Systems with Speech Interface.
Proceedings of the Spoken Dialogue Systems Technology and Design, 2011
2010
Temporal contrast normalization and edge-preserved smoothing of temporal modulation structures of speech for robust speech recognition.
Speech Commun., 2010
IEICE Trans. Inf. Syst., 2010
Dialogue strategy optimization to assist user's decision for spoken consulting dialogue systems.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010
Integrating lip-synch into game production workflow: "Sengoku BASARA 3" (Copyright restrictions prevent ACM from providing the full text for this article).
Proceedings of the ACM SIGGRAPH ASIA 2010 Sketches, 2010
Proceedings of the SIGDIAL 2010 Conference, 2010
Dialogue Acts Annotation for NICT Kyoto Tour Dialogue Corpus to Construct Statistical Dialogue Systems.
Proceedings of the International Conference on Language Resources and Evaluation, 2010
A Study Toward an Evaluation Method for Spoken Dialogue Systems Considering User Criteria.
Proceedings of the Spoken Dialogue Systems for Ambient Environments, 2010
Proceedings of the Spoken Dialogue Systems for Ambient Environments, 2010
Proceedings of the Spoken Dialogue Systems for Ambient Environments, 2010
Evaluation of Facial Direction Estimation from Cameras for Multi-modal Spoken Dialog System.
Proceedings of the Spoken Dialogue Systems for Ambient Environments, 2010
Proceedings of the Spoken Dialogue Systems for Ambient Environments, 2010
Proceedings of the Spoken Dialogue Systems for Ambient Environments, 2010
Proceedings of the 4th International Universal Communication Symposium, 2010
Web text classification for response generation in spoken decision support dialogue systems.
Proceedings of the 4th International Universal Communication Symposium, 2010
Proceedings of the 4th International Universal Communication Symposium, 2010
An environment structuring framework to facilitating suitable prior density estimation for MAPLR on robust speech recognition.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Active learning of confidence measure function in robot language acquisition framework.
Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Construction and evaluations of an annotated Chinese conversational corpus in travel domain for the language model of speech recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Cluster-based language model for spoken document retrieval using NMF-based document clustering.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Brazilian portuguese acoustic model training based on data borrowing from other language.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Spoken Dialog System on Plasma Display Panel Estimating Users' Interest by Image Processing.
Proceedings of the Workshops Proceedings of the 6th International Conference on Intelligent Environments, 2010
Proceedings of the Blizzard Challenge 2010, Kansai Science City, Japan, September 25, 2010, 2010
Proceedings of the Auditory-Visual Speech Processing, 2010
Active Learning for Generating Motion and Utterances in Object Manipulation Dialogue Tasks.
Proceedings of the Dialog with Robots, 2010
2009
Lecture Notes in Electrical Engineering 42, Springer, ISBN: 978-0-387-85829-6, 2009
IEICE Trans. Inf. Syst., 2009
Automatic pronunciation scoring of words and sentences independent from the non-native's first language.
Comput. Speech Lang., 2009
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009
Proceedings of the 2009 International Workshop on Spoken Language Translation, 2009
Soft margin estimation on improving environment structures for ensemble speaker and speaking environment modeling.
Proceedings of the 3rd International Universal Communication Symposium, 2009
Proceedings of the 3rd International Universal Communication Symposium, 2009
Proceedings of the 3rd International Universal Communication Symposium, 2009
Normalization on the modulation spectrum of the subband temporal envelopes for automatic speech recognition in reverberant environments.
Proceedings of the 3rd International Universal Communication Symposium, 2009
Proceedings of the 3rd International Universal Communication Symposium, 2009
Proceedings of the 3rd International Universal Communication Symposium, 2009
Bayesian learning of confidence measure function for generation of utterances and motions in object manipulation dialogue task.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
A close look into the probabilistic concatenation model for corpus-based speech synthesis.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Annotating communicative function and semantic content in dialogue act for construction of consulting dialogue systems.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
A study on soft margin estimation of linear regression parameters for speaker adaptation.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
A decision tree-based clustering approach to state definition in an excitation modeling framework for HMM-based speech synthesis.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Subband temporal modulation spectrum normalization for automatic speech recognition in reverberant environments.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
CART-based modeling of Chinese tonal patterns with a functional model tracing the fundamental frequency trajectories.
Proceedings of the IEEE International Conference on Acoustics, 2009
Temporal contrast normalization and edge-preserved smoothing on temporal modulation structure for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
The NICT Entry for the Blizzard Challenge 2009: an Enhanced HMM-based Speech Synthesis System with Trajectory Training considering Global Variance and State-Dependent Mixed Excitation.
Proceedings of the Blizzard Challenge 2009, Edinburgh, Scotland, UK, September 4, 2009, 2009
MAP estimation of online mapping parameters in ensemble speaker and speaking environment modeling.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
Proceedings of the Information Retrieval Technology, 2009
Proceedings of the 7th Workshop on Asian Language Resources, 2009
Construction of Chinese Segmented and POS-tagged Conversational Corpora and Their Evaluations on Spontaneous Speech Recognitions.
Proceedings of the 7th Workshop on Asian Language Resources, 2009
2008
IEEE Trans. Robotics, 2008
Comput. Animat. Virtual Worlds, 2008
An Improved Greedy Search Algorithm for the Development of a Phonetically Rich Speech Corpus.
IEICE Trans. Inf. Syst., 2008
Using Mutual Information Criterion to Design an Efficient Phoneme Set for Chinese Speech Recognition.
IEICE Trans. Inf. Syst., 2008
Proceedings of the 16th International Conference on Multimedia 2008, 2008
Proceedings of the 9th International Conference on Mobile Data Management (MDM 2008), 2008
Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -.
Proceedings of the International Conference on Language Resources and Evaluation, 2008
Probabilistic Pronunciation Variation Model Based on Bayesian Network for Conversational Speech Recognition.
Proceedings of the ISUC 2008, 2008
Proceedings of the ISUC 2008, 2008
Proceedings of the ISUC 2008, 2008
Normalization on Temporal Modulation Transfer Function for Robust Speech Recognition.
Proceedings of the ISUC 2008, 2008
Proceedings of the ISUC 2008, 2008
Simultaneous Acoustic, Prosodic, and Phrasing Model Training for TTs Conversion Systems.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
CENSREC-4: development of evaluation framework for distant-talking speech recognition under reverberant environments.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Development of Indonesian Large Vocabulary Continuous Speech Recognition System within A-STAR Project.
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008
Proceedings of the COLING 2008, 2008
Proceedings of the Blizzard Challenge 2008, 2008
2007
Incorporating Knowledge Sources Into a Statistical Acoustic Model for Spoken Language Communication Systems.
IEEE Trans. Computers, 2007
Out-of-Domain Utterance Detection Using Classification Confidences of Multiple Topics.
IEEE Trans. Speech Audio Process., 2007
Multichannel Bin-Wise Robust Frequency-Domain Adaptive Filtering and Its Application to Adaptive Beamforming.
IEEE Trans. Speech Audio Process., 2007
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2007
Proceedings of the Advances in Multimedia Information Processing, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
A method to integrate additional knowledge sources into HMM based on junction tree decomposition.
Proceedings of the 15th European Signal Processing Conference, 2007
Proceedings of the Evaluation of text-to-speech systems: Blizzard Challenge 2007, 2007
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
Development of VAD evaluation framework CENSREC-1-C and investigation of relationship between VAD and speech recognition performance.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
2006
IEEE Trans. Speech Audio Process., 2006
Integration of articulatory and spectrum features based on the hybrid HMM/BN modeling framework.
Speech Commun., 2006
Improving Acoustic Model Precision by Incorporating a Wide Phonetic Context Based on a Bayesian Framework.
IEICE Trans. Inf. Syst., 2006
IEICE Trans. Inf. Syst., 2006
IEICE Trans. Inf. Syst., 2006
ATR Parallel Decoding Based Speech Recognition System Robust to Noise and Speaking Styles.
IEICE Trans. Inf. Syst., 2006
IEICE Trans. Inf. Syst., 2006
CENSREC-3: An Evaluation Framework for Japanese Speech Recognition in Real Car-Driving Environments.
IEICE Trans. Inf. Syst., 2006
A Non-stationary Noise Suppression Method Based on Particle Filtering and Polyak Averaging.
IEICE Trans. Inf. Syst., 2006
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2006
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2006
Proceedings of the 7th International Conference on Mobile Data Management (MDM 2006), 2006
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006
Development of client-server speech translation system on a multi-lingual speech communication platform.
Proceedings of the 2006 International Workshop on Spoken Language Translation, 2006
Speech recognition of foreign out-of-vocabulary words using a hierarchical language model.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
The use of Bayesian network for incorporating accent, gender and wide-context dependency information.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
CENSREC2: corpus and evaluation environments for in car continuous digit speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Automatic Derivation of a Phoneme Set with Tone Information for Chinese Speech Recognition Based on Mutual Information Criterion.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Incorporation of Pentaphone-Context Dependency Based on Hybrid Hmm/Bn Acoustic Modeling Framework.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Sequential Non-Stationary Noise Tracking Using Particle Filtering with Switching Dynamical System.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Proceedings of the 2006 6th IEEE-RAS International Conference on Humanoid Robots, 2006
Developing a Test Bed of English Text-to-Speech System XIMERA for the Blizzard Challenge 2006.
Proceedings of the Blizzard Challenge 2006, Pittsburgh, PA, USA, September 16, 2006, 2006
2005
Speech Commun., 2005
Tone nucleus-based multi-level robust acoustic tonal modeling of sentential F0 variations for Chinese continuous speech tone recognition.
Speech Commun., 2005
Construction of Audio-Visual Speech Corpus Using Motion-Capture System and Corpus Based Facial Animation.
IEICE Trans. Inf. Syst., 2005
IEICE Trans. Inf. Syst., 2005
Dialogue Speech Recognition by Combining Hierarchical Topic Classification and Language Model Switching.
IEICE Trans. Inf. Syst., 2005
Automatic Generation of Non-uniform and Context-Dependent HMMs Based on the Variational Bayesian Approach.
IEICE Trans. Inf. Syst., 2005
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2005
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2005
Proceedings of the Ninth IEEE International Symposium on Wearable Computers (ISWC 2005), 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Spoken dialog system and its evaluation of geographic information system for elderly persons' mobility support.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
CENSREC-3: Data Collection for In-Car Speech Recognition and Its Common Evaluation Framework.
Proceedings of the 21st International Conference on Data Engineering Workshops, 2005
Online cepstral filtering using a sequential EM approach with Polyak averaging and feedback.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Joint optimization of LCMV beamforming and acoustic echo cancellation for automatic speech recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
2004
Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents.
Proceedings of the Life-like characters - tools, affective functions, and applications., 2004
Simultaneous Recognition of Distant-Talking Speech of Multiple Talkers Based on the 3-D <i>N</i>-Best Search Method.
J. VLSI Signal Process., 2004
IEEE Trans. Speech Audio Process., 2004
Speech Commun., 2004
IEICE Trans. Inf. Syst., 2004
IEICE Trans. Inf. Syst., 2004
Multimodal Translation System Using Texture-Mapped Lip-Sync Images for Video Mail and Automatic Dubbing Applications.
EURASIP J. Adv. Signal Process., 2004
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2004
Proceedings of the 2004 International Workshop on Spoken Language Translation, 2004
Generalized posterior probability for minimizing verification errors at subword, word and sentence levels.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004
Efficient tone classification of speaker independent continuous Chinese speech using anchoring based discriminating features.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Online minimum mean square error filtering of noisy cepstral coefficients using a sequential EM algorithm.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Integration of articulatory dynamic parameters in HMM/BN based speech recognition system.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Topic classification and verification modeling for out-of-domain utterance detection.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Increasing the mixture components of non-uniform HMM structures based on a variational Bayesian approach.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Speech recognition for multiple non-native accent groups with speaker-group-dependent acoustic models.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Minimum mean square error filtering of noisy cepstral coefficients with applications to ASR.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Out-of-domain detection based on confidence measures from multiple topic classification.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Automatic generation of non-uniform HMM structures based on variational Bayesian approach.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Proceedings of the 2004 12th European Signal Processing Conference, 2004
2003
Speech Commun., 2003
Syst. Comput. Jpn., 2003
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Model based noisy speech recognition with environment parameters estimated by noise adaptive speech recognition with prior.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Environmental sound source identification based on hidden Markov model for robust speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Noise reduction using paired-microphones on non-equally-spaced microphone arrangement.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Hierarchical topic classification for dialog speech recognition based on language model switching.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Automatic generation of non-uniform context-dependent HMM topologies based on the MDL criterion.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
A semi-blind source separation method for hands-free speech recognition of multiple talkers.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
A multilevel framework to model the inherently confounding nature of sentential F0sentential F0 contours contours for recognizing Chinese lexical tones.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
An evaluation of adaptive beamformer based on average speech spectrum for noisy speech recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
IEEE Trans. Neural Networks, 2002
Distant-talking speech recognition based on a 3-D Viterbi search using a microphone array.
IEEE Trans. Speech Audio Process., 2002
The Present Status of Speech Database in Japan: Development, Management, and Application to Speech Research.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002
Modeling varying pauses to develop robust acoustic models for recognizing noisy conversational speech.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Noise adaptive speech recognition with acoustic models trained from noisy speech evaluated on Aurora-2 database.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Speaking rate compensation based on likelihood criterion in acoustic model training and decoding.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Suitable design of adaptive beamformer based on average speech spectrum for noisy speech recognition.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
HMM COmposition-based rapid model adaptation using a priori noise GMM adaptation evaluation on Aurora2 corpus.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002
3-D N-Best Search for Simultaneous Recognition of Distant-Talking Speech of Multiple Talkers.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002
An evaluation of sound source identification with RWCP sound scene database in real acoustic environments.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002
Design and collection of acoustic sound data for hands-free speech recognition and sound scene understanding.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002
Noise adaptive speech recognition in time-varying noise based on sequential kullback proximal algorithm.
Proceedings of the IEEE International Conference on Acoustics, 2002
Talker localization in a real acoustic environment based on DOA estimation and statistical sound source identification.
Proceedings of the IEEE International Conference on Acoustics, 2002
Robust bi-modal speech recognition based on state synchronous modeling and stream weight optimization.
Proceedings of the IEEE International Conference on Acoustics, 2002
Audio-visual speech translation with automatic lip syncqronization and face tracking based on 3-D head model.
Proceedings of the IEEE International Conference on Acoustics, 2002
2001
Speech-to-Lip Movement Synthesis by Maximizing Audio-Visual Joint Probability Based on the EM Algorithm.
J. VLSI Signal Process., 2001
IEEE Trans. Speech Audio Process., 2001
Proceedings of the 5th International Symposium on Wearable Computers (ISWC 2001), 2001
A hybrid approach to enhance task portability of acoustic models in Chinese speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Feature extraction and model-based noise compensation for noisy speech recognition evaluated on AURORA 2 task.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Statistical sound source identification in a real acoustic environment for robust speech recognition using a microphone array.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Noise reduction using paired-microphones for both far-field and near-field sound sources.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Model-Based Lip Synchronization With Automatically Translated Systhetic Voice Toward A Multi-Modal Translation System.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001
Automatic Steering Of Microphone Array And Video Camera Toward Multi-Lingual Tele-Conference Through Speech-To-Speech Translation.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001
Proceedings of the IEEE International Conference on Acoustics, 2001
A microphone array-based 3-D N-best search algorithm for the simultaneous recognition of multiple sound sources in real environments.
Proceedings of the IEEE International Conference on Acoustics, 2001
Proceedings of the Audio- and Video-Based Biometric Person Authentication, 2001
2000
IEEE Trans. Speech Audio Process., 2000
Model adaptation by HMM decomposition and composition in noisy reverberant environments.
Syst. Comput. Jpn., 2000
Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Residual noise compensation by a sequential EM algorithm for robust speech recognition in nonstationary noise.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Stream weight optimization of speech and lip image sequence for audio-visual speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Robust fundamental frequency estimation using instantaneous frequencies of harmonic components.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000
Speech recognition for a distant moving speaker based on HMM composition and separation.
Proceedings of the IEEE International Conference on Acoustics, 2000
Localization of multiple sound sources based on a CSP analysis with a microphone array.
Proceedings of the IEEE International Conference on Acoustics, 2000
1999
Data collection in real acoustical environments for sound scene understanding and hands-free speech recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Simultaneous recognition of multiple sound sources based on 3-d n-best search using microphone array.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
1998
Speech Commun., 1998
Speech-to-lip movement synthesis maximizing audio-visual joint probability based on EM algorithm.
Proceedings of the Second IEEE Workshop on Multimedia Signal Processing, 1998
Compression algorithm of trigram language models based on maximum likelihood estimation.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
An effect of adaptive beamforming on hands-free speech recognition based on 3-d viterbi search.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Creating speaker independent HMM models for restricted database using STRAIGHT-TEMPO morphing.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
Proceedings of the Auditory-Visual Speech Processing, 1998
1997
A non-iterative model-adaptive e-CMN/PMC approach for speech recognition in car environments.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Improved bimodal speech recognition using tied-mixture HMMs and 5000 word audio-visual synchronous database.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
Proceedings of the ESCA Workshop on Audio-Visual Speech Processing, 1997
1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
1993
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
1991
Proceedings of the 1991 International Conference on Acoustics, 1991
1990
Proceedings of the First International Conference on Spoken Language Processing, 1990
Proceedings of the 1990 International Conference on Acoustics, 1990
Proceedings of the 1990 International Conference on Acoustics, 1990
Proceedings of the 1990 International Conference on Acoustics, 1990
1989
Proceedings of the IEEE International Conference on Acoustics, 1989
1988
Proceedings of the IEEE International Conference on Acoustics, 1988