Hiromitsu Nishizaki

Orcid: 0000-0002-7717-8312

According to our database1, Hiromitsu Nishizaki authored at least 119 papers between 1999 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Special issue on multimodal processing and robotics for dialogue systems (Part II).
Adv. Robotics, February, 2024

Enhancing Anti-spoofing Countermeasures Robustness through Joint Optimization and Transfer Learning.
CoRR, 2024

Overview of Dialogue Robot Competition 2023.
CoRR, 2024

Analysis of Classroom Processes Based on Deep Learning With Video and Audio Features.
IEEE Access, 2024

Text Detection and Style Classification from Images Using Vision Transformer and Transformer Decoder.
Proceedings of the 13th IEEE Global Conference on Consumer Electronics, 2024

Development of GUI application for multimodal analysis for evaluation of swallowing function.
Proceedings of the 13th IEEE Global Conference on Consumer Electronics, 2024

Evaluation of LoRa-based Long-Range Communication in a Fruit Theft Prevention Device.
Proceedings of the 13th IEEE Global Conference on Consumer Electronics, 2024

Evaluation of Speech Translation Subtitles Generated by ASR with Unnecessary Word Detection.
Proceedings of the 13th IEEE Global Conference on Consumer Electronics, 2024

2023
Single-Line Text Detection in Multi-Line Text with Narrow Spacing for Line-Based Character Recognition.
IEICE Trans. Inf. Syst., December, 2023

3D grape bunch model reconstruction from 2D images.
Comput. Electron. Agric., December, 2023

Special Issue on Multimodal processing and robotics for dialogue systems (Part 1).
Adv. Robotics, November, 2023

Design of a competition specifically for spoken dialogue with a humanoid robot.
Adv. Robotics, November, 2023

End-to-end lightweight berry number prediction for supporting table grape cultivation.
Comput. Electron. Agric., October, 2023

Supporting table grape berry thinning with deep neural network and augmented reality technologies.
Comput. Electron. Agric., October, 2023

A Lightweight End-to-End Speech Recognition System on Embedded Devices.
IEICE Trans. Inf. Syst., July, 2023

Comparative Evaluation of Diverse Features in Fluency Evaluation of Spontaneous Speech.
IEICE Trans. Inf. Syst., 2023

A new speech corpus of super-elderly Japanese for acoustic modeling.
Comput. Speech Lang., 2023

Proceedings of the Dialogue Robot Competition 2023.
CoRR, 2023

Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure.
CoRR, 2023

Automatic Exploration of Optimal Data Processing Operations for Sound Data Augmentation Using Improved Differentiable Automatic Data Augmentation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Proposal of a method for evaluating biological responses during swallowing using the LF/HF change rate.
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023

Data Augmentation with Automatically Generated Images for Character Classifier Model Training.
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023

Image Remapping Data Augmentation Approach for Improving Fisheye Face Recognition.
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023

Metric Learning Approach for End-to-End Multilingual Automatic Speech Recognition Model.
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023

2022
Appropriate grape color estimation based on metric learning for judging harvest timing.
Vis. Comput., 2022

Non-Contact Breathing Monitoring Using Sleep Breathing Detection Algorithm (SBDA) Based on UWB Radar Sensors.
Sensors, 2022

Overview of Dialogue Robot Competition 2022.
CoRR, 2022

Proceedings of the Dialogue Robot Competition 2022.
CoRR, 2022

Combination of Time-domain, Frequency-domain, and Cepstral-domain Acoustic Features for Speech Commands Classification.
CoRR, 2022

Handwritten Character Generation using Y-Autoencoder for Character Recognition Model Training.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Usability of Instrument for a Farm Product in a Real Farm.
Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022

Low Pass Filtering and Bandwidth Extension for Robust Anti-spoofing Countermeasure Against Codec Variabilities.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

End-to-End Speech to Braille Translation in Japanese.
Proceedings of the IEEE International Conference on Consumer Electronics, 2022

Peer Collaborative Learning for Polyphonic Sound Event Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

Automatic Selection of Appropriate Data Augmentation Operation for Acoustic Scene Classification Model Training.
Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022

Dialogue Robot Competition for the Development of an Android Robot with Hospitality.
Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022

2021
Real-Time In-Vehicle Air Quality Monitoring System Using Machine Learning Prediction Algorithm.
Sensors, 2021

Comparison of Static and Time-Sequential Features in Automatic Fluency Detection of Spontaneous Speech.
Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021

Evaluation of a Gait Analysis Tool Using Posture Estimation Technology in Clinical Rehabilitation.
Proceedings of the 3rd IEEE Global Conference on Life Sciences and Technologies, 2021

Voice Activity Detection for Live Speech of Baseball Game Based on Tandem Connection with Speech/Noise Separation Model.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Language and Speaker-Independent Feature Transformation for End-to-End Multilingual Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi.
Proceedings of the 10th IEEE Global Conference on Consumer Electronics, 2021

Audio Synthesis-based Data Augmentation Considering Audio Event Class.
Proceedings of the 10th IEEE Global Conference on Consumer Electronics, 2021

Corpus Design and Automatic Speech Recognition for Deaf and Hard-of-Hearing People.
Proceedings of the 10th IEEE Global Conference on Consumer Electronics, 2021

Semi-Supervised Learning for Aspect-Based Sentiment Analysis.
Proceedings of the International Conference on Cyberworlds, 2021

Supporting Vine Vegetation Status Observation Using AR.
Proceedings of the International Conference on Cyberworlds, 2021

End-to-End Inflorescence Measurement for Supporting Table Grape Trimming with Augmented Reality.
Proceedings of the International Conference on Cyberworlds, 2021

Development of a Support System for Judging the Appropriate Timing for Grape Harvesting.
Proceedings of the International Conference on Cyberworlds, 2021

2020
Classification of a Pincer Nail Using a Recurrent-based Neural Network.
Proceedings of the 2020 IEEE/SICE International Symposium on System Integration, 2020

Semi-Automatic Construction and Refinement of an Annotated Corpus for a Deep Learning Framework for Emotion Classification.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Improving Speech Recognition for the Elderly: A New Corpus of Elderly Japanese Speech and Investigation of Acoustic Modeling for Speech Recognition.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Integrating Disfluency-based and Prosodic Features with Acoustics in Automatic Fluency Evaluation of Spontaneous Speech.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Automatic Fluency Evaluation of Spontaneous Speech Using Disfluency-Based Features.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

ExKaldi: A Python-based Extension Tool of Kaldi.
Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020

Development of a Low-Latency and Real-Time Automatic Speech Recognition System.
Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020

Sentiment analysis using semi-supervised learning with few labeled data.
Proceedings of the International Conference on Cyberworlds, 2020

Analysis of Bit Sequence Representation for Sound Classification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Spoken Dialog Training System for Customer Service Improvement.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
A hybrid approach of knowledge-driven and data-driven reasoning for activity recognition in smart homes.
J. Intell. Fuzzy Syst., 2019

BERT based Web Mining of Concerns and Reviews for TV Drama Audience.
Proceedings of the 2019 IEEE/WIC/ACM International Conference on Web Intelligence, 2019

A New Corpus of Elderly Japanese Speech for Acoustic Modeling, and a Preliminary Investigation of Dialect-Dependent Speech Recognition.
Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019

Application of Sequence Input and Output Long Short-Term Memory Neural Networks for Autonomous Gas Source Localization in an Outdoor Environment.
Proceedings of the IEEE International Symposium on Olfaction and Electronic Nose, 2019

Audio Classification of Bit-Representation Waveform.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Abnormality Detection Approach using Deep Learning Models in Smart Home Environments.
Proceedings of the 7th International Conference on Communications and Broadband Networking, 2019

Scenario-based Customer Service VR Training System with Honorific Exercise.
Proceedings of the ICBBE '19: 2019 6th International Conference on Biomedical and Bioinformatics Engineering, 2019

Classification of Swing Motion of Tennis using a Recurrent-based Neural Network.
Proceedings of the 12th International Conference on Human System Interaction, 2019

2018
A Task Manual Creation Support System Using Automatic Speech Recognition.
Proceedings of the IEEE 7th Global Conference on Consumer Electronics, 2018

Construction of a Corpus for Elderly Japanese Speech Recognition.
Proceedings of the IEEE 7th Global Conference on Consumer Electronics, 2018

2017
Parallel Hierarchical Attention Networks with Shared Memory Reader for Multi-Stream Conversational Document Classification.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Emotion classification of spontaneous speech using spoken term detection.
Proceedings of the IEEE 6th Global Conference on Consumer Electronics, 2017

Data augmentation and feature extraction using variational autoencoder for acoustic modeling.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Acoustic modeling with a shared phoneme set for multilingual speech recognition without code-switching.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Re-Ranking Approach of Spoken Term Detection Using Conditional Random Fields-Based Triphone Detection.
IEICE Trans. Inf. Syst., 2016

Spoken Term Detection Using SVM-Based Classifier Trained with Pre-Indexed Keywords.
IEICE Trans. Inf. Syst., 2016

Evaluation of DNN-based Phoneme Estimation Approach on the NTCIR-12 SpokenQuery&Doc-2 SQ-STD Subtask.
Proceedings of the 12th NTCIR Conference on Evaluation of Information Access Technologies, 2016

Overview of the NTCIR-12 SpokenQuery&Doc-2 Task.
Proceedings of the 12th NTCIR Conference on Evaluation of Information Access Technologies, 2016

Recurrent Neural Network-Based Phoneme Sequence Estimation Using Multiple ASR Systems' Outputs for Spoken Term Detection.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Two-step spoken term detection using SVM classifier trained with pre-indexed keywords based on ASR result.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Score normalization using phoneme-based entropy for spoken term detection.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Combination of DTW-based and CRF-based Spoken Term Detection on the NTCIR-11 SpokenQuery&Doc SQ-STD Subtask.
Proceedings of the 11th NTCIR Conference on Evaluation of Information Access Technologies, 2014

Overview of the NTCIR-11 SpokenQuery&Doc Task.
Proceedings of the 11th NTCIR Conference on Evaluation of Information Access Technologies, 2014

Re-ranking of spoken term detections using CRF-based triphone detection models.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Selection of best match keyword using spoken term detection for spoken document indexing.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
Spoken Term Detection Using Phoneme Transition Network from Multiple Speech Recognizers' Outputs.
J. Inf. Process., 2013

Evaluation Framework Design of Spoken Term Detection Study at the NTCIR-9 IR for Spoken Documents Task.
Inf. Media Technol., 2013

STD and SCR Techniques and Their Evaluations on the NTCIR-10 SpokenDoc-2 Task.
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Overview of the NTCIR-10 SpokenDoc-2 Task.
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Evaluation of the usefulness of spoken term detection in an electronic note-taking support system.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Entropy-based false detection filtering in spoken term detection tasks.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Development of note-taking support system with speech interface.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Introduction of false detection control parameters in spoken term detection.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Spoken Term Detection Using Multiple Speech Recognizers' Outputs at NTCIR-9 SpokenDoc STD subtask.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

Overview of the IR for Spoken Documents Task in NTCIR-9 Workshop.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

Utterance verification using garbage words for a hospital appointment system with speech interface.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Japanese spoken term detection using syllable transition network derived from multiple speech recognizers' outputs.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Constructing Japanese test collections for spoken term detection.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Construction of a Test Collection for Spoken Document Retrieval from Lecture Audio Data.
J. Inf. Process., 2009

2008
Developing Corpus of Japanese Classroom Lecture Speech Contents.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Test Collections for Spoken Document Retrieval from Lecture Audio Data.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Speech recognition performance of CJLC: corpus of Japanese lecture contents.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Is a speech recognizer useful for characteristic analysis of classroom lecture speech?
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007
The effect of filled pauses in a lecture speech on impressive evaluation of listeners.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006
Word Error Correction of Continuous Speech Recognition Using WEB Documents for Spoken Document Indexing.
Proceedings of the Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead, 2006

2005
Combining outputs of multiple LVCSR models by machine learning.
Syst. Comput. Jpn., 2005

An Unsupervised Speaker Adaptation Method for Lecture-Style Spontaneous Speech Recognition Using Multiple Recognition Systems.
IEICE Trans. Inf. Syst., 2005

Improving Keyword Recognition of Spoken Queries by Combining Multiple Speech Recognizer's Outputs for Speech-driven WEB Retrieval Task.
IEICE Trans. Inf. Syst., 2005

2004
Estimating high-confidence portions based on agreement among outputs of multiple LVCSR models.
Syst. Comput. Jpn., 2004

Robust spoken document retrieval methods for misrecognition and out-of-vocabulary keywords.
Syst. Comput. Jpn., 2004

An Empirical Study on Multiple LVCSR Model Combination by Machine Learning.
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

Unsupervised speaker adaptation using high confidence portion recognition results by multiple recognition systems.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Keyword recognition and extraction by multiple-LVCSRs with 60, 000 words in speech-driven WEB retrieval task.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003
Evaluating multiple LVCSR model combination in NTCIR-3 speech-driven web retrieval task.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Confidence of agreement among multiple LVCSR models and model combination by SVM.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
A confidence measure based on agreement among multiple LVCSR models - correlation between pair of acoustic models and confidence.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Comparing isolately spoken keywords with spontaneously spoken queries for Japanese spoken document retrieval.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Experimental evaluation on confidence of agreement among multiple Japanese LVCSR models.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
A system for retrieving broadcast news speech documents using voice input keywords and similarity between words.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
A Retrieval System of Broadcast News Speech Documents through Keyboard and Voice.
Proceedings of the Text, Speech and Dialogue - Second International Workshop, 1999


  Loading...