We stand with Ukraine

We stand with Ukraine

Masafumi Nishimura

Orcid: 0000-0001-7633-9340

Affiliations:

IBM Research

According to our database¹, Masafumi Nishimura authored at least 85 papers between 1984 and 2024.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on researcher.ibm.com

On csauthors.net:

Bibliography

2024

Simultaneous Speech and Eating Behavior Recognition Using Multitask Learning.

[BibT_eX]

[DOI]

Toshihiro Tsukagoshi

,

Masafumi Nishida

,

Masafumi Nishimura

Proceedings of the 13th IEEE Global Conference on Consumer Electronics, 2024

Chewing and Swallowing Pattern Recognition Using Sound Information.

[BibT_eX]

[DOI]

,

Masafumi Nishida

,

Masafumi Nishimura

Proceedings of the 13th IEEE Global Conference on Consumer Electronics, 2024

Food Recognition Using Large-scale Pre-trained Speech Models.

[BibT_eX]

[DOI]

,

Masafumi Nishimura

,

Masafumi Nishida

,

,

Proceedings of the 13th IEEE Global Conference on Consumer Electronics, 2024

Mora-Based Evaluation of Naturalness for Japanese Dysarthric Speech.

[BibT_eX]

[DOI]

Megumi Hatakeyama

,

,

Motomichi Shirahase

,

,

Naoshi Yamamoto

,

Masafumi Nishimura

Proceedings of the 13th IEEE Global Conference on Consumer Electronics, 2024

SSL-based Chewing and Swallowing Detection Using Multiple Skin-contact Microphones.

[BibT_eX]

[DOI]

Toshihiro Tsukagoshi

,

Kazuhiro Koiwai

,

Masafumi Nishida

,

Masafumi Nishimura

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2024

2023

Utterance-style-dependent Speaker Verification by Utilizing Emotions.

[BibT_eX]

[DOI]

Hibiki Takayama

,

Masafumi Nishida

,

,

,

Masafumi Nishimura

Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023

Eating and Drinking Behavior Recognition Using Multimodal Fusion.

[BibT_eX]

[DOI]

,

Masafumi Nishida

,

Masafumi Nishimura

Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023

Multi-Self-Supervised Learning Model-Based Throat Microphone Speech Recognition.

[BibT_eX]

[DOI]

,

,

Masafumi Nishida

,

Masafumi Nishimura

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022

Automatic Detection of Crushing Completion Timing of Food.

[BibT_eX]

[DOI]

Akihiro Nakamura

,

,

,

,

,

Masafumi Nishimura

Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022

Identification of vocal tract state before and after swallowing using acoustic features.

[BibT_eX]

[DOI]

,

Masafumi Nishida

,

Masafumi Nishimura

,

,

Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022

Throat microphone speech recognition using wav2vec 2.0 and feature mapping.

[BibT_eX]

[DOI]

,

,

Masafumi Nishida

,

Masafumi Nishimura

Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022

2021

Automatic Detection of Chewing and Swallowing.

[BibT_eX]

[DOI]

Akihiro Nakamura

,

,

,

,

,

Masafumi Nishimura

Sensors, 2021

Tablet-Based Automatic Assessment for Early Detection of Alzheimer's Disease Using Speech Responses to Daily Life Questions.

[BibT_eX]

[DOI]

Yasunori Yamada

,

,

Masatomo Kobayashi

,

Masafumi Nishimura

,

,

,

,

Kiyotaka Nemoto

,

Frontiers Digit. Health, 2021

A Study for Detecting Mild Cognitive Impairment by Analyzing Conversations with Humanoid Robots.

[BibT_eX]

[DOI]

,

Masafumi Nishimura

,

,

,

,

,

Masatomo Kobayashi

,

Yasunori Yamada

,

,

Ryohei Watanabe

,

,

,

Kiyotaka Nemoto

,

,

Proceedings of the 3rd IEEE Global Conference on Life Sciences and Technologies, 2021

Automatic Detection of Chewing and Swallowing Using Multichannel Sound Information.

[BibT_eX]

[DOI]

Akihiro Nakamura

,

,

,

,

,

Masafumi Nishimura

Proceedings of the 3rd IEEE Global Conference on Life Sciences and Technologies, 2021

Automatic Detection of Chewing and Swallowing Using Attention-Based Fusion.

[BibT_eX]

[DOI]

Akihiro Nakamura

,

,

,

,

,

Masafumi Nishimura

Proceedings of the 10th IEEE Global Conference on Consumer Electronics, 2021

Question Generation using Knowledge Graphs with the T5 Language Model and Masked Self-Attention.

[BibT_eX]

[DOI]

,

Takashi Tsunakawa

,

Masafumi Nishida

,

Masafumi Nishimura

Proceedings of the 10th IEEE Global Conference on Consumer Electronics, 2021

2020

Automatic Detection of the Chewing Side Using Two-channel Recordings under the Ear.

[BibT_eX]

[DOI]

Akihiro Nakamura

,

,

Masafumi Nishimura

,

,

,

Proceedings of the 2nd IEEE Global Conference on Life Sciences and Technologies, 2020

A data augmentation-based technique to classify chewing and swallowing using LSTM.

[BibT_eX]

[DOI]

Muhammad Mehedi Billah

,

Masafumi Nishimura

Proceedings of the 2nd IEEE Global Conference on Life Sciences and Technologies, 2020

Automatic Detection of Chewing and Swallowing Using Hybrid CTC/Attention.

[BibT_eX]

[DOI]

Akihiro Nakamura

,

,

,

,

,

Masafumi Nishimura

Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020

BERT-based Automatic Text Scoring for Collaborative Learning.

[BibT_eX]

[DOI]

,

Takashi Tsunakawa

,

,

,

Masafumi Nishida

,

Masafumi Nishimura

Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020

A Data Augmentation Technique for Automatic Detection of Chewing Side and Swallowing.

[BibT_eX]

[DOI]

Akihiro Nakamura

,

,

,

,

,

Masafumi Nishimura

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019

Multimodal Behavior Analysis Towards Detecting Mild Cognitive Impairment: Preliminary Results on Gait and Speech.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

,

,

Kiyotaka Nemoto

,

Tomoko Takeuchi

,

,

Ryohei Watanabe

,

,

,

,

,

Yasunori Yamada

Proceedings of the MEDINFO 2019: Health and Wellbeing e-Networks for All, 2019

Knowledge Distillation for Throat Microphone Speech Recognition.

[BibT_eX]

[DOI]

Takahito Suzuki

,

,

Takashi Tsunakawa

,

Masafumi Nishida

,

Masafumi Nishimura

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Effects of Mounting Position on Throat Microphone Speech Recognition.

[BibT_eX]

[DOI]

Takahito Suzuki

,

Takashi Tsunakawa

,

Masafumi Nishida

,

Masafumi Nishimura

,

Proceedings of the IEEE 8th Global Conference on Consumer Electronics, 2019

Estimation of Number of Chewing Strokes and Swallowing Events by Using LSTM-CTC and Throat Microphone.

[BibT_eX]

[DOI]

Muhammad Mehedi Billah

,

,

Akihiro Nakamura

,

,

Masafumi Nishimura

,

,

Proceedings of the IEEE 8th Global Conference on Consumer Electronics, 2019

2018

Detecting breathing sounds in realistic Japanese telephone conversations and its application to automatic speech recognition.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

Speech Commun., 2018

Dialogue Breakdown Detection Based on Nonlinguistic Acoustic Information.

[BibT_eX]

[DOI]

,

Takashi Tsunakawa

,

Masafumi Nishida

,

Masafumi Nishimura

Proceedings of the IEEE 7th Global Conference on Consumer Electronics, 2018

Bottleneck feature-mediated DNN-based feature mapping for throat microphone speech recognition.

[BibT_eX]

[DOI]

Takahito Suzuki

,

,

Takashi Tsunakawa

,

Masafumi Nishida

,

Masafumi Nishimura

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017

Deep learning-based water-intake estimation method using second half of swallowing sound.

[BibT_eX]

[DOI]

,

Masafumi Nishimura

,

,

,

Satoshi Kawasaki

,

,

Masaji Katagiri

Proceedings of the IEEE 6th Global Conference on Consumer Electronics, 2017

A Deep-Learning-Based Method of Estimating Water Intake.

[BibT_eX]

[DOI]

,

,

Satoshi Kawasaki

,

,

Masaji Katagiri

,

Masafumi Nishimura

,

Proceedings of the 41st IEEE Annual Computer Software and Applications Conference, 2017

DNN-based feature transformation for speech recognition using throat microphone.

[BibT_eX]

[DOI]

,

Takashi Tsunakawa

,

Masafumi Nishida

,

Masafumi Nishimura

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2015

Discriminative re-ranking for automatic speech recognition by leveraging invariant structures.

[BibT_eX]

[DOI]

Masayuki Suzuki

,

,

Masafumi Nishimura

,

Nobuaki Minematsu

Speech Commun., 2015

A metric for evaluating speech recognizer output based on human-perception model.

[BibT_eX]

[DOI]

,

,

Ryuki Tachibana

,

Masafumi Nishimura

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014

Regularized feature-space discriminative adaptation for robust ASR.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

,

Steven J. Rennie

,

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Leveraging phonetic context dependent invariant structure for continuous speech recognition.

[BibT_eX]

[DOI]

,

Masayuki Suzuki

,

,

Masafumi Nishimura

,

Nobuaki Minematsu

Proceedings of the IEEE China Summit & International Conference on Signal and Information Processing, 2014

2013

Channel-mapping for speech corpus recycling.

[BibT_eX]

[DOI]

,

Steven J. Rennie

,

,

Masafumi Nishimura

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Acoustically discriminative language model training with pseudo-hypothesis.

[BibT_eX]

[DOI]

,

,

Bhuvana Ramabhadran

,

,

,

Masafumi Nishimura

Speech Commun., 2012

Leveraging word confusion networks for named entity modeling and detection from conversational telephone speech.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

,

,

Bhuvana Ramabhadran

Speech Commun., 2012

Discriminative Reranking for LVCSR Leveraging Invariant Structure.

[BibT_eX]

[DOI]

Masayuki Suzuki

,

,

Masafumi Nishimura

,

Nobuaki Minematsu

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Model-based noise reduction leveraging frequency-wise confidence metric for in-car speech recognition.

[BibT_eX]

[DOI]

,

Steven J. Rennie

,

,

Masafumi Nishimura

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Agglomerative Hierarchical Clustering of Emotions in Speech Based on Subjective Relative Similarity.

[BibT_eX]

[DOI]

Ryoichi Takashima

,

,

Ryuki Tachibana

,

Masafumi Nishimura

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Continuous Digits Recognition Leveraging Invariant Structure.

[BibT_eX]

[DOI]

Masayuki Suzuki

,

,

Masafumi Nishimura

,

Nobuaki Minematsu

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Acoustic Model Training with Detecting Transcription Errors in the Training Data.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Breath-Detection-Based Telephony Speech Phrasing.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Combining Feature Space Discriminative Training with Long-Term Spectro-Temporal Features for Noise-Robust Speech Recognition.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Named entity recognition from Conversational Telephone Speech leveraging Word Confusion Networks for training and recognition.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

,

,

Bhuvana Ramabhadran

Proceedings of the IEEE International Conference on Acoustics, 2011

Training of error-corrective model for ASR without using audio data.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Dynamic Features in the Linear-Logarithmic Hybrid Domain for Automatic Speech Recognition in a Reverberant Environment.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

IEEE J. Sel. Top. Signal Process., 2010

Long-Term Spectro-Temporal and Static Harmonic Features for Voice Activity Detection.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

IEEE J. Sel. Top. Signal Process., 2010

DOA Estimation with Local-Peak-Weighted CSP.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

EURASIP J. Adv. Signal Process., 2010

Speech synthesis by modeling harmonics structure with multiple function.

[BibT_eX]

[DOI]

,

Ryuki Tachibana

,

Masafumi Nishimura

,

Tetsuya Takiguchi

,

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Improved voice activity detection using static harmonic features.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Japanese pitch conversion for voice morphing based on differential modeling.

[BibT_eX]

[DOI]

Ryuki Tachibana

,

,

Masafumi Nishimura

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Dynamic features in the linear domain for robust automatic speech recognition in a reverberant environment.

[BibT_eX]

[DOI]

,

,

Ryuki Tachibana

,

Masafumi Nishimura

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Acoustically discriminative training for language models.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Local Peak Enhancement for In-Car Speech Recognition in Noisy Environment.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

IEICE Trans. Inf. Syst., 2008

Short- and long-term dynamic features for robust speech recognition.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Phone-duration-dependent long-term dynamic features for a stochastic model-based voice activity detection.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Improving phoneme and accent estimation by leveraging a dictionary for a stochastic TTS front-end.

[BibT_eX]

[DOI]

,

Ryuki Tachibana

,

,

Masafumi Nishimura

Proceedings of the IEEE International Conference on Acoustics, 2008

Local peak enhancement combined with noise reduction algorithms for robust automatic speech recognition in automobiles.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Automatic Prosody Labeling Using Multiple Models for Japanese.

[BibT_eX]

[DOI]

Ryuki Tachibana

,

,

,

Masafumi Nishimura

,

Noboru Babaguchi

IEICE Trans. Inf. Syst., 2007

Preliminary experiments toward automatic generation of new TTS voices from recorded speech alone.

[BibT_eX]

[DOI]

Ryuki Tachibana

,

,

,

Masafumi Nishimura

,

Noboru Babaguchi

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Determining Recording Location Based on Synchronization Positions of Audiowatermarking.

[BibT_eX]

[DOI]

,

Ryuki Tachibana

,

Masafumi Nishimura

,

Noboru Babaguchi

Proceedings of the IEEE International Conference on Acoustics, 2007

Unsupervised Lexicon Acquisition from Speech and Text.

[BibT_eX]

[DOI]

,

,

,

Masafumi Nishimura

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

Acoustic Model Adaptation Using First-Order Linear Prediction for Reverberant Speech.

[BibT_eX]

[DOI]

Tetsuya Takiguchi

,

Masafumi Nishimura

,

IEICE Trans. Inf. Syst., 2006

Estimation of recording location using audio watermarking.

[BibT_eX]

[DOI]

,

Ryuki Tachibana

,

Masafumi Nishimura

,

Noboru Babaguchi

Proceedings of the 8th workshop on Multimedia & Security, 2006

Unsupervised Adaptation of a Stochastic Language Model Using a Japanese Raw Corpus.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

Simultaneous Adaptation of Echo Cancellation and Spectral Subtraction for In-Car Speech Recognition.

[BibT_eX]

[DOI]

,

Masafumi Nishimura

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2005

A stochastic approach to phoneme and accent estimation.

[BibT_eX]

[DOI]

,

,

Masafumi Nishimura

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004

Improved HMM Separation for Distant-Talking Speech Recognition.

[BibT_eX]

[DOI]

Tetsuya Takiguchi

,

Masafumi Nishimura

IEICE Trans. Inf. Syst., 2004

Sound Source Localization Using a Profile Fitting Method with Sound Reflectors.

[BibT_eX]

[DOI]

,

Tetsuya Takiguchi

,

Masafumi Nishimura

IEICE Trans. Inf. Syst., 2004

Acoustic model adaptation using first order prediction for reverberant speech.

[BibT_eX]

[DOI]

Tetsuya Takiguchi

,

Masafumi Nishimura

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Language model adaptation using word clustering.

[BibT_eX]

[DOI]

,

Masafumi Nishimura

,

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2001

Improvement of a structured language model: arbori-context tree.

[BibT_eX]

[DOI]

,

Masafumi Nishimura

,

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000

A method for style adaptation to spontaneous speech by using a semi-linear interpolation technique.

[BibT_eX]

[DOI]

,

Masafumi Nishimura

,

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A Stochastic Parser Based on a Structural Word Prediction Model.

[BibT_eX]

[DOI]

,

Masafumi Nishimura

,

,

,

Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

1998

Word clustering for a word bi-gram model.

[BibT_eX]

[DOI]

,

Masafumi Nishimura

,

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1991

Speaker adaptation method for fenonic markov model-based speech recognition.

[BibT_eX]

[DOI]

Masafumi Nishimura

Syst. Comput. Jpn., 1991

1989

HMM-based speech recognition using dynamic spectral feature.

[BibT_eX]

[DOI]

Masafumi Nishimura

Proceedings of the IEEE International Conference on Acoustics, 1989

1988

Speaker adaptation method for HMM-based speech recognition.

[BibT_eX]

[DOI]

Masafumi Nishimura

,

Kazuhide Sugawara

Proceedings of the IEEE International Conference on Acoustics, 1988

1987

HMM-Based speech recognition using multi-dimensional multi-labeling.

[BibT_eX]

[DOI]

Masafumi Nishimura

,

Koichi Toshioka

Proceedings of the IEEE International Conference on Acoustics, 1987

1986

Speaker adaptation for a hidden Markov model.

[BibT_eX]

[DOI]

Kazuhide Sugawara

,

Masafumi Nishimura

,

Proceedings of the IEEE International Conference on Acoustics, 1986

1985

Isolated word recognition using hidden Markov models.

[BibT_eX]

[DOI]

Kazuhide Sugawara

,

Masafumi Nishimura

,

Koichi Toshioka

,

,

Toyohisa Kaneko

Proceedings of the IEEE International Conference on Acoustics, 1985

1984

A method for recognizing Japanese monosyllables by using intermediate cumulative distance.

[BibT_eX]

[DOI]

Yasuhiro Matsuda

,

,

Mitsuhiko Kanoh

,

Masafumi Nishimura

,

Toyohisa Kaneko

Proceedings of the IEEE International Conference on Acoustics, 1984

Loading...