Naohiro Tawara

According to our database1, Naohiro Tawara authored at least 40 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Guided Speaker Embedding.
CoRR, 2024

Mamba-based Segmentation Model for Speaker Diarization.
CoRR, 2024

SoundBeam meets M2D: Target Sound Extraction with Audio Foundation Model.
CoRR, 2024

Applying LLMs for Rescoring N-best ASR Hypotheses of Casual Conversations: Effects of Domain Adaptation and Context Carry-over.
CoRR, 2024

Recursive Attentive Pooling For Extracting Speaker Embeddings From Multi-Speaker Recordings.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Interaural Time Difference Loss for Binaural Target Sound Extraction.
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024

NTT Speaker Diarization System for Chime-7: Multi-Domain, Multi-Microphone end-to-end and Vector Clustering Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2024

Discriminative Training of VBx Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Influence of Personal Traits on Impressions of One's Own Voice.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

What are differences? Comparing DNN and Human by Their Performance and Characteristics in Speaker Age Estimation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Iterative Shallow Fusion of Backward Language Model for End-To-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Voice or Content? - Exploring Impact of Speech Content on Age Estimation from Voice.
Proceedings of the 31st European Signal Processing Conference, 2023

Coarse-Age Loss: A New Training Method Using Coarse-Age Labeled Data for Speaker Age Estimation.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Multi-Source Domain Generalization Using Domain Attributes for Recurrent Neural Network Language Models.
IEICE Trans. Inf. Syst., 2022

Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Advances in Integration of End-to-End Neural and Clustering-Based Diarization for Real Conversational Speech.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Age-VOX-Celeb: Multi-Modal Corpus for Facial and Speech Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2021

BLSTM-Based Confidence Estimation for End-to-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Integrating End-to-End Neural and Clustering-Based Diarization: Getting the Best of Both Worlds.
Proceedings of the IEEE International Conference on Acoustics, 2021

Robust Speech-Age Estimation Using Local Maximum Mean Discrepancy Under Mismatched Recording Conditions.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Language Model Data Augmentation Based on Text Domain Transfer.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Frame-Level Phoneme-Invariant Speaker Embedding for Text-Independent Speaker Recognition on Extremely Short Utterances.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Improving Speaker-Attribute Estimation by Voting Based on Speaker Cluster Information.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Improving Speaker Discrimination of Target Speech Extraction With Time-Domain Speakerbeam.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Noise-robust Attention Learning for End-to-End Speech Recognition.
Proceedings of the 28th European Signal Processing Conference, 2020

Speaker Age Estimation Using Age-Dependent Insensitive Loss.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Multi-Channel Speech Enhancement Using Time-Domain Convolutional Denoising Autoencoder.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speaker Adversarial Training of DPGMM-Based Feature Extractor for Zero-Resource Languages.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Postfiltering Using an Adversarial Denoising Autoencoder with Noise-aware Training.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Sequential Fish Catch Forecasting Using Bayesian State Space Models.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Speaker Invariant Feature Extraction for Zero-Resource Languages with Adversarial Learning.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Language Model Domain Adaptation Via Recurrent Neural Networks with Domain-Shared and Domain-Specific Representations.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Adversarial autoencoder for reducing nonlinear distortion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Exploiting end of sentences and speaker alternations in language modeling for multiparty conversations.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2015
A comparative study of spectral clustering for i-vector-based speaker clustering under noisy conditions.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2013
Blocked Gibbs sampling based multi-scale mixture model for speaker clustering on noisy data.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

2012
Fully Bayesian speaker clustering based on hierarchically structured utterance-oriented Dirichlet process mixture model.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Fully Bayesian inference of multi-mixture Gaussian model and its evaluation using speaker clustering.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Speaker Clustering Based on Utterance-Oriented Dirichlet Process Mixture Model.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011


  Loading...