Kentaro Mitsui

Orcid: 0000-0002-3551-9932

According to our database1, Kentaro Mitsui authored at least 12 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Release of Pre-Trained Models for the Japanese Language.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
An Integration of Pre-Trained Speech and Language Models for End-to-End Speech Recognition.
CoRR, 2023

Towards human-like spoken dialogue generation between AI agents from written dialogue.
CoRR, 2023

Text-Guided Scene Sketch-to-Photo Synthesis.
CoRR, 2023

UniFLG: Unified Facial Landmark Generator from Text or Speech.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

MSR-NV: Neural Vocoder Using Multiple Sampling Rates.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Deep Gaussian process based multi-speaker speech synthesis with latent speaker representation.
Speech Commun., 2021

2020
Multi-Speaker Text-to-Speech Synthesis Using Deep Gaussian Processes.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019
JVS corpus: free Japanese multi-speaker voice corpus.
CoRR, 2019


  Loading...