Kentaro Mitsui

Orcid: 0000-0002-3551-9932

According to our database¹, Kentaro Mitsui authored at least 12 papers between 2019 and 2024.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

2019

2020

2021

2022

2023

2024

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue Systems.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Release of Pre-Trained Models for the Japanese Language.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

An Integration of Pre-Trained Speech and Language Models for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2023

Towards human-like spoken dialogue generation between AI agents from written dialogue.

[BibT_eX]

[DOI]

Kentaro Mitsui

Yukiya Hono

Kei Sawada

CoRR, 2023

Text-Guided Scene Sketch-to-Photo Synthesis.

[BibT_eX]

[DOI]

AprilPyone MaungMaung

CoRR, 2023

UniFLG: Unified Facial Landmark Generator from Text or Speech.

[BibT_eX]

[DOI]

Kentaro Mitsui

Yukiya Hono

Kei Sawada

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

MSR-NV: Neural Vocoder Using Multiple Sampling Rates.

[BibT_eX]

[DOI]

Kentaro Mitsui

Kei Sawada

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

Deep Gaussian process based multi-speaker speech synthesis with latent speaker representation.

[BibT_eX]

[DOI]

Kentaro Mitsui

Tomoki Koriyama

Hiroshi Saruwatari

Speech Commun., 2021

2020

Multi-Speaker Text-to-Speech Synthesis Using Deep Gaussian Processes.

[BibT_eX]

[DOI]

Kentaro Mitsui

Tomoki Koriyama

Hiroshi Saruwatari

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

JVS corpus: free Japanese multi-speaker voice corpus.

[BibT_eX]

[DOI]

CoRR, 2019

Kentaro Mitsui

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...