Kentaro Tachibana

According to our database¹, Kentaro Tachibana authored at least 26 papers between 2007 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control.

[BibT_eX]

[DOI]

CoRR, 2024

LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning.

[BibT_eX]

[DOI]

CoRR, 2024

Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment.

[BibT_eX]

[DOI]

CoRR, 2024

SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark.

[BibT_eX]

[DOI]

CoRR, 2024

PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs.

[BibT_eX]

[DOI]

Reo Yoneyama

Ryuichi Yamamoto

Kentaro Tachibana

Proceedings of the IEEE International Conference on Acoustics, 2023

Period VITS: Variational Inference with Explicit Pitch Modeling for End-To-End Emotional Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning.

[BibT_eX]

[DOI]

Takaaki Saeki

Kentaro Tachibana

Ryuichi Yamamoto

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech.

[BibT_eX]

[DOI]

Byeongseon Park

Ryuichi Yamamoto

Kentaro Tachibana

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

Phrase Break Prediction with Bidirectional Encoder Representations in Japanese Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020

Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Speech-to-Speech Translation, 2020

Joint Adversarial Training of Speech Recognition and Synthesis Models for Many-to-One Voice Conversion Using Phonetic Posteriorgrams.

[BibT_eX]

[DOI]

Yuki Saito

Kei Akuzawa

Kentaro Tachibana

IEICE Trans. Inf. Syst., 2020

Face2Speech: Towards Multi-Speaker Text-to-Speech Synthesis Using an Embedding Vector Predicted from a Face Image.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2018

An Investigation of Noise Shaping with Perceptual Weighting for Wavenet-Based Speech Generation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An Investigation of Subband Wavenet Vocoder Covering Entire Audible Frequency Range with Limited Acoustic Features.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Full-Body High-Resolution Anime Generation with Progressive Structure-Conditional Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

2017

Subband wavenet with overlapped single-sideband filterbanks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

Model Integration for HMM- and DNN-Based Speech Synthesis Using Product-of-Experts Framework.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2009

Source adaptive blind signal extraction using closed-form ICA for hands-free robot spoken dialogue system.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2007

Efficient Blind Source Separation Combining Closed-Form Second-Order ICA and Nonclosed-Form Higher-Order ICA.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Kentaro Tachibana

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...