Kentaro Tachibana

According to our database1, Kentaro Tachibana authored at least 26 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control.
CoRR, 2024

LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning.
CoRR, 2024

Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment.
CoRR, 2024

SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark.
CoRR, 2024

PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs.
Proceedings of the IEEE International Conference on Acoustics, 2023

Period VITS: Variational Inference with Explicit Pitch Modeling for End-To-End Emotional Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2023

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Phrase Break Prediction with Bidirectional Encoder Representations in Japanese Text-to-Speech Synthesis.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
Text-to-Speech Synthesis.
Proceedings of the Speech-to-Speech Translation, 2020

Joint Adversarial Training of Speech Recognition and Synthesis Models for Many-to-One Voice Conversion Using Phonetic Posteriorgrams.
IEICE Trans. Inf. Syst., 2020

Face2Speech: Towards Multi-Speaker Text-to-Speech Synthesis Using an Embedding Vector Predicted from a Face Image.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2018
An Investigation of Noise Shaping with Perceptual Weighting for Wavenet-Based Speech Generation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An Investigation of Subband Wavenet Vocoder Covering Entire Audible Frequency Range with Limited Acoustic Features.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Full-Body High-Resolution Anime Generation with Progressive Structure-Conditional Generative Adversarial Networks.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

2017
Subband wavenet with overlapped single-sideband filterbanks.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Model Integration for HMM- and DNN-Based Speech Synthesis Using Product-of-Experts Framework.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2009
Source adaptive blind signal extraction using closed-form ICA for hands-free robot spoken dialogue system.
Proceedings of the IEEE International Conference on Acoustics, 2009

2007
Efficient Blind Source Separation Combining Closed-Form Second-Order ICA and Nonclosed-Form Higher-Order ICA.
Proceedings of the IEEE International Conference on Acoustics, 2007


  Loading...