Yukiya Hono

Orcid: 0000-0003-1245-8791

According to our database1, Yukiya Hono authored at least 18 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model.
Proceedings of the IEEE International Conference on Acoustics, 2024

PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Release of Pre-Trained Models for the Japanese Language.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
An Integration of Pre-Trained Speech and Language Models for End-to-End Speech Recognition.
CoRR, 2023

Towards human-like spoken dialogue generation between AI agents from written dialogue.
CoRR, 2023

Singing voice synthesis based on frame-level sequence-to-sequence models considering vocal timing deviation.
CoRR, 2023

UniFLG: Unified Facial Landmark Generator from Text or Speech.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Embedding a Differentiable Mel-Cepstral Synthesis Filter to a Neural Speech Synthesis System.
Proceedings of the IEEE International Conference on Acoustics, 2023

Singing Voice Synthesis Based on a Musical Note Position-Aware Attention Mechanism.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Sinsy: A Deep Neural Network-Based Singing Voice Synthesis System.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

PeriodNet: A Non-Autoregressive Raw Waveform Generative Model With a Structure Separating Periodic and Aperiodic Components.
IEEE Access, 2021

Periodnet: A Non-Autoregressive Waveform Generation Model with a Structure Separating Periodic and Aperiodic Components.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019
Singing Voice Synthesis Based on Generative Adversarial Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Singing Voice Conversion Using Posted Waveform Data on Music Social Media.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Recent Development of the DNN-based Singing Voice Synthesis System - Sinsy.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018


  Loading...