Yi-Chiao Wu

Orcid: 0000-0003-4390-1354

According to our database1, Yi-Chiao Wu authored at least 66 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Contact-Free Atrial Fibrillation Screening With Attention Network.
IEEE J. Biomed. Health Informatics, September, 2024

Contactless Blood Pressure Measurement Via Remote Photoplethysmography With Synthetic Data Generation Using Generative Adversarial Networks.
IEEE J. Biomed. Health Informatics, February, 2024

Video-Based Contactless Detection of Sleep Apnea With Deep-Learning Model.
IEEE Trans. Instrum. Meas., 2024

Multi-Speaker Text-to-Speech Training With Speaker Anonymized Data.
IEEE Signal Process. Lett., 2024

Movie Gen: A Cast of Media Foundation Models.
CoRR, 2024

Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models.
CoRR, 2024

EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations.
CoRR, 2024

EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation.
CoRR, 2024

ScoreDec: A Phase-Preserving High-Fidelity Audio Codec with a Generalized Score-Based Diffusion Post-Filter.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Motion Robust Remote Photoplethysmography Measurement During Exercise for Contactless Physical Activity Intensity Detection.
IEEE Trans. Instrum. Meas., 2023

Deep-Learning-Based Remote Photoplethysmography Measurement in Driving Scenarios With Color and Near-Infrared Images.
IEEE Trans. Instrum. Meas., 2023

High-Fidelity and Pitch-Controllable Neural Vocoder Based on Unified Source-Filter Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Recognizing, Fast and Slow: Complex Emotion Recognition With Facial Expression Detection and Remote Physiological Measurement.
IEEE Trans. Affect. Comput., 2023

Audiobox: Unified Audio Generation with Natural Language Prompts.
CoRR, 2023

Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural Vocoder.
Proceedings of the IEEE International Conference on Acoustics, 2023

Audiodec: An Open-Source Streaming High-Fidelity Neural Audio Codec.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
A Compensation Network With Error Mapping for Robust Remote Photoplethysmography in Noise-Heavy Conditions.
IEEE Trans. Instrum. Meas., 2022

A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System.
CoRR, 2022

Soft Label With Channel Encoding for Dependent Facial Image Classification.
IEEE Access, 2022

Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Direct Noisy Speech Modeling for Noisy-To-Noisy Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2022

Contactless Blood Pressure Measurement via Remote Photoplethysmography with Synthetic Data Generation Using Generative Adversarial Network.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
Incorporating Prior Knowledge on Speech Production Mechanism into Neural Speech Waveform Generation.
PhD thesis, 2021

Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model With Pitch-Dependent Dilated Convolution Neural Network.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Quasi-Periodic Parallel WaveGAN: A Non-Autoregressive Raw Waveform Generative Model With Pitch-Dependent Dilated Convolution Neural Network.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Pretraining Techniques for Sequence-to-Sequence Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

The AS-NU System for the M2VoC Challenge.
CoRR, 2021

Unified Source-Filter GAN: Unified Source-Filter Network Based On Factorization of Quasi-Periodic Parallel WaveGAN.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Relational Data Selection for Data Augmentation of Speaker-Dependent Multi-Band MelGAN Vocoder.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder.
Proceedings of the IEEE International Conference on Acoustics, 2021

Any-to-One Sequence-to-Sequence Voice Conversion Using Self-Supervised Discrete Speech Representations.
Proceedings of the IEEE International Conference on Acoustics, 2021

HASA-Net: A Non-Intrusive Hearing-Aid Speech Assessment Network.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Noisy-to-Noisy Voice Conversion Framework with Denoising Model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations.
CoRR, 2020

Non-Parallel Voice Conversion System With WaveNet Vocoder and Collapsed Speech Suppression.
IEEE Access, 2020

Masked Neural Sparse Encoder for Face Occlusion Detection.
Proceedings of the 2020 IEEE International Conference on Systems, Man, and Cybernetics, 2020

A Cyclical Post-Filtering Approach to Mismatch Refinement of Neural Vocoder for Text-to-Speech Systems.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Quasi-Periodic Parallel WaveGAN Vocoder: A Non-Autoregressive Pitch-Dependent Dilated Convolution Model for Parametric Speech Generation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Cyclic Spectral Modeling for Unsupervised Unit Discovery into Voice Conversion with Excitation and Waveform Modeling.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Efficient Shallow Wavenet Vocoder Using Multiple Samples Output Based on Laplacian Distribution and Linear Prediction.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Baseline System of Voice Conversion Challenge 2020 with Cyclic Variational Autoencoder and Parallel WaveGAN.
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders.
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

2019
The ASVspoof 2019 database.
CoRR, 2019

Voice Conversion With CycleRNN-Based Spectral Mapping and Finely Tuned WaveNet Vocoder.
IEEE Access, 2019

Statistical Voice Conversion with Quasi-periodic WaveNet Vocoder.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Quasi-Periodic WaveNet Vocoder: A Pitch Dependent Dilated Convolution Model for Parametric Speech Generation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Non-Parallel Voice Conversion with Cyclic Variational Autoencoder.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Investigation of F0 Conditioning and Fully Convolutional Networks in Variational Autoencoder Based Voice Conversion.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Voice Conversion with Cyclic Recurrent Neural Network and Fine-tuned Wavenet Vocoder.
Proceedings of the IEEE International Conference on Acoustics, 2019

Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion.
Proceedings of the 27th European Signal Processing Conference, 2019

2018
Locally Linear Embedding Based Post-Filtering for Speech Enhancement.
J. Inf. Sci. Eng., 2018

Voice Conversion Based on Locally Linear Embedding.
J. Inf. Sci. Eng., 2018

An Evaluation of Deep Spectral Mappings and WaveNet Vocoder for Voice Conversion.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

The NU Non-Parallel Voice Conversion System for the Voice Conversion Challenge 2018.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

NU Voice Conversion System for the Voice Conversion Challenge 2018.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Collapsed Speech Segment Detection and Suppression for WaveNet Vocoder.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Exemplar-Based Spectral Detail Compensation for Voice Conversion.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
A Post-Filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech Enhancement.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

A locally linear embbeding based postfiltering approach for speech enhancement.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Fast locally linear embedding algorithm for exemplar-based voice conversion.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Dictionary update for NMF-based voice conversion using an encoder-decoder network.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Locally Linear Embedding for Exemplar-Based Spectral Conversion.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Voice conversion from non-parallel corpora using variational auto-encoder.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016


  Loading...