Patrick Lumban Tobing

Orcid: 0000-0003-2792-8418

According to our database1, Patrick Lumban Tobing authored at least 36 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Mapache: Masked Parallel Transformer for Advanced Speech Editing and Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Cross-lingual Prosody Transfer for Expressive Machine Dubbing.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Expressive Machine Dubbing Through Phrase-level Cross-lingual Prosody Transfer.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System.
CoRR, 2022

Direct Noisy Speech Modeling for Noisy-To-Noisy Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model With Pitch-Dependent Dilated Convolution Neural Network.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Low-latency real-time non-parallel voice conversion based on cyclic variational autoencoder and multiband WaveRNN with data-driven linear prediction.
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

High-Fidelity and Low-Latency Universal Neural Vocoder Based on Multiband WaveRNN with Data-Driven Linear Prediction for Discrete Waveform Modeling.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder.
Proceedings of the IEEE International Conference on Acoustics, 2021

Noisy-to-Noisy Voice Conversion Framework with Denoising Model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Non-Parallel Voice Conversion System With WaveNet Vocoder and Collapsed Speech Suppression.
IEEE Access, 2020

A Cyclical Post-Filtering Approach to Mismatch Refinement of Neural Vocoder for Text-to-Speech Systems.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Cyclic Spectral Modeling for Unsupervised Unit Discovery into Voice Conversion with Excitation and Waveform Modeling.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Efficient Shallow Wavenet Vocoder Using Multiple Samples Output Based on Laplacian Distribution and Linear Prediction.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Semi-Supervised Enhancement and Suppression of Self-Produced Speech Using Correspondence between Air- and Body-Conducted Signals.
Proceedings of the 28th European Signal Processing Conference, 2020

Baseline System of Voice Conversion Challenge 2020 with Cyclic Variational Autoencoder and Parallel WaveGAN.
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders.
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

Cross-Lingual Voice Conversion using a Cyclic Variational Auto-encoder and a WaveNet Vocoder.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Voice Conversion With CycleRNN-Based Spectral Mapping and Finely Tuned WaveNet Vocoder.
IEEE Access, 2019

Statistical Voice Conversion with Quasi-periodic WaveNet Vocoder.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Quasi-Periodic WaveNet Vocoder: A Pitch Dependent Dilated Convolution Model for Parametric Speech Generation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Non-Parallel Voice Conversion with Cyclic Variational Autoencoder.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Investigation of F0 Conditioning and Fully Convolutional Networks in Variational Autoencoder Based Voice Conversion.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Voice Conversion with Cyclic Recurrent Neural Network and Fine-tuned Wavenet Vocoder.
Proceedings of the IEEE International Conference on Acoustics, 2019

Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion.
Proceedings of the 27th European Signal Processing Conference, 2019

Investigation of Shallow Wavenet Vocoder with Laplacian Distribution Output.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
An Evaluation of Deep Spectral Mappings and WaveNet Vocoder for Voice Conversion.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

The NU Non-Parallel Voice Conversion System for the Voice Conversion Challenge 2018.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

NU Voice Conversion System for the Voice Conversion Challenge 2018.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Collapsed Speech Segment Detection and Suppression for WaveNet Vocoder.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
Articulatory Controllable Speech Modification Based on Statistical Inversion and Production Mappings.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Deep acoustic-to-articulatory inversion mapping with latent trajectory modeling.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Acoustic-to-Articulatory Inversion Mapping Based on Latent Trajectory Gaussian Mixture Model.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Articulatory controllable speech modification based on Gaussian mixture models with direct waveform modification using spectrum differential.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014
Articulatory controllable speech modification based on statistical feature mapping with Gaussian mixture models.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014


  Loading...