Lauri Juvela

Orcid: 0000-0002-2201-103X

According to our database1, Lauri Juvela authored at least 45 papers between 2014 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
KLANN: Linearising Long-Term Dynamics in Nonlinear Audio Effects Using Koopman Networks.
IEEE Signal Process. Lett., 2024

HiFi-Glot: Neural Formant Synthesis with Differentiable Resonant Filters.
CoRR, 2024

Audio Codec Augmentation for Robust Collaborative Watermarking of Speech Synthesis.
CoRR, 2024

End-to-End Amp Modeling: From Data to Controllable Guitar Amplifier Models.
CoRR, 2024

Collaborative Watermarking for Adversarial Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input.
CoRR, 2023

Speaker-independent neural formant synthesis.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Adversarial Guitar Amplifier Modelling with Unpaired Data.
Proceedings of the IEEE International Conference on Acoustics, 2023

End-to-End Amp Modeling: from Data to Controllable Guitar Amplifier Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

2021
Exposure Bias and State Matching in Recurrent Neural Network Virtual Analog Models.
Proceedings of the 24th International Conference on Digital Audio Effects, 2021

2020
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech.
Comput. Speech Lang., 2020

Conditional Spoken Digit Generation with StyleGAN.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Transferring Neural Speech Waveform Synthesizers to Musical Instrument Sounds Generation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
GlotNet - A Raw Waveform Model for the Glottal Excitation in Statistical Parametric Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Normal-to-Lombard adaptation of speech synthesis using long short-term memory recurrent neural networks.
Speech Commun., 2019

The ASVspoof 2019 database.
CoRR, 2019

Vocal Effort Based Speaking Style Conversion Using Vocoder Features and Parallel Learning.
IEEE Access, 2019

Augmented CycleGANs for Continuous Scale Normal-to-Lombard Speaking Style Conversion.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-Spectrogram.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Lombard Speech Synthesis Using Transfer Learning in a Tacotron Text-to-Speech System.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Cycle-consistent Adversarial Networks for Non-parallel Vocal Effort Based Speaking Style Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2019

Waveform Generation for Text-to-speech Synthesis Using Pitch-synchronous Multi-scale Generative Adversarial Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

Deep Learning for Tube Amplifier Emulation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Data Augmentation Strategies for Neural Network F0 Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
A Comparison Between STRAIGHT, Glottal, and Sinusoidal Vocoding in Statistical Parametric Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Speaking style adaptation in Text-To-Speech synthesis using Sequence-to-sequence models with attention.
CoRR, 2018

Speaker-independent Raw Waveform Model for Glottal Excitation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Time-regularized Linear Prediction for Noise-robust Extraction of the Spectral Envelope of Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

A Comparison of Recent Waveform Generation and Acoustic Modeling Methods for Neural-Network-Based Speech Synthesis.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Speech Waveform Synthesis from MFCC Sequences with Generative Adversarial Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Speaking Style Conversion from Normal to Lombard Speech Using a Glottal Vocoder and Bayesian GMMs.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Reducing Mismatch in Training of DNN-Based Glottal Excitation Models in a Statistical Parametric Text-to-Speech System.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Generative Adversarial Network-Based Glottal Waveform Model for Statistical Parametric Speech Synthesis.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Normal-to-shouted speech spectral mapping for speaker recognition under vocal effort mismatch.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Non-parallel voice conversion using i-vector PLDA: towards unifying speaker verification and transformation.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Phase perception of the glottal excitation and its relevance in statistical parametric speech synthesis.
Speech Commun., 2016

Comparing human and automatic speech recognition in a perceptual restoration experiment.
Comput. Speech Lang., 2016

Using Text and Acoustic Features in Predicting Glottal Excitation Waveforms for Parametric Speech Synthesis with Recurrent Neural Networks.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Majorisation-Minimisation Based Optimisation of the Composite Autoregressive System with Application to Glottal Inverse Filtering.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Automatic Glottal Inverse Filtering with Non-Negative Matrix Factorization.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

GlottDNN - A Full-Band Glottal Vocoder for Statistical Parametric Speech Synthesis.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

High-pitched excitation generation for glottal vocoding in statistical parametric speech synthesis using a deep neural network.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

The NII speech synthesis entry for Blizzard Challenge 2016.
Proceedings of the Blizzard Challenge 2016, Cuppertino, CA, USA, September 16, 2016, 2016

2015
Phase perception of the glottal excitation of vocoded speech.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014
Deep neural network based trainable voice source model for synthesis of speech with varying vocal effort.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014


  Loading...