Alexis Moinet

According to our database¹, Alexis Moinet authored at least 43 papers between 2007 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data.

[BibT_eX]

[DOI]

Álvaro Martín-Cortinas

Soledad López Gambino

Kayeon Yoo

Elena Sokolova

Thomas Drugman

CoRR, 2024

2023

A Comparative Analysis of Pretrained Language Models for Text-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

Controllable Emphasis with zero data for text-to-speech.

[BibT_eX]

[DOI]

Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody.

[BibT_eX]

[DOI]

CoRR, 2022

Expressive, Variable, and Controllable Duration Modelling in TTS.

[BibT_eX]

[DOI]

CoRR, 2022

CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer.

[BibT_eX]

[DOI]

CoRR, 2022

Cross-lingual Style Transfer with Conditional Prior VAE and Style Loss.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Expressive, Variable, and Controllable Duration Modelling in TTS.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Distribution Augmentation for Low-Resource Expressive Text-To-Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Multi-Scale Spectrogram Modelling for Neural Text-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

A Learned Conditional Prior for the VAE Acoustic Space of a TTS System.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Camp: A Two-Stage Approach to Modelling Prosody in Context.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Voice Conversion for Whispered Speech Synthesis.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2020

Parallel WaveNet conditioned on VAE latent vectors.

[BibT_eX]

[DOI]

Roberto Barra-Chicote

CoRR, 2020

CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech.

[BibT_eX]

[DOI]

Daniel Sáez-Trigueros

Thomas Drugman

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Singing Synthesis: With a Little Help from my Attention.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

Towards Achieving Robust Universal Neural Vocoding.

[BibT_eX]

[DOI]

Roberto Barra-Chicote

Alexis Moinet

Vatsal Aggarwal

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018

Traditional Machine Learning for Pitch Detection.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2018

Comprehensive Evaluation of Statistical Speech Waveform Synthesis.

[BibT_eX]

[DOI]

Roberto Barra-Chicote

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Parameter Generation Algorithms for Text-To-Speech Synthesis with Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

2017

Phrase Break Prediction for Long-Form Reading TTS: Exploiting Text Structure Information.

[BibT_eX]

[DOI]

Roberto Barra-Chicote

Thomas Merritt

Thomas Drugman

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016

A Semantic and Content-Based Search User Interface for Browsing Large Collections of Foley Sounds.

[BibT_eX]

[DOI]

Proceedings of the Audio Mostly 2016, Norrköping, Sweden, October 4-6, 2016, 2016

2015

An HMM approach for synthesizing amused speech with a controllable intensity of smile.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2015

2013

Mage - HMM-based speech synthesis reactively controlled by the articulators.

[BibT_eX]

[DOI]

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Mage - reactive articulatory feature control of HMM-based parametric speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

MAGE 2.0: New Features and its Application in the Development of a Talking Guitar.

[BibT_eX]

[DOI]

Proceedings of the 13th International Conference on New Interfaces for Musical Expression, 2013

VideoCycle: User-Friendly Navigation by Similarity in Video Databases.

[BibT_eX]

[DOI]

Christian Frisson

Stéphane Dupont

Alexis Moinet

Cécile Picard-Limpens

Thierry Ravet

Xavier Siebert

Thierry Dutoit

Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Reactive Statistical Mapping: Towards the Sketching of Performative Control with Data.

[BibT_eX]

[DOI]

Emine Sümeyye Kalayci

Qiong Hu

Proceedings of the Innovative and Creative Developments in Multimodal Interaction Systems, 2013

2012

Stylistic gait synthesis based on hidden Markov models.

[BibT_eX]

[DOI]

Joëlle Tilmanne

Alexis Moinet

Thierry Dutoit

EURASIP J. Adv. Signal Process., 2012

LoopJam: turning the dance floor into a collaborative instrumental map.

[BibT_eX]

[DOI]

Proceedings of the 12th International Conference on New Interfaces for Musical Expression, 2012

2010

AVLaughterCycle.

[BibT_eX]

[DOI]

Jérôme Urbain

Radoslaw Niewiadomski

J. Multimodal User Interfaces, 2010

The AVLaughterCycle Database.

[BibT_eX]

[DOI]

Radoslaw Niewiadomski

Proceedings of the International Conference on Language Resources and Evaluation, 2010

2009

Cross-language voice conversion based on eigenvoices.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Using a pitch-synchronous residual codebook for hybrid HMM/frame selection speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

RAMCESS 2.X framework - expressive voice analysis for realtime and accurate synthesis of singing.

[BibT_eX]

[DOI]

J. Multimodal User Interfaces, 2008

Glottal Source Estimation Robustness - A Comparison of Sensitivity of Voice Source Estimation Techniques.

[BibT_eX]

Proceedings of the SIGMAP 2008, 2008

Voice source parameters estimation by fitting the glottal formant and the inverse filtering open phase.

[BibT_eX]

[DOI]

Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007

Causal/anticausal Decomposition for mixed-phase Description of brass and Bowed String sounds.

[BibT_eX]

[DOI]

Proceedings of the 2007 International Computer Music Conference, 2007

Towards a Voice Conversion System Based on Frame Selection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Alexis Moinet

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...