Rafael Valle

Orcid: 0009-0000-4262-5939

According to our database1, Rafael Valle authored at least 39 papers between 1993 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Storage Standards and Solutions, Data Storage, Sharing, and Structuring in Digital Health: A Brazilian Case Study.
Inf., 2024

OMCAT: Omni Context Aware Transformer.
CoRR, 2024

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data.
CoRR, 2024

Improving Robustness of LLM-based Speech Synthesis by Learning Monotonic Alignment.
CoRR, 2024

Improving Text-To-Audio Models with Synthetic Captions.
CoRR, 2024

Audio Dialogues: Dialogues dataset for audio and music understanding.
CoRR, 2024

ExpressiveSinger: Multilingual and Multi-Style Score-based Singing Voice Synthesis with Expressive Performance Control.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SelfVC: Voice Conversion With Iterative Refinement using Self Transformations.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Scaling Nvidia's Multi-Speaker Multi-Lingual TTS Systems With Zero-Shot TTS to Indic Languages.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Multilingual Multiaccented Multispeaker TTS with RADTTS.
CoRR, 2023

P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech Prompting.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

RAD-MMM: Multilingual Multiaccented Multispeaker Text To Speech.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

SPACE: Speech-driven Portrait Animation with Controllable Expression.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

High-Acoustic Fidelity Text To Speech Synthesis With Fine-Grained Control Of Speech Attributes.
Proceedings of the IEEE International Conference on Acoustics, 2023

Any-to-Any Voice Conversion with F0 and Timbre Disentanglement and Novel Timbre Conditioning.
Proceedings of the IEEE International Conference on Acoustics, 2023

Vani: Very-Lightweight Accent-Controllable TTS for Native And Non-Native Speakers With Identity Preservation.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
SPACEx: Speech-driven Portrait Animation with Controllable Expression.
CoRR, 2022

Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows.
CoRR, 2022

One TTS Alignment to Rule Them All.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Character-based handwritten text transcription with attention networks.
Neural Comput. Appl., 2021

Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Mellotron: Multispeaker Expressive Voice Synthesis by Conditioning on Rhythm, Pitch and Global Style Tokens.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Neural ODEs for Image Segmentation with Level Sets.
CoRR, 2019

Waveglow: A Flow-based Generative Network for Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Visual Display and Retrieval of Music Information.
CoRR, 2018

TequilaGAN: How to easily identify GAN samples.
CoRR, 2018

Attacking Speaker Recognition With Deep Generative Models.
CoRR, 2018

Missing Data Imputation for Supervised Learning.
Appl. Artif. Intell., 2018

2017
Attention networks for image-to-text.
CoRR, 2017

2016
Specification Mining for Machine Improvisation with Formal Specifications.
Comput. Entertain., 2016

Learning and Visualizing Music Specifications Using Pattern Graphs.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

Control Improvisation with Probabilistic Temporal Specifications.
Proceedings of the First IEEE International Conference on Internet-of-Things Design and Implementation, 2016

ABROA: Audio-Based Room-Occupancy Analysis Using Gaussian Mixtures and Hidden Markov Models.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

2015
Symbolic Music Similarity Using Neuronal Periodicity and Dynamic Programming.
Proceedings of the Mathematics and Computation in Music - 5th International Conference, 2015

2014
Machine Improvisation with Formal Specifications.
Proceedings of the Music Technology meets Philosophy, 2014

2013
Gradual control of harmonicity in the Context of frequency modulation.
Proceedings of the 39th International Computer Music Conference, 2013

Towards a Dynamic, Inclusive and Equalitarian Augmented Activity Space.
Proceedings of the 39th International Computer Music Conference, 2013

1993
Multi-protocol communications controller.
Microprocess. Microprogramming, 1993


  Loading...