Nicolas Obin

Orcid: 0000-0002-5236-5306

According to our database1, Nicolas Obin authored at least 58 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis.
CoRR, 2024

2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation?
CoRR, 2024

Investigating the impact of 2D gesture representation on co-speech gesture generation.
CoRR, 2024

Small-E: Small Language Model with Linear Attention for Efficient Speech Synthesis.
CoRR, 2024

BWSNET: Automatic Perceptual Assessment of Audio Signals.
Proceedings of the IEEE International Conference on Acoustics, 2024

Auditory Cortex-Inspired Spectral Attention Modulation for Binaural Sound Localization in HRTF Mismatch.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Zero-shot style transfer for gesture animation driven by text and speech using adversarial disentanglement of multimodal style encoding.
Frontiers Artif. Intell., February, 2023

Manipulating Voice Attributes by Adversarial Learning of Structured Disentangled Representations.
Entropy, February, 2023

META4: Semantically-Aligned Generation of Metaphoric Gestures Using Self-Supervised Text and Speech Representation.
CoRR, 2023

TranSTYLer: Multimodal Behavioral Style Transfer for Facial and Body Gestures Generation.
CoRR, 2023

ZS-MSTM: Zero-Shot Style Transfer for Gesture Animation driven by Text and Speech using Adversarial Disentanglement of Multimodal Style Encoding.
CoRR, 2023

Binaural Sound Localization in Noisy Environments Using Frequency-Based Audio Vision Transformer (FAViT).
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

I-Brow: Hierarchical and Multimodal Transformer Model for Eyebrows Animation Synthesis.
Proceedings of the Artificial Intelligence in HCI, 2023

Zero-Shot Style Transfer for Multimodal Data-Driven Gesture Synthesis.
Proceedings of the 17th IEEE International Conference on Automatic Face and Gesture Recognition, 2023

From signal representation to representation learning: structured modeling of speech signals. (De la représentation du signal à l'apprentissage de représentation : modélisation structurée de signaux de parole).
, 2023

2022
Rookognise: Acoustic detection and identification of individual rooks in field recordings using multi-task neural networks.
Ecol. Informatics, 2022

Zero-Shot Style Transfer for Gesture Animation driven by Text and Speech using Adversarial Disentanglement of Multimodal Style Encoding.
CoRR, 2022

Production Strategies of Vocal Attitudes.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Transformer Network for Semantically-Aware and Speech-Driven Upper-Face Generation.
Proceedings of the 30th European Signal Processing Conference, 2022

Voice Reenactment with F0 and timing constraints and adversarial learning of conversions.
Proceedings of the 30th European Signal Processing Conference, 2022

2021
Sequence-To-Sequence Voice Conversion using F0 and Time Conditioning and Adversarial Learning.
CoRR, 2021

Beyond Voice Identity Conversion: Manipulating Voice Attributes by Adversarial Learning of Structured Disentangled Representations.
CoRR, 2021

Towards end-to-end F0 voice conversion based on Dual-GAN with convolutional wavelet kernels.
CoRR, 2021

Speaker Attentive Speech Emotion Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Towards end-to-end F0 voice conversion based on Dual-GAN with convolutional wavelet kernels.
Proceedings of the 29th European Signal Processing Conference, 2021

2020
La voix actée : pratiques, enjeux, applications (Acted voice : practices, challenges, applications).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

CycleGAN Voice Conversion of Spectral Envelopes using Adversarial Weights.
Proceedings of the 28th European Signal Processing Conference, 2020

2019
SoftGAN: Learning generative models efficiently with application to CycleGAN Voice Conversion.
CoRR, 2019

Sequence-to-sequence Modelling of F0 for Speech Emotion Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Binaural Localization of Multiple Sound Sources by Non-Negative Tensor Factorization.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

2016
Similarity Search of Acted Voices for Automatic Voice Casting.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

A source/filter model with adaptive constraints for NMF-based speech separation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Symbolic Modeling of Prosody: From Linguistics to Statistics.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Real-time audio-to-score alignment of singing voice based on melody and lyric information.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

The role of glottal source parameters for high-quality transformation of perceptual age.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Phase distortion statistics as a representation of the glottal source: application to the classification of voice qualities.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

On automatic voice casting for expressive speech: Speaker recognition vs. speech classification.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Syll-O-Matic: An adaptive time-frequency representation for the automatic segmentation of speech into syllables.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
A la recherche des temps perdus : Variations sur le rythme en français (Regional Variations of Speech Rhythm in French: In Search of Lost Times) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

La variation prosodique dialectale en français. Données et hypothèses (Speech Prosody of Dialectal French: Data and Hypotheses) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

On the generalization of Shannon entropy for speech recognition.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Cries and Whispers - Classification of Vocal Effort in Expressive Speech.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Towards Glottal Source Controllability in Expressive Speech Synthesis.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Accentual Transfer from Swiss-German to French. A Study of "Français Fédéral".
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011
MeLos: Analysis and Modelling of Speech Prosody and Speaking Style.
PhD thesis, 2011

Stylization and Trajectory Modelling of Short and Long Term Speech Prosody Variations.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Discrete/Continuous Modelling of Speaking Style in HMM-Based Speech Synthesis: Design and Evaluation.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Reformulating Prosodic Break Model into Segmental HMMs and Information Fusion.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Toward a Continuous Modeling of French Prosodic Structure: Using Acoustic Features to Predict Prominence Location and Prominence Degree.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Typological Variations in the Realization of the French Accentual Phrase.
Proceedings of the 17th International Congress of Phonetic Sciences, 2011

2010
HMM-based prosodic structure model using rich linguistic context.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Expectations for discourse genre identification: a prosodic study.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Design and Evaluation of Shared Prosodic Annotation for Spontaneous French Speech: From Expert Knowledge to Non-Expert Annotation.
Proceedings of the Fourth Linguistic Annotation Workshop, 2010

2009
A multi-level context-dependent prosodic model applied to durational modeling.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008
IRCAM Corpus Tools: Managing speech corpora.
Trait. Autom. des Langues, 2008

A method for automatic and dynamic estimation of discourse genre typology with prosodic features.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

French prominence: A probabilistic framework.
Proceedings of the IEEE International Conference on Acoustics, 2008


  Loading...