Nicolas Obin
Orcid: 0000-0002-5236-5306
According to our database1,
Nicolas Obin
authored at least 58 papers
between 2008 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis.
CoRR, 2024
2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation?
CoRR, 2024
Investigating the impact of 2D gesture representation on co-speech gesture generation.
CoRR, 2024
CoRR, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Auditory Cortex-Inspired Spectral Attention Modulation for Binaural Sound Localization in HRTF Mismatch.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Zero-shot style transfer for gesture animation driven by text and speech using adversarial disentanglement of multimodal style encoding.
Frontiers Artif. Intell., February, 2023
Manipulating Voice Attributes by Adversarial Learning of Structured Disentangled Representations.
Entropy, February, 2023
META4: Semantically-Aligned Generation of Metaphoric Gestures Using Self-Supervised Text and Speech Representation.
CoRR, 2023
TranSTYLer: Multimodal Behavioral Style Transfer for Facial and Body Gestures Generation.
CoRR, 2023
ZS-MSTM: Zero-Shot Style Transfer for Gesture Animation driven by Text and Speech using Adversarial Disentanglement of Multimodal Style Encoding.
CoRR, 2023
Binaural Sound Localization in Noisy Environments Using Frequency-Based Audio Vision Transformer (FAViT).
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
I-Brow: Hierarchical and Multimodal Transformer Model for Eyebrows Animation Synthesis.
Proceedings of the Artificial Intelligence in HCI, 2023
Proceedings of the 17th IEEE International Conference on Automatic Face and Gesture Recognition, 2023
From signal representation to representation learning: structured modeling of speech signals. (De la représentation du signal à l'apprentissage de représentation : modélisation structurée de signaux de parole).
, 2023
2022
Rookognise: Acoustic detection and identification of individual rooks in field recordings using multi-task neural networks.
Ecol. Informatics, 2022
Zero-Shot Style Transfer for Gesture Animation driven by Text and Speech using Adversarial Disentanglement of Multimodal Style Encoding.
CoRR, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 30th European Signal Processing Conference, 2022
Voice Reenactment with F0 and timing constraints and adversarial learning of conversions.
Proceedings of the 30th European Signal Processing Conference, 2022
2021
Sequence-To-Sequence Voice Conversion using F0 and Time Conditioning and Adversarial Learning.
CoRR, 2021
Beyond Voice Identity Conversion: Manipulating Voice Attributes by Adversarial Learning of Structured Disentangled Representations.
CoRR, 2021
Towards end-to-end F0 voice conversion based on Dual-GAN with convolutional wavelet kernels.
CoRR, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Towards end-to-end F0 voice conversion based on Dual-GAN with convolutional wavelet kernels.
Proceedings of the 29th European Signal Processing Conference, 2021
2020
La voix actée : pratiques, enjeux, applications (Acted voice : practices, challenges, applications).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020
Proceedings of the 28th European Signal Processing Conference, 2020
2019
SoftGAN: Learning generative models efficiently with application to CycleGAN Voice Conversion.
CoRR, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Binaural Localization of Multiple Sound Sources by Non-Negative Tensor Factorization.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Real-time audio-to-score alignment of singing voice based on melody and lyric information.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
The role of glottal source parameters for high-quality transformation of perceptual age.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
Phase distortion statistics as a representation of the glottal source: application to the classification of voice qualities.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
On automatic voice casting for expressive speech: Speaker recognition vs. speech classification.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Syll-O-Matic: An adaptive time-frequency representation for the automatic segmentation of speech into syllables.
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
A la recherche des temps perdus : Variations sur le rythme en français (Regional Variations of Speech Rhythm in French: In Search of Lost Times) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012
La variation prosodique dialectale en français. Données et hypothèses (Speech Prosody of Dialectal French: Data and Hypotheses) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
2011
Stylization and Trajectory Modelling of Short and Long Term Speech Prosody Variations.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Discrete/Continuous Modelling of Speaking Style in HMM-Based Speech Synthesis: Design and Evaluation.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Toward a Continuous Modeling of French Prosodic Structure: Using Acoustic Features to Predict Prominence Location and Prominence Degree.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 17th International Congress of Phonetic Sciences, 2011
2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Design and Evaluation of Shared Prosodic Annotation for Spontaneous French Speech: From Expert Knowledge to Non-Expert Annotation.
Proceedings of the Fourth Linguistic Annotation Workshop, 2010
2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
2008
A method for automatic and dynamic estimation of discourse genre typology with prosodic features.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008