Xavier Alameda-Pineda
Orcid: 0000-0002-5354-1084
According to our database1,
Xavier Alameda-Pineda
authored at least 125 papers
between 2008 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Unsupervised performance analysis of 3D face alignment with a statistically robust confidence test.
Neurocomputing, January, 2024
Robust Audio-Visual Contrastive Learning for Proposal-Based Self-Supervised Sound Source Localization in Videos.
IEEE Trans. Pattern Anal. Mach. Intell., 2024
A multimodal dynamical variational autoencoder for audiovisual speech representation learning.
Neural Networks, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
2023
Appl. Intell., November, 2023
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023
Expression-Preserving Face Frontalization Improves Visually Assisted Speech Processing.
Int. J. Comput. Vis., May, 2023
Learning and controlling the source-filter representation of speech with a variational autoencoder.
Speech Commun., March, 2023
IEEE Trans. Multim., 2023
Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation.
Trans. Mach. Learn. Res., 2023
IEEE Trans. Affect. Comput., 2023
IEEE Trans. Pattern Anal. Mach. Intell., 2023
Univariate Radial Basis Function Layers: Brain-inspired Deep Neural Layers for Low-Dimensional Inputs.
CoRR, 2023
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation.
CoRR, 2023
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023
Proceedings of the 16th ACM SIGGRAPH Conference on Motion, Interaction and Games, 2023
Unsupervised speech enhancement with deep dynamical generative speech and noise models.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
IEEE Trans. Neural Networks Learn. Syst., 2022
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Probabilistic Graph Attention Network With Conditional Kernels for Pixel-Wise Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
CoRR, 2022
CoRR, 2022
CoRR, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
IEEE Trans. Signal Process., 2021
IEEE Trans. Pattern Anal. Mach. Intell., 2021
Found. Trends Mach. Learn., 2021
CoRR, 2021
CoRR, 2021
CoRR, 2021
CoRR, 2021
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021
Proceedings of the 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021
A Benchmark of Dynamical Variational Autoencoders Applied to Speech Spectrogram Modeling.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Switching Variational Auto-Encoders for Noise-Agnostic Audio-Visual Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Learning How to Smile: Expression Video Generation With Conditional Adversarial Recurrent Nets.
IEEE Trans. Multim., 2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
IEEE Trans. Pattern Anal. Mach. Intell., 2020
Int. J. Comput. Vis., 2020
GMM-UNIT: Unsupervised Multi-Domain and Multi-Modal Image-to-Image Translation via Attribute Gaussian Mixture Modeling.
CoRR, 2020
Describe What to Change: A Text-guided Unsupervised Image-to-image Translation Approach.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
FATE/MM 20: 2nd International Workshop on Fairness, Accountability, Transparency and Ethics in MultiMedia.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
CANU-ReID: A Conditional Adversarial Network for Unsupervised person Re-IDentification.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
ODANet: Online Deep Appearance Network for Identity-Consistent Multi-person Tracking.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020
Robust Unsupervised Audio-Visual Speech Enhancement Using a Mixture of Variational Autoencoders.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
ACM Trans. Multim. Comput. Commun. Appl., 2019
Special Section on Multimodal Understanding of Social, Affective, and Subjective Attributes.
ACM Trans. Multim. Comput. Commun. Appl., 2019
IEEE Signal Process. Lett., 2019
Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environments.
IEEE J. Sel. Top. Signal Process., 2019
CoRR, 2019
CoRR, 2019
FAT/MM'19: 1st International Workshop on Fairness, Accountability, and Transparency in MultiMedia.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the Working Notes Proceedings of the MediaEval 2019 Workshop, 2019
2018
Cross-Paced Representation Learning With Partial Curricula for Sketch-Based Image Retrieval.
IEEE Trans. Image Process., 2018
EE-USAD: ACM MM 2018Workshop on UnderstandingSubjective Attributes of Data focus on Evoked Emotions.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the Computer Vision - ECCV 2018, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
Proceedings of the Frontiers of Multimedia Research, 2018
2017
Extending the Cascaded Gaussian Mixture Regression Framework for Cross-Speaker Acoustic-Articulatory Mapping.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Automatic animation of an articulatory tongue model from ultrasound images of the vocal tract.
Speech Commun., 2017
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017
Learning Deep Structured Multi-Scale Features using Attention-Gated CRFs for Contour Prediction.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
MUSA2: First ACM Workshop on Multimodal Understanding of Social, Affective and Subjective Attributes.
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017
An EM algorithm for joint source separation and diarisation of multichannel convolutive speech mixtures.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Adaptation of a Gaussian Mixture Regressor to a New Input Distribution: Extending the C-GMR Framework.
Proceedings of the Latent Variable Analysis and Signal Separation, 2017
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
SALSA: A Multimodal Dataset for the Automated Analysis of Free-Standing Social Interactions.
Proceedings of the Group and Crowd Behavior for Computer Vision, 1st Edition, 2017
2016
A Variational EM Algorithm for the Separation of Time-Varying Convolutive Audio Mixtures.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
EM Algorithms for Weighted-Data Clustering with Application to Audio-Visual Scene Analysis.
IEEE Trans. Pattern Anal. Mach. Intell., 2016
IEEE Trans. Pattern Anal. Mach. Intell., 2016
An on-line variational Bayesian model for multi-person tracking from cluttered scenes.
Comput. Vis. Image Underst., 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 23rd International Conference on Pattern Recognition, 2016
An inverse-gamma source variance prior with factorized parameterization for audio source separation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016
Self-Adaptive Matrix Completion for Heart Rate Estimation from Face Videos under Realistic Conditions.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
Proceedings of the British Machine Vision Conference 2016, 2016
2015
Speaker-Adaptive Acoustic-Articulatory Inversion Using Cascaded Gaussian Mixture Regression.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
2014
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2014
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014
2013
Egocentric Audio-Visual Scene Analysis. A Machine Learning and Signal Processing Approach. (Analyse Égocentrique de Scènes Audio-Visuelles. Une approche par Apprentissage Automatique et Traitement du Signal).
PhD thesis, 2013
J. Multimodal User Interfaces, 2013
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Proceedings of the International Conference on Multimodal Interaction, 2012
Proceedings of the 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012), Osaka, Japan, November 29, 2012
Proceedings of the 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012), Osaka, Japan, November 29, 2012
Geometrically-constrained robust time delay estimation using non-coplanar microphone arrays.
Proceedings of the 20th European Signal Processing Conference, 2012
2011
Proceedings of the 13th International Conference on Multimodal Interfaces, 2011
2008
Proceedings of the International Conference on Image Processing, 2008