Giampiero Salvi
Orcid: 0000-0002-3323-5311
According to our database1,
Giampiero Salvi
authored at least 71 papers
between 1999 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
A lightweight approach to real-time speaker diarization: from audio toward audio-visual data streams.
EURASIP J. Audio Speech Music. Process., December, 2024
Towards Better Recognition of Spontaneous Children's Speech: Speaker-Clustering Fine-Tuning of Whisper.
Proceedings of the 34th IEEE International Workshop on Machine Learning for Signal Processing, 2024
Collecting Linguistic Resources for Assessing Children's Pronunciation of Nordic Languages.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
2023
A step-by-step training method for multi generator GANs with application to anomaly detection and cybersecurity.
Neurocomputing, June, 2023
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023
S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video Prediction.
CoRR, 2023
IEEE Access, 2023
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
Dataset, August, 2022
Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Hierarchical Residual Learning Based Vector Quantized Variational Autoencoder for Image Reconstruction and Generation.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022
2021
A DNN Based Speech Enhancement Approach to Noise Robust Acoustic-to-Articulatory Inversion.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2021
STEP-GAN: A One-Class Anomaly Detection Model with Applications to Power System Security.
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Self-Supervised Vision-Based Detection of the Active Speaker as Support for Socially Aware Language Acquisition.
IEEE Trans. Cogn. Dev. Syst., 2020
Beyond the Self: Using Grounded Affordances to Interpret and Describe Others' Actions.
IEEE Trans. Cogn. Dev. Syst., 2020
STEP-GAN: A Step-by-Step Training for Multi Generator GANs with application to Cyber Security in Power Systems.
CoRR, 2020
Sequence-to-Sequence Articulatory Inversion Through Time Convolution of Sub-Band Frequency Signals.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
2019
ACM Trans. Hum. Robot Interact., 2019
Biomed. Signal Process. Control., 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
CoRR, 2018
2017
Self-Supervised Vision-Based Detection of the Active Speaker as a Prerequisite for Socially-Aware Language Acquisition.
CoRR, 2017
Cepstral and Entropy Analyses in Vowels Excerpted from Continuous Speech of Dysphonic and Control Speakers.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
2016
An Analysis of Shallow and Deep Representations of Speech Based on Unsupervised Classification of Isolated Words.
Proceedings of the Recent Advances in Nonlinear Speech Processing, 2016
CoRR, 2016
Optimising The Input Window Alignment in CD-DNN Based Phoneme Recognition for Low Latency Processing.
CoRR, 2016
2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
2014
Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Speech Commun., 2013
On mispronunciation analysis of individual foreign speakers using auditory periphery models.
Speech Commun., 2013
A gaze-based method for relating group involvement to individual engagement in multimodal multiparty dialogue.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013
Proceedings of the 2013 International Conference on Collaboration Technologies and Systems, 2013
2012
IEEE Trans. Syst. Man Cybern. Part B, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the Biologically Inspired Cognitive Architectures 2012 - Proceedings of the Third Annual Meeting of the BICA Society, Palermo, Sicily, Italy, October 31, 2012
2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
2009
EURASIP J. Audio Speech Music. Process., 2009
Virtual speech reading support for hard of hearing in a domestic multi-media setting.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 2009 IEEE International Conference on Robotics and Automation, 2009
Proceedings of the Auditory-Visual Speech Processing, 2009
2008
Hearing at home - communication support in home environments for hearing impaired persons.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
2006
Mining Speech Sounds: Machine Learning Methods for Automatic Speech Recognition and Analysis.
PhD thesis, 2006
Segment boundary detection via class entropy measurements in connectionist phoneme recognition.
Speech Commun., 2006
Dynamic behaviour of connectionist speech recognition with strong latency constraints.
Speech Commun., 2006
Proceedings of the Computers Helping People with Special Needs, 2006
2005
Proceedings of the Nonlinear Analyses and Algorithms for Speech Processing, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
2004
Proceedings of the Computers Helping People with Special Needs, 2004
2003
Proceedings of the ITRW on Non-Linear Speech Processing, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
2000
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
1999
Proceedings of the Auditory-Visual Speech Processing, 1999