Themos Stafylakis
Orcid: 0000-0002-9227-3588
According to our database1,
Themos Stafylakis
authored at least 81 papers
between 2007 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
CA-MHFA: A Context-Aware Multi-Head Factorized Attentive Pooling for SSL-Based Speaker Verification.
CoRR, 2024
Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
Image Vis. Comput., December, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters.
Proceedings of the IEEE International Conference on Acoustics, 2023
Speech-Based Emotion Recognition with Self-Supervised Models Using Attentive Channel-Wise Correlations and Label Smoothing.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
2022
Extracting Speaker and Emotion Information from Self-Supervised Speech Models via Channel-Wise Correlations.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
An Attention-Based Backend Allowing Efficient Fine-Tuning of Transformer Models for Speaker Verification.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Analyzing Speaker Verification Embedding Extractors and Back-Ends Under Language and Channel Mismatch.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022
Development of ABC Systems for the 2021 Edition of NIST Speaker Recognition Evaluation.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022
Training speaker embedding extractors using multi-speaker audio with unknown speaker boundaries.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
2020
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 31st British Machine Vision Conference 2020, 2020
2019
Speaker Recognition With Random Digit Strings Using Uncertainty Normalized HMM-Based i-Vectors.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Pushing the boundaries of audiovisual word recognition using Residual Networks and LSTMs.
Comput. Vis. Image Underst., 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
DeepMine Speech Processing Database: Text-Dependent and Independent Speaker Verification and Speech Recognition in Persian and English.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the Computer Vision - ECCV 2018, 2018
2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016
Uncertainty Modeling Without Subspace Methods For Text-Dependent Speaker Recognition.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016
Spoofing Detection on the ASVspoof2015 Challenge Corpus Employing Deep Neural Networks.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016
Towards PLDA-RBM based speaker recognition in mobile environment: Designing stacked/deep PLDA-RBM systems.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Combining amplitude and phase-based features for speaker verification with short duration utterances.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Development of CRIM system for the automatic speaker verification spoofing and countermeasures challenge 2015.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
JFA modeling with left-to-right structure and a new backend for text-dependent speaker recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014
Supervised/Unsupervised Voice Activity Detectors for Text-dependent Speaker Recognition on the RSR2015 Corpus.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
I-vector-based speaker adaptation of deep neural networks for French broadcast audio transcription.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Efficient iterative mean shift based cosine dissimilarity for multi-recording speaker clustering.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012
Mean shift algorithm for exponential families with applications to speaker clustering.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012
Proceedings of the 11th International Conference on Information Science, 2012
PLDA using Gaussian Restricted Boltzmann Machines with application to Speaker Verification.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Music tempo estimation and beat tracking by applying source separation and metrical relations.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
Developing a Scoring Algorithm for Automatic Pronunciation Assessment of Modern Greek.
J. Quant. Linguistics, 2011
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
Pattern Recognit., 2010
The Segmental Bayesian Information Criterion and Its Applications to Speaker Diarization.
IEEE J. Sel. Top. Signal Process., 2010
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the 19th International Workshop on Database and Expert Systems Applications (DEXA 2008), 2008
2007
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), 2007
Efficient combination of parametric spaces, models and metrics for speaker diarization<sup>1</sup>.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007