Ville Hautamäki

Orcid: 0000-0002-5885-0003

Affiliations:
  • University of Eastern Finland


According to our database1, Ville Hautamäki authored at least 102 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Meta-Learning Approaches for Improving Detection of Unseen Speech Deepfakes.
CoRR, 2024

Improving Numerical Stability of Normalized Mutual Information Estimator on High Dimensions.
CoRR, 2024

Interpreting Deep Neural Network-Based Receiver Under Varying Signal-To-Noise Ratios.
CoRR, 2024

Natural Language as Polices: Reasoning for Coordinate-Level Embodied Control with LLMs.
CoRR, 2024

Zero-Shot Imitation Policy Via Search In Demonstration Dataset.
Proceedings of the IEEE International Conference on Acoustics, 2024

Gradient Weighting for Speaker Verification in Extremely Low Signal-to-Noise Ratio.
Proceedings of the IEEE International Conference on Acoustics, 2024

Online Adaptation for Enhancing Imitation Learning Policies.
Proceedings of the IEEE Conference on Games, 2024

2023
GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters.
IEEE Trans. Games, December, 2023

Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Behavioral Cloning via Search in Embedded Demonstration Dataset.
CoRR, 2023

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition.
CoRR, 2023

2022
Optimizing Tandem Speaker Verification and Anti-Spoofing Systems.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Behavioral Cloning via Search in Video PreTraining Latent Space.
CoRR, 2022

The Transitive Information Theory and its Application to Deep Generative Models.
CoRR, 2022

Improving Behavioural Cloning with Human-Driven Dynamic Dataset Augmentation.
CoRR, 2022

Self-Supervised Speaker Recognition with Loss-Gated Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Multi-Task Learning With Attention for End-to-End Autonomous Driving.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Distilling Reinforcement Learning Tricks for Video Games.
Proceedings of the 2021 IEEE Conference on Games (CoG), 2021

PL-EESR: Perceptual Loss Based End-to-End Robust Speaker Representation Extraction.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Voxceleb Enrichment for Age and Gender Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Maximal Figure-of-Merit Framework to Detect Multi-Label Phonetic Features for Spoken Language Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Semisupervised Generative Autoencoder for Single-Cell Data.
J. Comput. Biol., 2020

Policy Supervectors: General Characterization of Agents by their Behaviour.
CoRR, 2020

Transferring Monolingual Model to Low-Resource Language: The Case of Tigrinya.
CoRR, 2020

An Initial Investigation on Optimizing Tandem Speaker Verification and Countermeasure Systems Using Reinforcement Learning.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

From Video Game to Real Robot: The Transfer Between Action Spaces.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Action Space Shaping in Deep Reinforcement Learning.
Proceedings of the IEEE Conference on Games, 2020

Benchmarking End-to-End Behavioural Cloning on Video Games.
Proceedings of the IEEE Conference on Games, 2020

Cost Sensitive Optimization of Deepfake Detector.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Do Autonomous Agents Benefit from Hearing?
CoRR, 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.
CoRR, 2019

Playing Minecraft with Behavioural Cloning.
Proceedings of the NeurIPS 2019 Competition and Demonstration Track, 2019

Towards Debugging Deep Neural Networks by Generating Speech Utterances.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Who Do I Sound like? Showcasing Speaker Recognition Technology by Youtube Voice Search.
Proceedings of the IEEE International Conference on Acoustics, 2019

ToriLLE: Learning Environment for Hand-to-Hand Combat.
Proceedings of the IEEE Conference on Games, 2019

2018
Staircase Network: structural language identification via hierarchical attentive units.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Perceptual Evaluation of the Effectiveness of Voice Disguise by Age Modification.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Enabling Spoken Dialogue Systems for Low-Resourced Languages - End-to-End Dialect Recognition for North Sami.
Proceedings of the 9th International Workshop on Spoken Dialogue System Technology, 2018

Maximal Figure-of-Merit Embedding for Multi-Label Audio Classification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Acoustical and perceptual study of voice disguise by age modification in speaker verification.
Speech Commun., 2017


RedDots replayed: A new replay spoofing attack corpus for text-dependent speaker verification research.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Effects of gender information in text-independent and text-dependent speaker verification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Fantastic 4 system for NIST 2015 Language Recognition Evaluation.
CoRR, 2016

Deep learning with maximal figure-of-merit cost to advance multi-label speech attribute detection.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Deep Language: a comprehensive deep learning approach to end-to-end language recognition.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Incorporating uncertainty as a Quality Measure in I-Vector Based Language Recognition.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Age-Related Voice Disguise and its Impact on Speaker Verification Accuracy.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Out-of-Set i-Vector Selection for Open-set Language Identification.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Robust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Utterance Verification for Text-Dependent Speaker Recognition: A Comparative Assessment Using the RedDots Corpus.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Variation in Spoken North Sami Language.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Automatic versus human speaker verification: The case of voice mimicry.
Speech Commun., 2015

Factors affecting i-vector based foreign accent recognition: A case study in spoken Finnish.
Speech Commun., 2015

Boosting universal speech attributes classification with deep neural network for foreign accent characterization.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014
From single to multiple enrollment i-vectors: Practical PLDA scoring variants for speaker verification.
Digit. Signal Process., 2014

A Comparison of Categorical Attribute Data Clustering Methods.
Proceedings of the Structural, Syntactic, and Statistical Pattern Recognition, 2014

Comparison of human listeners and speaker verification systems using voice mimicry data.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Dialect levelling in Finnish: a universal speech attribute approach.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

An i-vector based descriptor for alphabetical gesture recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Introducing attribute features to foreign accent recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Sparse Classifier Fusion for Speaker Verification.
IEEE Trans. Speech Audio Process., 2013


Effect of multicondition training on i-vector PLDA configurations for speaker recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A blind segmentation approach to acoustic event detection based on i-vector.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Automatic regularization of cross-entropy cost for speaker recognition fusion.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

I-vectors meet imitators: on vulnerability of speaker verification systems against voice mimicry.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Merging human and automatic system decisions to improve speaker recognition performance.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Minimax i-vector extractor for short duration speaker verification.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Foreign accent detection from spoken Finnish using i-vectors.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012
Review of a concise introduction to data compression by David Salomon.
SIGACT News, 2012

Random swap EM algorithm for Gaussian mixture models.
Pattern Recognit. Lett., 2012

Variational Bayes logistic regression as regularized fusion for NIST SRE 2010.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

2011
Spoken Language Recognition in the Latent Topic Simplex.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Regularized Logistic Regression Fusion for Speaker Verification.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

RSEM: An Accelerated Algorithm on Repeated EM.
Proceedings of the Sixth International Conference on Image and Graphics, 2011

Classifier subset selection and fusion for speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Ad-Hoc Georeferencing of Web-Pages Using Street-Name Prefix Trees.
Proceedings of the Web Information Systems and Technologies - 6th International Conference, 2010

Towards long-range prosodic attribute modeling for language recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Approaching human listener accuracy with modern speaker verification.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Location-based search engine for multimedia phones.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

2009
Comparative evaluation of maximum a Posteriori vector quantization and gaussian mixture models in speaker verification.
Pattern Recognit. Lett., 2009

Random swap EM algorithm for finite mixture models in image segmentation.
Proceedings of the International Conference on Image Processing, 2009

Comparing maximum a posteriori vector quantization and Gaussian mixture models in speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2009

Developing Speaker Recognition System: From Prototype to Practical Application.
Proceedings of the Forensics in Telecommunications, 2009

2008
Maximum a Posteriori Adaptation of the Centroid Model for Speaker Verification.
IEEE Signal Process. Lett., 2008

Text-independent speaker recognition using graph matching.
Pattern Recognit. Lett., 2008

Time-series clustering by approximate prototypes.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Probabilistic clustering by random swap algorithm.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Knee Point Detection in BIC for Detecting the Number of Clusters.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2008

2006
Fast Agglomerative Clustering Using a k-Nearest Neighbor Graph.
IEEE Trans. Pattern Anal. Mach. Intell., 2006

Speaker, Vocabulary and Context Independent Word Spotting System for Continuous Speech.
Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

On the Use of Long-Term Average Spectrum in Automatic Speaker Recognition.
Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

2005
Accuracy of MFCC-Based Speaker Recognition in Series 60 Device.
EURASIP J. Adv. Signal Process., 2005

Improving K-Means by Outlier Removal.
Proceedings of the Image Analysis, 14th Scandinavian Conference, 2005

2004
Outlier Detection Using k-Nearest Neighbour Graph.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

2003
On the fusion of dissimilarity-based classifiers for speaker identification.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Fast PNN-based Clustering Using K-nearest Neighbor Graph.
Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), 2003


  Loading...