David A. van Leeuwen

Orcid: 0000-0001-9704-6141

According to our database1, David A. van Leeuwen authored at least 107 papers between 1995 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
The Effect of Batch Size on Contrastive Self-Supervised Speech Representation Learning.
CoRR, 2024

2023
Multi-task learning of speech and speaker recognition.
CoRR, 2023

Speaker and Language Change Detection using Wav2vec2 and Whisper.
CoRR, 2023

Towards Multi-task Learning of Speech and Speaker Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
Training speaker recognition systems with limited data.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Fine-Tuning Wav2Vec2 for Speaker Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

2019
Large-Scale Speaker Diarization of Radio Broadcast Archives.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multi-Graph Decoding for Code-Switching ASR.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
Semi-supervised acoustic model training for speech with code-switching.
Speech Commun., 2018

Code-Switching Detection with Data-Augmented Acoustic and Language Models.
CoRR, 2018

Acoustic and Textual Data Augmentation for Improved ASR of Code-Switching Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
Exploiting Untranscribed Broadcast Data for Improved Code-Switching Detection.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Longitudinal Speaker Clustering and Verification Corpus with Code-Switching Frisian-Dutch Speech.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Language diarization for semi-supervised bilingual acoustic model training.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
A study of speaker clustering for speaker attribution in large telephone conversation datasets.
Comput. Speech Lang., 2016

Calibration of Phone Likelihoods in Automatic Speech Recognition.
CoRR, 2016

Investigating Bilingual Deep Neural Networks for Automatic Recognition of Code-switching Frisian Speech.
Proceedings of the SLTU-2016, 2016

Code-switching detection using multilingual DNNS.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

The "Sprekend Nederland" project and its application to accent location.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Open Source Speech and Language Resources for Frisian.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Quality measures based calibration with duration and noise dependency for speaker recognition.
Speech Commun., 2015

The reddots data collection for speaker recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014
Score calibration in face recognition.
IET Biom., 2014

Speaker age estimation using i-vectors.
Eng. Appl. Artif. Intell., 2014

NFI-FRITS: A forensic speaker recognition database and some first experiments.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

A comparison of linear and non-linear calibrations for speaker recognition.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Semi-automatic annotation of the UCU accents speech corpus.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Constrained speaker linking.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Effect of long-term ageing on i-vector speaker verification.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013
N-Best 2008: A Benchmark Evaluation for Large Vocabulary Speech Recognition in Dutch.
Proceedings of the Essential Speech and Language Technology for Dutch, 2013

Quality Measure Functions for Calibration of Speaker Recognition Systems in Various Duration Conditions.
IEEE Trans. Speech Audio Process., 2013


The distribution of calibrated likelihood-ratios in speaker recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Automatic regularization of cross-entropy cost for speaker recognition fusion.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013


Knowing the non-target speakers: The effect of the i-vector population for PLDA training in speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Duration mismatch compensation for i-vector based speaker recognition systems.
Proceedings of the IEEE International Conference on Acoustics, 2013

Accent recognition using i-vector, Gaussian Mean Supervector and Gaussian posterior probability supervector for spontaneous telephone speech.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Source-Normalized LDA for Robust Speaker Recognition Using i-Vectors From Multiple Speech Sources.
IEEE Trans. Speech Audio Process., 2012

Speaker Diarization Error Analysis Using Oracle Components.
IEEE Trans. Speech Audio Process., 2012

Large-Scale Speaker Diarization for Long Recordings and Small Collections.
IEEE Trans. Speech Audio Process., 2012

Speech-based recognition of self-reported and observed emotion in a dimensional space.
Speech Commun., 2012

Exemplar-based sparse representation and sparse discrimination for noise robust speaker identification.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Source normalization for language-independent speaker recognition using i-vectors.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Calibration of probabilistic age recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Age Estimation from Telephone Speech using i-vectors.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Gender-independent speaker recognition using source normalisation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

The effect of noise on modern automatic speaker recognition systems.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Automatic stress detection in emergency (telephone) calls.
Int. J. Intell. Def. Support Syst., 2011

An International English Speech Corpus for Longitudinal Study of Accent Development.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

To Weight or Not to Weight: Source-Normalised LDA for Speaker Recognition Using i-vectors.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Evaluation of i-vector Speaker Recognition Systems for Forensic Application.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A Speaker Line-Up for the Likelihood Ratio.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Diarization-Based Speaker Retrieval for Broadcast Television Archives.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Improved speaker recognition when using i-vectors from multiple speech sources.
Proceedings of the IEEE International Conference on Acoustics, 2011

Source-normalised-and-weighted LDA for robust speaker recognition using i-vectors.
Proceedings of the IEEE International Conference on Acoustics, 2011

Unsupervised acoustic sub-word unit detection for query-by-example spoken term detection.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Emotion Recognition from Speech by Combining Databases and Fusion of Classifiers.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

Speaker linking in large data sets.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Towards automatic speaker retrieval for large multimedia archives.
Proceedings of the 3rd International Workshop on Automated Information Extraction in Media Production, 2010

2009
Attuning speech-enabled interfaces to user and context for inclusive design: technology, methodology and practice.
Univers. Access Inf. Soc., 2009

Arousal and valence prediction in spontaneous emotional speech: felt versus perceived emotion.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

A human benchmark for language recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Results of the n-best 2008 dutch speech recognition evaluation.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Overall performance metrics for multi-condition speaker recognition evaluations.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Speech overlap detection in a two-pass speaker diarization system.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

The majority wins: a method for combining speaker diarization systems.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008
A human benchmark for the NIST language recognition evaluation 2005.
Proceedings of the Odyssey 2008: The Speaker and Language Recognition Workshop, 2008

Building language detectors using small amounts of training data.
Proceedings of the Odyssey 2008: The Speaker and Language Recognition Workshop, 2008

Assessing agreement of observer- and self-annotations in spontaneous multimodal emotion data.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Mediacampaign - A multimodal semantic analysis system for advertisement campaign detection.
Proceedings of the International Workshop on Content-Based Multimedia Indexing, 2008

2007
Fusion of Heterogeneous Speaker Recognition Systems in the STBU Submission for the NIST Speaker Recognition Evaluation 2006.
IEEE Trans. Speech Audio Process., 2007

Automatic discrimination between laughter and speech.
Speech Commun., 2007

An Introduction to Application-Independent Evaluation of Speaker Recognition Systems.
Proceedings of the Speaker Classification I: Fundamentals, Features, and Methods, 2007

Affective multimodal mirror: sensing and eliciting laughter.
Proceedings of the International Workshop on Human-Centered Multimedia, 2007

Visualizing acoustic similarities between emotions in speech: an acoustic map of emotions.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Design and characterization of the non-native military air traffic communications database (nnMATC).
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

An open-set detection evaluation methodology applied to language and emotion recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

N-best: the northern- and southern-dutch benchmark evaluation of speech recognition technology.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

STBU System for the NIST 2006 Speaker Recognition Evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Unobtrusive Multimodal Emotion Detection in Adaptive Interfaces: Speech and Facial Expressions.
Proceedings of the Foundations of Augmented Cognition, 2007

Progress in the AMIDA Speaker Diarization System for Meeting Data.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

The 2007 AMI(DA) System for Meeting Transcription.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

Speech Indexing.
Proceedings of the Multimedia Retrieval, 2007

2006
NIST and NFI-TNO evaluations of automatic speaker recognition.
Comput. Speech Lang., 2006

Channel-dependent GMM and Multi-class Logistic Regression models for language recognition.
Proceedings of the Odyssey 2006: The Speaker and Language Recognition Workshop, 2006

On calibration of language recognition scores.
Proceedings of the Odyssey 2006: The Speaker and Language Recognition Workshop, 2006

The AMI Speaker Diarization System for NIST RT06s Meeting Data.
Proceedings of the Machine Learning for Multimodal Interaction, 2006


2005
The TNO Speaker Diarization System for NIST RT05s Meeting Data.
Proceedings of the Machine Learning for Multimodal Interaction, 2005

Automatic detection of laughter.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Speaker adaptation in the NIST speaker recognition evaluation 2004.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004
Results of the 2003 NFI-TNO forensic speaker recognition evaluation.
Proceedings of the Odyssey 2004: The Speaker and Language Recognition Workshop, Toledo, Spain, May 31, 2004

2003
Speaker verification systems and security considerations.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
TREC Feature Extraction by Active Learning.
Proceedings of The Eleventh Text REtrieval Conference, 2002

"Do as I Say! . But Who Says What I Should Say - or Do?" On the Definition of a Standard Spoken Command Vocabulary for ICT Devices and Services.
Proceedings of the Mobile Human-Computer Interaction, 4th International Symposium, 2002

2001
Creating a Dutch Information Retrieval Test Corpus.
Proceedings of the Computational Linguistics in the Netherlands 2001, 2001

2000
Automatic speech recognition of non-native speakers using consonant-vowel-consonant (CVC) words.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Objective and subjective evaluation of the acoustic models of a continuous speech recognition system.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
TNO TREC7 Site Report: SDR and Filtering.
Proceedings of The Seventh Text REtrieval Conference, 1998

1997
Multilingual large vocabulary speech recognition: the European SQALE project.
Comput. Speech Lang., 1997

Speaker recognition by humans and machines.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Within-speaker variability of the word error rate for a continuous speech recognition system.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1995
Appropriate Context Association and Learning Parameters for Word Spotting with Partially Recurrent Neural Networks.
Proceedings of the Neural Networks: Artificial Intelligence and Industrial Applications, 1995

Multi-lingual assessment of speaker independent large vocabulary speech-recognition systems: THE SQALE-PROJECT.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Human benchmarks for speaker independent large vocabulary recognition performance.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995


  Loading...