Emre Yilmaz

Orcid: 0000-0001-7466-3358

Affiliations:
  • SRI International, Speech Technology and Research Laboratory, CA, USA
  • National University of Singapore, Department of Electrical and Computer Engineering, Singapore
  • Radboud University Nijmegen, CLS/CLST, Gelderland, Netherlands
  • Katholieke Universiteit Leuven, Department ESAT, Belgium (PhD 2015)


According to our database1, Emre Yilmaz authored at least 57 papers between 2012 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2022
Code-switched automatic speech recognition in five South African languages.
Comput. Speech Lang., 2022

2020
Semi-supervised Acoustic Modelling for Five-lingual Code-switched ASR using Automatically-segmented Soap Opera Speech.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Semi-supervised Development of ASR Systems for Multilingual Code-switched Speech in Under-resourced Languages.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Multi-Encoder-Decoder Transformer for Code-Switching Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Self-and-Mixed Attention Decoder with Deep Acoustic Structure for Transformer-Based LVCSR.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Deep Convolutional Spiking Neural Networks for Keyword Spotting.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Automatic Lyrics Alignment and Transcription in Polyphonic Music: Does Background Music Help?
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Articulatory and bottleneck features for speaker-independent ASR of dysarthric speech.
Comput. Speech Lang., 2019

Deep Spiking Neural Networks for Large Vocabulary Automatic Speech Recognition.
CoRR, 2019

Automatic Lyrics Transcription in Polyphonic Music: Does Background Music Help?
CoRR, 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.
CoRR, 2019

Large-Scale Speaker Diarization of Radio Broadcast Archives.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multi-Graph Decoding for Code-Switching ASR.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Code-Switching Detection Using ASR-Generated Language Posteriors.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Acoustic Modeling for Automatic Lyrics-to-Audio Alignment.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Semi-Supervised Acoustic Model Training for Five-Lingual Code-Switched ASR.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Modularized Neural Network with Language-Specific Output Layers for Cross-Lingual Voice Conversion.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

End-to-End Code-Switching ASR for Low-Resourced Language Pairs.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Semi-supervised acoustic model training for speech with code-switching.
Speech Commun., 2018

Code-Switching Detection with Data-Augmented Acoustic and Language Models.
CoRR, 2018

Code-Switching Detection with Data-Augmented Acoustic and Language Models.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

ASR-Free CNN-DTW Keyword Spotting Using Multilingual Bottleneck Features for Almost Zero-Resource Languages.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Generative X-Vectors for Text-Independent Speaker Verification.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

How to train your speaker embeddings extractor.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Articulatory Features for ASR of Pathological Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Acoustic and Textual Data Augmentation for Improved ASR of Code-Switching Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Building a Unified Code-Switching ASR System for South African Languages.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Multilingual Neural Network Acoustic Modelling for ASR of Under-Resourced English-isiZulu Code-Switched Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
Exploiting Untranscribed Broadcast Data for Improved Code-Switching Detection.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Multi-Stage DNN Training for Automatic Recognition of Dysarthric Speech.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Longitudinal Speaker Clustering and Verification Corpus with Code-Switching Frisian-Dutch Speech.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Language diarization for semi-supervised bilingual acoustic model training.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Noise-robust exemplar matching for rescoring query-by-example search.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Noise robust exemplar matching with alpha-beta divergence.
Speech Commun., 2016

Investigating Bilingual Deep Neural Networks for Automatic Recognition of Code-switching Frisian Speech.
Proceedings of the SLTU-2016, 2016

Code-switching detection using multilingual DNNS.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Prototype ASR-based Multimedia Game for Speech Therapy.
Proceedings of the 2016 ACM Workshop on Multimedia for Personal Health and Health Care, 2016

On the Development of an ASR-based Multimedia Game for Speech Therapy: Preliminary Results.
Proceedings of the 2016 ACM Workshop on Multimedia for Personal Health and Health Care, 2016

A Dutch Dysarthric Speech Database for Individualized Speech Therapy Research.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Open Source Speech and Language Resources for Frisian.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Combining Non-Pathological Data of Different Language Varieties to Improve DNN-HMM Performance on Pathological Speech.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Data selection for noise robust exemplar matching.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Noise Robust Exemplar Matching for Speech Recognition and Enhancement ; Ruisrobuuste exemplaar matching voor spraakherkenning en spraakverbetering.
PhD thesis, 2015

Noise robust exemplar matching for speech enhancement: applications to automatic speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Adaptive noise dictionary design for noise robust exemplar matching of speech.
Proceedings of the 23rd European Signal Processing Conference, 2015

Noise robust exemplar matching with coupled dictionaries for single-channel speech enhancement.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
Noise Robust Exemplar Matching Using Sparse Representations of Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Learning Like a Toddler: Watching Television Series to Learn Vocabulary from Images and Audio.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Automatic assessment of children's reading with the FLaVoR decoding using a phone confusion model.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Noise-robust speech recognition with exemplar-based sparse representations using Alpha-Beta divergence.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Automating speech reception threshold measurements using automatic speech recognition.
Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies, 2013

Embedding time warping in exemplar-based sparse representations of speech.
Proceedings of the IEEE International Conference on Acoustics, 2013

Exemplar selection techniques for sparse representations of speech using multiple dictionaries.
Proceedings of the 21st European Signal Processing Conference, 2013

2012
Noise-robust digit recognition with exemplar-based sparse representations of variable length.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2012

Combining exemplar-based matching and exemplar-based sparse representations of speech.
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Robust Tracking for Automatic Reading Tutors.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012


  Loading...