Martin Jansche

Orcid: 0000-0003-0484-4879

According to our database1, Martin Jansche authored at least 41 papers between 1998 and 2020.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2020
Google Crowdsourced Speech Corpora and Related Open-Source Resources for Low-Resource Languages and Dialects: An Overview.
CoRR, 2020

Towards Induction of Structured Phoneme Inventories.
CoRR, 2020

Linguistic Typology Features from Text: Inferring the Sparse Features of World Atlas of Language Structures.
CoRR, 2020

Burmese Speech Corpus, Finite-State Text Normalization and Pronunciation Grammars with an Application to Text-to-Speech.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Open-source Multi-speaker Speech Corpora for Building Gujarati, Kannada, Malayalam, Marathi, Tamil and Telugu Speech Synthesis Systems.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2019
Cross-Lingual Consistency of Phonological Features: An Empirical Study.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Sampling from Stochastic Finite Automata with Applications to CTC Decoding.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
A Step-by-Step Process for Building TTS Voices Using Open Source Data and Frameworks for Bangla, Javanese, Khmer, Nepali, Sinhala, and Sundanese.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Crowd-Sourced Speech Corpora for Javanese, Sundanese, Sinhala, Nepali, and Bangladeshi Bengali.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Brahmic Schwa-Deletion with Neural Classifiers: Experiments with Bengali.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Predicting the Features of World Atlas of Language Structures from Speech.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

A Unified Phonological Representation of South Asian Languages for Multilingual Text-to-Speech.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Building Open Javanese and Sundanese Corpora for Multilingual Text-to-Speech.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

FonBund: A Library for Combining Cross-lingual Phonological Segment Data.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

2017
Rapid Development of TTS Corpora for Four South African Languages.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016
Building Statistical Parametric Multi-speaker Synthesis for Bangladeshi Bangla.
Proceedings of the SLTU-2016, 2016

TTS for Low Resource Languages: A Bangla Synthesizer.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2014
Computer-Aided Quality Assurance of an Icelandic Pronunciation Dictionary.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

2012
Google's cross-dialect Arabic voice search.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Deploying Google Search by Voice in Cantonese.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A Web-Based Tool for Developing Multilingual Pronunciation Lexicons.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010
Search by voice in Mandarin Chinese.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A Comparison of Features for Automatic Readability Assessment.
Proceedings of the COLING 2010, 2010

Reading difficulty in adults with intellectual disabilities: analysis with a hierarchical latent trait model.
Proceedings of the 12th International ACM SIGACCESS Conference on Computers and Accessibility, 2010

2009
Web derived pronunciations for spoken term detection.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

OpenFst: An Open-Source, Weighted Finite-State Transducer Library and its Applications to Speech and Language.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Restoring punctuation and capitalization in transcribed speech.
Proceedings of the IEEE International Conference on Acoustics, 2009

WEB-derived pronunciations.
Proceedings of the IEEE International Conference on Acoustics, 2009

Named Entity Transcription with Pair n-Gram Models.
Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration, 2009

2007
A Support Vector Approach to Censored Targets.
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

A Maximum Expected Utility Framework for Binary Sequence Labeling.
Proceedings of the ACL 2007, 2007

2005
Maximum Expected F-Measure Training of Logistic Regression Models.
Proceedings of the HLT/EMNLP 2005, 2005

Treebank transfer.
Proceedings of the Ninth International Workshop on Parsing Technology, 2005

Algorithms for Minimum Risk Chunking.
Proceedings of the Finite-State Methods and Natural Language Processing, 2005

2004
Learning Local Transductions Is Hard.
J. Log. Lang. Inf., 2004

2003
Parametric Models of Linguistic Count Data.
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003

2002
Information Extraction from Voicemail Transcripts.
Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, 2002

Named Entity Extraction with Conditional Markov Models and Classifiers.
Proceedings of the 6th Conference on Natural Language Learning, 2002

2001
Re-Engineering Letter-to-Sound Rules.
Proceedings of the Language Technologies 2001: The Second Meeting of the North American Chapter of the Association for Computational Linguistics, 2001

Information extraction via heuristics for a movie showtime query system.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

1998
Abductive Reasoning For Syntactic Realization.
Proceedings of the Ninth International Workshop on Natural Language Generation, 1998


  Loading...