Thorsten Brants

According to our database1, Thorsten Brants authored at least 37 papers between 1995 and 2014.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 




One billion word benchmark for measuring progress in statistical language modeling.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling.
CoRR, 2013

Query language modeling for voice search.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Study on interaction between entropy pruning and kneser-ney smoothing.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Distributed Language Models.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Translation.
Proceedings of the ACL 2008, 2008

Randomized Language Models via Perfect Hash Functions.
Proceedings of the ACL 2008, 2008

Large Language Models in Machine Translation.
Proceedings of the EMNLP-CoNLL 2007, 2007

A Context Pattern Induction Method for Named Entity Extraction.
Proceedings of the Tenth Conference on Computational Natural Language Learning, 2006

Test Data Likelihood for PLSA Models.
Inf. Retr., 2005

Multiple Similarity Measures and Source-Pair Information in Story Link Detection.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004

A System for new event detection.
Proceedings of the SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28, 2003

Story Link Detection and New Event Detection are Asymmetric.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Natural Language Processing in Information Retrieval.
Proceedings of the Computational Linguistics in the Netherlands 2003, 2003

Optimizing Story Link Detection is not Equivalent to Optimizing New Event Detection.
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003

The LinGO Redwoods Treebank: Motivation and Preliminary Applications.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

Topic-based document segmentation with probabilistic latent semantic analysis.
Proceedings of the 2002 ACM CIKM International Conference on Information and Knowledge Management, 2002

Interactive Corpus Annotation.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Inter-annotator Agreement for a German Newspaper Corpus.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Probabilistic Parsing and Psychological Plausibility.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

TnT - A Statistical Part-of-Speech Tagger.
Proceedings of the 6th Applied Natural Language Processing Conference, 2000

Tagging and parsing with cascaded Markov models: automation of corpus annotation.
PhD thesis, 1999

Cascaded Markov Models.
Proceedings of the EACL 1999, 1999

Studien zur performanzorientierten Linguistik Aspekte der Relativsatzextraposition im Deutschen.
Kognitionswissenschaft, 1998

A Linguistically Interpreted Corpus of German Newspaper Text
CoRR, 1998

Chunk Tagger - Statistical Recognition of Noun Phrases
CoRR, 1998

A lingnistically interpreted corpus of German newspaper text.
Proceedings of the First International Conference on Language Resources and Evaluation, 1998

Automation of Treebank Annotation.
Proceedings of the Joint Conference on New Methods in Language Processing and Computational Natural Language Learning, 1998

A Maximum-Entropy Partial Parser for Unrestricted Text.
Proceedings of the Sixth Workshop on Very Large Corpora, 1998

Internal and external tagsets in part-of-speech tagging.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Tagging Grammatical Functions.
Proceedings of the Second Conference on Empirical Methods in Natural Language Processing, 1997

Software for Annotating Argument Structure.
Proceedings of the 5th Applied Natural Language Processing Conference, 1997

An Annotation Scheme for Free Word Order Languages.
Proceedings of the 5th Applied Natural Language Processing Conference, 1997

Estimating Markov model structures.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Better Language Models with Model Merging.
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 1996

Tagging the Teleman Corpus
CoRR, 1995

Tagset Reduction without Information Loss.
Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, 1995
