Taraka Rama

Orcid: 0000-0002-4531-6733

According to our database1, Taraka Rama authored at least 48 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Dravidian language family through Universal Dependencies lens.
CoRR, 2024

Are Sounds Sound for Phylogenetic Reconstruction?
Proceedings of the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP, 2024

2022
What do complexity measures measure? Correlating and validating corpus-based measures of morphological complexity.
CoRR, 2022

2021
Neural classification of Norwegian radiology reports: using NLP to detect findings in CT-scans of children.
BMC Medical Informatics Decis. Mak., 2021

Are pre-trained text representations useful for multilingual and multi-dimensional language proficiency modeling?
CoRR, 2021

Synthetic data for annotation and extraction of family history information from clinical text.
J. Biomed. Semant., 2021

2020
Disentangling dialects: a neural approach to Indo-Aryan historical phonology and subgrouping.
Proceedings of the 24th Conference on Computational Natural Language Learning, 2020

Probing Multilingual BERT for Genetic and Typological Signals.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
Regression or classification? Automated Essay Scoring for Norwegian.
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications, 2019

An Automated Framework for Fast Cognate Detection and Bayesian Phylogenetic Inference in Computational Historical Linguistics.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Tübingen-Oslo system: Linear regression works the best at Predicting Current and Future Psychological Health from Childhood Essays in the CLPsych 2018 Shared Task.
CoRR, 2018

Three tree priors and five datasets: A study of the effect of tree priors in Indo-European phylogenetics.
CoRR, 2018

Tübingen-Oslo Team at the VarDial 2018 Evaluation Campaign: An Analysis of N-gram Features in Language Variety Identification.
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, 2018

A Telugu treebank based on a grammar book.
Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories, 2018

Drug-Use Identification from Tweets with Word and Character N-Grams.
Proceedings of the 2018 EMNLP Workshop SMM4H: The 3rd Social Media Mining for Health Applications Workshop & Shared Task, 2018

Tübingen-Oslo at SemEval-2018 Task 2: SVMs perform better than RNNs in Emoji Prediction.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

Are Automatic Methods for Cognate Detection Good Enough for Phylogenetic Reconstruction in Historical Linguistics?
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Tübingen-Oslo system at SIGMORPHON shared task on morphological inflection. A multi-tasking multilingual sequence to sequence model.
Proceedings of the CoNLL SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection, Brussels, October 31, 2018

Similarity Dependent Chinese Restaurant Process for Cognate Identification in Multilingual Wordlists.
Proceedings of the 22nd Conference on Computational Natural Language Learning, 2018

Towards identifying the optimal datasize for lexically-based Bayesian inference of linguistic phylogenies.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Experiments with Universal CEFR Classification.
Proceedings of the Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications@NAACL-HLT 2018, 2018

Using Universal Dependencies in cross-linguistic complexity research.
Proceedings of the Second Workshop on Universal Dependencies, 2018

Iterative development of family history annotation guidelines using a synthetic corpus of clinical text.
Proceedings of the Ninth International Workshop on Health Text Mining and Information Analysis, 2018

2017
Fast and unsupervised methods for multilingual cognate clustering.
CoRR, 2017

Computational analysis of Gondi dialects.
Proceedings of the Fourth Workshop on NLP for Similar Languages, 2017

Tübingen system in VarDial 2017 shared task: experiments with language identification and cross-lingual parsing.
Proceedings of the Fourth Workshop on NLP for Similar Languages, 2017

Fewer features perform well at Native Language Identification task.
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, 2017

2016
Chinese Restaurant Process for cognate clustering: A threshold free approach.
CoRR, 2016

Siamese convolutional networks based on phonetic features for cognate identification.
CoRR, 2016

LSTM Autoencoders for Dialect Analysis.
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, 2016

Discriminating Similar Languages with Linear SVMs and Neural Networks.
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, 2016

Siamese Convolutional Networks for Cognate Identification.
Proceedings of the COLING 2016, 2016

2015
Automatic cognate identification with gap-weighted string subsequences.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Comparative Evaluation of String Similarity Measures for Automatic Language Classification.
Proceedings of the Sequences in Language and Text, 2015

2014
<i>N</i>-Gram Approaches to the Historical Dynamics of Basic Vocabulary.
J. Quant. Linguistics, 2014

Quantitative methods for Phylogenetic Inference in Historical Linguistics: An experimental case study of South Central Dravidian.
CoRR, 2014

Does Syntactic Knowledge help English-Hindi SMT?
CoRR, 2014

Properties of phoneme N -grams across the world's language families.
CoRR, 2014

Supertagging: Introduction, learning, and application.
CoRR, 2014

Empirical Evaluation of Tree distances for Parser Evaluation.
CoRR, 2014

Gap-weighted subsequences for automatic cognate identification and phylogenetic inference.
CoRR, 2014

Linguistic landscaping of South Asia using digital language resources: Genetic vs. areal linguistics.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

2012
How Good are Typological Distances for Determining Genealogical Relationships among Languages?
Proceedings of the COLING 2012, 2012

2011
Estimating Language Lelationships from a Parallel Corpus. A Study of the Europarl Corpus.
Proceedings of the 18th Nordic Conference of Computational Linguistics, 2011

2010
Transliteration as Alignment vs. Transliteration as Generation for Crosslingual Information Retrieval.
Trait. Autom. des Langues, 2010

2009
From Bag of Languages to Family Trees From Noisy Corpus.
Proceedings of the Recent Advances in Natural Language Processing, 2009

Modeling Letter-to-Phoneme Conversion as a Phrase Based Statistical Machine Translation Problem with Minimum Error Rate Training.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Modeling Machine Transliteration as a Phrase Based Statistical Machine Translation Problem.
Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration, 2009


  Loading...