Timothy Baldwin

Orcid: 0000-0003-4525-6950

Affiliations:
  • Mohamed bin Zayed University of Artificial Intelligence, UAE
  • University of Melbourne, School of Computing and Information Systems, Australia


According to our database1, Timothy Baldwin authored at least 404 papers between 2000 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Factuality challenges in the era of large language models and opportunities for fact-checking.
Nat. Mac. Intell., 2024

Arabic Dataset for LLM Safeguard Evaluation.
CoRR, 2024

ToolGen: Unified Tool Retrieval and Calling via Generation.
CoRR, 2024

Loki: An Open-Source Tool for Fact Verification.
CoRR, 2024

Unconditional Truthfulness: Learning Conditional Dependency for Uncertainty Quantification of Large Language Models.
CoRR, 2024

Inference-Time Selective Debiasing.
CoRR, 2024

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs.
CoRR, 2024

Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph.
CoRR, 2024

Evaluating Transparency of Machine Generated Fact Checking Explanations.
CoRR, 2024

Revisiting subword tokenization: A case study on affixal negation in large language models.
CoRR, 2024

IndoCulture: Exploring Geographically-Influenced Cultural Commonsense Reasoning Across Eleven Indonesian Provinces.
CoRR, 2024

Against The Achilles' Heel: A Survey on Red Teaming for Generative Models.
CoRR, 2024

A Little Leak Will Sink a Great Ship: Survey of Transparency for Large Language Models from Start to Finish.
CoRR, 2024

MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT.
CoRR, 2024

PALO: A Polyglot Large Multimodal Model for 5B People.
CoRR, 2024

Eagle: Ethical Dataset Given from Real Interactions.
CoRR, 2024

A Chinese Dataset for Evaluating the Safeguards in Large Language Models.
CoRR, 2024

Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents.
CoRR, 2024

Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting.
CoRR, 2024

The Gaps between Pre-train and Downstream Settings in Bias Evaluation and Debiasing.
CoRR, 2024

To Aggregate or Not to Aggregate. That is the Question: A Case Study on Annotation Subjectivity in Span Prediction.
Proceedings of the 14th Workshop on Computational Approaches to Subjectivity, 2024

Revisiting subword tokenization: A case study on affixal negation in large language models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Are Multilingual LLMs Culturally-Diverse Reasoners? An Investigation into Multicultural Proverbs and Sayings.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Psychometric Predictive Power of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

BiMediX: Bilingual Medical Mixture of Experts LLM.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Do-Not-Answer: Evaluating Safeguards in LLMs.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Zero-shot Sentiment Analysis in Low-Resource Languages Using a Multilingual Sentiment Lexicon.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

A Chinese Dataset for Evaluating the Safeguards in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Demystifying Instruction Mixing for Fine-tuning Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Emergent Word Order Universals from Cognitively-Motivated Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

CMMLU: Measuring massive multitask language understanding in Chinese.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Disease progression modelling of Alzheimer's disease using probabilistic principal components analysis.
NeuroImage, September, 2023

On the Effectiveness of Images in Multi-modal Text Classification: An Annotation Study.
ACM Trans. Asian Low Resour. Lang. Inf. Process., March, 2023

Collective Human Opinions in Semantic Textual Similarity.
Trans. Assoc. Comput. Linguistics, 2023

Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics (Dagstuhl Seminar 23191).
Dagstuhl Reports, 2023

LLM360: Towards Fully Transparent Open-Source LLMs.
CoRR, 2023

Multi-EuP: The Multilingual European Parliament Dataset for Analysis of Bias in Information Retrieval.
CoRR, 2023

Factuality Challenges in the Era of Large Language Models.
CoRR, 2023

Are Multilingual LLMs Culturally-Diverse Reasoners? An Investigation into Multicultural Proverbs and Sayings.
CoRR, 2023

Connecting the Dots in News Analysis: A Cross-Disciplinary Survey of Media Bias and Framing.
CoRR, 2023

Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models.
CoRR, 2023

Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs.
CoRR, 2023

Bactrian-X : A Multilingual Replicable Instruction-Following Model with Low-Rank Adaptation.
CoRR, 2023

Language models are not naysayers: an analysis of language models on negation benchmarks.
Proceedings of the The 12th Joint Conference on Lexical and Computational Semantics, 2023

Location Aware Modular Biencoder for Tourism Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: IJCNLP-AACL 2023, 2023

Uncertainty Estimation for Debiased Models: Does Fairness Hurt Reliability?
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

It's not only What You Say, It's also Who It's Said to: Counterfactual Analysis of Interactive Behavior in the Courtroom.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

Everybody Needs Good Neighbours: An Unsupervised Locality-based Method for Bias Mitigation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Robustness Tests for Automatic Machine Translation Metrics with Adversarial Attacks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

More than Votes? Voting and Language based Partisanship in the US Supreme Court.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

LM-Polygraph: Uncertainty Estimation for Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Unsupervised Lexical Simplification with Context Augmentation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Fair Enough: Standardizing Evaluation and Model Selection for Fairness Research in NLP.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Promoting Fairness in Classification of Quality of Medical Evidence.
Proceedings of the 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, 2023

Cost-effective Distillation of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023


Unsupervised Paraphrasing of Multiword Expressions.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Uncertainty Estimation and Reduction of Pre-trained Models for Text Regression.
Trans. Assoc. Comput. Linguistics, 2022

FFCI: A Framework for Interpretable Automatic Evaluation of Summarization.
J. Artif. Intell. Res., 2022

NusaCrowd: Open Source Initiative for Indonesian NLP Resources.
CoRR, 2022

NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages.
CoRR, 2022

fairlib: A Unified Framework for Assessing and Improving Classification Fairness.
CoRR, 2022

Towards Equal Opportunity Fairness through Adversarial Learning.
CoRR, 2022

Improving negation detection with negation-focused pre-training.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Optimising Equal Opportunity Fairness in Model Training.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

MultiSpanQA: A Dataset for Multi-Span Question Answering.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

CULG: Commercial Universal Language Generation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, 2022

Not another Negation Benchmark: The NaN-NLI Test Suite for Sub-clausal Negation.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Does Representational Fairness Imply Empirical Fairness?
Proceedings of the Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022

Systematic Evaluation of Predictive Fairness.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

FairLib: A Unified Framework for Assessing and Improving Fairness.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Balancing out Bias: Achieving Fairness Through Balanced Training.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

M3: Multi-level dataset for Multi-document summarisation of Medical studies.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

The ChEMU 2022 Evaluation Campaign: Information Extraction in Chemical Patents.
Proceedings of the Advances in Information Retrieval, 2022

Noisy Label Regularisation for Textual Regression.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

LipKey: A Large-Scale News Dataset for Absent Keyphrases Generation and Abstractive Summarization.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

LED down the rabbit hole: exploring the potential of global attention for biomedical multi-document summarisation.
Proceedings of the Third Workshop on Scholarly Document Processing, 2022

Unsupervised Lexical Substitution with Decontextualised Embeddings.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Overview of ChEMU 2022 Evaluation Campaign: Information Extraction in Chemical Patents.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2022

Extended Overview of ChEMU 2022 Evaluation Campaign: Information Extraction in Chemical Patents.
Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022

What does it take to bake a cake? The RecipeRef corpus and anaphora resolution in procedural text.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

The patient is more dead than alive: exploring the current state of the multi-document summarisation of the biomedical literature.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Online Examinations in a Large Australian CS1 Course.
Proceedings of the ACE '22: Australasian Computing Education Conference, Virtual Event, Australia, February 14, 2022

2021
Evaluating Document Coherence Modelling.
Trans. Assoc. Comput. Linguistics, 2021

ChEMU 2020: Natural Language Processing Methods Are Effective for Information Extraction From Chemical Patents.
Frontiers Res. Metrics Anal., 2021

Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics (Dagstuhl Seminar 21351).
Dagstuhl Reports, 2021

Contrastive Learning for Fair Representations.
CoRR, 2021

Balancing out Bias: Achieving Fairness Through Training Reweighting.
CoRR, 2021

KFCNet: Knowledge Filtering and Contrastive Learning Network for Generative Commonsense Reasoning.
CoRR, 2021

Automatic Claim Review for Climate Science via Explanation Generation.
CoRR, 2021

Impact of detecting clinical trial elements in exploration of COVID-19 literature.
CoRR, 2021

Spatial concepts in the conversation with a computer.
Commun. ACM, 2021

ITTC @ TREC 2021 Clinical Trials Track.
Proceedings of the Thirtieth Text REtrieval Conference, 2021

A Simple yet Effective Method for Sentence Ordering.
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021

Discourse Probing of Pretrained Language Models.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Automatic Classification of Neutralization Techniques in the Narrative of Climate Change Scepticism.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Impact of detecting clinical trial elements in exploration of COVID-19 literature.
Proceedings of the 9th IEEE International Conference on Healthcare Informatics, 2021

Evaluating Debiasing Techniques for Intersectional Biases.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Fairness-aware Class Imbalanced Learning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

'Just What do You Think You're Doing, Dave?' A Checklist for Responsible Data Use in NLP.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

KFCNet: Knowledge Filtering and Contrastive Learning for Generative Commonsense Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Brief Description of COVID-SEE: The Scientific Evidence Explorer for COVID-19 Related Research.
Proceedings of the Advances in Information Retrieval, 2021

ChEMU 2021: Reaction Reference Resolution and Anaphora Resolution in Chemical Patents.
Proceedings of the Advances in Information Retrieval, 2021

On the (In)Effectiveness of Images for Text Classification.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Top-down Discourse Parsing via Sequence Labelling.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Diverse Adversaries for Mitigating Bias in Training.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

ChEMU-Ref: A Corpus for Modeling Anaphora Resolution in the Chemical Domain.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Overview of ChEMU 2021: Reaction Reference Resolution and Anaphora Resolution in Chemical Patents.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2021

Extended Overview of ChEMU 2021: Reaction Reference Resolution and Anaphora Resolution in Chemical Patents.
Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum, Bucharest, Romania, September 21st - to, 2021

Evaluating the Efficacy of Summarization Evaluation across Languages.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Decoupling Adversarial Training for Fair NLP.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Automatic Resolution of Domain Name Disputes.
Proceedings of the Natural Legal Language Processing Workshop 2021, 2021

Semi-automatic Triage of Requests for Free Legal Assistance.
Proceedings of the Natural Legal Language Processing Workshop 2021, 2021

2020
A General Approach to Multimodal Document Quality Assessment.
J. Artif. Intell. Res., 2020

Learning Contextualised Cross-lingual Word Embeddings for Extremely Low-Resource Languages Using Parallel Corpora.
CoRR, 2020

COVID-SEE: Scientific Evidence Explorer for COVID-19 Related Research.
CoRR, 2020

You are right. I am ALARMED - But by Climate Change Counter Movement.
CoRR, 2020

Liputan6: A Large-scale Indonesian Dataset for Text Summarization.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Improved Topic Representations of Medical Documents to Assist COVID-19 Literature Exploration.
Proceedings of the 1st Workshop on NLP for COVID-19@ EMNLP 2020, Online, December 2020, 2020

ChEMU: Named Entity Recognition and Event Extraction of Chemical Reactions from Patents.
Proceedings of the Advances in Information Retrieval, 2020

WikiUMLS: Aligning UMLS to Wikipedia via Cross-lingual Neural Ranking.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Target Word Masking for Location Metonymy Resolution.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP.
Proceedings of the 28th International Conference on Computational Linguistics, 2020


Overview of ChEMU 2020: Named Entity Recognition and Event Extraction of Chemical Reactions from Patents.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2020

Evaluating the Utility of Model Configurations and Data Augmentation on Clinical Semantic Textual Similarity.
Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing, 2020

Domain Adaptation and Instance Selection for Disease Syndrome Classification over Veterinary Clinical Notes.
Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing, 2020

Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evaluation Metrics.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Give Me Convenience and Give Her Death: Who Should Decide What Uses of NLP are Appropriate, and on What Basis?
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Learning from Unlabelled Data for Clinical Semantic Textual Similarity.
Proceedings of the 3rd Clinical Natural Language Processing Workshop, 2020

Online Tutoring to Support Programming Exercises.
Proceedings of the ACE 2020, 2020

2019
Automatic Language Identification in Texts: A Survey.
J. Artif. Intell. Res., 2019

Evaluating the Utility of Document Embedding Vector Difference for Relation Learning.
CoRR, 2019

Target Based Speech Act Classification in Political Campaign Text.
Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics, 2019

UniMelb at SemEval-2019 Task 12: Multi-model combination for toponym resolution.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Contextualization of Morphological Inflection.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

A Joint Model for Multimodal Document Quality Assessment.
Proceedings of the 19th ACM/IEEE Joint Conference on Digital Libraries, 2019

Deep Ordinal Regression for Pledge Specificity Prediction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Place Questions and Human-Generated Answers: A Data Analysis Approach.
Proceedings of the Geospatial Technologies for Local and Regional Development, 2019

Differences in language use: Insights from job and talent search.
Proceedings of the 24th Australasian Document Computing Symposium, 2019

Modelling Uncertainty in Collaborative Document Quality Assessment.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

Putting Evaluation in Context: Contextual Embeddings Improve Machine Translation Evaluation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Semi-supervised Stochastic Multi-Domain Learning using Variational Inference.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Reevaluating Argument Component Extraction in Low Resource Settings.
Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP, 2019

Detecting Chemical Reactions in Patents.
Proceedings of the The 17th Annual Workshop of the Australasian Language Technology Association, 2019

Feature-guided Neural Model Training for Supervised Document Representation Learning.
Proceedings of the The 17th Annual Workshop of the Australasian Language Technology Association, 2019

Improved Document Modelling with a Neural Discourse Parser.
Proceedings of the The 17th Annual Workshop of the Australasian Language Technology Association, 2019

Modelling Tibetan Verbal Morphology.
Proceedings of the The 17th Annual Workshop of the Australasian Language Technology Association, 2019

Does an LSTM forget more than a CNN? An empirical study of catastrophic forgetting in NLP.
Proceedings of the The 17th Annual Workshop of the Australasian Language Technology Association, 2019

2018
Predicting Online Islamophobic Behavior after #ParisAttacks.
J. Web Sci., 2018

Web Forum Retrieval and Text Analytics: A Survey.
Found. Trends Inf. Retr., 2018

The Company They Keep: Extracting Japanese Neologisms Using Language Patterns.
Proceedings of the 9th Global Wordnet Conference, 2018

Language and the Shifting Sands of Domain, Space and Time (Invited Talk).
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, 2018

A Living Lab Study of Query Amendment in Job Search.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

UniMelb at SemEval-2018 Task 12: Generative Implication using LSTMs, Siamese Networks and Semantic Representations with Synonym Fuzzing.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

Hierarchical Structured Model for Fine-to-Coarse Manifesto Text Analysis.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Recurrent Entity Networks with Delayed Memory Update for Targeted Aspect-Based Sentiment Analysis.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

What's in a Domain? Learning Domain-Robust Text Representations using Adversarial Training.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

A Preliminary Comparison of Job, Talent, and Web Search.
Proceedings of the 9th Italian Information Retrieval Workshop, 2018

Detecting Misflagged Duplicate Questions in Community Question-Answering Archives.
Proceedings of the Twelfth International Conference on Web and Social Media, 2018

Multitask Learning for Query Segmentation in Job Search.
Proceedings of the 2018 ACM SIGIR International Conference on Theory of Information Retrieval, 2018

Topic Intrusion for Automatic Topic Model Evaluation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Encoding Sentiment Information into Word Vectors for Sentiment Analysis.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Preferred Answer Selection in Stack Overflow: Better Text Representations ... and Metadata, Metadata, Metadata.
Proceedings of the 4th Workshop on Noisy User-generated Text, 2018

Twitter Geolocation using Knowledge-Based Methods.
Proceedings of the 4th Workshop on Noisy User-generated Text, 2018

Content-based Popularity Prediction of Online Petitions Using a Deep Regression Model.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Narrative Modeling with Memory Chains and Semantic Supervision.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Towards Robust and Privacy-preserving Text Representations.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Semi-supervised User Geolocation via Graph Convolutional Networks.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Deep-speare: A joint neural model of poetic language, meter and rhyme.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

A Comparative Study of Embedding Models in Predicting the Compositionality of Multiword Expressions.
Proceedings of the Australasian Language Technology Association Workshop 2018, 2018

Towards Efficient Machine Translation Evaluation by Modelling Annotators.
Proceedings of the Australasian Language Technology Association Workshop 2018, 2018

2017
Unsupervised Acquisition of Comprehensive Multiword Lexicons using Competition in an n-gram Lattice.
Trans. Assoc. Comput. Linguistics, 2017

Can machine translation systems be evaluated by the crowd alone.
Nat. Lang. Eng., 2017

Evaluating topic representations for exploring document collections.
J. Assoc. Inf. Sci. Technol., 2017

Pairwise Webpage Coreference Classification Using Distant Supervision.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Understanding User Behavior in Job and Talent Search: An Initial Investigation.
Proceedings of the SIGIR 2017 Workshop On eCommerce co-located with the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

SemEval-2017 Task 3: Community Question Answering.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

Semi-Automated Resolution of Inconsistency for a Harmonized Multiword Expression and Dependency Parse Annotation.
Proceedings of the 13th Workshop on Multiword Expressions, 2017

Capturing Long-range Contextual Dependencies with Memory-enhanced Conditional Random Fields.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Continuous Representation of Location for Geolocation and Lexical Dialectology using Mixture Density Networks.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Sub-character Neural Language Modelling in Japanese.
Proceedings of the First Workshop on Subword and Character Level Models in NLP, 2017

Sequence Effects in Crowdsourced Annotations.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Further Investigation into Reference Bias in Monolingual Evaluation of Machine Translation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Improving Evaluation of Document-level Machine Translation Quality Estimation.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Multimodal Topic Labelling.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Context-Aware Prediction of Derivational Word-forms.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Robust Training under Linguistic Adversity.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

An Automatic Approach for Document-level Topic Model Evaluation.
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), 2017

A Neural Model for User Geolocation and Lexical Dialectology.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Topically Driven Neural Language Model.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Decoupling Encoder and Decoder Networks for Abstractive Document Summarization.
Proceedings of the Workshop on Summarization and Summary Evaluation Across Source Types and Genres, 2017

Joint Sentence-Document Model for Manifesto Text Analysis.
Proceedings of the Australasian Language Technology Association Workshop, 2017

A Hybrid Model for Quality Assessment of Wikipedia Articles.
Proceedings of the Australasian Language Technology Association Workshop, 2017

Improving End-to-End Memory Networks with Unified Weight Tying.
Proceedings of the Australasian Language Technology Association Workshop, 2017

Automatic Negation and Speculation Detection in Veterinary Clinical Text.
Proceedings of the Australasian Language Technology Association Workshop, 2017

2016
From Incremental Meaning to Semantic Unit (phrase by phrase).
CoRR, 2016

#ISISisNotIslam or #DeportAllMuslims?: predicting unspoken views.
Proceedings of the 8th ACM Conference on Web Science, 2016

Quit While Ahead: Evaluating Truncated Rankings.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

UNIMELB at SemEval-2016 Tasks 4A and 4B: An Ensemble of Neural Networks and a Word2Vec Based Model for Sentiment Classification.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

VectorWeavers at SemEval-2016 Task 10: From Incremental Meaning to Semantic Unit (phrase by phrase).
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Melbourne at SemEval 2016 Task 11: Classifying Type-level Word Complexity using Random Forests with Corpus and Word List Features.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

UniMelb at SemEval-2016 Task 3: Identifying Similar Questions by combining a CNN with String Similarity Measures.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

An Empirical Evaluation of doc2vec with Practical Insights into Document Embedding Generation.
Proceedings of the 1st Workshop on Representation Learning for NLP, 2016

The Sensitivity of Topic Coherence Evaluation to Topic Cardinality.
Proceedings of the NAACL HLT 2016, 2016

Evaluating a Topic Modelling Approach to Measuring Corpus Similarity.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Named Entity Recognition for Novel Types by Transfer Learning.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Learning Robust Representations of Text.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Determining the Multiword Expression Inventory of a Surprise Language.
Proceedings of the COLING 2016, 2016

Is all that Glitters in Machine Translation Quality Estimation really Gold?
Proceedings of the COLING 2016, 2016

Automatic Labelling of Topics with Neural Embeddings.
Proceedings of the COLING 2016, 2016

Twitter Geolocation Prediction Shared Task of the 2016 Workshop on Noisy User-generated Text.
Proceedings of the 2nd Workshop on Noisy User-generated Text, 2016

Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vector Differences for Lexical Relation Learning.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

pigeo: A Python Geotagging Tool.
Proceedings of ACL-2016 System Demonstrations, Berlin, Germany, August 7-12, 2016, 2016

Bootstrapped Text-level Named Entity Recognition for Literature.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

LexSemTm: A Semantic Dataset Based on All-words Unsupervised Sense Distribution Learning.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
gDelta: a missing link in the grammar engineering toolchain.
Lang. Resour. Evaluation, 2015

Big Data Small Data, In Domain Out-of Domain, Known Word Unknown Word: The Impact of Word Representation on Sequence Labelling Tasks.
CoRR, 2015

Collective Document Classification with Implicit Inter-document Semantic Relationships.
Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics, 2015

RoseMerry: A Baseline Message-level Sentiment Classification System.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

A Word Embedding Approach to Predicting the Compositionality of Multiword Expressions.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Exploiting Text and Network Context for Geolocation of Social Media Users.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Accurate Evaluation of Segment-level Machine Translation Metrics.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

The Impact of Multiword Expression Compositionality on Machine Translation Evaluation.
Proceedings of the 11th Workshop on Multiword Expressions, 2015

A Classification Schema for Fast Disambiguation of Spatial Prepositions.
Proceedings of the 6th ACM SIGSPATIAL International Workshop on GeoStreaming, 2015

Big Data Small Data, In Domain Out-of Domain, Known Word Unknown Word: The Impact of Word Representations on Sequence Labelling Tasks.
Proceedings of the 19th Conference on Computational Natural Language Learning, 2015

A Probabilistic Rating Auto-encoder for Personalized Recommender Systems.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

TM 2015 - Topic Models: Post-Processing and Applications Workshop.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

Automatic Labelling of Topic Models Using Word Vectors and Letter Trigram Vectors.
Proceedings of the Information Retrieval Technology, 2015

CQADupStack: A Benchmark Data Set for Community Question-Answering Research.
Proceedings of the 20th Australasian Document Computing Symposium, 2015

Shared Tasks of the 2015 Workshop on Noisy User-generated Text: Twitter Lexical Normalization and Named Entity Recognition.
Proceedings of the Workshop on Noisy User-generated Text, 2015

Twitter User Geolocation Using a Unified Text and Network Prediction Model.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Understanding engagement with insurgents through retweet rhetoric.
Proceedings of the Australasian Language Technology Association Workshop, 2015

Domain Adaption of Named Entity Recognition to Support Credit Risk Assessment.
Proceedings of the Australasian Language Technology Association Workshop, 2015

2014
Automatic Detection and Language Identification of Multilingual Documents.
Trans. Assoc. Comput. Linguistics, 2014

Text-Based Twitter User Geolocation Prediction.
J. Artif. Intell. Res., 2014

Randomized Significance Tests in Machine Translation.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

Exploring Methods and Resources for Discriminating Similar Languages.
Proceedings of the First Workshop on Applying NLP Tools to Similar Languages, 2014

Automatic Zoom Level Prediction for Informal Location Descriptions.
Proceedings of the 4th International Workshop on Location and the Web, 2014

Automatic Identification of Locative Expressions from Social Media Text: A Comparative Analysis.
Proceedings of the 4th International Workshop on Location and the Web, 2014

Representing topics labels for exploring digital libraries.
Proceedings of the IEEE/ACM Joint Conference on Digital Libraries, 2014

Building a corpus of spatial relational expressions extracted from web documents.
Proceedings of the 8th Workshop on Geographic Information Retrieval, 2014

Detecting Non-compositional MWE Components using Wiktionary.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Testing for Significance of Increased Correlation with Human Judgment.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Using Distributional Similarity of Multi-way Translations to Predict Multiword Expression Compositionality.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Machine Reading Tea Leaves: Automatically Evaluating Topic Coherence and Topic Model Quality.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Is Machine Translation Getting Better over Time?
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

One Sense per Tweeter ... and Other Lexical Semantic Tales of Twitter.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Novel Word-sense Identification.
Proceedings of the COLING 2014, 2014

Learning Word Sense Distributions, Detecting Unattested Senses and Identifying Novel Senses Using Topic Models.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Automatic Detection of Multilingual Dictionaries on the Web.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
On collocations and topic models.
ACM Trans. Speech Lang. Process., 2013

Word sense and semantic relations in noun compounds.
ACM Trans. Speech Lang. Process., 2013

Lexical normalization for social media text.
ACM Trans. Intell. Syst. Technol., 2013

A lexical semantic approach to interpreting and bracketing English noun compounds.
Nat. Lang. Eng., 2013

Automatic keyphrase extraction from scientific articles.
Lang. Resour. Evaluation, 2013

UniMelb_NLP-CORE: Integrating predictions from multiple domains and feature sets for estimating semantic textual similarity.
Proceedings of the Second Joint Conference on Lexical and Computational Semantics, 2013

unimelb: Spanish Text Normalisation.
Proceedings of the Tweet Normalization Workshop co-located with 29th Conference of the Spanish Society for Natural Language Processing (SEPLN 2013), 2013

unimelb: Topic Modelling-based Word Sense Induction.
Proceedings of the 7th International Workshop on Semantic Evaluation, 2013

unimelb: Topic Modelling-based Word Sense Induction for Web Snippet Clustering.
Proceedings of the 7th International Workshop on Semantic Evaluation, 2013

Umelb: Cross-lingual Textual Entailment with Word Alignment and String Similarity Features.
Proceedings of the 7th International Workshop on Semantic Evaluation, 2013

Unsupervised Word Class Induction for Under-resourced Languages: A Case Study on Indonesian.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

How Noisy Social Media Text, How Diffrnt Social Media Sources?
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

The Utility of Discourse Structure in Forum Thread Retrieval.
Proceedings of the Information Retrieval Technology, 2013

Continuous Measurement Scales in Human Evaluation of Machine Translation.
Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse, 2013

A Stacking-based Approach to Twitter User Geolocation Prediction.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Automatic Climate Classification of Environmental Science Literature.
Proceedings of the Australasian Language Technology Association Workshop, 2013

Crowd-Sourcing of Human Judgments of Machine Translation Fluency.
Proceedings of the Australasian Language Technology Association Workshop, 2013

2012
Detecting modification of biomedical events using a deep parsing approach.
BMC Medical Informatics Decis. Mak., 2012

The Effects of Semantic Annotations on Precision Parse Ranking.
Proceedings of the First Joint Conference on Lexical and Computational Semantics, 2012

Combining resources for MWE-token classification.
Proceedings of the First Joint Conference on Lexical and Computational Semantics, 2012

Deep Lexical Acquisition of Type Properties in Low-resource Languages: A Case Study in Wambaya.
Proceedings of the 26th Pacific Asia Conference on Language, Information and Computation, 2012

Classifying Dialogue Acts in Multi-party Live Chats.
Proceedings of the 26th Pacific Asia Conference on Language, Information and Computation, 2012

Extracting Keywords from Multi-party Live Chats.
Proceedings of the 26th Pacific Asia Conference on Language, Information and Computation, 2012

Social Media: Friend or Foe of Natural Language Processing?
Proceedings of the 26th Pacific Asia Conference on Language, Information and Computation, 2012

Evaluating a Morphological Analyser of Inuktitut.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Automatically Constructing a Normalisation Dictionary for Microblogs.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Word Sense Induction for Novel Sense Detection.
Proceedings of the EACL 2012, 2012

A Support Platform for Event Detection using Social Intelligence.
Proceedings of the EACL 2012, 2012

The Utility of Discourse Structure in Identifying Resolved Threads in Technical User Forums.
Proceedings of the COLING 2012, 2012

Bayesian Text Segmentation for Index Term Identification and Keyphrase Extraction.
Proceedings of the COLING 2012, 2012

On-line Trend Analysis with Topic Models: \#twitter Trends Detection Topic Model Online.
Proceedings of the COLING 2012, 2012

Geolocation Prediction in Social Media Data by Finding Location Indicative Words.
Proceedings of the COLING 2012, 2012

langid.py: An Off-the-shelf Language Identification Tool.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, 2012

Classification of Study Region in Environmental Science Abstracts.
Proceedings of the Australasian Language Technology Association Workshop, 2012

Unsupervised Estimation of Word Usage Similarity.
Proceedings of the Australasian Language Technology Association Workshop, 2012

Measurement of Progress in Machine Translation.
Proceedings of the Australasian Language Technology Association Workshop, 2012

Segmentation and Translation of Japanese Multi-word Loanwords.
Proceedings of the Australasian Language Technology Association Workshop, 2012

Mining Micro-blogs: Opportunities and Challenges.
Proceedings of the Computational Social Networks, 2012

2011
Using ontological and document similarity to estimate museum exhibit relatedness.
ACM Journal on Computing and Cultural Heritage, 2011

Word sense disambiguation for event trigger word detection in biomedicine.
BMC Bioinform., 2011

Melbourne Language Group Microblog Track Report.
Proceedings of The Twentieth Text REtrieval Conference, 2011

Word classes in Indonesian: A linguistic reality or a convenient fallacy in natural language processing?
Proceedings of the 25th Pacific Asia Conference on Language, Information and Computation, 2011

In Situ Text Summarisation for Museum Visitors.
Proceedings of the 25th Pacific Asia Conference on Language, Information and Computation, 2011

MWEs and Topic Modelling: Enhancing Machine Learning with Linguistics.
Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World, 2011

Predicting and compensating for lexicon access errors.
Proceedings of the 16th International Conference on Intelligent User Interfaces, 2011

Treeblazing: Using External Treebanks to Filter Parse Forests for Parse Selection and Treebanking.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Cross-domain Feature Selection for Language Identification.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Fleshing it out: A Supervised Approach to MWE-token and MWE-type Classification.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Modelling and Predicting Movements of Museum Visitors: A Simulation Framework for Assessing the Impact of Sensor Noise on Model Performance.
Proceedings of the 9th Workshop on Intelligent Techniques for Web Personalization & Recommender Systems, 2011

Predicting Thread Discourse Structure over Technical Web Forums.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Relation Guided Bootstrapping of Semantic Lexicons.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

Automatic Labelling of Topic Models.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Lexical Normalisation of Short Text Messages: Makn Sens a #twitter.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Collective Classification of Congressional Floor-Debate Transcripts.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Predicting Thread Linking Structure by Lexical Chaining.
Proceedings of the Australasian Language Technology Association Workshop 2011, 2011

2010
Multiword Expressions.
Proceedings of the Handbook of Natural Language Processing, Second Edition., 2010

Visualizing search results and document collections using topic maps.
J. Web Semant., 2010

A Reexamination of MRD-Based Word Sense Disambiguation.
ACM Trans. Asian Lang. Inf. Process., 2010

How to pick out token instances of English verb-particle constructions.
Lang. Resour. Evaluation, 2010

SemEval-2010 Task 5 : Automatic Keyphrase Extraction from Scientific Articles.
Proceedings of the 5th International Workshop on Semantic Evaluation, 2010

Chart Mining-based Lexical Acquisition with Precision Grammars.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Automatic Evaluation of Topic Coherence.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Language Identification: The Long and the Short of the Matter.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Evaluating topic models for digital libraries.
Proceedings of the 2010 Joint International Conference on Digital Libraries, 2010

Classifying Dialogue Acts in One-on-One Live Chats.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Unsupervised Parse Selection for HPSG.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Tagging and Linking Web Forum Posts.
Proceedings of the Fourteenth Conference on Computational Natural Language Learning, 2010

Best Topic Word Selection for Topic Labelling.
Proceedings of the COLING 2010, 2010

Evaluating N-gram based Evaluation Metrics for Automatic Keyphrase Extraction.
Proceedings of the COLING 2010, 2010

PanLex and LEXTRACT: Translating all Words of all Languages of the World.
Proceedings of the COLING 2010, 2010

Thread-level Analysis over Technical User Forum Data.
Proceedings of the Australasian Language Technology Association Workshop, 2010

Classifying User Forum Participants: Separating the Gurus from the Hacks, and Other Tales of the Internet.
Proceedings of the Australasian Language Technology Association Workshop, 2010

Multilingual Language Identification: ALTW 2010 Shared Task Data.
Proceedings of the Australasian Language Technology Association Workshop, 2010

2009
The hare and the tortoise: speed and accuracy in translation retrieval.
Mach. Transl., 2009

Hozumi Tanaka.
Comput. Linguistics, 2009

Prepositions in Applications: A Survey and Introduction to the Special Issue.
Comput. Linguistics, 2009

A Baseline Approach to the RTE5 Search Pilot.
Proceedings of the Second Text Analysis Conference, 2009

Web and Corpus Methods for Malay Count Classifier Prediction.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Recognising the Predicate-argument Structure of Tagalog.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Experiments on pattern-based relation learning.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Biomedical Event Annotation with CRFs and Precision Grammars.
Proceedings of the BioNLP 2009 Workshop Companion Volume for Shared Task, BioNLP@HLT-NAACL 2009, 2009

Restoring Punctuation and Casing in English Text.
Proceedings of the AI 2009: Advances in Artificial Intelligence, 2009

Automatic Satire Detection: Are You Having a Laugh?
Proceedings of the ACL 2009, 2009

Double Double, Morphology and Trouble: Looking into Reduplication in Indonesian.
Proceedings of the Australasian Language Technology Association Workshop, 2009

Extracting Domain-Specific Words - A Statistical Approach.
Proceedings of the Australasian Language Technology Association Workshop, 2009

Corpus-based Extraction of Japanese Compound Verbs.
Proceedings of the Australasian Language Technology Association Workshop, 2009

2008
Using interest and transition models to predict visitor locations in museums.
AI Commun., 2008

An unsupervised approach to interpreting noun compounds.
Proceedings of the 4th International Conference on Natural Language Processing and Knowledge Engineering, 2008

Evaluating and Extending the Coverage of HPSG Grammars: A Case Study for German.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Benchmarking Noun Compound Interpretation.
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

MRD-based Word Sense Disambiguation: Further Extending Lesk.
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

Orthographic similarity search for dictionary lookup of Japanese words.
Proceedings of the ECAI 2008, 2008

Measuring and Predicting Orthographic Associations: Modelling the Similarity of Japanese Kanji.
Proceedings of the COLING 2008, 2008

Applying Discourse Analysis and Data Mining Methods to Spoken OSCE Assessments.
Proceedings of the COLING 2008, 2008

Using Collaborative Models to Adaptively Predict Visitor Locations in Museums.
Proceedings of the Adaptive Hypermedia and Adaptive Web-Based Systems, 2008

Aspect-Based Personalized Text Summarization.
Proceedings of the Adaptive Hypermedia and Adaptive Web-Based Systems, 2008

Improving Parsing and PP Attachment Performance with Sense Information.
Proceedings of the ACL 2008, 2008

Learning Count Classifier Preferences of Malay Nouns.
Proceedings of the Australasian Language Technology Association Workshop, 2008

Automatic Event Reference Identification.
Proceedings of the Australasian Language Technology Association Workshop, 2008

Towards Automatic Animated Storyboarding.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007
Automatic Acquisition of Qualia Structure from Corpus Data.
IEICE Trans. Inf. Syst., 2007

Bootstrapping Deep Lexical Resources: Resources for Courses
CoRR, 2007

MELB-YB: Preposition Sense Disambiguation Using Rich Semantic Features.
Proceedings of the 4th International Workshop on Semantic Evaluations, 2007

MELB-MKB: Lexical Substitution system based on Relatives in Context.
Proceedings of the 4th International Workshop on Semantic Evaluations, 2007

UBC-UMB: Combining unsupervised and supervised systems for all-words WSD.
Proceedings of the 4th International Workshop on Semantic Evaluations, 2007

MELB-KB: Nominal Classification as Noun Compound Interpretation.
Proceedings of the 4th International Workshop on Semantic Evaluations, 2007

Scalable Deep Linguistic Processing: Mind the Lexical Gap.
Proceedings of the 21st Pacific Asia Conference on Language, Information and Computation, 2007

Dynamic Path Prediction and Recommendation in a Museum Environment.
Proceedings of the Workshop on Language Technology for Cultural Heritage Data, 2007

The Impact of Deep Linguistic Processing on Parsing Technology.
Proceedings of the Tenth International Conference on Parsing Technologies, 2007

Word Sense Disambiguation Incorporating Lexical and Structural Semantic Information.
Proceedings of the EMNLP-CoNLL 2007, 2007

An Investigation into the Interaction Between Feature Selection and Discretization: Learning How and When to Read Numbers.
Proceedings of the AI 2007: Advances in Artificial Intelligence, 2007

Dictionary Alignment for Context-sensitive Word Glossing.
Proceedings of the Australasian Language Technology Workshop, 2007

Extending Sense Collocations in Interpreting Noun Compounds.
Proceedings of the Australasian Language Technology Workshop, 2007

Disambiguating Noun Compounds.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006
Semantic role labeling of prepositional phrases.
ACM Trans. Asian Lang. Inf. Process., 2006

Reconsidering Language Identification for Written Language Resources.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Open Source Corpus Analysis Tools for Malay.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Modelling the Orthographic Neighbourhood for Japanese Kanji.
Proceedings of the Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead, 2006

Multilingual Deep Lexical Acquisition for HPSGs via Supertagging.
Proceedings of the EMNLP 2006, 2006

Interpreting Semantic Relations in Noun Compounds via Verb Semantics.
Proceedings of the ACL 2006, 2006

Verb Sense Disambiguation Using Selectional Preferences Extracted with a State-of-the-art Semantic Role Labeler.
Proceedings of the Australasian Language Technology Workshop, 2006

Die Morphologie (f): Targeted Lexical Acquisition for Languages other than English.
Proceedings of the Australasian Language Technology Workshop, 2006

Analysis and Prediction of User Behaviour in a Museum Environment.
Proceedings of the Australasian Language Technology Workshop, 2006

2005
Disambiguating Japanese compound verbs.
Comput. Speech Lang., 2005

Deep lexical acquisition of verb-particle constructions.
Comput. Speech Lang., 2005

Semantic Role Labelling of Prepositional Phrases.
Proceedings of the Natural Language Processing, 2005

Automatic Interpretation of Noun Compounds Using WordNet Similarity.
Proceedings of the Natural Language Processing, 2005

Efficient Grapheme-phoneme Alignment for Japanese.
Proceedings of the Australasian Language Technology Workshop, 2005

Statistical Interpretation of Compound Nominalisations.
Proceedings of the Australasian Language Technology Workshop, 2005

POS Tagging with a More Informative Tagset.
Proceedings of the Australasian Language Technology Workshop, 2005

2004
Automatic Discovery of Telic and Agentive Roles from Corpus Data.
Proceedings of the 18th Pacific Asia Conference on Language, Information and Computation, 2004

A Multilingual Database of Idioms.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Evaluating the FOKS Error Model.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Road-testing the English Resource Grammar Over the British National Corpus.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

2003
Translation selection for Japanese-English noun-noun compounds.
Proceedings of Machine Translation Summit IX: Papers, 2003

A Plethora of Methods for Learning English Countability.
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2003

Crosslingual Countability Classification with EuroWordNet.
Proceedings of the Computational Linguistics in the Netherlands 2003, 2003

Learning the Countability of English Nouns from Corpus Data.
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003

The Ins and Outs of Dutch noun countability classification.
Proceedings of the Australasian Language Technology Workshop, 2003

2002
Multiword expressions: linguistic precision and reusability.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

Enhanced Japanese Electronic Dictionary Look-up.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

Extracting the Unextractable: A Case Study on Verb-particles.
Proceedings of the 6th Conference on Natural Language Learning, 2002

Bringing the Dictionary to the User: The FOKS System.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

Multiword Expressions: A Pain in the Neck for NLP.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2002

2001
The Japanese Translation Task: Lexical and Structural Perspectives.
Proceedings of Second International Workshop on Evaluating Word Sense Disambiguation Systems, 2001

Low-cost, High-Performance Translation Retrieval: Dumber is Better.
Proceedings of the Association for Computational Linguistic, 2001

2000
Verb Alternations and Japanese : How, What and Where.
Proceedings of the 14th Pacific Asia Conference on Language, Information and Computation, 2000

The Effects of Word Order and Segmentation on Translation Retrieval Performance.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000


  Loading...