Trevor Cohn

Orcid: 0000-0003-4363-1673

  • University of Melbourne, School of Computing and Information Systems, Australia

According to our database1, Trevor Cohn authored at least 240 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


Online presence:



A Computational Approach to Identifying Cultural Keywords Across Languages.
Cogn. Sci., January, 2024

SEEP: Training Dynamics Grounds Latent Representation Search for Mitigating Backdoor Poisoning Attacks.
Trans. Assoc. Comput. Linguistics, 2024

Overcoming Reward Model Noise in Instruction-Guided Reinforcement Learning.
CoRR, 2024

Planning in the Dark: LLM-Symbolic Planning Pipeline without Experts.
CoRR, 2024

Mufu: Multilingual Fused Learning for Low-Resource Translation with LLM.
CoRR, 2024

Don't Throw Away Data: Better Sequence Knowledge Distillation.
CoRR, 2024

Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning.
CoRR, 2024

Revisiting subword tokenization: A case study on affixal negation in large language models.
CoRR, 2024

Backdoor Attack on Multilingual Machine Translation.
CoRR, 2024

Backdoor Attacks on Multilingual Machine Translation.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Revisiting subword tokenization: A case study on affixal negation in large language models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Simpson's Paradox and the Accuracy-Fluency Tradeoff in Translation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Predicting Human Translation Difficulty with Neural Machine Translation.
CoRR, 2023

Multi-EuP: The Multilingual European Parliament Dataset for Analysis of Bias in Information Retrieval.
CoRR, 2023

A Reminder of its Brittleness: Language Reward Shaping May Hinder Learning for Instruction Following Agents.
CoRR, 2023

IMBERT: Making BERT Immune to Insertion-based Backdoor Attacks.
CoRR, 2023

DeltaScore: Evaluating Story Generation with Differentiating Perturbations.
CoRR, 2023

Can Very Large Pretrained Language Models Learn Storytelling With A Few Examples?
CoRR, 2023

Language models are not naysayers: an analysis of language models on negation benchmarks.
Proceedings of the The 12th Joint Conference on Lexical and Computational Semantics, 2023

Seeking Clozure: Robust Hypernym extraction from BERT with Anchored Prompts.
Proceedings of the The 12th Joint Conference on Lexical and Computational Semantics, 2023

The Next Chapter: A Study of Large Language Models in Storytelling.
Proceedings of the 16th International Natural Language Generation Conference, 2023

It's not only What You Say, It's also Who It's Said to: Counterfactual Analysis of Interactive Behavior in the Courtroom.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

Everybody Needs Good Neighbours: An Unsupervised Locality-based Method for Bias Mitigation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

DeltaScore: Fine-Grained Story Evaluation with Perturbations.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Boot and Switch: Alternating Distillation for Zero-Shot Dense Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Noisy Self-Training with Synthetic Queries for Dense Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Mitigating Backdoor Poisoning Attacks through the Lens of Spurious Correlation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

More than Votes? Voting and Language based Partisanship in the US Supreme Court.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Fingerprint Attack: Client De-Anonymization in Federated Learning.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

Performance Prediction via Bayesian Matrix Factorisation for Multilingual Natural Language Processing Tasks.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Probing Power by Prompting: Harnessing Pre-trained Language Models for Power Connotation Framing.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Don't Mess with Mister-in-Between: Improved Negative Search for Knowledge Graph Completion.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Fair Enough: Standardizing Evaluation and Model Selection for Fairness Research in NLP.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Rethinking Round-Trip Translation for Machine Translation Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

A Survey for Efficient Open Domain Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Predicting Human Translation Difficulty Using Automatic Word Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Cost-effective Distillation of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Detecting Backdoors in Deep Text Classifiers.
CoRR, 2022

Rethinking Round-trip Translation for Automatic Machine Translation Evaluation.
CoRR, 2022

fairlib: A Unified Framework for Assessing and Improving Classification Fairness.
CoRR, 2022

Towards Equal Opportunity Fairness through Adversarial Learning.
CoRR, 2022

Improving negation detection with negation-focused pre-training.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Optimising Equal Opportunity Fairness in Model Training.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Unsupervised Cross-Lingual Transfer of Structured Predictors without Source Data.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Not another Negation Benchmark: The NaN-NLI Test Suite for Sub-clausal Negation.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Does Representational Fairness Imply Empirical Fairness?
Proceedings of the Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022

WAX: A New Dataset for Word Association eXplanations.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Systematic Evaluation of Predictive Fairness.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Foiling Training-Time Attacks on Neural Machine Translation Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

FairLib: A Unified Framework for Assessing and Improving Fairness.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Balancing out Bias: Achieving Fairness Through Balanced Training.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

The ChEMU 2022 Evaluation Campaign: Information Extraction in Chemical Patents.
Proceedings of the Advances in Information Retrieval, 2022

LED down the rabbit hole: exploring the potential of global attention for biomedical multi-document summarisation.
Proceedings of the Third Workshop on Scholarly Document Processing, 2022

Measuring and Mitigating Name Biases in Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Incorporating Constituent Syntax for Coreference Resolution.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

ChemTables: a dataset for semantic classification on tables in chemical patents.
J. Cheminformatics, 2021

ChEMU 2020: Natural Language Processing Methods Are Effective for Information Extraction From Chemical Patents.
Frontiers Res. Metrics Anal., 2021

Exploring Story Generation with Multi-task Objectives in Variational Autoencoders.
CoRR, 2021

Contrastive Learning for Fair Representations.
CoRR, 2021

Balancing out Bias: Achieving Fairness Through Training Reweighting.
CoRR, 2021

A Targeted Attack on Black-Box Neural Machine Translation with Parallel Data Poisoning.
Proceedings of the WWW '21: The Web Conference 2021, 2021

ITTC @ TREC 2021 Clinical Trials Track.
Proceedings of the Thirtieth Text REtrieval Conference, 2021

PTST-UoM at SemEval-2021 Task 10: Parsimonious Transfer for Sequence Tagging.
Proceedings of the 15th International Workshop on Semantic Evaluation, 2021

Framing Unpacked: A Semi-Supervised Interpretable Multi-View Model of Media Frames.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Incorporating Syntax and Semantics in Coreference Resolution with Heterogeneous Graph Attention Network.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Generating Diverse Descriptions from Semantic Graphs.
Proceedings of the 14th International Conference on Natural Language Generation, 2021

It Is Not As Good As You Think! Evaluating Simultaneous Machine Translation on Interpretation Data.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Mitigating Data Poisoning in Text Classification with Differential Privacy.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Evaluating Debiasing Techniques for Intersectional Biases.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Fairness-aware Class Imbalanced Learning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

ChEMU 2021: Reaction Reference Resolution and Anaphora Resolution in Chemical Patents.
Proceedings of the Advances in Information Retrieval, 2021

PPT: Parsimonious Parser Transfer for Unsupervised Cross-Lingual Adaptation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Diverse Adversaries for Mitigating Bias in Training.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Learning Coupled Policies for Simultaneous Machine Translation using Imitation Learning.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Commonsense Knowledge in Word Associations and ConceptNet.
Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

Putting words into the system's mouth: A targeted attack on neural machine translation using monolingual data poisoning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

As Easy as 1, 2, 3: Behavioural Testing of NMT Systems for Numerical Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Decoupling Adversarial Training for Fair NLP.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Targeted Poisoning Attacks on Black-Box Neural Machine Translation.
CoRR, 2020

Learning Coupled Policies for Simultaneous Machine Translation.
CoRR, 2020

Decoding As Dynamic Programming For Recurrent Autoregressive Models.
Proceedings of the 8th International Conference on Learning Representations, 2020

ChEMU: Named Entity Recognition and Event Extraction of Chemical Reactions from Patents.
Proceedings of the Advances in Information Retrieval, 2020

Overview of ChEMU 2020: Named Entity Recognition and Event Extraction of Chemical Reactions from Patents.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2020

Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evaluation Metrics.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Gaussian Processes for Rumour Stance Classification in Social Media.
ACM Trans. Inf. Syst., 2019

Multilingual NER Transfer for Low-resource Languages.
CoRR, 2019

Truth Inference at Scale: A Bayesian Model for Adjudicating Highly Redundant Crowd Annotations.
Proceedings of the World Wide Web Conference, 2019

Neural Speech Translation using Lattice Transformations and Graph Networks.
Proceedings of the Thirteenth Workshop on Graph-Based Methods for Natural Language Processing, 2019

Target Based Speech Act Classification in Political Campaign Text.
Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics, 2019

Contextualization of Morphological Inflection.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Exploiting Worker Correlation for Label Aggregation in Crowdsourcing.
Proceedings of the 36th International Conference on Machine Learning, 2019

A Unified Neural Architecture for Instrumental Audio Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2019

On the Role of Scene Graphs in Image Captioning.
Proceedings of the Beyond Vision and LANguage: inTEgrating Real-world kNowledge, 2019

Deep Ordinal Regression for Pledge Specificity Prediction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Grounding learning of modifier dynamics: An application to color naming.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Improving Chemical Named Entity Recognition in Patents with Contextualized Word Embeddings.
Proceedings of the 18th BioNLP Workshop and Shared Task, 2019

Massively Multilingual Transfer for NER.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Putting Evaluation in Context: Contextual Embeddings Improve Machine Translation Evaluation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Semi-supervised Stochastic Multi-Domain Learning using Variational Inference.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

From Shakespeare to Li-Bai: Adapting a Sonnet Model to Chinese Poetry.
Proceedings of the The 17th Annual Workshop of the Australasian Language Technology Association, 2019

Discourse-aware rumour stance classification in social media using sequential classifiers.
Inf. Process. Manag., 2018

Evaluating the Utility of Hand-crafted Features in Sequence Labelling.
CoRR, 2018

Exploiting graph kernels for high performance biomedical relation extraction.
J. Biomed. Semant., 2018

Hierarchical Structured Model for Fine-to-Coarse Manifesto Text Analysis.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Recurrent Entity Networks with Delayed Memory Update for Targeted Aspect-Based Sentiment Analysis.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

What's in a Domain? Learning Domain-Robust Text Representations using Adversarial Training.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Evaluation Phonemic Transcription of Low-Resource Tonal Languages for Language Documentation.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Evaluating the Utility of Hand-crafted Features in Sequence Labeling.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Twitter Geolocation using Knowledge-Based Methods.
Proceedings of the 4th Workshop on Noisy User-generated Text, 2018

Iterative Back-Translation for Neural Machine Translation.
Proceedings of the 2nd Workshop on Neural Machine Translation and Generation, 2018

Content-based Popularity Prediction of Online Petitions Using a Deep Regression Model.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Narrative Modeling with Memory Chains and Semantic Supervision.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Towards Robust and Privacy-preserving Text Representations.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

A Stochastic Decoder for Neural Machine Translation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Graph-to-Sequence Learning using Gated Graph Neural Networks.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Semi-supervised User Geolocation via Graph Convolutional Networks.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Deep-speare: A joint neural model of poetic language, meter and rhyme.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Towards Efficient Machine Translation Evaluation by Modelling Annotators.
Proceedings of the Australasian Language Technology Association Workshop 2018, 2018

Improved Neural Machine Translation using Side Information.
Proceedings of the Australasian Language Technology Association Workshop 2018, 2018

DyNet: The Dynamic Neural Network Toolkit.
CoRR, 2017

Decoding as Continuous Optimization in Neural Machine Translation.
CoRR, 2017

Pairwise Webpage Coreference Classification Using Distant Supervision.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Capturing Long-range Contextual Dependencies with Memory-enhanced Conditional Random Fields.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

End-to-end Network for Twitter Geolocation Prediction and Hashing.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Learning Kernels over Strings using Gaussian Processes.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Compressed Nonparametric Language Modelling.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Modelling the Working Week for Multi-Step Forecasting using Gaussian Process Regression.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Word Representation Models for Morphologically Rich Languages in Neural Machine Translation.
Proceedings of the First Workshop on Subword and Character Level Models in NLP, 2017

Continuous Representation of Location for Geolocation and Lexical Dialectology using Mixture Density Networks.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Sequence Effects in Crowdsourced Annotations.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Towards Decoding as Continuous Optimisation in Neural Machine Translation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Learning how to Active Learn: A Deep Reinforcement Learning Approach.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Multilingual Training of Crosslingual Word Embeddings.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Cross-Lingual Word Embeddings for Low-Resource Language Modeling.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Context-Aware Prediction of Derivational Word-forms.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Robust Training under Linguistic Adversity.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Multi-step prediction with missing smart sensor data using multi-task Gaussian processes.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

Longitudinal Modeling of Social Media with Hawkes Process Based on Users and Networks.
Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017, Sydney, Australia, July 31, 2017

A Neural Model for User Geolocation and Lexical Dialectology.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Topically Driven Neural Language Model.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Model Transfer for Tagging Low-resource Languages using a Bilingual Dictionary.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Decoupling Encoder and Decoder Networks for Abstractive Document Summarization.
Proceedings of the Workshop on Summarization and Summary Evaluation Across Source Types and Genres, 2017

Joint Sentence-Document Model for Manifesto Text Analysis.
Proceedings of the Australasian Language Technology Association Workshop, 2017

Improving End-to-End Memory Networks with Unified Weight Tying.
Proceedings of the Australasian Language Technology Association Workshop, 2017

Phonemic Transcription of Low-Resource Tonal Languages.
Proceedings of the Australasian Language Technology Association Workshop, 2017

Fast, Small and Exact: Infinite-order Language Modelling with Compressed Suffix Trees.
Trans. Assoc. Comput. Linguistics, 2016

Using Gaussian Processes for Rumour Stance Classification in Social Media.
CoRR, 2016

Exploiting Tree Kernels for High Performance Chemical Induced Disease Relation Extraction.
Proceedings of the 7th International Symposium on Semantic Mining in Biomedicine, 2016

Incorporating Side Information into Recurrent Neural Network Language Models.
Proceedings of the NAACL HLT 2016, 2016

An Attentional Model for Speech Translation Without Transcription.
Proceedings of the NAACL HLT 2016, 2016

Incorporating Structural Alignment Biases into an Attentional Neural Translation Model.
Proceedings of the NAACL HLT 2016, 2016

Studying the Temporal Dynamics of Word Co-occurrences: An Application to Event Detection.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Learning a Translation Model from Word Lattices.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Richer Interpolative Smoothing Based on Modified Kneser-Ney Language Modeling.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Learning Robust Representations of Text.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Learning Crosslingual Word Embeddings without Bilingual Corpora.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Learning a Lexicon and Translation Model from Phoneme Lattices.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Learning when to trust distant supervision: An application to low-resource POS tagging using cross-lingual projection.
Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, 2016

Exploring Prediction Uncertainty in Machine Translation Quality Estimation.
Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, 2016

Succinct Data Structures for NLP-at-Scale.
Proceedings of the COLING 2016, 2016

SeeDev Binary Event Extraction using SVMs and a Rich Feature Set.
Proceedings of the 4th BioNLP Shared Task Workshop, BioNLP 2016, 2016

Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vector Differences for Lexical Relation Learning.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

pigeo: A Python Geotagging Tool.
Proceedings of ACL-2016 System Demonstrations, Berlin, Germany, August 7-12, 2016, 2016

Hawkes Processes for Continuous Time Sequence Classification: an Application to Rumour Stance Classification in Twitter.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Improving Neural Translation Models with Linguistic Factors.
Proceedings of the Australasian Language Technology Association Workshop 2016, Melbourne, Australia, December 5, 2016

ASM Kernel: Graph Kernel using Approximate Subgraph Matching for Relation Extraction.
Proceedings of the Australasian Language Technology Association Workshop 2016, Melbourne, Australia, December 5, 2016

Convolution Kernels for Discriminative Learning from Streaming Text.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Learning Structural Kernels for Natural Language Processing.
Trans. Assoc. Comput. Linguistics, 2015

A Bayesian non-linear method for feature selection in machine translation quality estimation.
Mach. Transl., 2015

Day trading profit maximization with multi-task learning and technical analysis.
Mach. Learn., 2015

Depth-Gated LSTM.
CoRR, 2015

Estimating collective judgement of rumours in social media.
CoRR, 2015

Document Context Language Models.
CoRR, 2015

Structured Prediction of Sequences and Trees Using Infinite Contexts.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2015

Exploiting Text and Network Context for Geolocation of Social Media Users.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Inducing bilingual lexicons from small quantities of sentence-aligned phonemic transcriptions.
Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015

Compact, Efficient and Unlimited Capacity: Language Modeling with Compressed Suffix Trees.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Modeling Tweet Arrival Times using Log-Gaussian Cox Processes.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Classifying Tweet Level Judgements of Rumours in Social Media.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

A Neural Network Model for Low-Resource Universal Dependency Parsing.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Cross-lingual Transfer for Unsupervised Dependency Parsing Without Parallel Data.
Proceedings of the 19th Conference on Computational Natural Language Learning, 2015

Twitter User Geolocation Using a Unified Text and Network Prediction Model.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Point Process Modelling of Rumour Dynamics in Social Media.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Low Resource Dependency Parsing: Cross-lingual Parameter Sharing in a Neural Network Parser.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Non-Linear Text Regression with a Deep Convolutional Neural Network.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Predicting Peer-to-Peer Loan Rates Using Bayesian Non-Linear Regression.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

What Can We Get From 1000 Tokens? A Case Study of Multilingual POS Tagging For Resource-Poor Languages.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Joint Emotion Analysis via Multi-task Gaussian Processes.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Data selection for discriminative training in statistical machine translation.
Proceedings of the 17th Annual conference of the European Association for Machine Translation, 2014

Predicting and Characterising User Impact on Twitter.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Factored Markov Translation with Robust Modeling.
Proceedings of the Eighteenth Conference on Computational Natural Language Learning, 2014

Extracting Socioeconomic Patterns from the News: Modelling Text and Outlet Importance Jointly.
Proceedings of the Workshop on Language Technologies and Computational Social Science@ACL 2014, 2014

Simple extensions and POS Tags for a reparameterised IBM Model 2.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Gaussian Processes for Natural Language Processing.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

An abstractive approach to sentence compression.
ACM Trans. Intell. Syst. Technol., 2013

BLEU Deconstructed: Designing a Better MT Evaluation Metric.
Int. J. Comput. Linguistics Appl., 2013

SHEF-Lite: When Less is More for Translation Quality Estimation.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Mining user behaviours: a study of check-in patterns in location based social networks.
Proceedings of the Web Science 2013 (co-located with ECRC), 2013

Adaptation of lecture speech recognition system with machine translation output.
Proceedings of the IEEE International Conference on Acoustics, 2013

Where's @wally?: a classification approach to geolocating users based on their social ties.
Proceedings of the 24th ACM Conference on Hypertext and Social Media (part of ECRC), 2013

A temporal model of text periodicities using Gaussian Processes.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Topic-Oriented Words as Features for Named Entity Recognition.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2013

QuEst - A translation quality estimation framework.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

A user-centric model of voting intention from Social Media.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

A Markov Model of Machine Translation using Non-parametric Bayesian Inference.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Modelling Annotator Bias with Multi-task Gaussian Processes: An Application to Machine Translation Quality Estimation.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

An Infinite Hierarchical Bayesian Model of Phrasal Translation.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Reducing Annotation Effort for Quality Estimation via Active Learning.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Redundancy reduction for multi-document summaries using A* search and discriminative training.
Proceedings of the 2nd International Workshop on Exploiting Large Knowledge Repositories, 2012

Evaluating a Morphological Analyser of Inuktitut.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Left-to-Right Tree-to-String Decoding with Prediction.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Regression and Ranking based Optimisation for Sentence Level MT Evaluation.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

A Hierarchical Pitman-Yor Process HMM for Unsupervised Part of Speech Induction.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Inducing Tree-Substitution Grammars.
J. Mach. Learn. Res., 2010

Inducing Synchronous Grammars with Slice Sampling.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Unsupervised Induction of Tree Substitution Grammars for Dependency Parsing.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Multi-Document Summarization Using A* Search and Discriminative Learning.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Blocked Inference in Bayesian Tree Substitution Grammars.
Proceedings of the ACL 2010, 2010

Sentence Compression as Tree Transduction.
J. Artif. Intell. Res., 2009

Inducing Compact but Accurate Tree-Substitution Grammars.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

A Bayesian Model of Syntax-Directed Tree to String Grammar Induction.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Word Lattices for Multi-Source Translation.
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009

A Note on the Implementation of Hierarchical Dirichlet Processes.
Proceedings of the ACL 2009, 2009

A Gibbs Sampler for Phrasal Synchronous Grammar Induction.
Proceedings of the ACL 2009, 2009

Constructing Corpora for the Development and Evaluation of Paraphrase Systems.
Comput. Linguistics, 2008

Bayesian Synchronous Grammar Induction.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Sentence Compression Beyond Word Deletion.
Proceedings of the COLING 2008, 2008

ParaMetric: An Automatic Evaluation Metric for Paraphrasing.
Proceedings of the COLING 2008, 2008

A Discriminative Latent Variable Model for Statistical Machine Translation.
Proceedings of the ACL 2008, 2008

Scaling conditional random fields for natural language processing.
PhD thesis, 2007

Large Margin Synchronous Generation and its Application to Sentence Compression.
Proceedings of the EMNLP-CoNLL 2007, 2007

Machine Translation by Triangulation: Making Effective Use of Multi-Parallel Corpora.
Proceedings of the ACL 2007, 2007

Efficient Inference in Large Conditional Random Fields.
Proceedings of the Machine Learning: ECML 2006, 2006

Discriminative Word Alignment with Conditional Random Fields.
Proceedings of the ACL 2006, 2006

Semantic Role Labelling with Tree Conditional Random Fields.
Proceedings of the Ninth Conference on Computational Natural Language Learning, 2005

Logarithmic Opinion Pools for Conditional Random Fields.
Proceedings of the ACL 2005, 2005

Scaling Conditional Random Fields Using Error-Correcting Codes.
Proceedings of the ACL 2005, 2005

Performance metrics for word sense disambiguation.
Proceedings of the Australasian Language Technology Workshop, 2003
