Stephen Wan

Orcid: 0000-0001-7505-1417

  • CSIRO, Data61, Epping, Australia
  • Macquarie University, Sydney, Australia (PhD)

According to our database1, Stephen Wan authored at least 60 papers between 1998 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


Online presence:



An adaptive approach to noisy annotations in scientific information extraction.
Inf. Process. Manag., 2024

What Causes the Failure of Explicit to Implicit Discourse Relation Recognition?
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Detecting Online Community Practices with Large Language Models: A Case Study of Pro-Ukrainian Publics on Twitter.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

SciHarvester: Searching Scientific Documents for Numerical Values.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Rethinking the Role of Entity Type in Relation Classification.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

impact of sample selection on in-context learning for entity extraction from scientific writing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Investigating Metric Diversity for Evaluating Long Document Summarisation.
Proceedings of the Third Workshop on Scholarly Document Processing, 2022

Neural Rule-Execution Tracking Machine For Transformer-Based Text Generation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Integrating Lexical Information into Entity Neighbourhood Representations for Relation Prediction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

ECOL-R: Encouraging Copying in Novel Object Captioning with Reinforcement Learning.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Mention Flags (MF): Constraining Transformer-based Text Generators.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Image Captioning using Facial Expression and Attention.
J. Artif. Intell. Res., 2020

Social Media Relevance Filtering Using Perplexity-Based Positive-Unlabelled Learning.
Proceedings of the Fourteenth International AAAI Conference on Web and Social Media, 2020

'Watch the Flu': A Tweet Monitoring Tool for Epidemic Intelligence of Influenza in Australia.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Searching for musical features using natural language queries: the C@merata evaluations at MediaEval.
Lang. Resour. Evaluation, 2019

Towards Generating Stylized Image Captions via Adversarial Training.
Proceedings of the PRICAI 2019: Trends in Artificial Intelligence, 2019

Automatic Recognition of Student Engagement Using Deep Learning and Facial Expression.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

How to Best Use Syntax in Semantic Role Labelling.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Red-faced ROUGE: Examining the Suitability of ROUGE for Opinion Summary Evaluation.
Proceedings of the The 17th Annual Workshop of the Australasian Language Technology Association, 2019

Senti-Attend: Image Captioning using Sentiment and Attention.
CoRR, 2018

Distinguishing Individuals from Organisations on Twitter.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Twitter Content Eliciting User Engagement: A Case Study on Australian Organisations.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

CSIRO at 2017 TREC Precision Medicine Track.
Proceedings of The Twenty-Sixth Text REtrieval Conference, 2017

The CLAS System at the MediaEval 2017 C@merata Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2017 Workshop co-located with the Conference and Labs of the Evaluation Forum (CLEF 2017), 2017

Demographic Inference on Twitter using Recursive Neural Networks.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Data61-CSIRO systems at the CLPsych 2016 Shared Task.
Proceedings of the 3rd Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, 2016

Occupational Representativeness in Twitter.
Proceedings of the 21st Australasian Document Computing Symposium, 2016

CSIRO Data61 at the WNUT Geo Shared Task.
Proceedings of the 2nd Workshop on Noisy User-generated Text, 2016

Detecting Social Roles in Twitter.
Proceedings of The Fourth International Workshop on Natural Language Processing for Social Media, 2016

The Effects of Data Collection Methods in Twitter.
Proceedings of the First Workshop on NLP and Computational Social Science, 2016

The Role of Features and Context on Suicide Ideation Detection.
Proceedings of the Australasian Language Technology Association Workshop 2016, Melbourne, Australia, December 5, 2016

CLAS at the MediaEval 2015 C@merata Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

Ranking election issues through the lens of social media.
Proceedings of the 9th SIGHUM Workshop on Language Technology for Cultural Heritage, 2015

Social Media Data Aggregation and Mining for Internet-Scale Customer Relationship Management.
Proceedings of the 2015 IEEE International Conference on Information Reuse and Integration, 2015

Understanding Public Emotional Reactions on Twitter.
Proceedings of the Ninth International Conference on Web and Social Media, 2015

Improving Government Services Using Social Media Feedback.
Proceedings of the Social Media for Government Services, 2015

The CLAS System at the MediaEval 2014 C@merata Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Improving government services with social media feedback.
Proceedings of the 19th International Conference on Intelligent User Interfaces, 2014

Filtering and Ranking for Social Media Monitoring.
Proceedings of the CORIA 2013 - Conférence en Recherche d'Infomations et Applications, 2013

A Study: From Electronic Laboratory Notebooks to Generated Queries for Literature Recommendation.
Proceedings of the Australasian Language Technology Association Workshop, 2013

Differences in Language and Style Between Two Social Media Communities.
Proceedings of the Sixth International Conference on Weblogs and Social Media, 2012

Listening to the community: social media monitoring tasks for improving government services.
Proceedings of the International Conference on Human Factors in Computing Systems, 2011

Supporting browsing-specific information needs: Introducing the Citation-Sensitive In-Browser Summariser.
J. Web Semant., 2010

Focused and aggregated search: a perspective from natural language generation.
Inf. Retr., 2010

Spanning Tree Approaches for Statistical Sentence Generation.
Proceedings of the Empirical Methods in Natural Language Generation: Data-oriented Methods and Empirical Evaluation, 2010

Capturing the User's Reading Context for Tailoring Summaries.
Proceedings of the User Modeling, 2009

Whetting the appetite of scientists: producing summaries tailored to the citation context.
Proceedings of the 2009 Joint International Conference on Digital Libraries, 2009

Improving Grammaticality in Statistical Sentence Generation: Introducing a Dependency Spanning Tree Algorithm with an Argument Satisfaction Model.
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009

Experimenting with Clause Segmentation for Text Summarization.
Proceedings of the First Text Analysis Conference, 2008

Seed and Grow: Augmenting Statistically Generated Summary Sentences using Schematic Word Patterns.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

In-Browser Summarisation: Generating Elaborative Summaries Biased Towards the Reading Context.
Proceedings of the ACL 2008, 2008

GLEU: Automatic Evaluation of Sentence-Level Fluency.
Proceedings of the ACL 2007, 2007

Using Dependency-Based Features to Take the 'Para-farce' out of Paraphrase.
Proceedings of the Australasian Language Technology Workshop, 2006

Searching for Grammaticality: Propagating Dependencies in the Viterbi Algorithm.
Proceedings of the Tenth European Workshop on Natural Language Generation, 2005

Towards Statistical Paraphrase Generation: Preliminary Evaluations of Grammaticality.
Proceedings of the Third International Workshop on Paraphrasing, 2005

Generating Overview Summaries of Ongoing Email Thread Discussions.
Proceedings of the COLING 2004, 2004

Straight to the point: Discovering themes for summary generation.
Proceedings of the Australasian Language Technology Workshop, 2003

Generating Personal Travel Guides - And Who Wants Them?
Proceedings of the User Modeling 2001, 8th International Conference, 2001

Generating Personal Travel Guides from Discourse Plans.
Proceedings of the Adaptive Hypermedia and Adaptive Web-Based Systems, 2000

Automatic English-Chinese Name Transliteration for Development of Multilingual Resources.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998
