Andrew Caines

Orcid: 0000-0001-9647-4902

According to our database1, Andrew Caines authored at least 44 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes.
CoRR, 2024

Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Logging Keystrokes in Writing by English Learners.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Grammatical Error Correction for Code-Switched Sentences by Learners of English.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Recurrent Neural Collaborative Filtering for Knowledge Tracing.
Proceedings of the Artificial Intelligence in Education. Posters and Late Breaking Results, Workshops and Tutorials, Industry and Innovation Tracks, Practitioners, Doctoral Consortium and Blue Sky, 2024

Workshop on Automatic Evaluation of Learning and Assessment Content.
Proceedings of the Artificial Intelligence in Education. Posters and Late Breaking Results, Workshops and Tutorials, Industry and Innovation Tracks, Practitioners, Doctoral Consortium and Blue Sky, 2024

Prompting open-source and commercial language models for grammatical error correction of English learner text.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Automated hate speech detection and span extraction in underground hacking and extremist forums.
Nat. Lang. Eng., September, 2023

A Survey on Recent Approaches to Question Difficulty Estimation from Text.
ACM Comput. Surv., 2023

CLIMB: Curriculum Learning for Infant-inspired Model Building.
CoRR, 2023

On the application of Large Language Models for language teaching and assessment technology.
CoRR, 2023

Finding the Needle in a Haystack: Unsupervised Rationale Extraction from Long Text Classifiers.
CoRR, 2023

On the Application of Large Language Models for Language Teaching and Assessment Technology.
Proceedings of the Workshop on Empowering Education with LLMs, 2023

2022
PostCog: A tool for interdisciplinary research into underground forums at scale.
Proceedings of the IEEE European Symposium on Security and Privacy, 2022

Probing for targeted syntactic knowledge through grammatical error detection.
Proceedings of the 26th Conference on Computational Natural Language Learning, 2022

2021
Towards a parallel corpus of Portuguese and the Bantu language Emakhuwa of Mozambique.
Proceedings of the 2nd AfricaNLP Workshop Proceedings, AfricaNLP@EACL 2021, Virtual Event, 2021

Efficient Unsupervised NMT for Related Languages with Cross-Lingual Language Models and Fidelity Objectives.
Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects, 2021

2020
The Teacher-Student Chatroom Corpus.
CoRR, 2020

REPROLANG 2020: Automatic Proficiency Scoring of Czech, English, German, Italian, and Spanish Learner Essays.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

An Expectation Maximisation Algorithm for Automated Cognate Detection.
Proceedings of the 24th Conference on Computational Natural Language Learning, 2020

Grammatical error detection in transcriptions of spoken English.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Adaptive Forgetting Curves for Spaced Repetition Language Learning.
Proceedings of the Artificial Intelligence in Education - 21st International Conference, 2020

Detecting Trending Terms in Cybersecurity Forum Discussions.
Proceedings of the Sixth Workshop on Noisy User-generated Text, 2020

Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Overview of the 2019 Spoken CALL Shared Task.
Proceedings of the 8th ISCA International Workshop on Speech and Language Technology in Education, 2019

CAMsterdam at SemEval-2019 Task 6: Neural and graph-based feature extraction for the identification of offensive tweets.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Automatic Grammatical Error Detection of Non-native Spoken Learner English.
Proceedings of the IEEE International Conference on Acoustics, 2019

Accurate Modelling of Language Learning Tasks and Students Using Representations of Grammatical Proficiency.
Proceedings of the 12th International Conference on Educational Data Mining, 2019

Skills Embeddings: A Neural Approach to Multicomponent Representations of Students and Tasks.
Proceedings of the 12th International Conference on Educational Data Mining, 2019

Behavioural Cloning of Teachers for Automatic Homework Selection.
Proceedings of the Artificial Intelligence in Education - 20th International Conference, 2019

2018
Characterizing Eve: Analysing Cybercrime Actors in a Large Underground Forum.
Proceedings of the Research in Attacks, Intrusions, and Defenses, 2018

Impact of ASR Performance on Free Speaking Language Assessment.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Overview of the 2018 Spoken CALL Shared Task.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Aggressive language in an online hacking forum.
Proceedings of the 2nd Workshop on Abusive Language Online, 2018

2017
Spoken CALL Shared Task system description.
Proceedings of the 7th ISCA International Workshop on Speech and Language Technology in Education, 2017

Parsing transcripts of speech.
Proceedings of the Workshop on Speech-Centric Natural Language Processing, 2017

Collecting fluency corrections for spoken learner English.
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, 2017

A Text Normalisation System for Non-Standard English Words.
Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017

2016
Predicting Author Age from Weibo Microblog Posts.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Crowdsourcing a Multi-lingual Speech Corpus: Recording, Transcription and Annotation of the CrowdIS Corpora.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Automated speech-unit delimitation in spoken learner English.
Proceedings of the COLING 2016, 2016

2015
Incremental Dependency Parsing and Disfluency Detection in Spoken Learner English.
Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015

2012
Annotating progressive aspect constructions in the spoken section of the British National Corpus.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Reclassifying subcategorization frames for experimental analysis and stimulus generation.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012


  Loading...