Nigel Collier

Orcid: 0000-0002-7230-4164

Affiliations:
  • University of Cambridge, UK


According to our database1, Nigel Collier authored at least 200 papers between 1996 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
LoGU: Long-form Generation with Uncertainty Expressions.
CoRR, 2024

Atomic Calibration of LLMs in Long-Form Generations.
CoRR, 2024

Conformity in Large Language Models.
CoRR, 2024

Prompt Compression for Large Language Models: A Survey.
CoRR, 2024

Aligning with Logic: Measuring, Evaluating and Improving Logical Consistency in Large Language Models.
CoRR, 2024

500xCompressor: Generalized Prompt Compression for Large Language Models.
CoRR, 2024

Attention Instruction: Amplifying Attention in the Middle via Prompting.
CoRR, 2024

Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators.
CoRR, 2024

Unlocking Structure Measuring: Introducing PDD, an Automatic Metric for Positional Discourse Coherence.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Can We Instruct LLMs to Compensate for Position Bias?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

LUQ: Long-text Uncertainty Quantification for LLMs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

TopViewRS: Vision-Language Models as Top-View Spatial Reasoners.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Can LLM be a Personalized Judge?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

TOAD: Task-Oriented Automatic Dialogs with Diverse Response Styles.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

An Individualized News Affective Response Dataset.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Quantifying the Persona Effect in LLM Simulations.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

BAND: Biomedical Alert News Dataset.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Contrastive Search Is What You Need For Neural Text Generation.
Trans. Mach. Learn. Res., 2023

Visual Spatial Reasoning.
Trans. Assoc. Comput. Linguistics, 2023

Instruct-SCTG: Guiding Sequential Controlled Text Generation through Instructions.
CoRR, 2023

Generative Language Models Exhibit Social Identity Biases.
CoRR, 2023

FireAct: Toward Language Agent Fine-tuning.
CoRR, 2023

Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models.
CoRR, 2023

BAND: Biomedical Alert News Dataset.
CoRR, 2023

Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder.
CoRR, 2023

COFFEE: A Contrastive Oracle-Free Framework for Event Extraction.
CoRR, 2023

A Stability Analysis of Fine-Tuning a Pre-Trained Model.
CoRR, 2023

Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

POSQA: Probe the World Models of LLMs with Size Comparisons.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Biomedical Named Entity Recognition via Dictionary-based Synonym Generalization.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence Encoders.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

On Reality and the Limits of Language Data: Aligning LLMs with Human Norms.
Proceedings of the 45th Annual Meeting of the Cognitive Science Society, 2023

MatCha: Enhancing Visual Language Pretraining with Math Reasoning and Chart Derendering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

DePlot: One-shot visual language reasoning by plot-to-table translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

On the Effectiveness of Parameter-Efficient Fine-Tuning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
A survey on clinical natural language processing in the United Kingdom from 2007 to 2022.
npj Digit. Medicine, 2022

Plug-and-Play Recipe Generation with Content Planning.
CoRR, 2022

On Reality and the Limits of Language Data.
CoRR, 2022

Language Models Can See: Plugging Visual Controls in Text Generation.
CoRR, 2022

Exposing Cross-Lingual Lexical Knowledge from Multilingual Sentence Encoders.
CoRR, 2022

Exploiting document graphs for inter sentence relation extraction.
J. Biomed. Semant., 2022

PheneBank: a literature-based database of phenotypes.
Bioinform., 2022

BioCaster in 2021: automatic disease outbreaks detection from global news media.
Bioinform., 2022

A Contrastive Framework for Neural Text Generation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

A Conceptual Framework for Representing Events Under Public Health Surveillance.
Proceedings of the Challenges of Trustable AI and Added-Value on Health, 2022

Do ever larger octopi still amplify reporting biases? Evidence from judgments of typical colour.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

How to tackle an emerging topic? Combining strong and weak labels for Covid news NER.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Prix-LM: Pretraining for Multilingual Knowledge Base Construction.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Rewire-then-Probe: A Contrastive Recipe for Probing Biomedical Knowledge of Pre-trained Language Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Improving Word Translation via Two-Stage Contrastive Learning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Incorporating Stock Market Signals for Twitter Stance Detection.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
PROTOTYPE-TO-STYLE: Dialogue Generation With Style-Aware Editing on Retrieval Memory.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning.
CoRR, 2021

Fast, Effective and Self-Supervised: Transforming Masked LanguageModels into Universal Lexical and Sentence Encoders.
CoRR, 2021

Synthetic Examples Improve Cross-Target Generalization: A Study on Stance Detection on a Twitter corpus.
Proceedings of the Eleventh Workshop on Computational Approaches to Subjectivity, 2021

Learning Sparse Sentence Encoding without Supervision: An Exploration of Sparsity in Variational Autoencoders.
Proceedings of the 6th Workshop on Representation Learning for NLP, 2021

Self-Alignment Pretraining for Biomedical Entity Representations.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Plan-then-Generate: Controlled Data-to-Text Generation via Planning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Few-Shot Table-to-Text Generation with Prototype Memory.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Mixture-of-Partitions: Infusing Large Biomedical Knowledge Graphs into BERT.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Fast, Effective, and Self-Supervised: Transforming Masked Language Models into Universal Lexical and Sentence Encoders.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Visually Grounded Reasoning across Languages and Cultures.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Non-Autoregressive Text Generation with Pre-trained Language Models.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Adversarial Training for News Stance Detection: Leveraging Signals from a Multi-Genre Corpus.
Proceedings of the EACL Hackashop on News Media Content Analysis and Automated Report Generation, 2021

MirrorWiC: On Eliciting Word-in-Context Representations from Pretrained Language Models.
Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

Integrating Transformers and Knowledge Graphs for Twitter Stance Detection.
Proceedings of the Seventh Workshop on Noisy User-generated Text, 2021

Keep the Primary, Rewrite the Secondary: A Two-Stage Approach for Paraphrase Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Dialogue Response Selection with Hierarchical Curriculum Learning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Learning Domain-Specialised Representations for Cross-Lingual Biomedical Entity Linking.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Visual Pivoting for (Unsupervised) Entity Alignment.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
PheneBank: Processed Medline Abstracts and PMC full articles + Phenotype-Disease Associations.
Dataset, July, 2020

PheneBank: Processed Medline Abstracts and PMC full articles + Phenotype-Disease Associations.
Dataset, July, 2020

A pragmatic guide to geoparsing evaluation.
Lang. Resour. Evaluation, 2020

Self-alignment Pre-training for Biomedical Entity Representations.
CoRR, 2020

Hierarchical Sparse Variational Autoencoder for Text Encoding.
CoRR, 2020

Prototype-to-Style: Dialogue Generation with Style-Aware Editing on Retrieval Memory.
CoRR, 2020

Stylistic Dialogue Generation via Information-Guided Reinforcement Learning Strategy.
CoRR, 2020

STANDER: An Expert-Annotated Dataset for News Stance Detection and Evidence Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

COMETA: A Corpus for Medical Entity Linking in the Social Media.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Will-They-Won't-They: A Very Large Dataset for Stance Detection on Twitter.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Global Health Monitor: A Web-based System for Detecting and Mapping Infectious Diseases.
CoRR, 2019

An Empirical Study of Sections in Classifying Disease Outbreak Reports.
CoRR, 2019

Generating Knowledge Graph Paths from Textual Definitions using Sequence-to-Sequence Models.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

A Richer-but-Smarter Shortest Dependency Path with Attentive Augmentation for Relation Extraction.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

On the Importance of the Kullback-Leibler Divergence Term in Variational Autoencoders for Text Generation.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019

BioReddit: Word Embeddings for User-Generated Biomedical NLP.
Proceedings of the Tenth International Workshop on Health Text Mining and Information Analysis LOUHI@EMNLP 2019, 2019

Unseen Word Representation by Aligning Heterogeneous Lexical Semantic Spaces.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
PheneBank: Processed Medline Abstracts and PMC full articles.
Dataset, February, 2018

PheneBank: Processed Medline Abstracts and PMC full articles.
Dataset, February, 2018

PheneBank: Processed Medline Abstracts and PMC full articles.
Dataset, February, 2018

PheneBank: Processed Medline Abstracts.
Dataset, February, 2018

What's missing in geographical parsing?
Lang. Resour. Evaluation, 2018

Card-660: Cambridge Rare Word Dataset - a Reliable Benchmark for Infrequent Word Representation Models.
CoRR, 2018

Card-660: A Reliable Evaluation Framework for Rare Word Representation Models.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Large-scale Exploration of Neural Relation Classification Architectures.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Mapping Text to Knowledge Graph Entities using Multi-Sense LSTMs.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Towards Automatic Fake News Detection: Cross-Level Stance Detection in News Articles.
Proceedings of the First Workshop on Fact Extraction and VERification, 2018

Modeling the Fake News Challenge as a Cross-Level Stance Detection Task.
Proceedings of the CIKM 2018 Workshops co-located with 27th ACM International Conference on Information and Knowledge Management (CIKM 2018), 2018

Which Melbourne? Augmenting Geocoding with Maps.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Learning Rare Word Representations using Semantic Bridging.
CoRR, 2017

WSDM 2017 Workshop on Mining Online Health Reports: MOHRS 2017.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

SemEval-2017 Task 2: Multilingual and Cross-lingual Semantic Word Similarity.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

Improving chemical-induced disease relation extraction with learned features based on convolutional neural network.
Proceedings of the 9th International Conference on Knowledge and Systems Engineering, 2017

Inducing Embeddings for Rare and Unseen Words by Leveraging Lexical Resources.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Towards a Seamless Integration of Word Senses into Downstream NLP Applications.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Vancouver Welcomes You! Minimalist Location Metonymy Resolution.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Thematic issue of the Second combined Bio-ontologies and Phenotypes Workshop.
J. Biomed. Semant., 2016

Sieve-based coreference resolution enhances semi-supervised learning model for chemical-induced disease relation extraction.
Database J. Biol. Databases Curation, 2016

The digital revolution in phenotyping.
Briefings Bioinform., 2016

De-Conflated Semantic Representations.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Learning Orthographic Features in Bi-directional LSTM for Biomedical Named Entity Recognition.
Proceedings of the Fifth Workshop on Building and Evaluating Resources for Biomedical Text Mining, 2016

Improved Semantic Representation for Domain-Specific Entities.
Proceedings of the 15th Workshop on Biomedical Natural Language Processing, 2016

Modelling the Combination of Generic and Target Domain Embeddings in a Convolutional Neural Network for Sentence Classification.
Proceedings of the 15th Workshop on Biomedical Natural Language Processing, 2016

Bidirectional LSTM for Named Entity Recognition in Twitter Messages.
Proceedings of the 2nd Workshop on Noisy User-generated Text, 2016

Normalising Medical Concepts in Social Media Texts by Learning Semantic Representation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

NLP and Online Health Reports: What do we say and what do we mean?
Proceedings of the Seventh International Workshop on Health Text Mining and Information Analysis, 2016

2015
Crowdsourcing Twitter annotations to identify first-hand experiences of prescription drug use.
J. Biomed. Informatics, 2015

Special issue on bio-ontologies and phenotypes.
J. Biomed. Semant., 2015

Concept selection for phenotypes and diseases using learn to rank.
J. Biomed. Semant., 2015

Automatic concept recognition using the Human Phenotype Ontology reference and test suite corpora.
Database J. Biol. Databases Curation, 2015

PhenoMiner: from text to a database of phenotypes associated with OMIM diseases.
Database J. Biol. Databases Curation, 2015

Adapting Phrase-based Machine Translation to Normalise Medical Terms in Social Media Messages.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Towards the Semantic Interpretation of Personal Health Messages from Social Media.
Proceedings of the ACM First International Workshop on Understanding the City with Urban Informatics, 2015

2014
Discriminating Rhetorical Analogies in Social Media.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

The impact of near domain transfer on biomedical named entity recognition.
Proceedings of the 5th International Workshop on Health Text Mining and Information Analysis, 2014

2013
Change-point detection in time-series data by relative density-ratio estimation.
Neural Networks, 2013

Twitter Emotion Analysis in Earthquake Situations.
Int. J. Comput. Linguistics Appl., 2013

A partially supervised cross-collection topic model for cross-domain text classification.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Exploring a Probabilistic Earley Parser for Event Composition in Biomedical Texts.
Proceedings of the BioNLP Shared Task 2013 Workshop, Sofia, 2013

Using silver and semi-gold standard corpora to compare open named entity recognisers.
Proceedings of the 2013 IEEE International Conference on Bioinformatics and Biomedicine, 2013

2012
Recognition of medication information from discharge summaries using ensembles of classifiers.
BMC Medical Informatics Decis. Mak., 2012

GENI-DB: a database of global events for epidemic intelligence.
Bioinform., 2012

Enhancing Twitter Data Analysis with Simple Semantic Filtering: Example in Tracking Influenza-Like Illnesses.
Proceedings of the 2012 IEEE Second International Conference on Healthcare Informatics, 2012

On-line Trend Analysis with Topic Models: \#twitter Trends Detection Topic Model Online.
Proceedings of the COLING 2012, 2012

A Hybrid Approach to Finding Phenotype Candidates in Genetic Texts.
Proceedings of the COLING 2012, 2012

2011
Towards mature use of semantic resources for biomedical analyses.
J. Biomed. Semant., 2011

Assessment of NER solutions against the first and second CALBC Silver Standard Corpus.
J. Biomed. Semant., 2011

OMG U got flu? Analysis of shared health messages for bio-surveillance.
J. Biomed. Semant., 2011

Towards cross-lingual alerting for bursty epidemic events.
J. Biomed. Semant., 2011

An Analysis of Twitter Messages in the 2011 Tohoku Earthquake.
Proceedings of the Electronic Healthcare - 4th International Conference, 2011

Syndromic Classification of Twitter Messages.
Proceedings of the Electronic Healthcare - 4th International Conference, 2011

2010
An Empirical Study of Sections in Classifying Disease Outbreak Reports.
Proceedings of the Web-Based Applications in Healthcare and Biomedicine, 2010

A framework for enhancing spatial and temporal granularity in report-based health surveillance systems.
BMC Medical Informatics Decis. Mak., 2010

Wrestling with Biomedical Research Results: Language Resources and Literature Analysis.
J. Bioinform. Comput. Biol., 2010

A methodology to enhance spatial understanding of disease outbreak events reported in news articles.
Int. J. Medical Informatics, 2010

What's unusual in online disease outbreak news?
J. Biomed. Semant., 2010

Analysis of syntactic and semantic features for fine-grained event-spatial understanding in outbreak news reports.
J. Biomed. Semant., 2010

OMG U got flu? Analysis of shared health messages for bio-surveillance.
Proceedings of the Fourth International Symposium for Semantic Mining in Biomedicine, 2010

An ontology-driven system for detecting global health events.
Proceedings of the COLING 2010, 2010

2009
Towards role-based filtering of disease outbreak reports.
J. Biomed. Informatics, 2009

Classifying disease outbreak reports using n-grams and semantic features.
Int. J. Medical Informatics, 2009

The development of a schema for semantic annotation: Gain brought by a formal ontological method.
Appl. Ontology, 2009

Using Hedges to Enhance a Disease Outbreak Report Text Mining System.
Proceedings of the BioNLP Workshop, BioNLP@HLT-NAACL 2009, 2009

2008
Synonym set extraction from the biomedical literature by lexical pattern discovery.
BMC Bioinform., 2008

Structuring an event ontology for disease outbreak detection.
BMC Bioinform., 2008

BioCaster: detecting public health rumors with a Web-based text mining system.
Bioinform., 2008

Global Health Monitor - A Web-based System for Detecting and Mapping Infectious Diseases.
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

The Choice of Features for Classification of Verbs in Biomedical Texts.
Proceedings of the COLING 2008, 2008

2007
Named entity recognition in Vietnamese using classifier voting.
ACM Trans. Asian Lang. Inf. Process., 2007

Construction of a Vietnamese Corpora for Named Entity Recognition.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications) - RIAO 2007, 8th International Conference, Carnegie Mellon University, Pittsburgh, PA, USA, May 30, 2007

Towards a Methodology for Entity Error Analysis in Annotated Corpora.
Proceedings of the Semantic Authoring, 2007

The Role of Roles in Classifying Annotated Biomedical Text.
Proceedings of the Biological, translational, and clinical language processing, 2007

Topic-Based Vietnamese News Document Filtering in the BioCaster Project.
Proceedings of The Sixth International Conference on Advanced Language Processing and Web Information Technology, 2007

2006
A multilingual ontology for infectious disease surveillance: rationale, design and challenges.
Lang. Resour. Evaluation, 2006

Zone analysis in biology articles as a basis for information extraction.
Int. J. Medical Informatics, 2006

Recent advances in natural language processing for biomedical applications.
Int. J. Medical Informatics, 2006

The Development of a Schema for the Annotation of Terms in the Biocaster Disease Detecting/Tracking System.
Proceedings of the KR-MED 2006, 2006

Automatic Classification of Verbs in Biomedical Texts.
Proceedings of the ACL 2006, 2006

2005
A baseline feature set for learning rhetorical zones using full articles in the biomedical domain.
SIGKDD Explor., 2005

Bio-medical entity extraction using support vector machines.
Artif. Intell. Medicine, 2005

Exploring Predicate-Argument Relations for Named Entity Recognition in the Molecular Biology Domain.
Proceedings of the Discovery Science, 8th International Conference, 2005

Towards Semantic Role Labeling & IE in the Medical Literature.
Proceedings of the AMIA 2005, 2005

2004
A Visual Lexical Model of Caravanserais of Silk Roads, A Tool for Semantic Access to Architectural 3D Data.
J. Digit. Inf. Manag., 2004

Comparison of character-level and part of speech features for name recognition in biomedical texts.
J. Biomed. Informatics, 2004

PASBio: predicate-argument structures for event extraction in molecular biology.
BMC Bioinform., 2004

Integrating Event Frame Annotation into the Open Ontology Forge Annotation Tool.
Proceedings of the 4th International Workshop on Knowledge Markup and Semantic Annotation ( SemAnnot 2004 ) located at the 3rd International Semantic Web Conference ISWC 2004 8th November 2004, 2004

Managing the semantics of coreference relations with Open Ontology Forge.
Proceedings of the 4th International Workshop on Knowledge Markup and Semantic Annotation ( SemAnnot 2004 ) located at the 3rd International Semantic Web Conference ISWC 2004 8th November 2004, 2004

An Annotation Scheme for a Rhetorical Analysis of Biology Articles.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Annotation of Coreference Relations Among Linguistic Expressions and Images in Biological Articles.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Sentiment Analysis using Support Vector Machines with Diverse Information Sources.
Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing , 2004

Zone Identification in Biology Articles as a Basis for Information Extraction.
Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications, 2004

Introduction to the Bio-entity Recognition Task at JNLPBA.
Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications, 2004

Incorporating topic information into semantic analysis models.
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, July 21-26, 2004, 2004

2003
A Framework for Integrating Deep and Shallow Semantic Structures in Text Mining.
Proceedings of the Knowledge-Based Intelligent Information and Engineering Systems, 2003

2002
Progress on Multi-lingual Named Entity Annotation Guidelines using RDF (S).
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

PIA-Core: Semantic Annotation through Example-based Learning.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

Use of Support Vector Machines in Extended Named Entity Recognition.
Proceedings of the 6th Conference on Natural Language Learning, 2002

2001
A Framework for Cross-Language Information Access: Application to English and Japanese.
Comput. Humanit., 2001

Machine Learning for Information Extraction from XML marked-up text on the Semantic Web.
Proceedings of the Second International Workshop on the Semantic Web, 2001

2000
Building an Annotated Corpus in the Molecular-Biology Domain.
Proceedings of the COLING-2000 Workshop on Semantic Annotation and Intelligent Content, 2000

Extracting the Names of Genes and Gene Products with a Hidden Markov Model.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

1999
A Comparison of Query Translation Methods for English-Japanese Cross-Language Information Retrieval (poster abstract).
Proceedings of the SIGIR '99: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1999

The GENIA project: corpus-based knowledge acquisition and information extraction from genome research papers.
Proceedings of the EACL 1999, 1999

1998
An Experiment in Hybrid Dictionary and Statistical Sentence Alignment.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

Machine Translation versus Dictionary Term Translation - A Comparison for English-Japanese News Article Alignment.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

1997
Convergence Time Characteristics of an Associative Memory for Natural Language Processing.
Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, 1997

1996
Storage of Natural Language Sentences in a Hopfield Network
CoRR, 1996


  Loading...