Jun'ichi Tsujii

  • National Institute for Advanced Industrial Science and Technology, Japan

According to our database1, Jun'ichi Tsujii authored at least 329 papers between 1973 and 2025.

Collaborative distances:



In proceedings 
PhD thesis 


Online presence:

On csauthors.net:


ELAINE-medLLM: Lightweight English Japanese Chinese Trilingual Large Language Model for Bio-medical Domain.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Next Big Challenges in Core AI Technology.
Proceedings of the Reflections on Artificial Intelligence for Humanity, 2021

Transfer fine-tuning of BERT with phrasal paraphrases.
Comput. Speech Lang., 2021

Natural Language Processing and Computational Linguistics.
Comput. Linguistics, 2021

Compositional Phrase Alignment and Beyond.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Improving clinical named entity recognition in Chinese using the graphical and phonetic feature.
BMC Medical Informatics Decis. Mak., 2019

Mapping anatomical related entities to human body parts based on wikipedia in discharge summaries.
BMC Bioinform., 2019

Transfer Fine-Tuning: A BERT Case Study.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Annotation and detection of drug effects in text for pharmacovigilance.
J. Cheminformatics, 2018

SPADE: Evaluation Dataset for Monolingual Phrase Alignment.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Improve Chinese Clinical Named Entity Recognition Performance by Using the Graphical and Phonetic Feature.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018

Monolingual Phrase Alignment on Parse Forests.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Distributed Document and Phrase Co-embeddings for Descriptive Clustering.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

A Latent Concept Topic Model for Robust Topic Inference Using Word Embeddings.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Overview of the Cancer Genetics and Pathway Curation tasks of BioNLP Shared Task 2013.
BMC Bioinform., December, 2015

Bilingual term alignment from comparable corpora in English discharge summary and Chinese discharge summary.
BMC Bioinform., 2015

Estimating Numerical Attributes by Bringing Together Fragmentary Clues.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Generalization of Semantic Roles in Automatic Semantic Role Labeling.
Inf. Media Technol., 2014

Discovering robust Embeddings in (DIS)Similarity Space for High-Dimensional Linguistic Features.
Comput. Intell., 2014

Generalising semantic category disambiguation with large lexical resources for fun and profit.
J. Biomed. Semant., 2014

Combining String and Context Similarity for Bilingual Term Alignment from Comparable Corpora.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Using a Random Forest Classifier to Compile Bilingual Dictionaries of Technical Terms from Comparable Corpora.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Common Space Embedding of Primal-Dual Relation Semantic Spaces.
Proceedings of the COLING 2014, 2014

Learning Abbreviations from Chinese and English Terms by Modeling Non-Local Information.
ACM Trans. Asian Lang. Inf. Process., 2013

An end-to-end system to identify temporal relation in discharge summaries: 2012 i2b2 challenge.
J. Am. Medical Informatics Assoc., 2013

Probabilistic Chinese word segmentation with non-local information and stochastic training.
Inf. Process. Manag., 2013

Named entity recognition with multiple segment representations.
Inf. Process. Manag., 2013

Design and implementation of GXP make - A workflow system based on make.
Future Gener. Comput. Syst., 2013

Overview of the Pathway Curation (PC) task of BioNLP Shared Task 2013.
Proceedings of the BioNLP Shared Task 2013 Workshop, Sofia, 2013

Deep Context-Free Grammar for Chinese with Broad-Coverage.
Proceedings of the Seventh SIGHAN Workshop on Chinese Language Processing, 2013

Using a Random Forest Classifier to recognise translations of biomedical terms across languages.
Proceedings of the Sixth Workshop on Building and Using Comparable Corpora, 2013

Statistical Extraction and Comparison of Pivot Words for Bilingual Lexicon Extension.
ACM Trans. Asian Lang. Inf. Process., 2012

Proximity-Based Frameworks for Generating Embeddings from Multi-Output Data.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Structure-guided supertagger learning.
Nat. Lang. Eng., 2012

Named entity recognition of follow-up and time information in 20 000 radiology reports.
J. Am. Medical Informatics Assoc., 2012

A classification approach to coreference in discharge summaries: 2011 i2b2 challenge.
J. Am. Medical Informatics Assoc., 2012

Feature engineering combined with machine learning and rule-based methods for structured information extraction from narrative clinical discharge summaries.
J. Am. Medical Informatics Assoc., 2012

Overview of the ID, EPI and REL tasks of BioNLP Shared Task 2011.
BMC Bioinform., 2012

Improving protein coreference resolution by simple semantic classification.
BMC Bioinform., 2012

The Genia Event and Protein Coreference tasks of the BioNLP Shared Task 2011.
BMC Bioinform., 2012

Event extraction across multiple levels of biological organization.
Bioinform., 2012

Biomedical Chinese-English CLIR Using an Extended CMeSH Resource to Expand Queries.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

brat: a Web-based Tool for NLP-Assisted Text Annotation.
Proceedings of the EACL 2012, 2012

Coordination Structure Analysis using Dual Decomposition.
Proceedings of the EACL 2012, 2012

Akamon: An Open Source Toolkit for Tree/Forest-Based Statistical Machine Translation.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, 2012

Bridging the Gap Between Scope-based and Event-based Negation/Speculation Annotations: A Bridge Not Too Far.
Proceedings of the Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics, 2012

Incremental Joint Approach to Word Segmentation, POS Tagging, and Dependency Parsing in Chinese.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Large-vocabulary Lexical Choice with Rich Context Features.
Int. J. Comput. Linguistics Appl., 2011

U-Compare: A modular NLP workflow construction and evaluation system.
IBM J. Res. Dev., 2011

Bio-molecular Event Extraction with Markov Logic.
Comput. Intell., 2011

Extracting Bio-molecular Events from literature - the BioNLP'09 Shared Task.
Comput. Intell., 2011

U-Compare bio-event meta-service: compatible BioNLP event extraction services.
BMC Bioinform., 2011

An analysis of gene/protein associations at PubMed scale.
J. Biomed. Semant., 2011

Event extraction for DNA methylation.
J. Biomed. Semant., 2011

Automatic extraction of angiogenesis bioprocess from text.
Bioinform., 2011

Discovering and visualizing indirect associations between biomedical concepts.
Bioinform., 2011

AGRA: analysis of gene ranking algorithms.
Bioinform., 2011

SMT Systems in the University of Tokyo for NTCIR-9 PatentMT.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

Natural Language Understanding, Semantic-based Information Retrieval and Knowledge Management.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

NTT-UT Statistical Machine Translation in NTCIR-9 PatentMT.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

Resource-rich research on natural language processing and understanding.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Analysis of the Difficulties in Chinese Deep Parsing.
Proceedings of the 12th International Conference on Parsing Technologies, 2011

Incremental Joint POS Tagging and Dependency Parsing in Chinese.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Exploring Difficulties in Parsing Imperatives and Questions.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Poster: Analysis of gene ranking algorithms with extraction of relevant biomedical concepts from PubMed publications.
Proceedings of the IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences, 2011

Computational Linguistics and Natural Language Processing.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2011

Multi-topical Discussion Summarization Using Structured Lexical Chains and Cue Words.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2011

Effective Use of Dependency Structure for Bilingual Lexicon Creation.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2011

Automatic Acquisition of Huge Training Data for Bio-Medical Named Entity Recognition.
Proceedings of the 2011 Workshop on Biomedical Natural Language Processing, 2011

BioNLP Shared Task 2011: Supporting Resources.
Proceedings of BioNLP Shared Task 2011 Workshop, Portland, Oregon, USA, June 24, 2011, 2011

SimSem: Fast Approximate String Matching in Relation to Semantic Category Disambiguation.
Proceedings of the 2011 Workshop on Biomedical Natural Language Processing, 2011

Overview of the Entity Relations (REL) supporting task of BioNLP Shared Task 2011.
Proceedings of BioNLP Shared Task 2011 Workshop, Portland, Oregon, USA, June 24, 2011, 2011

Overview of the Infectious Diseases (ID) task of BioNLP Shared Task 2011.
Proceedings of BioNLP Shared Task 2011 Workshop, Portland, Oregon, USA, June 24, 2011, 2011

Towards Exhaustive Event Extraction for Protein Modifications.
Proceedings of the 2011 Workshop on Biomedical Natural Language Processing, 2011

Overview of the Epigenetics and Post-translational Modifications (EPI) task of BioNLP Shared Task 2011.
Proceedings of BioNLP Shared Task 2011 Workshop, Portland, Oregon, USA, June 24, 2011, 2011

From Pathways to Biomolecular Events: Opportunities and Challenges.
Proceedings of the 2011 Workshop on Biomedical Natural Language Processing, 2011

Overview of BioNLP 2011 Protein Coreference Shared Task.
Proceedings of BioNLP Shared Task 2011 Workshop, Portland, Oregon, USA, June 24, 2011, 2011

Overview of BioNLP Shared Task 2011.
Proceedings of BioNLP Shared Task 2011 Workshop, Portland, Oregon, USA, June 24, 2011, 2011

A Collaborative Annotation between Human Annotators and a Statistical Parser.
Proceedings of the Fifth Linguistic Annotation Workshop, 2011

Effective Use of Function Words for Rule Generalization in Forest-Based Translation.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Learning the Optimal Use of Dependency-parsing Information for Finding Translations with Comparable Corpora.
Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web, 2011

Extracting Protein Interactions from Text with the Unified AkaneRE Event Extraction System.
IEEE ACM Trans. Comput. Biol. Bioinform., 2010

Improve syntax-based translation using deep syntactic structures.
Mach. Transl., 2010

Improving the Inter-Corpora Compatibility for protein Annotations.
J. Bioinform. Comput. Biol., 2010

A Re-Evaluation of Biomedical Named Entity-Term Relations.
J. Bioinform. Comput. Biol., 2010

Event Extraction with Complex Event Classification Using Rich Features.
J. Bioinform. Comput. Biol., 2010

Comparison of Chinese Treebanks for Corpus-oriented HPSG Grammar Development.
Inf. Media Technol., 2010

Medie and Info-pubmed: 2010 update.
BMC Bioinform., 2010

Disambiguating the species of biomedical named entities using natural language parsers.
Bioinform., 2010

Building a high-quality sense inventory for improved abbreviation disambiguation.
Bioinform., 2010

PathText: a text mining integrator for biological pathway visualizations.
Bioinform., 2010

Text mining meets workflow: linking U-Compare with Taverna.
Bioinform., 2010

Complex event extraction at PubMed scale.
Bioinform., 2010

A Modular Architecture for the Wide-Coverage Translation of Natural Language Texts into Predicate Logic Formulas.
Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation, 2010

A Simple Approach for HPSG Supertagging Using Dependency Information.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

U-Compare: An Integrated Language Resource Evaluation Platform Including a Comprehensive UIMA Resource Library.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

A Japanese Particle Corpus Built by Example-Based Annotation.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Forest-guided Supertagger Training.
Proceedings of the COLING 2010, 2010

Semi-automatically Developing Chinese HPSG Grammar from the Penn Chinese Treebank for Deep Parsing.
Proceedings of the COLING 2010, 2010

Simple and Efficient Algorithm for Approximate Dictionary Matching.
Proceedings of the COLING 2010, 2010

Imbalanced Classification Using Dictionary-based Prototypes and Hierarchical Decision Rules for Entity Sense Disambiguation.
Proceedings of the COLING 2010, 2010

Entity-Focused Sentence Simplification for Relation Extraction.
Proceedings of the COLING 2010, 2010

Evaluating Dependency Representations for Event Extraction.
Proceedings of the COLING 2010, 2010

Robust Measurement and Comparison of Context Similarity for Finding Translation Pairs.
Proceedings of the COLING 2010, 2010

Towards Event Extraction from Full Texts on Infectious Diseases.
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, 2010

Event Extraction for Post-Translational Modifications.
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, 2010

A Comparative Study of Syntactic Parsers for Event Extraction.
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, 2010

Scaling up Biomedical Event Extraction to the Entire PubMed.
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, 2010

The Deep Re-Annotation in a Chinese Scientific Treebank.
Proceedings of the Fourth Linguistic Annotation Workshop, 2010

Fine-Grained Tree-to-String Translation Rule Extraction.
Proceedings of the ACL 2010, 2010

Dependency Parsing and Domain Adaptation with Data-Driven LR Models and Parser Ensembles.
Proceedings of the Trends in Parsing Technology, 2010

HPSG Parsing with a Supertagger.
Proceedings of the Trends in Parsing Technology, 2010

Evaluating the Impact of Re-training a Lexical Disambiguation Model on Domain Adaptation of an HPSG Parser.
Proceedings of the Trends in Parsing Technology, 2010

A Chinese-Japanese Lexical Machine Translation through a Pivot Language.
ACM Trans. Asian Lang. Inf. Process., 2009

On Contribution of Sense Dependencies to Word Sense Disambiguation.
Inf. Media Technol., 2009

Protein-protein interaction extraction by leveraging multiple kernels and parsers.
Int. J. Medical Informatics, 2009

Tag-Annotated Text Search Using Extended Region Algebra.
IEICE Trans. Inf. Syst., 2009

Hozumi Tanaka.
Comput. Linguistics, 2009

Investigating heterogeneous protein annotations toward cross-corpora utilization.
BMC Bioinform., 2009

Evaluating contributions of natural language parsers to protein-protein interaction extraction.
Bioinform., 2009

U-Compare: share and compare text mining tools with UIMA.
Bioinform., 2009

Text Categorization with All Substring Features.
Proceedings of the SIAM International Conference on Data Mining, 2009

Design of Chinese HPSG Framework for Data-Driven Parsing.
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, 2009

GuideLink: A Corpus Annotation System that Integrates the Management of Annotation Guidelines.
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, 2009

Extracting Bilingual Dictionary from Comparable Corpora with Dependency Heterogeneity.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Semi-Supervised Lexicon Mining from Parenthetical Expressions in Monolingual Web Pages.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

A Discriminative Latent Variable Chinese Segmenter with Hybrid Word/Character Information.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Learning Combination Features with L1 Regularization.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Bilingual Dictionary Extraction from Wikipedia.
Proceedings of Machine Translation Summit XII: Posters, 2009

The UOT system.
Proceedings of the 6th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2009, 2009

The UOT system: improve string-to-tree translation using head-driven phrase structure grammar and predicate-argument structures.
Proceedings of the 2009 International Workshop on Spoken Language Translation, 2009

HPSG Supertagging: A Sequence Labeling View.
Proceedings of the 11th International Workshop on Parsing Technologies (IWPT-2009), 2009

Evaluating Contribution of Deep Syntactic Information to Shallow Semantic Analysis.
Proceedings of the 11th International Workshop on Parsing Technologies (IWPT-2009), 2009

Effective Analysis of Causes and Inter-dependencies of Parsing Errors.
Proceedings of the 11th International Workshop on Parsing Technologies (IWPT-2009), 2009

Latent Variable Perceptron Algorithm for Structured Classification.
Proceedings of the IJCAI 2009, 2009

Classifying Relations for Biomedical Named Entity Disambiguation.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Supervised Learning of a Probabilistic Lexicon of Verb Semantic Classes.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

A Rich Feature Vector for Protein-Protein Interaction Extraction from Multiple Corpora.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Descriptive and Empirical Approaches to Capturing Underlying Dependencies among Parsing Errors.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Fast Full Parsing by Linear-Chain Conditional Random Fields.
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009

Sequential Labeling with Latent Variables: An Exact Inference Algorithm and its Efficient Approximation.
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009

Opinion classification with tree kernel SVM using linguistic modality analysis.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Bridging the Gap between Domain-Oriented and Linguistically-Oriented Semantics.
Proceedings of the BioNLP Workshop, BioNLP@HLT-NAACL 2009, 2009

From Protein-Protein Interaction to Molecular Event Extraction.
Proceedings of the BioNLP 2009 Workshop Companion Volume for Shared Task, BioNLP@HLT-NAACL 2009, 2009

A Markov Logic Approach to Bio-Molecular Event Extraction.
Proceedings of the BioNLP 2009 Workshop Companion Volume for Shared Task, BioNLP@HLT-NAACL 2009, 2009

Static Relations: a Piece in the Biomedical Information Extraction Puzzle.
Proceedings of the BioNLP Workshop, BioNLP@HLT-NAACL 2009, 2009

Incorporating GENETAG-style annotation to GENIA corpus.
Proceedings of the BioNLP Workshop, BioNLP@HLT-NAACL 2009, 2009

Overview of BioNLP'09 Shared Task on Event Extraction.
Proceedings of the BioNLP 2009 Workshop Companion Volume for Shared Task, BioNLP@HLT-NAACL 2009, 2009

Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty.
Proceedings of the ACL 2009, 2009

Robust Approach to Abbreviating Terms: A Discriminative Latent Variable Model with Global Information.
Proceedings of the ACL 2009, 2009

A Comparative Study on Generalization of Semantic Roles in FrameNet.
Proceedings of the ACL 2009, 2009

A Novel Word Segmentation Approach for Written Languages with Word Boundary Markers.
Proceedings of the ACL 2009, 2009

Feature Forest Models for Probabilistic HPSG Parsing.
Comput. Linguistics, 2008

Accelerating the annotation of sparse named entities by dynamic sentence selection.
BMC Bioinform., 2008

New challenges for text mining: mapping between text and manually curated pathways.
BMC Bioinform., 2008

Corpus annotation for mining biomedical events from literature.
BMC Bioinform., 2008

Themes in biomedical natural language processing: BioNLP08.
BMC Bioinform., 2008

FACTA: a text search engine for finding associated biomedical concepts.
Bioinform., 2008

Kleio: a knowledge-enriched information retrieval system for biology.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

Filling the Gaps Between Tools and Users: A Tool Comparator, Using Protein-Protein Interactions as an Example.
Proceedings of the Biocomputing 2008, 2008

Building Bilingual Lexicons using Lexical Translation Probabilities via Pivot Languages.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

GENIA-GR: a Grammatical Relation Corpus for Parser Evaluation in the Biomedical Domain.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Connecting Text Mining and Pathways using the PathText Resource.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Challenges in Pronoun Resolution System for Biomedical Text.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Bilingual Synonym Identification with Spelling Variations.
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

A Discriminative Approach to Japanese Abbreviation Extraction.
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

Towards Data and Goal Oriented Analysis: Tool Inter-operability and Combinatorial Comparison.
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

A Discriminative Candidate Generator for String Transformations.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Coreference Resolution in Biomedical Texts: a Machine Learning Approach.
Proceedings of the Ontologies and Text Mining for Life Sciences: Current Status and Future Perspectives, 24.03., 2008

Building a Bilingual Lexicon Using Phrase-based Statistical Machine Translation via a Pivot Language.
Proceedings of the COLING 2008, 2008

Parser Evaluation Across Frameworks without Format Conversion.
Proceedings of the workshop on Cross-Framework and Cross-Domain Parser Evaluation@COLING 2008, 2008

Modeling Latent-Dynamic in Shallow Parsing: A Latent Conditional Model with Imrpoved Inference.
Proceedings of the COLING 2008, 2008

Shift-Reduce Dependency DAG Parsing.
Proceedings of the COLING 2008, 2008

A Discriminative Alignment Model for Abbreviation Recognition.
Proceedings of the COLING 2008, 2008

Exact Inference for Multi-label Classification using Sparse Graphical Models.
Proceedings of the COLING 2008, 2008

Comparative Parser Performance Analysis across Grammar Frameworks through Automatic Tree Conversion using Synchronous Grammars.
Proceedings of the COLING 2008, 2008

Word Sense Disambiguation for All Words using Tree-Structured Conditional Random Fields.
Proceedings of the COLING 2008, 2008

Nested region algebra extended with variables for tag-annotated text search.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

Raising the Compatibility of Heterogeneous Annotations: A Case Study on.
Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing, 2008

Prediction of Protein Sub-cellular Localization using Information from Texts and Sequences.
Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing, 2008

From Text to Pathway: Corpus Annotation for Knowledge Acquisition from Biomedical Literature.
Proceedings of the 6th Asia-Pacific Bioinformatics Conference, 2008

Improving English-to-Chinese Translation for Technical Terms using Morphological Information.
Proceedings of the 8th Conference of the Association for Machine Translation in the Americas: Research Papers, 2008

Evaluating the Effects of Treebank Size in a Practical Application for Parsing.
Proceedings of the Software Engineering, 2008

Task-oriented Evaluation of Syntactic Parsers and Their Representations.
Proceedings of the ACL 2008, 2008

Learning string similarity measures for gene/protein name dictionary look-up using logistic regression.
Bioinform., 2007

Development of a Japanese-Chinese machine translation system.
Proceedings of Machine Translation Summit XI: Papers, 2007

Syntactic Features for Protein-Protein Interaction Extraction.
Proceedings of the Short Paper Proceedings of the 2nd International Symposium on Languages in Biology and Medicine (LBM 2007), 2007

A log-linear model with an n-gram reference distribution for accurate HPSG parsing.
Proceedings of the Tenth International Conference on Parsing Technologies, 2007

Evaluating Impact of Re-training a Lexical Disambiguation Model on Domain Adaptation of an HPSG Parser.
Proceedings of the Tenth International Conference on Parsing Technologies, 2007

Ambiguous Part-of-Speech Tagging for Improving Accuracy and Domain Portability of Syntactic Parsers.
Proceedings of the IJCAI 2007, 2007

Efficient HPSG Parsing with Supertagging and CFG-Filtering.
Proceedings of the IJCAI 2007, 2007

Dependency Parsing and Domain Adaptation with LR Models and Parser Ensembles.
Proceedings of the EMNLP-CoNLL 2007, 2007

Move Prediction in Go with the Maximum Entropy Method.
Proceedings of the 2007 IEEE Symposium on Computational Intelligence and Games, 2007

Reranking for Biomedical Named-Entity Recognition.
Proceedings of the Biological, translational, and clinical language processing, 2007

Combining statistical models with symbolic grammar in parsing.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

HPSG Parsing with Shallow Dependency Constraints.
Proceedings of the ACL 2007, 2007

A discriminative language model with pseudo-negative samples.
Proceedings of the ACL 2007, 2007

Automatic recognition of topic-classified relations between prostate cancer and genes using MEDLINE abstracts.
BMC Bioinform., 2006

Automatic Recognition of Topic-Classified Relations between Prostate Cancer and Genes from Medline Abstracts.
Proceedings of the Second International Symposium on Semantic Mining in Biomedicine, 2006

Extraction of Gene-Disease Relations from Medline Using Domain Dictionaries and Machine Learning.
Proceedings of the Biocomputing 2006, 2006

Linguistic and Biological Annotations of Biological Interaction Events.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Automatic Construction of Predicate-argument Structure Patterns for Biomedical Information Extraction.
Proceedings of the EMNLP 2006, 2006

Extremely Lexicalized Models for Accurate and Fast HPSG Parsing.
Proceedings of the EMNLP 2006, 2006

Subdomain adaptation of a POS tagger with a small corpus.
Proceedings of the Workshop on Linking Natural Language and Biology, 2006

Trimming CFG Parse Trees for Sentence Compression Using Machine Learning Approaches.
Proceedings of the ACL 2006, 2006

Translating HPSG-Style Outputs of a Robust Parser into Typed Dynamic Logic.
Proceedings of the ACL 2006, 2006

Improving the Scalability of Semi-Markov Conditional Random Fields for Named Entity Recognition.
Proceedings of the ACL 2006, 2006

An Intelligent Search Engine and GUI-based Efficient MEDLINE Search Tool Based on Deep Syntactic Parsing.
Proceedings of the ACL 2006, 2006

Semantic Retrieval for the Accurate Identification of Relational Concepts in Massive Textbases.
Proceedings of the ACL 2006, 2006

Maximum Entropy Models with Inequality Constraints: A Case Study on Text Categorization.
Mach. Learn., 2005

Thesaurus or Logical Ontology, Which One Do We Need for Text Mining?
Lang. Resour. Evaluation, 2005

MaSTerClass: a case-based reasoning system for the classification of biomedical terms.
Bioinform., 2005

Developing a Robust Part-of-Speech Tagger for Biomedical Text.
Proceedings of the Advances in Informatics, 2005

Bidirectional Inference with the Easiest-First Strategy for Tagging Sequence Data.
Proceedings of the HLT/EMNLP 2005, 2005

Chunk Parsing Revisited.
Proceedings of the Ninth International Workshop on Parsing Technology, 2005

Efficacy of Beam Thresholding, Unification Filtering and Hybrid Parsing in Probabilistic HPSG Parsing.
Proceedings of the Ninth International Workshop on Parsing Technology, 2005

Probabilistic Models for Disambiguation of an HPSG-Based Chart Generator.
Proceedings of the Ninth International Workshop on Parsing Technology, 2005

A Machine Learning Approach to Acronym Generation.
Proceedings of the ACL-ISMB Workshop on Linking Biological Literature, 2005

Syntax Annotation for the GENIA Corpus.
Proceedings of the Natural Language Processing - IJCNLP 2005, Second International Joint Conference, Jeju Island, Republic of Korea, October 11-13, 2005, 2005

Assigning Polarity Scores to Reviews Using Machine Learning Techniques.
Proceedings of the Natural Language Processing, 2005

Adapting a Probabilistic Disambiguation Model of an HPSG Parser to a New Domain.
Proceedings of the Natural Language Processing, 2005

Probabilistic Disambiguation Models for Wide-Coverage HPSG Parsing.
Proceedings of the ACL 2005, 2005

Probabilistic CFG with Latent Annotations.
Proceedings of the ACL 2005, 2005

Improving the performance of dictionary-based approaches in protein name recognition.
J. Biomed. Informatics, 2004

Introduction: named entity recognition in biomedicine.
J. Biomed. Informatics, 2004

Generalizing Subcategorization Frames Acquired from Corpora Using Lexicalized Grammars.
Proceedings of the 7th International Workshop on Tree Adjoining Grammar and Related Formalisms, 2004

Context-free Approximation of LTAG towards CFG Filtering.
Proceedings of the 7th International Workshop on Tree Adjoining Grammar and Related Formalisms, 2004

Thesaurus or Logical Ontology, Which do we Need for Mining Text?
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Part-of-Speech Annotation of Biology Research Abstracts.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

How long will we be able to ignore linguistic knowledge and their formalisms?
Proceedings of the 2004 International Workshop on Spoken Language Translation, 2004

Overview of the IWSLT04 evaluation campaign.
Proceedings of the 2004 International Workshop on Spoken Language Translation, 2004

Iterative CKY Parsing for Probabilistic Context-Free Grammars.
Proceedings of the Natural Language Processing, 2004

A Persistent Feature-Object Database for Intelligent Text Archive Systems.
Proceedings of the Natural Language Processing, 2004

Corpus-Oriented Grammar Development for Acquiring a Head-Driven Phrase Structure Grammar from the Penn Treebank.
Proceedings of the Natural Language Processing, 2004

Word Folding: Taking the Snapshot of Words Instead of the Whole.
Proceedings of the Natural Language Processing, 2004

Deep Linguistic Analysis for the Accurate Identification of Predicate-Argument Relations.
Proceedings of the COLING 2004, 2004

Finding Anchor Verbs for Biomedical IE Using Predicate-Argument Structures.
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, July 21-26, 2004, 2004

Probabilistic term variant generator for biomedical terms.
Proceedings of the SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28, 2003

A Robust Retrieval Engine for Proximal and Structural Search.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

GENIA corpus - a semantically annotated corpus for bio-textmining.
Proceedings of the Eleventh International Conference on Intelligent Systems for Molecular Biology, June 29, 2003

Evaluation and Extension of Maximum Entropy Models with Inequality Constraints.
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2003

Lexicalized Grammar Acquisition.
Proceedings of the EACL 2003, 2003

Training a Naive Bayes Classifier via the EM Algorithm with a Class Distribution Constraint.
Proceedings of the Seventh Conference on Natural Language Learning, 2003

A model of syntactic disambiguation based on lexicalized grammars.
Proceedings of the Seventh Conference on Natural Language Learning, 2003

An efficient clustering algorithm for class-based language models.
Proceedings of the Seventh Conference on Natural Language Learning, 2003

Boosting Precision and Recall of Dictionary-Based Protein Name Recognition.
Proceedings of the Workshop on Natural Language Processing in Biomedicine, 2003

Encoding Biomedical Resources in TEI: The Case of the GENIA Corpus.
Proceedings of the Workshop on Natural Language Processing in Biomedicine, 2003

Comparison between CFG Filtering Techniques for LTAG and HPSG.
Proceedings of the ACL 2003, 2003

A Debug Tool for Practical Grammar Development.
Proceedings of the ACL 2003, 2003

Self-Organizing Markov Models and Their Application to Part-of-Speech Tagging.
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003

Stretching TEI: Converting the Genia Corpus.
Proceedings of 4th International Workshop on Linguistically Interpreted Corpora, 2003

Extracting Attributes and their Values from Web pages.
Proceedings of the Web Document Analysis, 2003

ACM Trans. Asian Lang. Inf. Process., 2002

Terminology-driven literature mining and knowledge acquisition in biomedicine.
Int. J. Medical Informatics, 2002

Accomplishments and challenges in literature data mining for biology.
Bioinform., 2002

A Formal Proof of Strong Equivalence for a Grammar Conversion from LTAG to HPSG-style.
Proceedings of the Sixth International Workshop on Tree Adjoining Grammar and Related Frameworks, 2002

Clustering for obtaining syntactic classes of words from automatically extracted LTAG grammars.
Proceedings of the Sixth International Workshop on Tree Adjoining Grammar and Related Frameworks, 2002

Measuring Term Representativeness.
Proceedings of the Information Extraction in the Web Era: Natural Language Communication for Knowledge Acquisition and Intelligent Information Agents, 2002

Literature Data Mining for Biology - Session Introduction.
Proceedings of the 7th Pacific Symposium on Biocomputing, 2002

An Indexing Scheme for Typed Feature Structures.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

Lenient Default Unification for Robust Processing within Unification Based Grammar Formalisms.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

A Methodology for Terminology-based Knowledge Acquisition and Integration.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

Tuning support vector machines for biomedical named entity recognition.
Proceedings of the ACL 2002 Workshop on Natural Language Processing in the Biomedical Domain, 2002

Event Extraction from Biomedical Papers Using a Full Parser.
Proceedings of the 6th Pacific Symposium on Biocomputing, 2001

Natural Language Processing for Biology - Session Introduction.
Proceedings of the 6th Pacific Symposium on Biocomputing, 2001

LiLFeS/GENIA Project --- NLP Tools and A Biology Domain Corpus ---.
Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, 2001

Are NLP technologies really ready for application? (Panel).
Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, 2001

A Maximum Entropy Tagger with Unsupervised Hidden Markov Models.
Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, 2001

An HPSG parser with CFG filtering.
Nat. Lang. Eng., 2000

Introduction to this Special Issue.
Nat. Lang. Eng., 2000

The LiLFeS Abstract Machine and its evaluation with the LinGO grammar.
Nat. Lang. Eng., 2000

Building an Annotated Corpus in the Molecular-Biology Domain.
Proceedings of the COLING-2000 Workshop on Semantic Annotation and Intelligent Content, 2000

Lexicalized Hidden Markov Models for Part-of-Speech Tagging.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

A Hybrid Japanese Parser with Hand-crafted Grammar and Statistics.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

A Method of Measuring Term Representativeness - Baseline Method Using Co-occurrence Distribution.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

Extracting the Names of Genes and Gene Products with a Hidden Markov Model.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

Invited Talk: Generic NLP Technologies: Language, Knowledge and Information Extraction.
Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, 2000

Difficulty Indices for the Named Entity Task in Japanese.
Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, 2000

Hidden Markov Model-Based Korean Part-of-Speech Tagging Considering High Agglutinativity, Word-Spacing, and Lexical Correlativity.
Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, 2000

Part-of-Speech Tagging Based on Hidden Markov Model Assuming Joint Independence.
Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, 2000

Transfer in experience-guided machine translation.
Proceedings of Machine Translation Summit VII, 1999

Machine translation for the next century.
Proceedings of Machine Translation Summit VII, 1999

Classifying Technical Terms.
Proceedings of the Electronic Publishing '99, Redefining the Information Chain - New Ways and Voices: 3rd ICCC/IFIP conference held at the University of Karlskrona/Ronneby, 1999

The GENIA project: corpus-based knowledge acquisition and information extraction from genome research papers.
Proceedings of the EACL 1999, 1999

Translating the XTAG English grammar to HPSG.
Proceedings of the Fourth International Workshop on Tree Adjoining Grammars and Related Frameworks, 1998

Packing of feature structures for optimizing the HPSG-style grammar translated from TAG.
Proceedings of the Fourth International Workshop on Tree Adjoining Grammars and Related Frameworks, 1998

The C-value/NC-value Method of Automatic Recognition for Multi-Word Terms.
Proceedings of the Research and Advanced Technology for Digital Libraries, 1998

An Efficient Parallel Substrate tor Typed Feature Structures on Shared Memory Parallel Machines.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

HPSG-Style Underspecified Japanese Grammar with Wide Coverage.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

LiLFes - Towards a Practical HPSG Parser.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

Computing Phrasal-signs in HPSG prior to Parsing.
Proceedings of the 16th International Conference on Computational Linguistics, 1996

Automatic acquisition of semantic collocation from corpora.
Mach. Transl., 1995

An empirical study of MT: we knew nothing.
Proceedings of Machine Translation Summit V, 1995

An HPSG-based Parser for Automatic Knowledge Acquisition.
Proceedings of the Fourth International Workshop on Parsing Technologies, 1995

Quantitative Perceptual Representation of Prepositional Semantics.
Artif. Intell. Rev., 1994

Hypothesis Selection in Grammar Acquisition.
Proceedings of the 15th International Conference on Computational Linguistics, 1994

Breaking Down Rhetorical Relations for the purpose of Analysing Discourse Structures.
Proceedings of the 15th International Conference on Computational Linguistics, 1994

Automatic Recognition of Verbal Polysemy.
Proceedings of the 15th International Conference on Computational Linguistics, 1994

Combination of Symbolic and Statistical Approaches for Grammatical Knowledge Acquisition.
Proceedings of the 4th Applied Natural Language Processing Conference, 1994

A Computational View of the Cognitive Semantics of Spatial Prepostions.
Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, 1994

Automatic Depiction of Spatial Descriptions.
Proceedings of the 12th National Conference on Artificial Intelligence, Seattle, WA, USA, July 31, 1994

Linguistic Knowledge Acquisition from Parsing Failures.
Proceedings of the Sixth Conference of the European Chapter of the Association for Computational Linguistics, 1993

Interaction between Structural Changes in Machine Translation.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992

Visualisation: mediating the interchange of information from the verbal to the visual domain.
Proceedings of the Mensch und Maschine, 1992

Linguistic Knowledge Generator.
Proceedings of the 14th International Conference on Computational Linguistics, 1992

Automatic Learning for Semantic Collocation.
Proceedings of the 3rd Applied Natural Language Processing Conference, 1992

Lexical Transfer based on bilingual signs: Towards interaction during transfer.
Proceedings of the EACL 1991, 1991

Machine Translation without a source text.
Proceedings of the 13th International Conference on Computational Linguistics, 1990

GRADE: A software environment for machine translation.
Mach. Transl., 1988

Dialogue translation vs. text translation.
Proceedings of the 12th International Conference on Computational Linguistics, 1988

How to get preferred readings in natural language analysis.
Proceedings of the 12th International Conference on Computational Linguistics, 1988

Reasons why I do not care grammar formalism.
Proceedings of the 12th International Conference on Computational Linguistics, 1988

Machine translation from japanese into english.
Proc. IEEE, 1986

Science and technology agency's Mu machine translation project.
Future Gener. Comput. Syst., 1986

Future Directions of Machine Translation.
Proceedings of the 11th International Conference on Computational Linguistics, 1986

Solutions for Problems of MT Parser. Methods used in Mu-Machine Translation Project.
Proceedings of the 11th International Conference on Computational Linguistics, 1986

The Transfer Phase of the Mu Machine Translation System.
Proceedings of the 11th International Conference on Computational Linguistics, 1986

The Japanese Government Project for Machine Translation.
Comput. Linguistics, 1985

Analysis Grammar of Japanese in the Mu-Project: A Procedural Approach to Analysis Grammar.
Proceedings of the 10th International Conference on Computational Linguistics and 22nd Annual Meeting of the Association for Computational Linguistics, 1984

Grammar Writing System (GRADE) of Mu-Machine Translation Project and its Characteristics.
Proceedings of the 10th International Conference on Computational Linguistics and 22nd Annual Meeting of the Association for Computational Linguistics, 1984

Dealing With Incompleteness Of Linguistic Knowledge In Language Translation - Transfer And Generation Stage Of MU Machine Translation Project.
Proceedings of the 10th International Conference on Computational Linguistics and 22nd Annual Meeting of the Association for Computational Linguistics, 1984

The Transfer Phase In an English-Japanese Translation System.
Proceedings of the 9th International Conference on Computational Linguistics, 1982

An English Japanese Machine Translation System Of The Titles Of Scientific And Engineering Papers.
Proceedings of the 9th International Conference on Computational Linguistics, 1982

An Attempt To Computerized Dictionary Data Bases.
Proceedings of the 8th International Conference on Computational Linguistics, 1980

A Machine Translation System From Japanese Into English - Another Perspective Of MT Systems.
Proceedings of the 8th International Conference on Computational Linguistics, 1980

LISP Machine NK3 and Measurement of Its Performance.
Proceedings of the Sixth International Joint Conference on Artificial Intelligence, 1979

S-Net: A Foundation for Knowledge Representation Languages.
Proceedings of the Sixth International Joint Conference on Artificial Intelligence, 1979

Mechanism of Deduction in a Question-Answering System with Natural Language Input.
Proceedings of the 3rd International Joint Conference on Artificial Intelligence. Standford, 1973
