Chunyu Kit

Orcid: 0000-0002-6445-7400

According to our database1, Chunyu Kit authored at least 73 papers between 1991 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Creation of a structured solar cell material dataset and performance prediction using large language models.
Patterns, 2024

Du Fu's conspicuous negativity and Li Bai's hidden positivity: a sentiment comparison and exploration.
Digit. Scholarsh. Humanit., 2024

Opinion Mining by Convolutional Neural Networks for Maximizing Discoverability of Nanomaterials.
J. Chem. Inf. Model., 2024

From Tokens to Materials: Leveraging Language Models for Scientific Discovery.
CoRR, 2024

SciQAG: A Framework for Auto-Generated Scientific Question Answering Dataset with Fine-grained Evaluation.
CoRR, 2024

2023
DARWIN Series: Domain Specific Large Language Models for Natural Science.
CoRR, 2023

Large Language Models as Master Key: Unlocking the Secrets of Materials Science with GPT.
CoRR, 2023

2022
Interdisciplinary Discovery of Nanomaterials Based on Convolutional Neural Networks.
CoRR, 2022

2020
Multi-choice Relational Reasoning for Machine Reading Comprehension.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
Chinese Word Segmentation: Another Decade Review (2007-2017).
CoRR, 2019

2018
Collaborative Matching for Sentence Alignment.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2018

2015
Towards non-monotonic sentence alignment.
Inf. Sci., 2015

Learning bilingual distributed phrase represenations for statistical machine translation.
Proceedings of Machine Translation Summit XV: Papers, 2015

Short and Sparse Text Topic Modeling via Self-Aggregation.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

2013
Integrative Semantic Dependency Parsing via Efficient Large-scale Feature Selection.
J. Artif. Intell. Res., 2013

Combine Constituent and Dependency Parsing via Reranking.
Proceedings of the IJCAI 2013, 2013

Non-Monotonic Sentence Alignment via Semisupervised Learning.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Finding More Bilingual Webpages with High Credibility via Link Analysis.
Proceedings of the Sixth Workshop on Building and Using Comparable Corpora, 2013

2012
Extending Machine Translation Evaluation Metrics with Lexical Cohesion to Document Level.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Entropy-based Training Data Selection for Domain Adaptation.
Proceedings of the COLING 2012, 2012

Higher-order Constituent Parsing and Parser Combination.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Semi-automatic Annotation of Chinese Word Structure.
Proceedings of the Second CIPS-SIGHAN Joint Conference on Chinese Language Processing, 2012

2011
Integrating unsupervised and supervised word segmentation: The role of goodness measures.
Inf. Sci., 2011

Learning Multiple Level Features for Opinion Analysis.
Int. J. Comput. Process. Orient. Lang., 2011

Guest Editors' Introduction.
Int. J. Comput. Process. Orient. Lang., 2011

Lexical cohesion for evaluation of machine translation at document level.
Proceedings of the 7th International Conference on Natural Language Processing and Knowledge Engineering, 2011

Comparative Evaluation of Term Informativeness Measures in Machine Translation Evaluation Metrics.
Proceedings of Machine Translation Summit XIII: Papers, 2011

Improving Part-of-speech Tagging for Context-free Parsing.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

2010
Cross Lingual Opinion Analysis via Transfer Learning.
Aust. J. Intell. Inf. Process. Syst., 2010

The Parameter-Optimized ATEC Metric for MT Evaluation.
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010

HITSZ_CITYU: Combine Collocation, Context Words and Neighboring Sentence Sentiment in Sentiment Adjectives Disambiguation.
Proceedings of the 5th International Workshop on Semantic Evaluation, 2010

Incorporating Feature-based and Similarity-based Opinion Mining - CTL in NTCIR-8 MOAT.
Proceedings of the 8th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2010

How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Opinion retrieval based on mutual reinforcement between opinon analysis and relavence estimation.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2010

Does joint decoding really outperform cascade processing in English-to-Chinese transliteration generation? The role of syllabification.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2010

Two Cores in Chinese Negation System: A Corpus-Based View.
Proceedings of the International Conference on Asian Language Processing, 2010

Reranking with Multiple Features for Better Transliteration.
Proceedings of the 2010 Named Entities Workshop, 2010

Bigram HMM with Context Distribution Clustering for Unsupervised Chinese Part-of-Speech tagging.
Proceedings of the CIPS-SIGHAN Joint Conference on Chinese Language Processing, 2010

Combine Person Name and Person Identity Recognition and Document Clustering for Chinese Person Name Disambiguation.
Proceedings of the CIPS-SIGHAN Joint Conference on Chinese Language Processing, 2010

Active Learning Based Corpus Annotation.
Proceedings of the CIPS-SIGHAN Joint Conference on Chinese Language Processing, 2010

Automatic Identification of Predicate Heads in Chinese Sentences.
Proceedings of the CIPS-SIGHAN Joint Conference on Chinese Language Processing, 2010

2009
ATEC: automatic evaluation of machine translation via word choice and word order.
Mach. Transl., 2009

A Simple and Efficient Model Pruning Method for Conditional Random Fields.
Proceedings of the Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy, 2009

Meta-evaluation of Machine Translation Using Parallel Legal Texts.
Proceedings of the Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy, 2009

An Extractive Text Summarizer Based on Significant Words.
Proceedings of the Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy, 2009

Semantic Dependency Parsing of NomBank and PropBank: An Efficient Integrated Approach via a Large-scale Feature Selection.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Multilingual Dependency Learning: A Huge Feature Engineering Method to Semantic Dependency Parsing.
Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, 2009

Transliteration of Name Entity via Improved Statistical Translation on Character Sequences.
Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration, 2009

Cross Language Dependency Parsing using a Bilingual Lexicon.
Proceedings of the ACL 2009, 2009

2008
Scaling Conditional Random Fields by One-Against-the-Other Decomposition.
J. Comput. Sci. Technol., 2008

Chinese word segmentation as morpheme-based lexical chunking.
Inf. Sci., 2008

An Improved Corpus Comparison Approach to Domain Specific Term Recognition.
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation, 2008

Unsupervised Segmentation Helps Supervised Learning of Character Tagging for Word Segmentation and Named Entity Recognition.
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

An Empirical Comparison of Goodness Measures for Unsupervised Chinese Word Segmentation with a Unified Framework.
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

Parsing Syntactic and Semantic Dependencies with Two Single-Stage Maximum Entropy Models.
Proceedings of the Twelfth Conference on Computational Natural Language Learning, 2008

Automatic Chinese Multi-word Term Extraction.
Proceedings of the ALPIT 2008, 2008

2007
Scaling Conditional Random Field with Application to Chinese Word Segmentation.
Proceedings of the Third International Conference on Natural Computation, 2007

Improving Chinese Word Segmentation with Description Length Gain.
Proceedings of the 2007 International Conference on Artificial Intelligence, 2007

An Intelligent Web Agent to Mine Bilingual Parallel Pages via Automatic Discovery of URL Pairing Patterns.
Proceedings of the 2007 IEEE/WIC/ACM International Conference on Web Intelligence and International Conference on Intelligent Agent Technology, 2007

2006
A New Dictionary-based Word Alignment Algorithm.
J. Chin. Lang. Comput., 2006

Abbreviation Recognition with MaxEnt Model.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2006

2005
Harvesting the Bitexts of the Laws of Hong Kong From the Web.
Proceedings of the Fifth Workshop on Asian Language Resources and First Symposium on Asian Language Resources Network, 2005

Period Disambiguation with Maxent Model.
Proceedings of the Natural Language Processing, 2005

An Example-Based Chinese Word Segmentation System for CWSB-2.
Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing, 2005

2004
Integrating N-gram Model and Case-based Learning For Chinese Word Segmentation.
J. Chin. Lang. Comput., 2004

An Example-Based Study on Chinese Word Segmentation Using Critical Fragments.
Proceedings of the Natural Language Processing, 2004

Unsupervised Segmentation of Chinese Corpus Using Accessor Variety.
Proceedings of the Natural Language Processing, 2004

2003
Integrating Ngram Model and Case-based Learning for Chinese Word Segmentation.
Proceedings of the Second Workshop on Chinese Language Processing, 2003

2002
Learning Case-based Knowledge for Disambiguating Chinese Word Segmentation: A Preliminary Study.
Proceedings of the First Workshop on Chinese Language Processing, 2002

1999
Unsupervised Learning of Word Boundary with Description Length Gain.
Proceedings of the 1999 Workshop on Computational Natural Language Learning, 1999

1994
Automatic Terminology Extraction For Thematic Corpus Based On Subterm Co-Occurrence.
Proceedings of Rocling Computational Linguistics Conference VII, 1994

1992
Tokenization As The Initial Phase In NLP.
Proceedings of the 14th International Conference on Computational Linguistics, 1992

1991
Automatic Chinese Text Generation Based On Inference Trees.
Proceedings of Rocling Computational Linguistics Conference, 1991


  Loading...