Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities.
CoRR, 2024
Building a Large Japanese Web Corpus for Large Language Models.
CoRR, 2024
Unsupervised Domain Adaptation for Sparse Retrieval by Filling Vocabulary and Word Frequency Gaps.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022
Incorporating Semantic Textual Similarity and Lexical Matching for Information Retrieval.
Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation, 2021
The helix-inversion mechanism in double-stranded helical oligomers bridged by rotary cyclic boronate esters.
J. Comput. Chem., 2019