Tatsuya Hiraoka

According to our database1, Tatsuya Hiraoka authored at least 17 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
SubRegWeigh: Effective and Efficient Annotation Weighing with Subword Regularization.
CoRR, 2024

LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs.
CoRR, 2024

Constructing Multilingual Visual-Text Datasets Revealing Visual Multilingual Ability of Vision Language Models.
CoRR, 2024

An Analysis of BPE Vocabulary Trimming in Neural Machine Translation.
CoRR, 2024

Knowledge of Pretrained Language Models on Surface Information of Tokens.
CoRR, 2024

2023
Tokenization Tractability for Human and Machine Learning Model: An Annotation Study.
CoRR, 2023

Downstream Task-Oriented Neural Tokenizer Optimization with Vocabulary Restriction as Post Processing.
CoRR, 2023

Vocabulary Replacement in SentencePiece for Domain Adaptation.
Proceedings of the 37th Pacific Asia Conference on Language, 2023

2022
Recurrent Neural Hidden Markov Model for High-order Transition.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2022

MaxMatch-Dropout: Subword Regularization for WordPiece.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Single Model Ensemble for Subword Regularized Models in Low-Resource Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Word-level Perturbation Considering Word Length and Compositional Subwords.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Joint Entity and Relation Extraction Based on Table Labeling Using Convolutional Neural Networks.
Proceedings of the Sixth Workshop on Structured Prediction for NLP, 2022

2021
Joint Optimization of Tokenization and Downstream Model.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Named Entity Recognition and Relation Extraction using Enhanced Table Filling by Contextualized Representations.
CoRR, 2020

Optimizing Word Segmentation for Downstream Task.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

2019
Stochastic Tokenization with a Language Model for Neural Text Classification.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019


  Loading...