Shun Kiyono

Orcid: 0000-0002-5063-9297

According to our database1, Shun Kiyono authored at least 25 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Large Vocabulary Size Improves Large Language Models.
CoRR, 2024

2023
Spike No More: Stabilizing the Pre-training of Large Language Models.
CoRR, 2023

Lessons on Parameter Sharing across Layers in Transformers.
Proceedings of The Fourth Workshop on Simple and Efficient Natural Language Processing, 2023

Bridging the Gap between Subword and Character Segmentation in Pretrained Language Models.
Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, 2023

B2T Connection: Serving Stability and Performance in Deep Transformers.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
On Layer Normalizations and Residual Connections in Transformers.
CoRR, 2022

Diverse Lottery Tickets Boost Ensemble from a Single Pretrained Model.
CoRR, 2022

NT5 at WMT 2022 General Translation Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

2021
Rethinking Perturbations in Encoder-Decoders for Fast Training.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Pseudo Zero Pronoun Resolution Improves Zero Anaphora Resolution.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

SHAPE : Shifted Absolute Position Embedding for Transformers.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Massive Exploration of Pseudo Data for Grammatical Error Correction.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Tohoku-AIP-NTT at WMT 2020 News Translation Task.
Proceedings of the Fifth Conference on Machine Translation, 2020

A Self-Refinement Strategy for Noise Reduction in Grammatical Error Correction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

An Empirical Study of Contextual Data Augmentation for Japanese Zero Anaphora Resolution.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Encoder-Decoder Models Can Benefit from Pre-trained Masked Language Models in Grammatical Error Correction.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

ESPnet-ST: All-in-One Speech Translation Toolkit.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

2019
Riposte! A Large Corpus of Counter-Arguments.
CoRR, 2019

ESPnet How2 Speech Translation System for IWSLT 2019: Pre-training, Knowledge Distillation, and Going Deeper.
Proceedings of the 16th International Conference on Spoken Language Translation, 2019

An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Effective Adversarial Regularization for Neural Machine Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Mixture of Expert/Imitator Networks: Scalable Semi-Supervised Learning Framework.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Reducing Odd Generation from Neural Headline Generation.
Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation, 2018

Unsupervised Token-wise Alignment to Improve Interpretation of Encoder-Decoder Models.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

2017
Source-side Prediction for Neural Headline Generation.
CoRR, 2017


  Loading...