We stand with Ukraine

We stand with Ukraine

Shun Kiyono

Orcid: 0000-0002-5063-9297

According to our database¹, Shun Kiyono authored at least 26 papers between 2017 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Self-Translate-Train: A Simple but Strong Baseline for Cross-lingual Transfer of Large Language Models.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

Large Vocabulary Size Improves Large Language Models.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

2023

Spike No More: Stabilizing the Pre-training of Large Language Models.

[BibT_eX]

[DOI]

,

,

Sosuke Kobayashi

,

CoRR, 2023

Lessons on Parameter Sharing across Layers in Transformers.

[BibT_eX]

[DOI]

,

Proceedings of The Fourth Workshop on Simple and Efficient Natural Language Processing, 2023

Bridging the Gap between Subword and Character Segmentation in Pretrained Language Models.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, 2023

B2T Connection: Serving Stability and Performance in Deep Transformers.

[BibT_eX]

[DOI]

,

,

Sosuke Kobayashi

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

On Layer Normalizations and Residual Connections in Transformers.

[BibT_eX]

[DOI]

,

,

Sosuke Kobayashi

,

CoRR, 2022

Diverse Lottery Tickets Boost Ensemble from a Single Pretrained Model.

[BibT_eX]

[DOI]

Sosuke Kobayashi

,

,

,

CoRR, 2022

NT5 at WMT 2022 General Translation Task.

[BibT_eX]

[DOI]

Makoto Morishita

,

,

,

,

,

,

Proceedings of the Seventh Conference on Machine Translation, 2022

2021

Rethinking Perturbations in Encoder-Decoders for Fast Training.

[BibT_eX]

[DOI]

,

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Pseudo Zero Pronoun Resolution Improves Zero Anaphora Resolution.

[BibT_eX]

[DOI]

,

,

Yuichiroh Matsubayashi

,

,

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

SHAPE : Shifted Absolute Position Embedding for Transformers.

[BibT_eX]

[DOI]

,

Sosuke Kobayashi

,

,

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020

Massive Exploration of Pseudo Data for Grammatical Error Correction.

[BibT_eX]

[DOI]

,

,

Tomoya Mizumoto

,

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Tohoku-AIP-NTT at WMT 2020 News Translation Task.

[BibT_eX]

[DOI]

,

,

,

Makoto Morishita

,

Proceedings of the Fifth Conference on Machine Translation, 2020

A Self-Refinement Strategy for Noise Reduction in Grammatical Error Correction.

[BibT_eX]

[DOI]

,

,

Masahiro Kaneko

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

An Empirical Study of Contextual Data Augmentation for Japanese Zero Anaphora Resolution.

[BibT_eX]

[DOI]

,

Yuichiroh Matsubayashi

,

,

,

,

Proceedings of the 28th International Conference on Computational Linguistics, 2020

Encoder-Decoder Models Can Benefit from Pre-trained Masked Language Models in Grammatical Error Correction.

[BibT_eX]

[DOI]

Masahiro Kaneko

,

,

,

,

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

ESPnet-ST: All-in-One Speech Translation Toolkit.

[BibT_eX]

[DOI]

Hirofumi Inaguma

,

,

,

,

,

,

Shinji Watanabe

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

2019

Riposte! A Large Corpus of Counter-Arguments.

[BibT_eX]

[DOI]

,

Benjamin Heinzerling

,

,

,

CoRR, 2019

ESPnet How2 Speech Translation System for IWSLT 2019: Pre-training, Knowledge Distillation, and Going Deeper.

[BibT_eX]

[DOI]

Hirofumi Inaguma

,

,

Nelson Enrique Yalta Soplin

,

,

,

Shinji Watanabe

Proceedings of the 16th International Conference on Spoken Language Translation, 2019

An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction.

[BibT_eX]

[DOI]

,

,

,

Tomoya Mizumoto

,

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Effective Adversarial Regularization for Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Mixture of Expert/Imitator Networks: Scalable Semi-Supervised Learning Framework.

[BibT_eX]

[DOI]

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Reducing Odd Generation from Neural Headline Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation, 2018

Unsupervised Token-wise Alignment to Improve Interpretation of Encoder-Decoder Models.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

2017

Source-side Prediction for Neural Headline Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2017

Loading...