We stand with Ukraine

We stand with Ukraine

Kaito Ariu

Orcid: 0000-0001-6286-9906

According to our database¹, Kaito Ariu authored at least 37 papers between 2017 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

The Power of Perturbation under Sampling in Solving Extensive-Form Games.

[BibT_eX]

[DOI]

,

Mitsuki Sakamoto

,

,

,

Tuomas Sandholm

,

Atsushi Iwasaki

CoRR, January, 2025

Efficient Creative Selection in Online Advertising using Top-Two Thompson Sampling.

[BibT_eX]

[DOI]

Daiki Katsuragawa

,

,

,

Proceedings of the Eighteenth ACM International Conference on Web Search and Data Mining, 2025

2024

Optimal clustering from noisy binary feedback.

[BibT_eX]

[DOI]

,

,

Alexandre Proutière

,

Mach. Learn., May, 2024

Rate-Optimal Bayesian Simple Regret in Best Arm Identification.

[BibT_eX]

[DOI]

Junpei Komiyama

,

,

,

Math. Oper. Res., 2024

Time-Varyingness in Auction Breaks Revenue Equivalence.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

Last Iterate Convergence in Monotone Mean Field Games.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

Boosting Perturbed Gradient Ascent for Last-Iterate Convergence in Games.

[BibT_eX]

[DOI]

,

Mitsuki Sakamoto

,

,

Atsushi Iwasaki

CoRR, 2024

Synchronization behind Learning in Periodic Zero-Sum Games Triggers Divergence from Nash equilibrium.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

Global Behavior of Learning Dynamics in Zero-Sum Games with Memory Asymmetry.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment.

[BibT_eX]

[DOI]

,

Tetsuro Morimura

,

,

CoRR, 2024

Nash Equilibrium and Learning Dynamics in Three-Player Matching m-Action Games.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

Return-Aligned Decision Transformer.

[BibT_eX]

[DOI]

Tsunehiko Tanaka

,

,

,

Tetsuro Morimura

,

Edgar Simo-Serra

CoRR, 2024

On Universally Optimal Algorithms for A/B Testing.

[BibT_eX]

[DOI]

,

,

Alexandre Proutière

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Matroid Semi-Bandits in Sublinear Time.

[BibT_eX]

[DOI]

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Model-Based Minimum Bayes Risk Decoding for Text Generation.

[BibT_eX]

[DOI]

,

Tetsuro Morimura

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Adaptively Perturbed Mirror Descent for Learning in Games.

[BibT_eX]

[DOI]

,

,

Mitsuki Sakamoto

,

Atsushi Iwasaki

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Filtered Direct Preference Optimization.

[BibT_eX]

[DOI]

Tetsuro Morimura

,

Mitsuki Sakamoto

,

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding.

[BibT_eX]

[DOI]

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Memory Asymmetry Creates Heteroclinic Orbits to Nash Equilibrium in Learning in Zero-Sum Games.

[BibT_eX]

[DOI]

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Model-Based Minimum Bayes Risk Decoding.

[BibT_eX]

[DOI]

,

Tetsuro Morimura

,

,

,

CoRR, 2023

On Uniformly Optimal Algorithms for Best Arm Identification in Two-Armed Bandits with Fixed Budget.

[BibT_eX]

[DOI]

,

,

Alexandre Proutière

CoRR, 2023

Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model.

[BibT_eX]

[DOI]

,

Alexandre Proutière

,

CoRR, 2023

A Slingshot Approach to Learning in Monotone Games.

[BibT_eX]

[DOI]

,

,

Mitsuki Sakamoto

,

Atsushi Iwasaki

CoRR, 2023

Memory Asymmetry: A Key to Convergence in Zero-Sum Games.

[BibT_eX]

[DOI]

,

,

CoRR, 2023

Exploration of Unranked Items in Safe Online Learning to Re-Rank.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Learning in Multi-Memory Games Triggers Complex Dynamics Diverging from Nash Equilibrium.

[BibT_eX]

[DOI]

,

,

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum Games.

[BibT_eX]

[DOI]

,

,

Mitsuki Sakamoto

,

Kentaro Toyoshima

,

Atsushi Iwasaki

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022

Last-Iterate Convergence with Full- and Noisy-Information Feedback in Two-Player Zero-Sum Games.

[BibT_eX]

[DOI]

,

,

Mitsuki Sakamoto

,

Kentaro Toyoshima

,

Atsushi Iwasaki

CoRR, 2022

Optimal Fixed-Budget Best Arm Identification using the Augmented Inverse Probability Weighting Estimator in Two-Armed Gaussian Bandits with Unknown Variances.

[BibT_eX]

[DOI]

,

,

Masaaki Imaizumi

,

Masatoshi Uehara

,

Masahiro Nomura

,

CoRR, 2022

Thresholded Lasso Bandit.

[BibT_eX]

[DOI]

,

,

Alexandre Proutière

Proceedings of the International Conference on Machine Learning, 2022

2021

Optimal Simple Regret in Bayesian Best Arm Identification.

[BibT_eX]

[DOI]

Junpei Komiyama

,

,

,

CoRR, 2021

Policy Choice and Best Arm Identification: Comments on "Adaptive Treatment Assignment in Experiments for Policy Choice".

[BibT_eX]

[DOI]

,

,

Junpei Komiyama

,

Kenichiro McAlinn

CoRR, 2021

The Role of Contextual Information in Best Arm Identification.

[BibT_eX]

[DOI]

,

CoRR, 2021

2020

A Practical Guide of Off-Policy Evaluation for Bandit Problems.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

Regret in Online Recommendation Systems.

[BibT_eX]

[DOI]

,

,

,

Alexandre Proutière

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Optimal Algorithms for Multiplayer Multi-Armed Bandits.

[BibT_eX]

[DOI]

,

Alexandre Proutière

,

,

,

Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2017

Chance-Constrained Path Planning with Continuous Time Safety Guarantees.

[BibT_eX]

[DOI]

,

,

Márcio da Silva Arantes

,

Cláudio Toledo

,

Brian Charles Williams

Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017

Loading...