Yuu Jinnai

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Filtered Direct Preference Optimization.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding.

[BibT_eX]

[DOI]

Kaito Ariu

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Model-Based Minimum Bayes Risk Decoding.

[BibT_eX]

[DOI]

CoRR, 2023

On the Depth between Beam Search and Exhaustive Search for Text Generation.

[BibT_eX]

[DOI]

Tetsuro Morimura

Ukyo Honda

CoRR, 2023

Blind Signal Separation for Fast Ultrasound Computed Tomography.

[BibT_eX]

[DOI]

CoRR, 2023

2021

Lipschitz Lifelong Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Exploration in Reinforcement Learning with Deep Covering Options.

[BibT_eX]

[DOI]

Jee Won Park

Marlos C. Machado

Proceedings of the 8th International Conference on Learning Representations, 2020

Neural Architecture Search Using Deep Neural Networks and Monte Carlo Tree Search.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search.

[BibT_eX]

[DOI]

CoRR, 2019

Discovering Options for Exploration by Minimizing Cover Time.

[BibT_eX]

[DOI]

Jee Won Park

Proceedings of the 36th International Conference on Machine Learning, 2019

Finding Options that Minimize Planning Time.

[BibT_eX]

[DOI]

David Ellis Hershkowitz

Michael L. Littman

Proceedings of the 36th International Conference on Machine Learning, 2019

State Abstraction as Compression in Apprenticeship Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Finding Options that Minimize Planning Time.

[BibT_eX]

[DOI]

Michael L. Littman

CoRR, 2018

AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search.

[BibT_eX]

[DOI]

Linnan Wang

Yiyang Zhao

CoRR, 2018

Policy and Value Transfer in Lifelong Reinforcement Learning.

[BibT_eX]

[DOI]

Yue (Sophie) Guo

Michael L. Littman

Proceedings of the 35th International Conference on Machine Learning, 2018

Parallel A* for State-Space Search.

[BibT_eX]

[DOI]

Proceedings of the Handbook of Parallel Constraint Reasoning., 2018

2017

On Hash-Based Work Distribution Methods for Parallel Best-First Search.

[BibT_eX]

[DOI]

Alex Fukunaga

J. Artif. Intell. Res., 2017

A Survey of Parallel A.

[BibT_eX]

[DOI]

CoRR, 2017

Learning to Avoid Dominated Action Sequences in Planning for Black-Box Domains.

[BibT_eX]

[DOI]

Alex Fukunaga

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Learning to Prune Dominated Action Sequences in Online Black-Box Planning.

[BibT_eX]

[DOI]

Alex S. Fukunaga

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Automated Creation of Efficient Work Distribution Functions for Parallel Best-First Search.

[BibT_eX]

[DOI]

Alex S. Fukunaga

Proceedings of the Twenty-Sixth International Conference on Automated Planning and Scheduling, 2016

Abstract Zobrist Hashing: An Efficient Work Distribution Method for Parallel Best-First Search.

[BibT_eX]

[DOI]