Yuu Jinnai

According to our database1, Yuu Jinnai authored at least 28 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

2016
2017
2018
2019
2020
2021
2022
2023
2024
0
1
2
3
4
5
6
7
8
9
1
3
3
1
2
2
5
1
2
3
1
2
2

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Does Cross-Cultural Alignment Change the Commonsense Morality of Language Models?
CoRR, 2024

Annotation-Efficient Preference Optimization for Language Model Alignment.
CoRR, 2024

Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment.
CoRR, 2024

On the True Distribution Approximation of Minimum Bayes-Risk Decoding.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

Model-Based Minimum Bayes Risk Decoding for Text Generation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Filtered Direct Preference Optimization.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Model-Based Minimum Bayes Risk Decoding.
CoRR, 2023

On the Depth between Beam Search and Exhaustive Search for Text Generation.
CoRR, 2023

Blind Signal Separation for Fast Ultrasound Computed Tomography.
CoRR, 2023

2021
Lipschitz Lifelong Reinforcement Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Exploration in Reinforcement Learning with Deep Covering Options.
Proceedings of the 8th International Conference on Learning Representations, 2020

Neural Architecture Search Using Deep Neural Networks and Monte Carlo Tree Search.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search.
CoRR, 2019

Discovering Options for Exploration by Minimizing Cover Time.
Proceedings of the 36th International Conference on Machine Learning, 2019

Finding Options that Minimize Planning Time.
Proceedings of the 36th International Conference on Machine Learning, 2019

State Abstraction as Compression in Apprenticeship Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Finding Options that Minimize Planning Time.
CoRR, 2018

AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search.
CoRR, 2018

Policy and Value Transfer in Lifelong Reinforcement Learning.
Proceedings of the 35th International Conference on Machine Learning, 2018

Parallel A* for State-Space Search.
Proceedings of the Handbook of Parallel Constraint Reasoning., 2018

2017
On Hash-Based Work Distribution Methods for Parallel Best-First Search.
J. Artif. Intell. Res., 2017

A Survey of Parallel A.
CoRR, 2017

Learning to Avoid Dominated Action Sequences in Planning for Black-Box Domains.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Learning to Prune Dominated Action Sequences in Online Black-Box Planning.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Automated Creation of Efficient Work Distribution Functions for Parallel Best-First Search.
Proceedings of the Twenty-Sixth International Conference on Automated Planning and Scheduling, 2016

Abstract Zobrist Hashing: An Efficient Work Distribution Method for Parallel Best-First Search.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016


  Loading...