Sina Ghiassian

According to our database¹, Sina Ghiassian authored at least 19 papers between 2017 and 2024.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

2017

2018

2019

2020

2021

2022

2023

2024

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

On the Importance of Uncertainty in Decision-Making with Large Language Models.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Learning in complex action spaces without policy gradients.

[BibT_eX]

[DOI]

Arash Tavakoli

Sina Ghiassian

Nemanja Rakicevic

CoRR, 2024

Soft Preference Optimization: Aligning Language Models to Expert Distributions.

[BibT_eX]

[DOI]

CoRR, 2024

In-context Exploration-Exploitation for Reinforcement Learning.

[BibT_eX]

[DOI]

Zhenwen Dai

Federico Tomasi

Sina Ghiassian

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

From eye-blinks to state construction: Diagnostic benchmarks for online representation learning.

[BibT_eX]

[DOI]

Adapt. Behav., February, 2023

Auxiliary task discovery through generate-and-test.

[BibT_eX]

[DOI]

Proceedings of the Conference on Lifelong Learning Agents, 2023

2022

Importance Sampling Placement in Off-Policy Temporal-Difference Methods.

[BibT_eX]

[DOI]

Eric Graves

Sina Ghiassian

CoRR, 2022

2021

An Empirical Comparison of Off-policy Prediction Learning Algorithms in the Four Rooms Environment.

[BibT_eX]

[DOI]

Sina Ghiassian

Richard S. Sutton

CoRR, 2021

An Empirical Comparison of Off-policy Prediction Learning Algorithms on the Collision Task.

[BibT_eX]

[DOI]

Sina Ghiassian

Richard S. Sutton

CoRR, 2021

A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Does Standard Backpropagation Forget Less Catastrophically Than Adam?

[BibT_eX]

[DOI]

Dylan R. Ashley

Sina Ghiassian

Richard S. Sutton

CoRR, 2021

2020

Gradient Temporal-Difference Learning with Regularized Corrections.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Improving Performance in Reinforcement Learning by Breaking Generalization in Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019

Overcoming Catastrophic Interference in Online Reinforcement Learning with Dynamic Self-Organizing Maps.

[BibT_eX]

[DOI]

Yat Long Lo

Sina Ghiassian

CoRR, 2019

Should All Temporal Difference Learning Use Emphasis?

[BibT_eX]

[DOI]

Xiang Gu

Sina Ghiassian

Richard S. Sutton

CoRR, 2019

Prediction in Intelligence: An Empirical Comparison of Off-policy Algorithms on Robots.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018

Online Off-policy Prediction.

[BibT_eX]

[DOI]

CoRR, 2018

Two geometric input transformation methods for fast online reinforcement learning with neural nets.

[BibT_eX]

[DOI]

CoRR, 2018

2017

A First Empirical Study of Emphatic Temporal Difference Learning.

[BibT_eX]

[DOI]

Sina Ghiassian

Banafsheh Rafiee

Richard S. Sutton

CoRR, 2017

Sina Ghiassian

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...