Sina Ghiassian

According to our database1, Sina Ghiassian authored at least 19 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
On the Importance of Uncertainty in Decision-Making with Large Language Models.
Trans. Mach. Learn. Res., 2024

Learning in complex action spaces without policy gradients.
CoRR, 2024

Soft Preference Optimization: Aligning Language Models to Expert Distributions.
CoRR, 2024

In-context Exploration-Exploitation for Reinforcement Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
From eye-blinks to state construction: Diagnostic benchmarks for online representation learning.
Adapt. Behav., February, 2023

Auxiliary task discovery through generate-and-test.
Proceedings of the Conference on Lifelong Learning Agents, 2023

2022
Importance Sampling Placement in Off-Policy Temporal-Difference Methods.
CoRR, 2022

2021
An Empirical Comparison of Off-policy Prediction Learning Algorithms in the Four Rooms Environment.
CoRR, 2021

An Empirical Comparison of Off-policy Prediction Learning Algorithms on the Collision Task.
CoRR, 2021

A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning.
CoRR, 2021

Does Standard Backpropagation Forget Less Catastrophically Than Adam?
CoRR, 2021

2020
Gradient Temporal-Difference Learning with Regularized Corrections.
Proceedings of the 37th International Conference on Machine Learning, 2020

Improving Performance in Reinforcement Learning by Breaking Generalization in Neural Networks.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019
Overcoming Catastrophic Interference in Online Reinforcement Learning with Dynamic Self-Organizing Maps.
CoRR, 2019

Should All Temporal Difference Learning Use Emphasis?
CoRR, 2019

Prediction in Intelligence: An Empirical Comparison of Off-policy Algorithms on Robots.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018
Online Off-policy Prediction.
CoRR, 2018

Two geometric input transformation methods for fast online reinforcement learning with neural nets.
CoRR, 2018

2017
A First Empirical Study of Emphatic Temporal Difference Learning.
CoRR, 2017


  Loading...