Rasool Fakoor

According to our database1, Rasool Fakoor authored at least 33 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens.
CoRR, 2024

AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents.
CoRR, 2024

AlphaRouter: Quantum Circuit Routing with Reinforcement Learning and Tree Search.
CoRR, 2024

EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data.
CoRR, 2024

Learning the Target Network in Function Space.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Time-Varying Propensity Score to Bridge the Gap between the Past and Present.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Flexible Model Aggregation for Quantile Regression.
J. Mach. Learn. Res., 2023

TD Convergence: An Optimization Perspective.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Resetting the Optimizer in Deep RL: An Empirical Study.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Budgeting Counterfactual for Offline RL.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Task-Agnostic Continual Reinforcement Learning: Gaining Insights and Overcoming Challenges.
Proceedings of the Conference on Lifelong Learning Agents, 2023

2022
Data drift correction via time-varying importance weight estimator.
CoRR, 2022

Task-Agnostic Continual Reinforcement Learning: In Praise of a Simple Baseline.
CoRR, 2022

Adaptive Interest for Emphatic Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Faster Deep Reinforcement Learning with Slower Online Network.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Deep Q-Network with Proximal Iteration.
CoRR, 2021

Deep Quantile Aggregation.
CoRR, 2021

Continuous Doubly Constrained Batch Reinforcement Learning.
CoRR, 2021

Continuous Doubly Constrained Batch Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning.
CoRR, 2020

TraDE: Transformers for Density Estimation.
CoRR, 2020

Fast, Accurate, and Simple Models for Tabular Data via Augmented Distillation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Meta-Q-Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
P3O: Policy-on Policy-off Policy Optimization.
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

2018
Differentiable Greedy Networks.
CoRR, 2018

Direct Optimization of F-Measure for Retrieval-Based Personal Question Answering.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Constrained Convolutional-Recurrent Networks to Improve Speech Quality with Low Impact on Recognition Accuracy.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Reinforcement Learning To Adapt Speech Enhancement to Instantaneous Input Signal Quality.
CoRR, 2017

2016
Memory-augmented Attention Modelling for Videos.
CoRR, 2016

2012
Improving tractability of POMDPs by separation of decision and perceptual processes.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2012

An integrated cloud-based framework for mobile phone sensing.
Proceedings of the first edition of the MCC workshop on Mobile cloud computing, 2012

A Sampling-Based Approach to Reducing the Complexity of Continuous State Space POMDPs by Decomposition Into Coupled Perceptual and Decision Processes.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012


  Loading...