We stand with Ukraine

We stand with Ukraine

Rasool Fakoor

According to our database¹, Rasool Fakoor authored at least 33 papers between 2012 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens.

[BibT_eX]

[DOI]

,

,

,

Pratik Chaudhari

,

Huzefa Rangwala

,

,

CoRR, 2024

AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents.

[BibT_eX]

[DOI]

,

,

Sapana Chaudhary

,

,

Pratik Chaudhari

,

,

Huzefa Rangwala

CoRR, 2024

AlphaRouter: Quantum Circuit Routing with Reinforcement Learning and Tree Search.

[BibT_eX]

[DOI]

,

,

Yaroslav Kharkov

,

,

,

CoRR, 2024

EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Learning the Target Network in Function Space.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Time-Varying Propensity Score to Bridge the Gap between the Past and Present.

[BibT_eX]

[DOI]

,

,

Zachary Chase Lipton

,

Pratik Chaudhari

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Flexible Model Aggregation for Quantile Regression.

[BibT_eX]

[DOI]

,

,

,

Alexander J. Smola

,

Ryan J. Tibshirani

J. Mach. Learn. Res., 2023

TD Convergence: An Optimization Perspective.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Resetting the Optimizer in Deep RL: An Empirical Study.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Budgeting Counterfactual for Offline RL.

[BibT_eX]

[DOI]

,

Pratik Chaudhari

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Task-Agnostic Continual Reinforcement Learning: Gaining Insights and Overcoming Challenges.

[BibT_eX]

[DOI]

,

,

,

Laurent Charlin

,

Proceedings of the Conference on Lifelong Learning Agents, 2023

2022

Data drift correction via time-varying importance weight estimator.

[BibT_eX]

[DOI]

,

,

Zachary C. Lipton

,

Pratik Chaudhari

,

Alexander J. Smola

CoRR, 2022

Task-Agnostic Continual Reinforcement Learning: In Praise of a Simple Baseline.

[BibT_eX]

[DOI]

,

,

,

Laurent Charlin

,

CoRR, 2022

Adaptive Interest for Emphatic Reinforcement Learning.

[BibT_eX]

[DOI]

Martin Klissarov

,

,

Jonas W. Mueller

,

,

,

Alexander J. Smola

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Faster Deep Reinforcement Learning with Slower Online Network.

[BibT_eX]

[DOI]

,

,

,

,

Michael L. Littman

,

Alexander J. Smola

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

Deep Q-Network with Proximal Iteration.

[BibT_eX]

[DOI]

,

,

,

Michael L. Littman

,

Alexander J. Smola

CoRR, 2021

Deep Quantile Aggregation.

[BibT_eX]

[DOI]

,

,

,

Alexander J. Smola

,

Ryan J. Tibshirani

CoRR, 2021

Continuous Doubly Constrained Batch Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Pratik Chaudhari

,

Alexander J. Smola

CoRR, 2021

Continuous Doubly Constrained Batch Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

Pratik Chaudhari

,

Alexander J. Smola

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020

DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning.

[BibT_eX]

[DOI]

,

Pratik Chaudhari

,

Alexander J. Smola

CoRR, 2020

TraDE: Transformers for Density Estimation.

[BibT_eX]

[DOI]

,

Pratik Chaudhari

,

,

Alexander J. Smola

CoRR, 2020

Fast, Accurate, and Simple Models for Tabular Data via Augmented Distillation.

[BibT_eX]

[DOI]

,

,

,

Pratik Chaudhari

,

Alexander J. Smola

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Meta-Q-Learning.

[BibT_eX]

[DOI]

,

Pratik Chaudhari

,

,

Alexander J. Smola

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

P3O: Policy-on Policy-off Policy Optimization.

[BibT_eX]

[DOI]

,

Pratik Chaudhari

,

Alexander J. Smola

Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

2018

Differentiable Greedy Networks.

[BibT_eX]

[DOI]

,

,

,

,

,

Abdel-rahman Mohamed

,

CoRR, 2018

Direct Optimization of F-Measure for Retrieval-Based Personal Question Answering.

[BibT_eX]

[DOI]

,

,

,

Christopher Winestock

,

Abdel-rahman Mohamed

,

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Constrained Convolutional-Recurrent Networks to Improve Speech Quality with Low Impact on Recognition Accuracy.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Reinforcement Learning To Adapt Speech Enhancement to Instantaneous Input Signal Quality.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2017

2016

Memory-augmented Attention Modelling for Videos.

[BibT_eX]

[DOI]

,

Abdel-rahman Mohamed

,

Margaret Mitchell

,

,

CoRR, 2016

2012

Improving tractability of POMDPs by separation of decision and perceptual processes.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2012

An integrated cloud-based framework for mobile phone sensing.

[BibT_eX]

[DOI]

,

,

,

Mario Di Francesco

,

Proceedings of the first edition of the MCC workshop on Mobile cloud computing, 2012

A Sampling-Based Approach to Reducing the Complexity of Continuous State Space POMDPs by Decomposition Into Coupled Perceptual and Decision Processes.

[BibT_eX]

[DOI]

,

Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

Loading...