Yinlam Chow

Orcid: 0000-0002-7533-8300

According to our database1, Yinlam Chow authored at least 57 papers between 2011 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Discovering Personalized Semantics for Soft Attributes in Recommender Systems Using Concept Activation Vectors.
Trans. Recomm. Syst., December, 2024

Embedding-Aligned Language Models.
CoRR, 2024

DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning.
CoRR, 2024

Demystifying Embedding Spaces using Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Preference Elicitation with Soft Attributes in Interactive Recommendation.
CoRR, 2023

Factual and Personalized Recommendations using Language Models and Reinforcement Learning.
CoRR, 2023

Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management.
CoRR, 2023

Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Mixture-of-Expert Approach to RL-based Dialogue Management.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning.
CoRR, 2022

A Mixture-of-Expert Approach to RL-based Dialogue Management.
CoRR, 2022

SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition.
CoRR, 2022

Discovering Personalized Semantics for Soft Attributes in Recommender Systems using Concept Activation Vectors.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Efficient Risk-Averse Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Safe Reinforcement Learning with Natural Language Constraints.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Variational Model-based Policy Optimization.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Control-Aware Representations for Model-based Reinforcement Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

Non-Stationary Off-Policy Optimization.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
Non-Stationary Latent Bandits.
CoRR, 2020

Piecewise-Stationary Off-Policy Optimization.
CoRR, 2020

Latent Bandits Revisited.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

CoinDICE: Off-Policy Confidence Interval Estimation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

BRPO: Batch Residual Policy Optimization.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Predictive Coding for Locally-Linear Control.
Proceedings of the 37th International Conference on Machine Learning, 2020

CAQL: Continuous Action Q-Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control.
Proceedings of the 8th International Conference on Learning Representations, 2020

Safe Policy Learning for Continuous Control.
Proceedings of the 4th Conference on Robot Learning, 2020

2019
A Framework for Time-Consistent, Risk-Sensitive Model Predictive Control: Theory and Algorithms.
IEEE Trans. Autom. Control., 2019

AlgaeDICE: Policy Gradient from Arbitrary Experience.
CoRR, 2019

Lyapunov-based Safe Policy Optimization for Continuous Control.
CoRR, 2019

DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Risk-Sensitive Generative Adversarial Imitation Learning.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
A Block Coordinate Ascent Algorithm for Mean-Variance Optimization.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

A Lyapunov-based Approach to Safe Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

More Robust Doubly Robust Off-policy Evaluation.
Proceedings of the 35th International Conference on Machine Learning, 2018

Path Consistency Learning in Tsallis Entropy Regularized MDPs.
Proceedings of the 35th International Conference on Machine Learning, 2018

Imitation Learning from Visual Data with Multiple Intentions.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Sequential Decision Making With Coherent Risk.
IEEE Trans. Autom. Control., 2017

Weighted SGD for $\ell_p$ Regression with Randomized Preconditioning.
J. Mach. Learn. Res., 2017

Risk-Constrained Reinforcement Learning with Percentile Risk Criteria.
J. Mach. Learn. Res., 2017

A Framework for Time-Consistent, Risk-Averse Model Predictive Control: Theory and Algorithms.
CoRR, 2017

Sequential Multiple Hypothesis Testing with Type I Error Control.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016
Distributed Online Modified Greedy Algorithm for Networked Storage Operation Under Uncertainty.
IEEE Trans. Smart Grid, 2016

Weighted SGD for <i>ℓ<sub>p</sub></i> Regression with Randomized Preconditioning.
Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms, 2016

Safe Policy Improvement by Minimizing Robust Baseline Regret.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Risk aversion in finite Markov Decision Processes using total cost criteria and average value at risk.
Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

2015
Trading Safety Versus Performance: Rapid Deployment of Robotic Swarms with Robust Performance Constraints.
CoRR, 2015

Policy Gradient for Coherent Risk Measures.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Real-time Bidding based Vehicle Sharing.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

2014
Algorithms for CVaR Optimization in MDPs.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Modeling and online control of generalized energy storage networks.
Proceedings of the Fifth International Conference on Future Energy Systems, 2014

Weighted difference approximation of value functions for slow-discounting Markov Decision Processes.
Proceedings of the 53rd IEEE Conference on Decision and Control, 2014

A framework for time-consistent, risk-averse model predictive control: Theory and algorithms.
Proceedings of the American Control Conference, 2014

2013
A uniform-grid discretization algorithm for stochastic optimal control with risk constraints.
Proceedings of the 52nd IEEE Conference on Decision and Control, 2013

Stochastic optimal control with dynamic, time-consistent risk constraints.
Proceedings of the American Control Conference, 2013

2011
A decentralized control technique for vehicle chassis control.
Proceedings of the 50th IEEE Conference on Decision and Control and European Control Conference, 2011


  Loading...