Tengyang Xie

According to our database¹, Tengyang Xie authored at least 27 papers between 2018 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2018

2019

2020

2021

2022

2023

2024

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Correcting the Mythos of KL-Regularization: Direct Alignment without Overoptimization via Chi-Squared Preference Optimization.

[BibT_eX]

[DOI]

CoRR, 2024

Self-Play with Adversarial Critic: Provable and Scalable Offline Alignment for Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF.

[BibT_eX]

[DOI]

CoRR, 2024

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences.

[BibT_eX]

[DOI]

CoRR, 2024

Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Towards Principled Representation Learning from Videos for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Harnessing Density Ratios for Online Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Adversarial Model for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

The Role of Coverage in Online Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

ARMOR: A Model-based Framework for Improving Arbitrary Baseline Policies with Offline Data.

[BibT_eX]

[DOI]

CoRR, 2022

Interaction-Grounded Learning with Action-Inclusive Feedback.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Adversarially Trained Actor Critic for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

2021

Finite Sample Analysis of Minimax Offline Reinforcement Learning: Completeness, Fast Rates and First-Order Efficiency.

[BibT_eX]

[DOI]

CoRR, 2021

Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Bellman-consistent Pessimism for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Batch Value-function Approximation with Only Realizability.

[BibT_eX]

[DOI]

Tengyang Xie

Nan Jiang

Proceedings of the 38th International Conference on Machine Learning, 2021

Interaction-Grounded Learning.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

A Variant of the Wang-Foster-Kakade Lower Bound for the Discounted Setting.

[BibT_eX]

[DOI]

Philip Amortila

Nan Jiang

Tengyang Xie

CoRR, 2020

Q<sup>*</sup> Approximation Schemes for Batch Reinforcement Learning: A Theoretical Comparison.

[BibT_eX]

[DOI]

Tengyang Xie

Nan Jiang

CoRR, 2020

Q* Approximation Schemes for Batch Reinforcement Learning: A Theoretical Comparison.

[BibT_eX]

[DOI]

Tengyang Xie

Nan Jiang

Proceedings of the Thirty-Sixth Conference on Uncertainty in Artificial Intelligence, 2020

2019

Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling.

[BibT_eX]

[DOI]

Tengyang Xie

Yifei Ma

Yu-Xiang Wang

CoRR, 2019

Privacy Preserving Off-Policy Evaluation.

[BibT_eX]

[DOI]

Tengyang Xie

Philip S. Thomas

Gerome Miklau

CoRR, 2019

Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling.

[BibT_eX]

[DOI]

Tengyang Xie

Yifei Ma

Yu-Xiang Wang

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Provably Efficient Q-Learning with Low Switching Cost.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018

A Block Coordinate Ascent Algorithm for Mean-Variance Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Tengyang Xie

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...