Jincheng Mei

According to our database¹, Jincheng Mei authored at least 30 papers between 2014 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2014

2016

2018

2020

2022

2024

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Faster WIND: Accelerating Iterative Best-of-<i>N</i> Distillation for LLM Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF.

[BibT_eX]

[DOI]

CoRR, 2024

Beyond Expectations: Learning with Stochastic Dominance Made Practical.

[BibT_eX]

[DOI]

CoRR, 2024

Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation.

[BibT_eX]

[DOI]

Christopher K. Harris

A. Rupam Mahmood

Dale Schuurmans

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Ordering-based Conditions for Global Convergence of Policy Gradient Methods.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Stochastic Gradient Succeeds for Bandits.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice.

[BibT_eX]

[DOI]

Mohammad Gheshlaghi Azar

Proceedings of the International Conference on Machine Learning, 2023

2022

KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal.

[BibT_eX]

[DOI]

Mohammad Gheshlaghi Azar

CoRR, 2022

On the Effect of Log-Barrier Regularization in Decentralized Softmax Gradient Play in Multiagent Systems.

[BibT_eX]

[DOI]

CoRR, 2022

Understanding and mitigating the limitations of prioritized experience replay.

[BibT_eX]

[DOI]

Yangchen Pan

Jincheng Mei

Amir-massoud Farahmand

Proceedings of the Uncertainty in Artificial Intelligence, 2022

On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

The Role of Baselines in Policy Gradient Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Understanding and Leveraging Overparameterization in Recursive Value Estimation.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Understanding the Effect of Stochasticity in Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

On the Optimality of Batch Policy Optimization Algorithms.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Leveraging Non-uniformity in First-order Non-convex Optimization.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

Beyond Prioritized Replay: Sampling States in Model-Based RL via Simulated Priorities.

[BibT_eX]

[DOI]

Jincheng Mei

Yangchen Pan

Martha White

Amir-massoud Farahmand

Hengshuai Yao

CoRR, 2020

Escaping the Gravitational Pull of Softmax.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

On the Global Convergence Rates of Softmax Policy Gradient Methods.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Frequency-based Search-control in Dyna.

[BibT_eX]

[DOI]

Yangchen Pan

Jincheng Mei

Amir-massoud Farahmand

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Maximum Entropy Monte-Carlo Planning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

On Principled Entropy Exploration in Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

2018

Memory-Augmented Monte Carlo Tree Search.

[BibT_eX]

[DOI]

Chenjun Xiao

Jincheng Mei

Martin Müller

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Identifying and Tracking Sentiments and Topics from Social Media Texts during Natural Disasters.

[BibT_eX]

[DOI]

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2016

Discovering Author Interest Evolution in Topic Modeling.

[BibT_eX]

[DOI]

Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

On the Reducibility of Submodular Functions.

[BibT_eX]

[DOI]

Jincheng Mei

Hao Zhang

Bao-Liang Lu

Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

2015

On Unconstrained Quasi-Submodular Function Optimization.

[BibT_eX]

[DOI]

Jincheng Mei

Kang Zhao

Bao-Liang Lu

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014

Saliency Level Set Evolution.

[BibT_eX]

[DOI]

Jincheng Mei

Bao-Liang Lu

Proceedings of the Neural Information Processing - 21st International Conference, 2014

Locality Preserving Hashing.

[BibT_eX]

[DOI]

Kang Zhao

Hongtao Lu

Jincheng Mei

Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Jincheng Mei

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...