Jincheng Mei

According to our database1, Jincheng Mei authored at least 29 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Faster WIND: Accelerating Iterative Best-of-<i>N</i> Distillation for LLM Alignment.
CoRR, 2024

Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF.
CoRR, 2024

Beyond Expectations: Learning with Stochastic Dominance Made Practical.
CoRR, 2024

Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Ordering-based Conditions for Global Convergence of Policy Gradient Methods.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Stochastic Gradient Succeeds for Bandits.
Proceedings of the International Conference on Machine Learning, 2023

Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice.
Proceedings of the International Conference on Machine Learning, 2023

2022
KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal.
CoRR, 2022

On the Effect of Log-Barrier Regularization in Decentralized Softmax Gradient Play in Multiagent Systems.
CoRR, 2022

Understanding and mitigating the limitations of prioritized experience replay.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

The Role of Baselines in Policy Gradient Optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Understanding and Leveraging Overparameterization in Recursive Value Estimation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Understanding the Effect of Stochasticity in Policy Optimization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

On the Optimality of Batch Policy Optimization Algorithms.
Proceedings of the 38th International Conference on Machine Learning, 2021

Leveraging Non-uniformity in First-order Non-convex Optimization.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Beyond Prioritized Replay: Sampling States in Model-Based RL via Simulated Priorities.
CoRR, 2020

Escaping the Gravitational Pull of Softmax.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

On the Global Convergence Rates of Softmax Policy Gradient Methods.
Proceedings of the 37th International Conference on Machine Learning, 2020

Frequency-based Search-control in Dyna.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Maximum Entropy Monte-Carlo Planning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

On Principled Entropy Exploration in Policy Optimization.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

2018
Memory-Augmented Monte Carlo Tree Search.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Identifying and Tracking Sentiments and Topics from Social Media Texts during Natural Disasters.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2016
Discovering Author Interest Evolution in Topic Modeling.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

On the Reducibility of Submodular Functions.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

2015
On Unconstrained Quasi-Submodular Function Optimization.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Saliency Level Set Evolution.
Proceedings of the Neural Information Processing - 21st International Conference, 2014

Locality Preserving Hashing.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014


  Loading...