Shuai Li

Orcid: 0000-0002-3935-0708

Affiliations:
  • Shanghai Jiao Tong University, John Hopcroft Center for Computer Science, Shanghai, China
  • Chinese University of Hong Kong, Hong Kong (PhD)


According to our database1, Shuai Li authored at least 76 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Toward joint utilization of absolute and relative bandit feedback for conversational recommendation.
User Model. User Adapt. Interact., November, 2024

Improved Regret Bounds for Linear Adversarial MDPs via Linear Optimization.
Trans. Mach. Learn. Res., 2024

Optimal analysis for bandit learning in matching markets with serial dictatorship.
Theor. Comput. Sci., 2024

Learning Versatile Skills with Curriculum Masking.
CoRR, 2024

Extracting Essential and Disentangled Knowledge for Recommendation Enhancement.
CoRR, 2024

Calibrating Reasoning in Language Models with Internal Consistency.
CoRR, 2024

Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs.
CoRR, 2024

Which LLM to Play? Convergence-Aware Online Model Selection with Time-Increasing Bandits.
Proceedings of the ACM on Web Conference 2024, 2024

Interact with the Explanations: Causal Debiased Explainable Recommendation System.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

Hallucination Diversity-Aware Active Learning for Text Summarization.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Aligning as Debiasing: Causality-Aware Alignment via Reinforcement Learning with Interventional Feedback.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

In-context Learning on Function Classes Unveiled for Transformers.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

On Stationary Point Convergence of PPO-Clip.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Exploring Soft Prompt Initialization Strategy for Few-Shot Continual Text Classification.
Proceedings of the IEEE International Conference on Acoustics, 2024

Sequential Optimum Test with Multi-armed Bandits for Online Experimentation.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Improved Bandits in Many-to-One Matching Markets with Incentive Compatibility.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Clustering of conversational bandits with posterior sampling for user preference learning and elicitation.
User Model. User Adapt. Interact., November, 2023

bvnGPS: a generalizable diagnostic model for acute bacterial and viral infection using integrative host transcriptomics and pretrained neural networks.
Bioinform., March, 2023

Adversarial Attacks on Cooperative Multi-agent Bandits.
CoRR, 2023

Future-conditioned Unsupervised Pretraining for Decision Transformer.
CoRR, 2023

The Closeness of In-Context Learning and Weight Shifting for Softmax Regression.
CoRR, 2023

Differentially Private Temporal Difference Learning with Stochastic Nonconvex-Strongly-Concave Optimization.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Player-optimal Stable Regret for Bandit Learning in Matching Markets.
Proceedings of the 2023 ACM-SIAM Symposium on Discrete Algorithms, 2023

Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Online Corrupted User Detection and Regret Minimization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Online Clustering of Bandits with Misspecified User Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

User-Regulation Deconfounded Conversational Recommender System with Bandit Feedback.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Future-conditioned Unsupervised Pretraining for Decision Transformer.
Proceedings of the International Conference on Machine Learning, 2023

Stochastic No-regret Learning for General Games with Variance Reduction.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Adversarial Linear Mixture Markov Decision Processes with Bandit Feedback and Unknown Transition.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Personalized Diversification for Neural Re-ranking in Recommendation.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Best-of-three-worlds Analysis for Linear Bandits with Follow-the-regularized-leader Algorithm.
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

Online Influence Maximization under Decreasing Cascade Model.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Understanding Representation Learnability of Nonlinear Self-Supervised Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Few-Shot Composition Learning for Image Retrieval with Prompt Tuning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Efficient Explorative Key-Term Selection Strategies for Conversational Contextual Bandits.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
A Robust and Generalizable Immune-Related Signature for Sepsis Diagnostics.
IEEE ACM Trans. Comput. Biol. Bioinform., 2022

Pretraining in Deep Reinforcement Learning: A Survey.
CoRR, 2022

Knowledge-aware Conversational Preference Elicitation with Bandit Feedback.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Combinatorial Bandits under Strategic Manipulations.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Federated online clustering of bandits.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Dynamics-Aware Adaptation for Reinforcement Learning Based Cross-Domain Interactive Recommendation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Spatial-Temporal Aligned Multi-Agent Learning for Visual Dialog Systems.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Thompson Sampling for Bandit Learning in Matching Markets.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Simultaneously Learning Stochastic and Adversarial Bandits with General Graph Feedback.
Proceedings of the International Conference on Machine Learning, 2022

Cascading Bandit Under Differential Privacy.
Proceedings of the IEEE International Conference on Acoustics, 2022

Discovering Low-rank Subspaces for Language-agnostic Multilingual Representations.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Context-aware Information-theoretic Causal De-biasing for Interactive Sequence Labeling.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Hierarchical Conversational Preference Elicitation with Bandit Feedback.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Bandit Learning in Many-to-One Matching Markets.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Simultaneously Learning Stochastic and Adversarial Bandits under the Position-Based Model.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Incentivizing an Unknown Crowd.
CoRR, 2021

Cascading Bandit under Differential Privacy.
CoRR, 2021

Conservative Contextual Combinatorial Cascading Bandit.
CoRR, 2021

An Adversarial Imitation Click Model for Information Retrieval.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Comparison-based Conversational Recommender System with Relative Bandit Feedback.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

A Graph-Enhanced Click Model for Web Search.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

The Hardness Analysis of Thompson Sampling for Combinatorial Semi-bandits with Greedy Oracle.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Understanding Bandits with Graph Feedback.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Deconfounded and Explainable Interactive Vision-Language Retrieval of Complex Scenes.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Clustering of Conversational Bandits for User Preference Learning and Elicitation.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

2020
Online Influence Maximization under Linear Threshold Model.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

The Gambler's Problem and Beyond.
Proceedings of the 8th International Conference on Learning Representations, 2020

Stochastic Online Learning with Probabilistic Graph Feedback.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Contextual Combinatorial Conservative Bandits.
CoRR, 2019

Predicting associations among drugs, targets and diseases by tensor decomposition for drug repositioning.
BMC Bioinform., 2019

Improving prediction of phenotypic drug response on cancer cell lines using deep convolutional network.
BMC Bioinform., 2019

Improved Algorithm on Online Clustering of Bandits.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

2018
Offline Evaluation of Ranking Policies with Click Models.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Contextual Dependent Click Bandit Algorithm for Web Recommendation.
Proceedings of the Computing and Combinatorics - 24th International Conference, 2018

Drug-Protein-Disease Association Prediction and Drug Repositioning Based on Tensor Decomposition.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018

Online Clustering of Contextual Cascading Bandits.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2016
Contextual Combinatorial Cascading Bandits.
Proceedings of the 33nd International Conference on Machine Learning, 2016


  Loading...