Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Aligning as Debiasing: Causality-Aware Alignment via Reinforcement Learning with Interventional Feedback.

[BibT_eX]

[DOI]

Yu Xia

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

In-context Learning on Function Classes Unveiled for Transformers.

[BibT_eX]

[DOI]

Zhijie Wang

Bo Jiang

Shuai Li

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

On Stationary Point Convergence of PPO-Clip.

[BibT_eX]

[DOI]

Ruinan Jin

Shuai Li

Baoxiang Wang

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Exploring Soft Prompt Initialization Strategy for Few-Shot Continual Text Classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Sequential Optimum Test with Multi-armed Bandits for Online Experimentation.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Improved Bandits in Many-to-One Matching Markets with Incentive Compatibility.

[BibT_eX]

[DOI]

Fang Kong

Shuai Li

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Clustering of conversational bandits with posterior sampling for user preference learning and elicitation.

[BibT_eX]

[DOI]

User Model. User Adapt. Interact., November, 2023

bvnGPS: a generalizable diagnostic model for acute bacterial and viral infection using integrative host transcriptomics and pretrained neural networks.

[BibT_eX]

[DOI]

Bioinform., March, 2023

Adversarial Attacks on Cooperative Multi-agent Bandits.

[BibT_eX]

[DOI]

CoRR, 2023

Future-conditioned Unsupervised Pretraining for Decision Transformer.

[BibT_eX]

[DOI]

CoRR, 2023

The Closeness of In-Context Learning and Weight Shifting for Softmax Regression.

[BibT_eX]

[DOI]

CoRR, 2023

Differentially Private Temporal Difference Learning with Stochastic Nonconvex-Strongly-Concave Optimization.

[BibT_eX]

[DOI]

Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Player-optimal Stable Regret for Bandit Learning in Matching Markets.

[BibT_eX]

[DOI]

Fang Kong

Shuai Li

Proceedings of the 2023 ACM-SIAM Symposium on Discrete Algorithms, 2023

Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Online Corrupted User Detection and Regret Minimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Online Clustering of Bandits with Misspecified User Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

User-Regulation Deconfounded Conversational Recommender System with Bandit Feedback.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Future-conditioned Unsupervised Pretraining for Decision Transformer.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Stochastic No-regret Learning for General Games with Variance Reduction.

[BibT_eX]

[DOI]

Yichi Zhou

Fang Kong

Shuai Li

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Adversarial Linear Mixture Markov Decision Processes with Bandit Feedback and Unknown Transition.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Personalized Diversification for Neural Re-ranking in Recommendation.

[BibT_eX]

[DOI]

Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Best-of-three-worlds Analysis for Linear Bandits with Follow-the-regularized-leader Algorithm.

[BibT_eX]

[DOI]

Fang Kong

Canzhe Zhao

Shuai Li

Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

Online Influence Maximization under Decreasing Cascade Model.

[BibT_eX]

[DOI]

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Understanding Representation Learnability of Nonlinear Self-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Few-Shot Composition Learning for Image Retrieval with Prompt Tuning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Efficient Explorative Key-Term Selection Strategies for Conversational Contextual Bandits.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

A Robust and Generalizable Immune-Related Signature for Sepsis Diagnostics.

[BibT_eX]

[DOI]

IEEE ACM Trans. Comput. Biol. Bioinform., 2022

Pretraining in Deep Reinforcement Learning: A Survey.

[BibT_eX]

[DOI]

CoRR, 2022

Knowledge-aware Conversational Preference Elicitation with Bandit Feedback.

[BibT_eX]

[DOI]

Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Combinatorial Bandits under Strategic Manipulations.

[BibT_eX]

[DOI]

Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Federated online clustering of bandits.

[BibT_eX]

[DOI]

Proceedings of the Uncertainty in Artificial Intelligence, 2022

Dynamics-Aware Adaptation for Reinforcement Learning Based Cross-Domain Interactive Recommendation.

[BibT_eX]

[DOI]

Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Spatial-Temporal Aligned Multi-Agent Learning for Visual Dialog Systems.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Thompson Sampling for Bandit Learning in Matching Markets.

[BibT_eX]

[DOI]

Fang Kong

Junming Yin

Shuai Li

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Simultaneously Learning Stochastic and Adversarial Bandits with General Graph Feedback.

[BibT_eX]

[DOI]

Fang Kong

Yichi Zhou

Shuai Li

Proceedings of the International Conference on Machine Learning, 2022

Cascading Bandit Under Differential Privacy.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Discovering Low-rank Subspaces for Language-agnostic Multilingual Representations.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Context-aware Information-theoretic Causal De-biasing for Interactive Sequence Labeling.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Hierarchical Conversational Preference Elicitation with Bandit Feedback.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Bandit Learning in Many-to-One Matching Markets.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Simultaneously Learning Stochastic and Adversarial Bandits under the Position-Based Model.

[BibT_eX]

[DOI]

Cheng Chen

Canzhe Zhao

Shuai Li

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Incentivizing an Unknown Crowd.

[BibT_eX]

[DOI]

Jing Dong

Shuai Li

Baoxiang Wang

CoRR, 2021

Cascading Bandit under Differential Privacy.

[BibT_eX]

[DOI]

CoRR, 2021

Conservative Contextual Combinatorial Cascading Bandit.

[BibT_eX]

[DOI]

CoRR, 2021

An Adversarial Imitation Click Model for Information Retrieval.

[BibT_eX]

[DOI]

Proceedings of the WWW '21: The Web Conference 2021, 2021

Comparison-based Conversational Recommender System with Relative Bandit Feedback.

[BibT_eX]

[DOI]

Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

A Graph-Enhanced Click Model for Web Search.

[BibT_eX]

[DOI]

Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

The Hardness Analysis of Thompson Sampling for Combinatorial Semi-bandits with Greedy Oracle.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Understanding Bandits with Graph Feedback.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Deconfounded and Explainable Interactive Vision-Language Retrieval of Complex Scenes.

[BibT_eX]

[DOI]

Junda Wu

Tong Yu

Shuai Li

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Clustering of Conversational Bandits for User Preference Learning and Elicitation.

[BibT_eX]

[DOI]

Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

2020

Online Influence Maximization under Linear Threshold Model.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

The Gambler's Problem and Beyond.

[BibT_eX]