Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret.

[BibT_eX]

[DOI]

Jiawei Huang

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

An Adaptive Deep RL Method for Non-Stationary Environments with Piecewise Stable Context.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Data Mining, 2022

Inspector: Pixel-Based Automated Game Testing via Exploration, Detection, and Investigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Games, CoG 2022, Beijing, 2022

2021

Demonstration actor critic.

[BibT_eX]

[DOI]

Neurocomputing, 2021

Curriculum Offline Imitation Learning.

[BibT_eX]

[DOI]

CoRR, 2021

DRL4IR: 2nd Workshop on Deep Reinforcement Learning for Information Retrieval.

[BibT_eX]

[DOI]

Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Distributional Reinforcement Learning for Multi-Dimensional Reward Functions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Curriculum Offline Imitating Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Independence-aware Advantage Estimation.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Return-Based Contrastive Representation Learning for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

Suphx: Mastering Mahjong with Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Leveraging Demonstrations for Reinforcement Recommendation Reasoning over Knowledge Graphs.

[BibT_eX]

[DOI]

Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Deep Reinforcement Learning for Information Retrieval: Fundamentals and Advances.

[BibT_eX]

[DOI]

Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

RD$^2$: Reward Decomposition with Representation Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019

Semi-Supervised Neural Machine Translation via Marginal Distribution Estimation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Fully Parameterized Quantile Function for Distributional Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Distributional Reward Decomposition for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Individualized Indicator for All: Stock-wise Technical Indicator Optimization with Stock Embedding.

[BibT_eX]

[DOI]

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Investment Behaviors Can Tell What Inside: Exploring Stock Intrinsic Properties for Stock Trend Prediction.

[BibT_eX]

[DOI]

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Unified Policy Optimization for Robust Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of The 11th Asian Conference on Machine Learning, 2019

Trust Region Evolution Strategies.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Efficient Sequence Learning with Group Recurrent Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Adversarial Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of The 10th Asian Conference on Machine Learning, 2018

Learning Structured Representation for Text Classification via Reinforcement Learning.

[BibT_eX]

[DOI]

Tianyang Zhang

Minlie Huang

Li Zhao

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Word Attention for Sequence to Sequence Text Understanding.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Dual Transfer Learning for Neural Machine Translation with Marginal Distribution Regularization.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Reinforcement Learning for Relation Classification From Noisy Data.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Sequence Prediction with Unlabeled Data by Reward Function Learning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

2016

Attention-based LSTM for Aspect-level Sentiment Classification.

[BibT_eX]

[DOI]

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Semi-Supervised Multinomial Naive Bayes for Text Classification by Leveraging Word-Level Statistical Constraint.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Sentiment Extraction by Leveraging Aspect-Opinion Association Structure.

[BibT_eX]

[DOI]

Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

2014

Clustering Aspect-related Phrases by Leveraging Sentiment Distribution Consistency.

[BibT_eX]

[DOI]

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Li Zhao

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...