Yi Ma
Orcid: 0000-0001-9375-6605Affiliations:
- Tianjin University, College of Intelligence and Computing, China
According to our database1,
Yi Ma
authored at least 24 papers
between 2019 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
2019
2020
2021
2022
2023
2024
0
1
2
3
4
5
6
7
8
1
4
1
1
1
5
3
1
2
4
1
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making.
CoRR, 2024
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Unlock the Cognitive Generalization of Deep Reinforcement Learning via Granular Ball Representation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
A Trajectory Perspective on the Role of Data Sampling Techniques in Offline Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024
2023
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning.
CoRR, 2023
Ensemble-based Offline-to-Online Reinforcement Learning: From Pessimistic Learning to Optimistic Exploration.
CoRR, 2023
HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach.
CoRR, 2023
Reining Generalization in Offline Reinforcement Learning via Representation Distinction.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023
SplitNet: A Reinforcement Learning Based Sequence Splitting Method for the MinMax Multiple Travelling Salesman Problem.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
CoRR, 2022
PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
2021
A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
A Multi-Graph Attributed Reinforcement Learning based Optimization Algorithm for Large-scale Hybrid Flow Shop Scheduling Problem.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021
2020
Combining sequence and network information to enhance protein-protein interaction prediction.
BMC Bioinform., 2020
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising.
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2020
2019
Integrating Sequence and Network Information to Enhance Protein-Protein Interaction Prediction Using Graph Convolutional Networks.
Proceedings of the 2019 IEEE International Conference on Bioinformatics and Biomedicine, 2019