2025
STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization.
CoRR, June, 2025
Scalable In-Context Q-Learning.
CoRR, June, 2025
One Demo Is All It Takes: Planning Domain Derivation with LLMs from A Single Demonstration.
CoRR, May, 2025
Reinforcing Question Answering Agents with Minimalist Policy Gradient Optimization.
CoRR, May, 2025
Conditioning Matters: Training Diffusion Policies is Faster Than You Think.
CoRR, May, 2025
EmbodiedMAE: A Unified 3D Multi-Modal Representation for Robot Manipulation.
CoRR, May, 2025
From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation.
CoRR, May, 2025
Advancing Symbolic Discovery on Unsupervised Data: A Pre-training Framework for Non-degenerate Implicit Equation Discovery.
CoRR, May, 2025
Accelerating Large Language Model Reasoning via Speculative Search.
CoRR, May, 2025
HyperTree Planning: Enhancing LLM Reasoning via Hierarchical Thinking.
CoRR, May, 2025
Robust Multiobjective Reinforcement Learning Considering Environmental Uncertainties.
IEEE Trans. Neural Networks Learn. Syst., April, 2025
MENTOR: Guiding Hierarchical Reinforcement Learning With Human Feedback and Dynamic Distance Constraint.
IEEE Trans. Emerg. Top. Comput. Intell., April, 2025
DualRAG: A Dual-Process Approach to Integrate Reasoning and Retrieval for Multi-Hop Question Answering.
CoRR, April, 2025
Few-Shot Vision-Language Action-Incremental Policy Learning.
CoRR, April, 2025
ViMo: A Generative Visual GUI World Model for App Agent.
CoRR, April, 2025
From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models.
,
,
,
,
,
,
,
,
,
,
CoRR, March, 2025
Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation.
CoRR, March, 2025
AhaRobot: A Low-Cost Open-Source Bimanual Mobile Manipulator for Embodied AI.
CoRR, March, 2025
Generative Models in Decision Making: A Survey.
,
,
,
,
,
,
,
,
,
,
,
CoRR, February, 2025
Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, February, 2025
VSC-RL: Advancing Autonomous Vision-Language Agents with Variational Subgoal-Conditioned Reinforcement Learning.
CoRR, February, 2025
AppVLM: A Lightweight Vision Language Model for Online App Control.
CoRR, February, 2025
ARIES: Stimulating Self-Refinement of Large Language Models by Iterative Preference Optimization.
,
,
,
,
,
,
,
,
,
,
CoRR, February, 2025
AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference.
,
,
,
,
,
,
,
,
,
,
CoRR, February, 2025
Trajectory World Models for Heterogeneous Environments.
CoRR, February, 2025
3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow.
CoRR, January, 2025
SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and Chain-of-Thought for Embodied Task Planning.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, January, 2025
Hierarchical task network-enhanced multi-agent reinforcement learning: Toward efficient cooperative strategies.
Neural Networks, 2025
SheetAgent: Towards a Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models.
Proceedings of the ACM on Web Conference 2025, 2025
CoopRide: Cooperate All Grids in City-Scale Ride-Hailing Dispatching with Multi-Agent Reinforcement Learning.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.1, 2025
Computing Circuits Optimization via Model-Based Circuit Genetic Evolution.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agent.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Differentiable Integer Linear Programming.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
LaMPlace: Learning to Optimize Cross-Stage Metrics in Macro Placement.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Lightweight Neural App Control.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Spa-Bench: a comprehensive Benchmark for Smartphone Agent Evaluation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
A Graph Enhanced Symbolic Discovery Framework For Efficient Logic Optimization.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Apollo-MILP: An Alternating Prediction-Correction Neural Solving Framework for Mixed-Integer Linear Programming.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Circuit Synthesis based on Hierarchical Conditional Diffusion.
Proceedings of the Great Lakes Symposium on VLSI 2025, GLSVLSI 2025, New Orleans, LA, USA, 30 June 2025, 2025
PCBAgent: An Agent-based Framework for High-Density Printed Circuit Board Placement.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 30th Asia and South Pacific Design Automation Conference, 2025
SWAMamba: A Sliding Window Attention Mamba Framework for Predicting Translation Elongation Rates.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
Improving Generalization in Offline Reinforcement Learning via Latent Distribution Representation Learning.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Sample Efficient Deep Reinforcement Learning With Online State Abstraction and Causal Transformer Model Prediction.
IEEE Trans. Neural Networks Learn. Syst., November, 2024
Prototypical Context-Aware Dynamics for Generalization in Visual Control With Model-Based Reinforcement Learning.
IEEE Trans. Ind. Informatics, September, 2024
WToE: Learning When to Explore in Multiagent Reinforcement Learning.
IEEE Trans. Cybern., August, 2024
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain.
IEEE Trans. Neural Networks Learn. Syst., July, 2024
A survey on interpretable reinforcement learning.
Mach. Learn., July, 2024
Cooperative Multiagent Transfer Learning With Coalition Pattern Decomposition.
,
,
,
,
,
,
,
,
,
,
IEEE Trans. Games, June, 2024
Exploiting counter-examples for active learning with partial labels.
Mach. Learn., June, 2024
Learning from Hierarchical Structure of Knowledge Graph for Recommendation.
ACM Trans. Inf. Syst., January, 2024
Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning.
Artif. Intell., January, 2024
Path-of-Thoughts: Extracting and Following Paths for Robust Relational Reasoning with Large Language Models.
CoRR, 2024
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
The Graph's Apprentice: Teaching an LLM Low Level Knowledge for Circuit Quality Estimation.
CoRR, 2024
SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation.
CoRR, 2024
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents.
CoRR, 2024
ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards Spatial-Temporal Cognition with Foundation Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data.
CoRR, 2024
MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning.
CoRR, 2024
Actra: Optimized Transformer Architecture for Vision-Language-Action Models in Robot Learning.
CoRR, 2024
Benchmarking End-To-End Performance of AI-Based Chip Placement Algorithms.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
CellAgent: An LLM-driven Multi-Agent Framework for Automated Single-cell Data Analysis.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
MFE-ETP: A Comprehensive Evaluation Benchmark for Multi-modal Foundation Models on Embodied Task Planning.
CoRR, 2024
ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
A Survey on Vision-Language-Action Models for Embodied AI.
CoRR, 2024
SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models.
CoRR, 2024
Reinforced In-Context Black-Box Optimization.
CoRR, 2024
Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models.
CoRR, 2024
LLM4EDA: Emerging Progress in Large Language Models for Electronic Design Automation.
CoRR, 2024
Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey.
CoRR, 2024
Machine Learning Insides OptVerse AI Solver: Design Principles and Applications.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
The MMO Economist: AI Empowers Robust, Healthy, and Sustainable P2W MMO Economies.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024
Hybrid CtrlFormer: Learning Adaptive Search Space Partition for Hybrid Action Control via Transformer-based Monte Carlo Tree Search.
Proceedings of the Uncertainty in Artificial Intelligence, 2024
FlexPlanner: Flexible 3D Floorplanning via Deep Reinforcement Learning in Hybrid Action Space with Multi-Modality Representation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Towards Next-Generation Logic Synthesis: A Scalable Neural Circuit Generation Framework.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
The Ladder in Chaos: Improving Policy Learning by Harnessing the Parameter Evolving Path in A Low-dimensional Space.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
PERIA: Perceive, Reason, Imagine, Act via Holistic Language and Vision Planning for Manipulation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Unlock the Intermittent Control Ability of Model Free Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
DiffuserLite: Towards Real-time Diffusion Planning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Iteratively Refined Behavior Regularization for Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
GraSS: Combining Graph Neural Networks with Expert Knowledge for SAT Solver Selection.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024
DyPS: Dynamic Parameter Sharing in Multi-Agent Reinforcement Learning for Spatio-Temporal Resource Allocation.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Safe Table Tennis Swing Stroke with Low-Cost Hardware.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024
Improving Generalization in Offline Reinforcement Learning via Adversarial Data Splitting.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
A Hierarchical Adaptive Multi-Task Reinforcement Learning Framework for Multiplier Circuit Design.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
A Circuit Domain Generalization Framework for Efficient Logic Synthesis in Chip Design.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Rethinking Decision Transformer via Hierarchical Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
HarmonyDream: Task Harmonization Inside World Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Unlock the Cognitive Generalization of Deep Reinforcement Learning via Granular Ball Representation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Value-Evolutionary-Based Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
EvoRainbow: Combining Improvements in Evolutionary Reinforcement Learning for Policy Search.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Towards General Algorithm Discovery for Combinatorial Optimization: Learning Symbolic Branching Policy from Bipartite Graph.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
KISA: A Unified Keyframe Identifier and Skill Annotator for Long-Horizon Robotics Demonstrations.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Reinforcement Learning within Tree Search for Fast Macro Placement.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Sample-Efficient Multiagent Reinforcement Learning with Reset Replay.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Rethinking Branching on Exact Combinatorial Optimization Solver: The First Deep Symbolic Discovery Framework.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Sample-Efficient Quality-Diversity by Cooperative Coevolution.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
JigsawPlanner: Jigsaw-like Floorplanner for Eliminating Whitespace and Overlap among Complex Rectilinear Modules.
Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024
Generalizable and Relation Sensitive Netlist Representation for Analog Circuit Design.
Proceedings of the Great Lakes Symposium on VLSI 2024, 2024
Generate Subgoal Images Before Act: Unlocking the Chain-of-Thought Reasoning in Diffusion Model for Robot Manipulation with Multimodal Prompts.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Improving Unsupervised Hierarchical Representation With Reinforcement Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
vMFER: von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement of Actor-Critic Algorithms.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024
PADDLE: Logic Program Guided Policy Reuse in Deep Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024
A Trajectory Perspective on the Role of Data Sampling Techniques in Offline Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024
EWEK-QA : Enhanced Web and Efficient Knowledge Graph Retrieval for Citation-based Question Answering Systems.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
PreRoutGNN for Timing Prediction with Order Preserving Partition: Global Circuit Pre-training, Local Delay Learning and Attentional Cell Modeling.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
A Transfer Approach Using Graph Neural Networks in Deep Reinforcement Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Debiased Recommendation with User Feature Balancing.
ACM Trans. Inf. Syst., October, 2023
Empirical Policy Optimization for n-Player Markov Games.
IEEE Trans. Cybern., October, 2023
ASN: action semantics network for multiagent reinforcement learning.
,
,
,
,
,
,
,
,
,
,
,
,
Auton. Agents Multi Agent Syst., October, 2023
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning.
IEEE Trans. Neural Networks Learn. Syst., August, 2023
Accelerating deep reinforcement learning via knowledge-guided policy network.
Auton. Agents Multi Agent Syst., June, 2023
A Unified Framework for Layout Pattern Analysis With Deep Causal Estimation.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., April, 2023
A benchmark for automatic medical consultation system: frameworks, tasks and datasets.
Bioinform., January, 2023
Differentiable Logic Machines.
Trans. Mach. Learn. Res., 2023
Contrastive-ACE: Domain Generalization Through Alignment of Causal Mechanisms.
IEEE Trans. Image Process., 2023
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
VOLTA: Diverse and Controllable Question-Answer Pair Generation with Variational Mutual Information Maximizing Autoencoder.
CoRR, 2023
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning.
CoRR, 2023
Hierarchical Task Network Planning for Facilitating Cooperative Multi-Agent Reinforcement Learning.
CoRR, 2023
Multi-agent Policy Reciprocity with Theoretical Guarantee.
CoRR, 2023
DR-Label: Improving GNN Models for Catalysis Systems by Label Deconstruction and Reconstruction.
CoRR, 2023
The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting.
CoRR, 2023
Reweighted Interacting Langevin Diffusions: an Accelerated Sampling Methodfor Optimization.
CoRR, 2023
Breaking Filter Bubble: A Reinforcement Learning Framework of Controllable Recommender System.
Proceedings of the ACM Web Conference 2023, 2023
Uncertainty-aware Consistency Learning for Cold-Start Item Recommendation.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
RLMixer: A Reinforcement Learning Approach for Integrated Ranking with Contrastive User Preference Modeling.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2023
Transfer Reinforcement Learning Based Negotiating Agent Framework.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2023
Generalized Universal Domain Adaptation with Generative Flow Networks.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
T3S: Improving Multi-Task Reinforcement Learning with Task-Specific Feature Selector and Scheduler.
Proceedings of the International Joint Conference on Neural Networks, 2023
Generative Flow Networks for Precise Reward-Oriented Active Learning on Graphs.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Achieving Last-Mile Functional Coverage in Testing Chip Design Software Implementations.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL.
Proceedings of the International Conference on Machine Learning, 2023
RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolution.
Proceedings of the International Conference on Machine Learning, 2023
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer.
Proceedings of the International Conference on Machine Learning, 2023
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Out-of-distribution Detection with Implicit Outlier Transformation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Regularized Offline GFlowNets.
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023
CFlowNets: Continuous Control with Generative Flow Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
GFlowNets with Human Feedback.
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023
DAG Matters! GFlowNets Enhanced Explainer for Graph Neural Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Traj-MAE: Masked Autoencoders for Trajectory Prediction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
EasySO: Exploration-enhanced Reinforcement Learning for Logic Synthesis Sequence Optimization and a Comprehensive RL Environment.
Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023
EasyMap: Improving Technology Mapping via Exploration-Enhanced Heuristics and Adaptive Sequencing.
Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023
Limited Information Opponent Modeling.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2023, 2023
BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023
TOFU: A Two-Step Floorplan Refinement Framework for Whitespace Reduction.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023
RITA: Boost Driving Simulators with Realistic Interactive Traffic Flow.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Fifth International Conference on Distributed Artificial Intelligence, 2023
Co-speech Gesture Synthesis by Reinforcement Learning with Contrastive Pretrained Rewards.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Dual-Process Graph Neural Network for Diversified Recommendation.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023
A Hierarchical Imitation Learning-based Decision Framework for Autonomous Driving.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023
Transfer Learning based Agent for Automated Negotiation.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023
Off-Beat Multi-Agent Reinforcement Learning.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023
Spectral Augmentations for Graph Contrastive Learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023
Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Structure Aware Incremental Learning with Personalized Imitation Weights for Recommender Systems.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Neighbor Auto-Grouping Graph Neural Networks for Handover Parameter Configuration in Cellular Network.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
SplitNet: A Reinforcement Learning Based Sequence Splitting Method for the MinMax Multiple Travelling Salesman Problem.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Learning to select cuts for efficient mixed-integer programming.
Pattern Recognit., 2022
SCC-rFMQ: a multiagent reinforcement learning method in cooperative Markov games with continuous actions.
Int. J. Mach. Learn. Cybern., 2022
Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents.
Frontiers Inf. Technol. Electron. Eng., 2022
HEBO: An Empirical Study of Assumptions in Bayesian Optimisation.
,
,
,
,
,
,
,
,
,
,
J. Artif. Intell. Res., 2022
Transformer in Transformer as Backbone for Deep Reinforcement Learning.
CoRR, 2022
Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents.
CoRR, 2022
State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning.
CoRR, 2022
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning.
CoRR, 2022
RITA: Boost Autonomous Driving Simulators with Realistic Interactive Traffic Flow.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
ERL-Re<sup>2</sup>: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation.
CoRR, 2022
PTDE: Personalized Training with Distillated Execution for Multi-Agent Reinforcement Learning.
CoRR, 2022
GFlowCausal: Generative Flow Networks for Causal Discovery.
CoRR, 2022
Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning.
CoRR, 2022
On the Convergence Theory of Meta Reinforcement Learning with Personalized Policies.
CoRR, 2022
Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes.
CoRR, 2022
PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration.
CoRR, 2022
API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks.
CoRR, 2022
Generalizable Information Theoretic Causal Representation.
CoRR, 2022
Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization.
CoRR, 2022
Introduction to The Dynamic Pickup and Delivery Problem Benchmark - ICAPS 2021 Competition.
CoRR, 2022
Debiased Recommendation with User Feature Balancing.
CoRR, 2022
A review and performance evaluation of clustering frameworks for single-cell Hi-C data.
,
,
,
,
,
,
,
,
,
,
,
Briefings Bioinform., 2022
Modeling Scale-free Graphs with Hyperbolic Geometry for Knowledge-aware Recommendation.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022
Cross-domain adaptive transfer reinforcement learning based on state-action correspondence.
Proceedings of the Uncertainty in Artificial Intelligence, 2022
Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-Based Policy Learning.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022
Versatile Multi-stage Graph Neural Network for Circuit Representation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Plan To Predict: Learning an Uncertainty-Foreseeing Model For Model-Based Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Multiagent Q-learning with Sub-Team Coordination.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
The Policy-gradient Placement and Generative Routing Neural Networks for Chip Design.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Transformer-based Working Memory for Multiagent Reinforcement Learning with Action Parsing.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Flat-Aware Cross-Stage Distilled Framework for Imbalanced Medical Image Classification.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022
Generalizable Floorplanner through Corner Block List Representation and Hypergraph Embedding.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022
RCANet: Root Cause Analysis via Latent Variable Interaction Modeling for Yield Improvement.
Proceedings of the IEEE International Test Conference, 2022
PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Promoting Quality and Diversity in Population-based Reinforcement Learning via Hierarchical Trajectory Space Exploration.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022
Individual Reward Assisted Multi-Agent Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022
Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization.
Proceedings of the International Conference on Machine Learning, 2022
PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration.
Proceedings of the International Conference on Machine Learning, 2022
Learning Pseudometric-based Action Representations for Offline Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022
Neuro-Symbolic Hierarchical Rule Induction.
Proceedings of the International Conference on Machine Learning, 2022
Learning State Representations via Retracing in Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation.
Proceedings of the Tenth International Conference on Learning Representations, 2022
Online Ad Hoc Teamwork under Partial Observability.
Proceedings of the Tenth International Conference on Learning Representations, 2022
Invariant Factor Graph Neural Networks.
Proceedings of the IEEE International Conference on Data Mining, 2022
Heterogeneous Graph Neural Network-Based Imitation Learning for Gate Sizing Acceleration.
Proceedings of the 41st IEEE/ACM International Conference on Computer-Aided Design, 2022
Batch Sequential Black-Box Optimization with Embedding Alignment Cells for Logic Synthesis.
Proceedings of the 41st IEEE/ACM International Conference on Computer-Aided Design, 2022
Efficient Dual-Process Cognitive Recommender Balancing Accuracy and Diversity.
Proceedings of the Database Systems for Advanced Applications, 2022
Efficient Deep Reinforcement Learning via Policy-Extended Successor Feature Approximator.
Proceedings of the Distributed Artificial Intelligence - 4th International Conference, 2022
LHNN: lattice hypergraph neural network for VLSI congestion prediction.
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022
Socially-Attentive Policy Optimization in Multi-Agent Self-Driving System.
Proceedings of the Conference on Robot Learning, 2022
Multiagent Q-learning with Sub-Team Coordination.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022
What about Inputting Policy in Value Function: Policy Representation and Policy-Extended Value Function Approximator.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning.
IEEE Trans. Software Eng., 2021
Generalized Centered 2-D Principal Component Analysis.
IEEE Trans. Cybern., 2021
SC2disease: a manually curated database of single-cell transcriptome for human diseases.
Nucleic Acids Res., 2021
ED2: An Environment Dynamics Decomposition Framework for World Model Construction.
CoRR, 2021
Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward Machines.
CoRR, 2021
Ranking Cost: Building An Efficient and Scalable Circuit Routing Planner with Evolution-Based Optimization.
CoRR, 2021
Exploration in Deep Reinforcement Learning: A Comprehensive Survey.
CoRR, 2021
Contrastive ACE: Domain Generalization Through Alignment of Causal Mechanisms.
CoRR, 2021
Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment.
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2021
Learning Symbolic Rules for Interpretable Deep Reinforcement Learning.
CoRR, 2021
Integrating multi-network topology for gene function prediction using deep neural networks.
Briefings Bioinform., 2021
An end-to-end heterogeneous graph representation learning-based framework for drug-target interaction prediction.
Briefings Bioinform., 2021
Efficient policy detecting and reusing for non-stationarity in Markov games.
Auton. Agents Multi Agent Syst., 2021
An Adversarial Imitation Click Model for Information Retrieval.
Proceedings of the WWW '21: The Web Conference 2021, 2021
A Graph-Enhanced Click Model for Web Search.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021
Detecting and Learning Against Unknown Opponents for Automated Negotiations.
Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021
Off-Policy Training for Truncated TD(λ) Boosted Soft Actor-Critic.
Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021
An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Adaptive Online Packing-guided Search for POMDPs.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Model-Based Reinforcement Learning via Imagination with Derived Memory.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Dynamic Bottleneck for Robust Self-Supervised Exploration.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
A Multi-Graph Attributed Reinforcement Learning based Optimization Algorithm for Large-scale Hybrid Flow Shop Scheduling Problem.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021
FIGCPS: Effective Failure-inducing Input Generation for Cyber-Physical Systems with Deep Reinforcement Learning.
Proceedings of the 36th IEEE/ACM International Conference on Automated Software Engineering, 2021
Ordering-Based Causal Discovery with Reinforcement Learning.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
A deep reinforcement learning-based agent for negotiation with multiple communication channels.
Proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence, 2021
Automatic Web Testing Using Curiosity-Driven Reinforcement Learning.
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering, 2021
Relational Navigation Learning in Continuous Action Space among Crowds.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021
Principled Exploration via Optimistic Bootstrapping and Backward Induction.
Proceedings of the 38th International Conference on Machine Learning, 2021
Coalition-based Task Assignment in Spatial Crowdsourcing.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021
Uncertainty-Aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning.
Proceedings of the Distributed Artificial Intelligence - Third International Conference, 2021
SEIHAI: A Sample-Efficient Hierarchical AI for the MineRL Competition.
Proceedings of the Distributed Artificial Intelligence - Third International Conference, 2021
CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
CMML: Contextual Modulation Meta Learning for Cold-Start Recommendation.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Addressing Action Oscillations through Learning Policy Inertia.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Efficient Multiagent Policy Optimization Based on Weighted Estimators in Stochastic Cooperative Environments.
J. Comput. Sci. Technol., 2020
HEBO: Heteroscedastic Evolutionary Bayesian Optimisation.
CoRR, 2020
Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning.
CoRR, 2020
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2020
What About Taking Policy as Input of Value Function: Policy-extended Value Function Approximator.
CoRR, 2020
Event-Triggered Multi-agent Reinforcement Learning with Communication under Limited-bandwidth Constraint.
CoRR, 2020
Dynamic Horizon Value Estimation for Model-based Reinforcement Learning.
CoRR, 2020
CausalVAE: Structured Causal Disentanglement in Variational Autoencoder.
CoRR, 2020
Learning When to Transfer among Agents: An Efficient Multiagent Transfer Learning Framework.
CoRR, 2020
Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning.
CoRR, 2020
A Method for Deploying Distributed Denial of Service Attack Defense Strategies on Edge Servers Using Reinforcement Learning.
IEEE Access, 2020
Cross-data Automatic Feature Engineering via Meta-learning and Reinforcement Learning.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2020
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
A Multi-Task Reinforcement Learning Approach for Navigating Unsignalized Intersections.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Generating Behavior-Diverse Game AIs with Evolutionary Multi-Objective Deep Reinforcement Learning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Triple-GAIL: A Multi-Modal Imitation Learning Framework with Generative Adversarial Nets.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Q-value Path Decomposition for Deep Multiagent Reinforcement Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020
Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 37th International Conference on Machine Learning, 2020
Action Semantics Network: Considering the Effects of Actions in Multiagent Systems.
Proceedings of the 8th International Conference on Learning Representations, 2020
An Empirical Study on Correlation between Coverage and Robustness for Deep Neural Networks.
Proceedings of the 25th International Conference on Engineering of Complex Computer Systems, 2020
Faster Convention Emergence by Avoiding Local Conventions in Reinforcement Social Learning.
Proceedings of the Artificial Intelligence and Soft Computing, 2020
MGHRL: Meta Goal-Generation for Hierarchical Reinforcement Learning.
Proceedings of the Distributed Artificial Intelligence - Second International Conference, 2020
SMARTS: An Open-Source Scalable Multi-Agent RL Training School for Autonomous Driving.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 4th Conference on Robot Learning, 2020
Large Scale Deep Reinforcement Learning in War-games.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2020
Efficient Deep Reinforcement Learning through Policy Transfer.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020
Mastering Basketball With Deep Reinforcement Learning: An Integrated Curriculum Training Approach.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020
From Few to More: Large-Scale Dynamic Multiagent Curriculum Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Continuous Multiagent Control Using Collective Behavior Entropy for Large-Scale Home Energy Management.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Multi-Agent Game Abstraction via Graph Attention Neural Network.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
LoopFix: an approach to automatic repair of buggy loops.
J. Syst. Softw., 2019
There is Limited Correlation between Coverage and Robustness for Deep Neural Networks.
CoRR, 2019
Efficient meta reinforcement learning via meta goal generation.
CoRR, 2019
Spectral-based Graph Convolutional Network for Directed Graphs.
CoRR, 2019
Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction.
CoRR, 2019
Attention-based recurrent neural network for influenza epidemic prediction.
,
,
,
,
,
,
,
,
,
,
BMC Bioinform., 2019
Using deep reinforcement learning to speed up collective cell migration.
BMC Bioinform., 2019
A learning-based framework for miRNA-disease association identification using neural networks.
Bioinform., 2019
SA-IGA: a multiagent reinforcement learning method towards socially optimal outcomes.
Auton. Agents Multi Agent Syst., 2019
An Efficient Handover Authentication Mechanism for 5G Wireless Network.
Proceedings of the 2019 IEEE Wireless Communications and Networking Conference, 2019
Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning.
Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering, 2019
Large-Scale Home Energy Management Using Entropy-Based Collective Multiagent Deep Reinforcement Learning Framework.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Towards Efficient Detection and Optimal Response against Sophisticated Opponents.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Building Personalized Simulator for Interactive Search.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Explicitly Coordinated Policy Iteration.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid Action Spaces.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Achieving cooperation through deep multiagent reinforcement learning in sequential prisoner's dilemmas.
Proceedings of the First International Conference on Distributed Artificial Intelligence, 2019
Learning Adaptive Display Exposure for Real-Time Advertising.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019
Automatic Feature Engineering by Deep Reinforcement Learning.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019
Reinforcement Learning for Cooperative Overtaking.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019
Large-Scale Home Energy Management Using Entropy-Based Collective Multiagent Reinforcement Learning Framework.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019
Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019
Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019
ONECG: Online Negotiation Environment for Coalitional Games.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019
An Optimal Rewiring Strategy for Cooperative Multiagent Social Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
An Adaptive Markov Strategy for Defending Smart Grid False Data Injection From Malicious Attackers.
IEEE Trans. Smart Grid, 2018
Efficient and Robust Emergence of Norms through Heuristic Collective Learning.
ACM Trans. Auton. Adapt. Syst., 2018
An Adaptive Learning Based Network Selection Approach for 5G Dynamic Environments.
Entropy, 2018
Hierarchical Deep Multiagent Reinforcement Learning.
CoRR, 2018
SCC-rFMQ Learning in Cooperative Markov Games with Continuous Actions.
CoRR, 2018
Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents.
CoRR, 2018
Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning.
,
,
,
,
,
,
,
,
,
,
CoRR, 2018
An Optimal Rewiring Strategy for Reinforcement Social Learning in Cooperative Multiagent Systems.
CoRR, 2018
Hierarchical Heuristic Learning towards Effcient Norm Emergence.
CoRR, 2018
SA-IGA: A Multiagent Reinforcement Learning Method Towards Socially Optimal Outcomes.
CoRR, 2018
Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach.
CoRR, 2018
Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments.
CoRR, 2018
Effective norm emergence in cell systems under limited communication.
BMC Bioinform., 2018
ESRQ: An Efficient Secure Routing Method in Wireless Sensor Networks Based on Q-Learning.
Proceedings of the 17th IEEE International Conference On Trust, 2018
Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments.
Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018
Achieving Multiagent Coordination Through CALA-rFMQ Learning in Continuous Action Space.
Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018
A Deep Bayesian Policy Reuse Approach Against Non-Stationary Agents.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Recurrent Deep Multiagent Q-Learning for Autonomous Brokers in Smart Grid.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Speeding up Collective Cell Migration Using Deep Reinforcement Learning.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018
Attention-Based Recurrent Multi-Channel Neural Network for Influenza Epidemic Prediction.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018
The Dynamics of Opinion Evolution in Gossiper-Media Model with WoLS-CALA Learning.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018
SCC-rFMQ Learning in Cooperative Markov Games with Continuous Actions.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018
Recurrent Deep Multiagent Q-Learning for Autonomous Agents in Future Smart Grid.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018
Efficient Convention Emergence through Decoupled Reinforcement Social Learning with Teacher-Student Mechanism.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018
2017
Blind Image Denoising via Dependent Dirichlet Process Tree.
IEEE Trans. Pattern Anal. Mach. Intell., 2017
Reciprocal Social Strategy in Social Repeated Games and Emergence of Social Norms.
Int. J. Artif. Intell. Tools, 2017
The dynamics of reinforcement social learning in networked cooperative multiagent systems.
Eng. Appl. Artif. Intell., 2017
Automated Software Security Requirements Recommendation Based on FT-SR Model.
Proceedings of the 29th International Conference on Software Engineering and Knowledge Engineering, 2017
FESR: A Framework for Eliciting Security Requirements Based on Integration of Common Criteria and Weakness Detection Formal Model.
Proceedings of the 2017 IEEE International Conference on Software Quality, 2017
An Adaptive Handover Trigger Strategy for 5G C/U Plane Split Heterogeneous Network.
Proceedings of the 14th IEEE International Conference on Mobile Ad Hoc and Sensor Systems, 2017
Defending Against Man-In-The-Middle Attack in Repeated Games.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017
An Improved Android Collusion Attack Detection Method Based on Program Slicing.
Proceedings of the Formal Methods and Software Engineering, 2017
TLSsem: A TLS Security-Enhanced Mechanism against MITM Attacks in Public WiFis.
Proceedings of the 22nd International Conference on Engineering of Complex Computer Systems, 2017
Towards Solving Decision Making Problems Using Probabilistic Model Checking.
Proceedings of the 22nd International Conference on Engineering of Complex Computer Systems, 2017
A Prediction and Learning Based Approach to Network Selection in Dynamic Environments.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2017, 2017
A real-time ensemble classification algorithm for time series data.
Proceedings of the IEEE International Conference on Agents, 2017
Effective norm emergence in cell systems under limited communication.
Proceedings of the 2017 IEEE International Conference on Bioinformatics and Biomedicine, 2017
Optimal Personalized Defense Strategy Against Man-In-The-Middle Attack.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
2016
Improved EGT-Based Robustness Analysis of Negotiation Strategies in Multiagent Systems via Model Checking.
IEEE Trans. Hum. Mach. Syst., 2016
Fepchecker: An Automatic Model Checker for Verifying Fairness and Non-Repudiation of Security Protocols in Web Service.
Int. J. Softw. Eng. Knowl. Eng., 2016
Formal Modeling and Verification of Security Protocols on Cloud Computing Systems Based on UML 2.3.
Proceedings of the 2016 IEEE Trustcom/BigDataSE/ISPA, 2016
E-SSL: An SSL Security-Enhanced Method for Bypassing MITM Attacks in Mobile Internet.
Proceedings of the Structured Object-Oriented Formal Language and Method, 2016
Designing minimal effective normative systems with the help of lightweight formal methods.
Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2016
Accelerating Norm Emergence Through Hierarchical Heuristic Learning.
Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016
Socially-Aware Multiagent Learning: Towards Socially Optimal Outcomes.
Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016
Dynamic analysis of cell interactions in biological environments under multiagent social learning framework.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2016
An Adaptive Learning Framework for Efficient Emergence of Social Norms: (Extended Abstract).
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016
2015
Reinforcement social learning of social optimality with influencer agents.
Web Intell., 2015
Multiagent Reinforcement Social Learning toward Coordination in Cooperative Multiagent Systems.
ACM Trans. Auton. Adapt. Syst., 2015
Introducing decision entrustment mechanism into repeated bilateral agent interactions to achieve social optimality.
Auton. Agents Multi Agent Syst., 2015
An Adaptive Markov Strategy for Effective Network Intrusion Detection.
Proceedings of the 27th IEEE International Conference on Tools with Artificial Intelligence, 2015
Toward Efficient Agreements in Real-Time Multilateral Agent-Based Negotiations.
Proceedings of the 27th IEEE International Conference on Tools with Artificial Intelligence, 2015
Reciprocal Social Strategy in Social Repeated Games.
Proceedings of the 27th IEEE International Conference on Tools with Artificial Intelligence, 2015
Hierarchical Learning for Emergence of Social Norms in Networked Multiagent Systems.
Proceedings of the AI 2015: Advances in Artificial Intelligence, 2015
Heuristic Collective Learning for Efficient and Robust Emergence of Social Norms.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015
2014
CUHKAgent: An Adaptive Negotiation Strategy for Bilateral Negotiations over Multiple Items.
Proceedings of the Novel Insights in Agent-based Complex Automated Negotiation, 2014
An efficient and robust negotiating strategy in bilateral negotiations over multiple items.
Eng. Appl. Artif. Intell., 2014
Robustness Analysis of Negotiation Strategies through Multiagent Learning in Repeated Negotiation Games.
Proceedings of the Multiagent System Technologies - 12th German Conference, 2014
Evaluating Practical Automated Negotiation Based on Spatial Evolutionary Game Theory.
Proceedings of the KI 2014: Advances in Artificial Intelligence, 2014
Networked Reinforcement Social Learning towards Coordination in Cooperative Multiagent Systems.
Proceedings of the 26th IEEE International Conference on Tools with Artificial Intelligence, 2014
Spatial evolutionary game-theoretic perspective on agent-based complex negotiations.
Proceedings of the ECAI 2014 - 21st European Conference on Artificial Intelligence, 18-22 August 2014, Prague, Czech Republic, 2014
Adaptive Defending Strategy for Smart Grid Attacks.
Proceedings of the 2nd Workshop on Smart Energy Grid Security, 2014
2013
Fairness, social optimality and individual rationality in agent interactions.
PhD thesis, 2013
Achieving Socially Optimal Outcomes in Multiagent Systems with Reinforcement Social Learning.
ACM Trans. Auton. Adapt. Syst., 2013
The Dynamics of Reinforcement Social Learning in Cooperative Multiagent Systems.
Proceedings of the IJCAI 2013, 2013
Reinforcement social learning of coordination in cooperative multiagent systems.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013
2012
Maintaining cooperation in homogeneous multi-agent system.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2012
Probabilistic Model Checking Multi-agent Behaviors in Dispersion Games Using Counter Abstraction.
Proceedings of the PRIMA 2012: Principles and Practice of Multi-Agent Systems, 2012
An Efficient Negotiation Protocol to Achieve Socially Optimal Allocation.
Proceedings of the PRIMA 2012: Principles and Practice of Multi-Agent Systems, 2012
Incorporating Fairness into Agent Interactions Modeled as Two-Player Normal-Form Games.
Proceedings of the PRICAI 2012: Trends in Artificial Intelligence, 2012
Learning to Achieve Socially Optimal Solutions in General-Sum Games.
Proceedings of the PRICAI 2012: Trends in Artificial Intelligence, 2012
Incorporating Fairness into Infinitely Repeated Games with Conflicting Interests for Conflicts Elimination.
Proceedings of the IEEE 24th International Conference on Tools with Artificial Intelligence, 2012
Analyzing multi-agent systems with probabilistic model checking approach.
Proceedings of the 34th International Conference on Software Engineering, 2012
ABiNeS: An Adaptive Bilateral Negotiating Strategy over Multiple Items.
Proceedings of the 2012 IEEE/WIC/ACM International Conferences on Intelligent Agent Technology, 2012
Achieving Social Optimality with Influencer Agents.
Proceedings of the Complex Sciences - Second International Conference, 2012
2011
Learning to Achieve Social Rationality Using Tag Mechanism in Repeated Interactions.
Proceedings of the IEEE 23rd International Conference on Tools with Artificial Intelligence, 2011
2010
Strategy and Fairness in Repeated Two-agent Interaction.
Proceedings of the 22nd IEEE International Conference on Tools with Artificial Intelligence, 2010
2009
Bus-Based and NoC Infrastructure Performance Emulation and Comparison.
Proceedings of the Sixth International Conference on Information Technology: New Generations, 2009
2007
Theoretical Investigation on Post-Processed LDA for Face and Palmprint Recognition.
Proceedings of the Computational Intelligence and Security, International Conference, 2007