Bo An

Orcid: 0000-0002-7064-7438

Affiliations:
  • Nanyang Technological University, Singapore


According to our database1, Bo An authored at least 323 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Online binary classification from similar and dissimilar data.
Mach. Learn., June, 2024

TWOSOME: An Efficient Online Framework to Align LLMs with Embodied Environments via Reinforcement Learning.
Int. J. Artif. Intell. Robotics Res., June, 2024

Offline policy reuse-guided anytime online collective multiagent planning and its application to mobility-on-demand systems.
Auton. Agents Multi Agent Syst., June, 2024

On the Robustness of Average Losses for Partial-Label Learning.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Robust Multi-Agent Communication With Graph Information Bottleneck Optimization.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Egoism, utilitarianism and egalitarianism in multi-agent reinforcement learning.
Neural Networks, 2024

Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models.
CoRR, 2024

Computing Ex Ante Equilibrium in Heterogeneous Zero-Sum Team Games.
CoRR, 2024

Deep Graph Anomaly Detection: A Survey and New Perspectives.
CoRR, 2024

Resultant: Incremental Effectiveness on Likelihood for Unsupervised Out-of-Distribution Detection.
CoRR, 2024

In-Context Exploiter for Extensive-Form Games.
CoRR, 2024

Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning.
CoRR, 2024

MANO: Exploiting Matrix Norm for Unsupervised Accuracy Estimation Under Distribution Shifts.
CoRR, 2024

S<sup>2</sup>AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic.
CoRR, 2024

AgentStudio: A Toolkit for Building General Virtual Agents.
CoRR, 2024

Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study.
CoRR, 2024

Leveraging Team Correlation for Approximating Equilibrium in Two-Team Zero-Sum Games.
CoRR, 2024

True Knowledge Comes from Practice: Aligning LLMs with Embodied Environments via Reinforcement Learning.
CoRR, 2024

Debiased Sample Selection for Combating Noisy Labels.
CoRR, 2024

Characterising Gradients for Unsupervised Accuracy Estimation under Distribution Shift.
CoRR, 2024

keqing: knowledge-based question answering is a nature chain-of-thought mentor of LLM.
CoRR, 2024

Deep reinforcement learning for multi-objective game strategy selection.
Comput. Oper. Res., 2024

Reinforcement Learning with Maskable Stock Representation for Portfolio Management in Customizable Stock Pools.
Proceedings of the ACM on Web Conference 2024, 2024

MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Scheduling Generative-AI Job DAGs with Model Serving in Data Centers.
Proceedings of the 32nd IEEE/ACM International Symposium on Quality of Service, 2024

vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Reinforcement Learning from Diverse Human Preferences.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Reinforcement Nash Equilibrium Solver.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Self-adaptive PSRO: Towards an Automatic Population-based Game Solver.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

PoRank: A Practical Framework for Learning to Rank Policies.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Latent Logic Tree Extraction for Event Sequence Explanation from LLMs.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Configurable Mirror Descent: Towards a Unification of Decision Making.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Safe and Robust Subgame Exploitation in Imperfect Information Games.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

DAG-Based Column Generation for Adversarial Team Games.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

On the Vulnerability of Adversarially Trained Models Against Two-faced Attacks.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Consistent Multi-Class Classification from Multiple Unlabeled Datasets.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

True Knowledge Comes from Practice: Aligning Large Language Models with Embodied Environments via Reinforcement Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

S2AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Solving Homogeneous and Heterogeneous Cooperative Tasks with Greedy Sequential Execution.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Improving Unsupervised Hierarchical Representation With Reinforcement Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

vMFER: von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement of Actor-Critic Algorithms.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Grasper: A Generalist Pursuer for Pursuit-Evasion Problems.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Mitigating Underfitting in Learning to Defer with Consistent Losses.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

Consistent Hierarchical Classification with A Generalized Metric.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

Market-GAN: Adding Control to Financial Market Data Generation with Semantic Context.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

EarnHFT: Efficient Hierarchical Reinforcement Learning for High Frequency Trading.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Transition-Informed Reinforcement Learning for Large-Scale Stackelberg Mean-Field Games.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Deep Learning From Multiple Noisy Annotators as A Union.
IEEE Trans. Neural Networks Learn. Syst., December, 2023

Multiple-Instance Learning From Unlabeled Bags With Pairwise Similarity.
IEEE Trans. Knowl. Data Eng., November, 2023

Classifying ambiguous identities in hidden-role Stochastic games with multi-agent reinforcement learning.
Auton. Agents Multi Agent Syst., October, 2023

Multi-agent dueling Q-learning with mean field and value decomposition.
Pattern Recognit., July, 2023

PRUDEX-Compass: Towards Systematic Evaluation of Reinforcement Learning in Financial Markets.
Trans. Mach. Learn. Res., 2023

Reinforcement Learning for Quantitative Trading.
ACM Trans. Intell. Syst. Technol., 2023

Effective Interpretable Policy Distillation via Critical Experience Point Identification.
IEEE Intell. Syst., 2023

AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement.
CoRR, 2023

Efficient Last-iterate Convergence Algorithms in Solving Games.
CoRR, 2023

Synapse: Leveraging Few-Shot Exemplars for Human-Level Computer Control.
CoRR, 2023

Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning.
CoRR, 2023

State Regularized Policy Optimization on Data with Dynamics Shift.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On the Importance of Feature Separability in Predicting Out-Of-Distribution Error.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

TradeMaster: A Holistic Quantitative Trading Platform Empowered by Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Offline RL with Discrete Proxy Representations for Generalizability in POMDPs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Few-shot Generation via Recalling Brain-Inspired Episodic-Semantic Memory.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Regression with Cost-based Rejection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

In Defense of Softmax Parametrization for Calibrated and Consistent Learning to Defer.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Computing Optimal Nash Equilibria in Multiplayer Games.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Mastering Stock Markets with Efficient Mixture of Diversified Trading Experts.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Exploring Leximin Principle for Fair Core-Selecting Combinatorial Auctions: Payment Rule Design and Implementation.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Complex Contagion Influence Maximization: A Reinforcement Learning Approach.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification.
Proceedings of the International Conference on Machine Learning, 2023

Mitigating Memorization of Noisy Labels by Clipping the Model Prediction.
Proceedings of the International Conference on Machine Learning, 2023

Weakly Supervised Regression with Interval Targets.
Proceedings of the International Conference on Machine Learning, 2023

ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Population-size-Aware Policy Optimization for Mean-Field Games.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

RPM: Generalizable Multi-Agent Policies for Multi-Agent Reinforcement Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Structural Credit Assignment-Guided Coordinated MCTS: An Efficient and Scalable Method for Online Multiagent Planning.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Off-Beat Multi-Agent Reinforcement Learning.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

A Learning Approach to Complex Contagion Influence Maximization.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Finding Optimal Nash Equilibria in Multiplayer Games via Correlation Plans.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Consistent Complementary-Label Learning via Order-Preserving Losses.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Games.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Solving Large-Scale Pursuit-Evasion Games Using Pre-trained Strategies.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Partial-Label Regression.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Batch Crowdsourcing for Complex Tasks Based on Distributed Team Formation in E-Markets.
IEEE Trans. Parallel Distributed Syst., 2022

SemiNLL: A Framework of Noisy-Label Learning by Semi-Supervised Learning.
Trans. Mach. Learn. Res., 2022

Regularized Matrix Factorization for Multilabel Learning With Missing Labels.
IEEE Trans. Cybern., 2022

Planning sequential interventions to tackle depression in large uncertain social networks using deep reinforcement learning.
Neurocomputing, 2022

Deep Reinforcement Learning for Quantitative Trading: Challenges and Opportunities.
IEEE Intell. Syst., 2022

Logit Clipping for Robust Learning against Label Noise.
CoRR, 2022

PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement.
CoRR, 2022

RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning.
CoRR, 2022

Quantitative Stock Investment by Routing Uncertainty-Aware Trading Experts: A Multi-Task Learning Approach.
CoRR, 2022

Offline Equilibrium Finding.
CoRR, 2022

ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor.
CoRR, 2022

A Unified Perspective on Deep Equilibrium Finding.
CoRR, 2022

DeepScalper: A Risk-Aware Deep Reinforcement Learning Framework for Intraday Trading with Micro-level Market Embedding.
CoRR, 2022

Quantal Correlated Equilibrium in Normal Form Games.
Proceedings of the EC '22: The 23rd ACM Conference on Economics and Computation, Boulder, CO, USA, July 11, 2022

Out-of-Distribution Detection with An Adaptive Likelihood Ratio on Informative Hierarchical VAE.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Alleviating "Posterior Collapse" in Deep Topic Models via Policy Gradient.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Deep Attentive Belief Propagation: Integrating Reasoning and Learning for Solving Constraint Optimization Problems.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Generalizing Consistent Multi-Class Classification with Rejection to be Compatible with Arbitrary Losses.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

L2E: Learning to Exploit Your Opponent.
Proceedings of the International Joint Conference on Neural Networks, 2022

Correlation-Based Algorithm for Team-Maxmin Equilibrium in Multiplayer Extensive-Form Games.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Mitigating Neural Network Overconfidence with Logit Normalization.
Proceedings of the International Conference on Machine Learning, 2022

Open-Sampling: Exploring Out-of-Distribution data for Re-balancing Long-tailed datasets.
Proceedings of the International Conference on Machine Learning, 2022

Learning Pseudometric-based Action Representations for Offline Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

Online Ad Hoc Teamwork under Partial Observability.
Proceedings of the Tenth International Conference on Learning Representations, 2022

A Flexi Partner Selection Model for the Emergence of Cooperation in N-person Social Dilemmas.
Proceedings of the Distributed Artificial Intelligence - 4th International Conference, 2022

DO-GAN: A Double Oracle Framework for Generative Adversarial Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DeepScalper: A Risk-Aware Reinforcement Learning Framework to Capture Fleeting Intraday Trading Opportunities.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Multi-Faceted Hierarchical Multi-Task Learning for Recommender Systems.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement Learning.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

Online Collective Multiagent Planning by Offline Policy Reuse with Applications to City-Scale Mobility-on-Demand Systems.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

NSGZero: Efficiently Learning Non-exploitable Policy in Large-Scale Network Security Games with Neural Monte Carlo Tree Search.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

GearNet: Stepwise Dual Learning for Weakly Supervised Domain Adaptation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Pretrained Cost Model for Distributed Constraint Optimization Problems.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Group-Oriented Task Allocation for Crowdsourcing in Social Networks.
IEEE Trans. Syst. Man Cybern. Syst., 2021

Toward Efficient City-Scale Patrol Planning Using Decomposition and Grafting.
IEEE Trans. Intell. Transp. Syst., 2021

Embedding-Augmented Generalized Matrix Factorization for Recommendation With Implicit Feedback.
IEEE Intell. Syst., 2021

Orientational variable-length strip covering problem: A branch-and-price-based algorithm.
Eur. J. Oper. Res., 2021

Cooperation, Retaliation and Forgiveness in Revision Games.
CoRR, 2021

Multi-Class Classification from Single-Class Data with Confidences.
CoRR, 2021

On the Robustness of Average Losses for Partial-Label Learning.
CoRR, 2021

L2E: Learning to Exploit Your Opponent.
CoRR, 2021

Electric vehicle charging strategy study and the application on charging station placement.
Auton. Agents Multi Agent Syst., 2021

Utility distribution matters: enabling fast belief propagation for multi-agent optimization with dense local utility function.
Auton. Agents Multi Agent Syst., 2021

Contingency-aware influence maximization: A reinforcement learning approach.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Open-set Label Noise Can Improve Robustness Against Inherent Label Noise.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Multiple-Instance Learning from Similar and Dissimilar Bags.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

CFR-MIX: Solving Imperfect Information Extensive-Form Games with Combinatorial Action Space.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Neural Regret-Matching for Distributed Constraint Optimization Problems.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Pointwise Binary Classification with Pairwise Confidence Comparisons.
Proceedings of the 38th International Conference on Machine Learning, 2021

Learning from Similarity-Confidence Data.
Proceedings of the 38th International Conference on Machine Learning, 2021

Personalized Adaptive Meta Learning for Cold-start User Preference Prediction.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Commission Fee is not Enough: A Hierarchical Reinforced Framework for Portfolio Management.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Complexity and Algorithms for Exploiting Quantal Opponents in Large Two-Player Games.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Computing Quantal Stackelberg Equilibrium in Extensive-Form Games.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Computing Ex Ante Coordinated Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Context-Aware Reliable Crowdsourcing in Social Networks.
IEEE Trans. Syst. Man Cybern. Syst., 2020

Randomized Security Patrolling for Link Flooding Attack Detection.
IEEE Trans. Dependable Secur. Comput., 2020

Optimal Spot-Checking for Improving the Evaluation Quality of Crowdsourcing: Application to Peer Grading Systems.
IEEE Trans. Comput. Soc. Syst., 2020

MetaInfoNet: Learning Task-Guided Information for Sample Reweighting.
CoRR, 2020

Efficient Reservoir Management through Deep Reinforcement Learning.
CoRR, 2020

Computing Ex Ante Coordinated Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games.
CoRR, 2020

Learning Behaviors with Uncertain Human Feedback.
Proceedings of the Thirty-Sixth Conference on Uncertainty in Artificial Intelligence, 2020

Finite State Machines Play Extensive-Form Games.
Proceedings of the EC '20: The 21st ACM Conference on Economics and Computation, 2020

Learning to Collaborate in Multi-Module Recommendation via Multi-Agent Reinforcement Learning without Communication.
Proceedings of the RecSys 2020: Fourteenth ACM Conference on Recommender Systems, 2020

Contextual User Browsing Bandits for Large-Scale Online Mobile Recommendation.
Proceedings of the RecSys 2020: Fourteenth ACM Conference on Recommender Systems, 2020

Provably Consistent Partial-Label Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

I²HRL: Interactive Influence-based Hierarchical Reinforcement Learning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Can Cross Entropy Loss Be Robust to Label Noise?
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Speeding Up Incomplete GDL-based Algorithms for Multi-agent Optimization with Dense Local Utilities.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Dinkelbach-Type Algorithm for Computing Quantal Stackelberg Equilibrium.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Learning Efficient Multi-agent Communication: An Information Bottleneck Approach.
Proceedings of the 37th International Conference on Machine Learning, 2020

Learning with Multiple Complementary Labels.
Proceedings of the 37th International Conference on Machine Learning, 2020

Converging to Team-Maxmin Equilibria in Zero-Sum Multiplayer Games.
Proceedings of the 37th International Conference on Machine Learning, 2020

Learning Expensive Coordination: An Event-Based Deep RL Approach.
Proceedings of the 8th International Conference on Learning Representations, 2020

User detection of threats with different security measures.
Proceedings of the IEEE International Conference on Human-Machine Systems, 2020

Context-Aware Multi-agent Coordination with Loose Couplings and Repeated Interaction.
Proceedings of the Distributed Artificial Intelligence - Second International Conference, 2020

Battery Management for Automated Warehouses via Deep Reinforcement Learning.
Proceedings of the Distributed Artificial Intelligence - Second International Conference, 2020

Combating Noisy Labels by Agreement: A Joint Training Method with Co-Regularization.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

We Mind Your Well-Being: Preventing Depression in Uncertain Social Networks by Sequential Interventions.
Proceedings of the Thirtieth International Conference on Automated Planning and Scheduling, 2020

Computing Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Batch Allocation for Tasks with Overlapping Skill Requirements in Crowdsourcing.
IEEE Trans. Parallel Distributed Syst., 2019

Strategic Social Team Crowdsourcing: Forming a Team of Truthful Workers for Crowdsourcing in Social Networks.
IEEE Trans. Mob. Comput., 2019

Competitive Cloud Pricing for Long-Term Revenue Maximization.
J. Comput. Sci. Technol., 2019

Protecting internet infrastructure against link flooding attacks: A techno-economic perspective.
Inf. Sci., 2019

Learning from Multiple Complementary Labels.
CoRR, 2019

Inducing Cooperation via Team Regret Minimization based Multi-Agent Deep Reinforcement Learning.
CoRR, 2019

A branch and price algorithm for EOS constellation imaging and downloading integrated scheduling problem.
Comput. Oper. Res., 2019

Securing the Deep Fraud Detector in Large-Scale E-Commerce Platform via Adversarial Machine Learning Approach.
Proceedings of the World Wide Web Conference, 2019

Social Cost Guarantees in Smart Route Guidance.
Proceedings of the PRICAI 2019: Trends in Artificial Intelligence, 2019

Manipulating a Learning Defender and Ways to Counteract.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Who Should Pay the Cost: A Game-theoretic Model for Government Subsidized Investments to Improve National Cybersecurity.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Dynamic Electronic Toll Collection via Multi-Agent Deep Reinforcement Learning with Edge-Based Graph Convolutional Networks.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Partial Label Learning by Semantic Difference Maximization.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

InBEDE: Integrating Contextual Bandit with TD Learning for Joint Pricing and Dispatch of Ride-Hailing Platforms.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Choosing Protection: User Investments in Security Measures for Cyber Risk Management.
Proceedings of the Decision and Game Theory for Security - 10th International Conference, 2019

When Players Affect Target Values: Modeling and Solving Dynamic Partially Observable Security Games.
Proceedings of the Decision and Game Theory for Security - 10th International Conference, 2019

Efficient City-Scale Patrolling Using Decomposition and Grafting.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Competitive Bridge Bidding with Deep Neural Networks.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Optimal Interdiction of Urban Criminals with the Aid of Real-Time Information.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

A Memetic Approach for Sequential Security Games on a Plane with Moving Targets.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

On the Inducibility of Stackelberg Equilibrium for Security Games.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Collaboration Based Multi-Label Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Partial Label Learning with Self-Guided Retraining.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Optimal Electric Vehicle Fast Charging Station Placement Based on Game Theoretical Framework.
IEEE Trans. Intell. Transp. Syst., 2018

ACM TIST Special Issue on Urban Intelligence.
ACM Trans. Intell. Syst. Technol., 2018

Understanding Crowdsourcing Systems from a Multiagent Perspective and Approach.
ACM Trans. Auton. Adapt. Syst., 2018

Active learning with confidence-based answers for crowdsourcing labeling tasks.
Knowl. Based Syst., 2018

Camera Placement Based on Vehicle Traffic for Better City Security Surveillance.
IEEE Intell. Syst., 2018

Optimal defense against election control by deleting voter groups.
Artif. Intell., 2018

Predicting Human Decision-Making: From Prediction to Action, Ariel Rosenfeld, Sarit Kraus. Morgan & Claypool Publishers (2018).
Artif. Intell., 2018

Impression Allocation for Combating Fraud in E-commerce Via Deep Reinforcement Learning with Action Norm Penalty.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Stackelberg Security Games: Looking Beyond a Decade of Success.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Leveraging Latent Label Distributions for Partial Label Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Equilibrium Refinement in Security Games with Arbitrary Scheduling Constraints.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

Inducible Equilibrium for Security Games.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

Data Poisoning Attacks on Multi-Task Relationship Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

HogRider: Champion Agent of Microsoft Malmo Collaborative AI Challenge.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Catching Captain Jack: Efficient Time and Space Dependent Patrols to Combat Oil-Siphoning in International Waters.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Optimal Spot-Checking for Improving Evaluation Accuracy of Peer Grading Systems.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Dynamic Pricing for Reusable Resources in Competitive Market With Stochastic Demand.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

DyETC: Dynamic Electronic Toll Collection for Traffic Congestion Alleviation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Data-Driven Frequency-Based Airline Profit Maximization.
ACM Trans. Intell. Syst. Technol., 2017

Toward Efficient Team Formation for Crowdsourcing in Noncooperative Social Networks.
IEEE Trans. Cybern., 2017

Workload Factoring and Resource Sharing via Joint Vertical and Horizontal Cloud Federation Networks.
IEEE J. Sel. Areas Commun., 2017

Measuring the social influences of scientist groups based on multiple types of collaboration relations.
Inf. Process. Manag., 2017

Game-Theoretic Considerations for Optimizing Taxi System Efficiency.
IEEE Intell. Syst., 2017

Vehicle Traffic Driven Camera Placement for Better Metropolis Security Surveillance.
CoRR, 2017

Artificial Intelligence Research in Singapore: Assisting the Development of a Smart Nation.
AI Mag., 2017

PAWS - A Deployed Game-Theoretic Application to Combat Poaching.
AI Mag., 2017

Human-computer negotiation in a three player market setting.
Artif. Intell., 2017

Efficient Label Contamination Attacks Against Black-Box Learning Models.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Optimal Escape Interdiction on Transportation Networks.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Defending Against Man-In-The-Middle Attack in Repeated Games.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Playing Repeated Network Interdiction Games with Semi-Bandit Feedback.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Comparing Strategic Secrecy and Stackelberg Commitment in Security Games.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Game Theoretic Analysis of Security and Sustainability.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Nash Equilibrium and Decentralized Pricing for QoS Aware Service Composition in Cloud Computing Environments.
Proceedings of the 2017 IEEE International Conference on Web Services, 2017

Stop Nuclear Smuggling Through Efficient Container Inspection.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Pricing Optimization for Selling Reusable Resources.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Revenue Maximization for Finitely Repeated Ad Auctions.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Optimal Personalized Defense Strategy Against Man-In-The-Middle Attack.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Strategic Information Revelation and Commitment in Security Games.
Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017

Security Games on a Plane.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

POI2Vec: Geographical Latent Representation for Predicting Future Visitors.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Recent advances in Support Vector Machines.
Neurocomputing, 2016

Alternating-offers bargaining in one-to-many and many-to-many settings.
Ann. Math. Artif. Intell., 2016

A simulation framework for measuring robustness of incentive mechanisms and its implementation in reputation systems.
Auton. Agents Multi Agent Syst., 2016

MAP: Frequency-Based Maximization of Airline Profits based on an Ensemble Forecasting Approach.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Optimally Protecting Elections.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Efficient Resource Allocation for Protecting Coral Reef Ecosystems.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Optimal Interdiction of Illegal Network Flow.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Modeling Bounded Rationality for Sponsored Search Auctions.
Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016

Optimal Pricing for Efficient Electric Vehicle Charging Station Management.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Measuring the Distance Between Finite Markov Decision Processes.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Optimal Sample Size for Adword Auctions: (Extended Abstract).
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Coalitional Security Games.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Optimizing Personalized Email Filtering Thresholds to Mitigate Sequential Spear Phishing Attacks.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Efficient Average Reward Reinforcement Learning Using Constant Shifting Values.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Computing Optimal Monitoring Strategy for Detecting Terrorist Plots.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Deploying PAWS: Field Optimization of the Protection Assistant for Wildlife Security.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Deploying PAWS to Combat Poaching: Game-Theoretic Patrolling in Areas with Complex Terrain (Demonstration).
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Accelerating Multiagent Reinforcement Learning by Equilibrium Transfer.
IEEE Trans. Cybern., 2015

Multiagent Reinforcement Learning With Unshared Value Functions.
IEEE Trans. Cybern., 2015

Computing Optimal Mixed Strategies for Security Games with Dynamic Payoffs.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Optimal Electric Vehicle Charging Station Placement.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Learning Personality Modeling for Regulating Learning Feedback.
Proceedings of the 15th IEEE International Conference on Advanced Learning Technologies, 2015

Modeling Learner's Emotions with PAD.
Proceedings of the 15th IEEE International Conference on Advanced Learning Technologies, 2015

Computing Quantal Response Equilibrium for Sponsored Search Auctions.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Learning in Multi-agent Systems with Sparse Interactions by Knowledge Transfer and Game Abstraction.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Optimizing Efficiency of Taxi Systems: Scaling-up and Handling Arbitrary Constraints.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Security Games with Protection Externalities.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Game Theoretic Considerations for Optimizing Efficiency of Taxi Systems.
Proceedings of the Computational Sustainability, 2015

2014
Filtering trust opinions through reinforcement learning.
Decis. Support Syst., 2014

An extended study on multi-objective security games.
Auton. Agents Multi Agent Syst., 2014

A New Paradigm for the Study of Corruption in Different Cultures.
Proceedings of the Social Computing, Behavioral-Cultural Modeling and Prediction, 2014

Human-Computer Negotiation in Three-Player Market Settings.
Proceedings of the ECAI 2014 - 21st European Conference on Artificial Intelligence, 18-22 August 2014, Prague, Czech Republic, 2014

Reputation-aware task allocation for human trustees.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Dynamic allocation of security resources for protecting public events.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Modeling curiosity for virtual learning companions.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

A practical robustness measure of incentive mechanisms.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

An efficient algorithm for taxi system optimization.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Computing Solutions in Infinite-Horizon Discounted Adversarial Patrolling Games.
Proceedings of the Twenty-Fourth International Conference on Automated Planning and Scheduling, 2014

Game-Theoretic Resource Allocation for Protecting Large Public Events.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Regret-Based Optimization and Preference Elicitation for Stackelberg Security Games with Uncertainty.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Security Games Applied to Real-World: Research Contributions and Challenges.
Proceedings of the Moving Target Defense II, 2013

A Deployed Quantal Response-Based Patrol Planning System for the U.S. Coast Guard.
Interfaces, 2013

Bilateral bargaining with one-sided uncertain reserve prices.
Auton. Agents Multi Agent Syst., 2013

A Reputation Management Approach for Resource Constrained Trustee Agents.
Proceedings of the IJCAI 2013, 2013

Optimal Pricing for Improving Efficiency of Taxi Systems.
Proceedings of the IJCAI 2013, 2013

A reputation-aware decision-making approach for improving the efficiency of crowdsourcing systems.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Security games with surveillance cost and optimal timing of attack execution.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

2012
Yushu: A Heuristic-Based Agent for Automated Negotiating Competition.
Proceedings of the New Trends in Agent-Based Complex Automated Negotiations, 2012

An Overview of Recent Application Trends at the AAMAS Conference: Security, Sustainability and Safety.
AI Mag., 2012

PROTECT - A Deployed Game Theoretic System for Strategic Security Allocation for the United States Coast Guard.
AI Mag., 2012

Reports of the AAAI 2012 Spring Symposia.
AI Mag., 2012

An Adaptive Witness Selection Method for Reputation-Based Trust Models.
Proceedings of the PRIMA 2012: Principles and Practice of Multi-Agent Systems, 2012

Challenges and Opportunities for Trust Management in Crowdsourcing.
Proceedings of the 2012 IEEE/WIC/ACM International Conferences on Intelligent Agent Technology, 2012

Adversarial patrolling games.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

PROTECT: a deployed game theoretic system to protect the ports of the United States.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Multi-objective optimization for security games.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Game Theory for Security: A Real-World Challenge Problem for Multiagent Systems and Beyond.
Proceedings of the Game Theory for Security, 2012

Getting Started on a Real-World Challenge Problem in Computational Game Theory and Beyond.
Proceedings of the AI, 2012

Security Games with Limited Surveillance: An Initial Report.
Proceedings of the Game Theory for Security, 2012

PROTECT: An Application of Computational Game Theory for the Security of the Ports of the United States.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Security Games with Limited Surveillance.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
GUARDS and PROTECT: next generation applications of security games.
SIGecom Exch., 2011

Strategic agents for multi-resource negotiation.
Auton. Agents Multi Agent Syst., 2011

Game Theory for Security: An Important Challenge for Multiagent Systems.
Proceedings of the Multi-Agent Systems - 9th European Workshop, 2011

Agent-mediated multi-step optimization for resource allocation in distributed sensor networks.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Negotiation over decommitment penalty.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Mixed-Initiative Optimization in Security Games: A Preliminary Report.
Proceedings of the Help Me Help You: Bridging the Gaps in Human-Agent Collaboration, 2011

Refinement of Strong Stackelberg Equilibria in Security Games.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
Characterizing Contract-Based Multiagent Resource Allocation in Networks.
IEEE Trans. Syst. Man Cybern. Part B, 2010

Multi-Agent Asynchronous Negotiation Based on Time-Delay.
Proceedings of the Life System Modeling and Intelligent Computing, 2010

Automated negotiation with decommitment for dynamic resource allocation in cloud computing.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Searching for pure strategy equilibria in bilateral bargaining with one-sided uncertainty.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

2009
Evolving Best-Response Strategies for Market-Driven Agents Using Aggregative Fitness GA.
IEEE Trans. Syst. Man Cybern. Part C, 2009

Extending Alternating-Offers Bargaining in One-to-Many and Many-to-Many Settings.
Proceedings of the 2009 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2009

Bilateral Bargaining with One-Sided Two-Type Uncertainty.
Proceedings of the 2009 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2009

2008
Decommitment in multi-resource negotiation.
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Heuristics for negotiation schedules in multi-plan optimization.
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

2007
Algorithms for Transitive Dependence-Based Coalition Formation.
IEEE Trans. Ind. Informatics, 2007

An Agent Reinforcement Learning Model Based on Neural Networks.
Proceedings of the Bio-Inspired Computational Intelligence and Applications, 2007

Market Based Resource Allocation with Incomplete Information.
Proceedings of the IJCAI 2007, 2007

An Application of Automated Negotiation to Distributed Task Allocation.
Proceedings of the 2007 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2007

Evolving the best-response strategy to decide when to make a proposal.
Proceedings of the IEEE Congress on Evolutionary Computation, 2007

2006
Continuous-Time Negotiation Mechanism for Software Agents.
IEEE Trans. Syst. Man Cybern. Part B, 2006

Fuzzy cognitive maps for dynamic grid service negotiation.
Multiagent Grid Syst., 2006

2005
A Coalition Formation Framework Based on Transitive Dependence.
IEICE Trans. Inf. Syst., 2005

A Transitive Dependence Based Social Reasoning Mechanism for Coalition Formation.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2005

Toward Transitive Dependence in MAS.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2005

Transitive Dependence Based Formation of Virtual Organizations.
Proceedings of the Computational Intelligence and Security, International Conference, 2005

2004
A Negotiation Strategy based on Uncompromising Degree.
Proceedings of the 2004 IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT 2004), 2004


  Loading...