Jun Wang

Orcid: 0000-0002-4021-4228

Affiliations:
  • University College London, Department of Computer Science, UCL Centre for Artificial Intelligence, London, UK
  • Delft University of Technology, The Netherlands (PhD 2008)


According to our database1, Jun Wang authored at least 369 papers between 2003 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
A Review of Safe Reinforcement Learning: Methods, Theories, and Applications.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Self-Supervised MAFENN for Classifying Low-Labeled Distorted Images Over Mobile Fading Channels.
IEEE Trans. Mob. Comput., August, 2024

An Empirical Study on Google Research Football Multi-agent Scenarios.
Mach. Intell. Res., June, 2024

Fear-Neuro-Inspired Reinforcement Learning for Safe Autonomous Driving.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2024

Task-Oriented 6G Native-AI Network Architecture.
IEEE Netw., January, 2024

Manas: multi-agent neural architecture search.
Mach. Learn., January, 2024

Cross-Utterance Conditioned VAE for Speech Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

A survey on algorithms for Nash equilibria in finite normal-form games.
Comput. Sci. Rev., 2024

Lightweight Neural App Control.
CoRR, 2024

SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation.
CoRR, 2024

DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents.
CoRR, 2024

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models.
CoRR, 2024

Efficient Reinforcement Learning with Large Language Model Priors.
CoRR, 2024

SparsePO: Controlling Preference Alignment of LLMs via Sparse Token Masks.
CoRR, 2024

Hammer: Robust Function-Calling for On-Device Language Models via Function Masking.
CoRR, 2024

Mixture of Attentions For Speculative Decoding.
CoRR, 2024

Human-like Episodic Memory for Infinite Context LLMs.
CoRR, 2024

ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning.
CoRR, 2024

Logic Synthesis with Generative Deep Neural Networks.
CoRR, 2024

Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf.
CoRR, 2024

Reinforcing Language Agents via Policy Optimization with Action Decomposition.
CoRR, 2024

Correlated Mean Field Imitation Learning.
CoRR, 2024

Circuit Transformer: End-to-end Circuit Design by Predicting the Next Gate.
CoRR, 2024

Learning Macroeconomic Policies based on Microfoundations: A Stackelberg Mean Field Game Approach.
CoRR, 2024

Looking Ahead to Avoid Being Late: Solving Hard-Constrained Traveling Salesman Problem.
CoRR, 2024

ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization.
CoRR, 2024

Bayesian Reward Models for LLM Alignment.
CoRR, 2024

Natural Language Reinforcement Learning.
CoRR, 2024

Entropy-Regularized Token-Level Policy Optimization for Large Language Models.
CoRR, 2024

Learning State-Specific Action Masks for Reinforcement Learning.
Algorithms, 2024

InfoRank: Unbiased Learning-to-Rank via Conditional Mutual Information Minimization.
Proceedings of the ACM on Web Conference 2024, 2024

Language and Sketching: An LLM-driven Interactive Multimodal Multitask Robot Navigation Framework.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

AlphaZero-Like Tree-Search can Guide Large Language Model Decoding and Training.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

RACon: Retrieval-Augmented Simulated Character Locomotion Control.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Boosting Studies of Multi-Agent Reinforcement Learning on Google Research Football Environment: The Past, Present, and Future.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

TaxAI: A Dynamic Economic Simulator and Benchmark for Multi-agent Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

A Summary of Online Markov Decision Processes with Non-oblivious Strategic Adversary.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Human-Guided Moral Decision Making in Text-Based Games.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Large Language Models Are Neurosymbolic Reasoners.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Large sequence models for sequential decision-making: a survey.
Frontiers Comput. Sci., December, 2023

Debiased Recommendation with User Feature Balancing.
ACM Trans. Inf. Syst., October, 2023

6G Network AI Architecture for Everyone-Centric Customized Services.
IEEE Netw., September, 2023

GEO: A Computational Design Framework for Automotive Exterior Facelift.
ACM Trans. Knowl. Discov. Data, July, 2023

Online Markov decision processes with non-oblivious strategic adversary.
Auton. Agents Multi Agent Syst., June, 2023

Offline Pre-trained Multi-agent Decision Transformer.
Mach. Intell. Res., April, 2023

JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games.
Trans. Mach. Learn. Res., 2023

A Comparison of Topologically Associating Domain Callers Based on Hi-C Data.
IEEE ACM Trans. Comput. Biol. Bioinform., 2023

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning.
J. Mach. Learn. Res., 2023

Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning.
CoRR, 2023

Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach.
CoRR, 2023

Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models.
CoRR, 2023

Why Can Large Language Models Generate Correct Chain-of-Thoughts?
CoRR, 2023

Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training.
CoRR, 2023

Invariant Learning via Probability of Sufficient and Necessary Causes.
CoRR, 2023

GRD: A Generative Approach for Interpretable Reward Redistribution in Reinforcement Learning.
CoRR, 2023

MANSA: Learning Fast and Slow in Multi-Agent Systems.
CoRR, 2023

Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach.
Proceedings of the 21st International Symposium on Modeling and Optimization in Mobile, 2023

Rectifying Unfairness in Recommendation Feedback Loop.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Invariant Learning via Probability of Sufficient and Necessary Causes.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

An Efficient End-to-End Training Approach for Zero-Shot Human-AI Coordination.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Lending Interaction Wings to Recommender Systems with Conversational Agents.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ChessGPT: Bridging Policy Learning and Language Modeling.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Online PCA in Converging Self-consistent Field Equations.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Specify Robust Causal Representation from Mixed Observations.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Multi-embodiment Legged Robot Control as a Sequence Modeling Problem.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models.
Proceedings of the International Conference on Machine Learning, 2023

A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems.
Proceedings of the International Conference on Machine Learning, 2023

MANSA: Learning Fast and Slow in Multi-Agent Systems.
Proceedings of the International Conference on Machine Learning, 2023

Order Matters: Agent-by-agent Policy Optimization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Enhanced Discrete Multi-modal Hashing: More Constraints yet Less Time to Learn (Extended Abstract).
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

PPO-Based Energy-Efficient Power Control and Spectrum Allocation in In-Vehicle HetNets.
Proceedings of the IEEE Global Communications Conference, 2023

How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Replace Scoring with Arrangement: A Contextual Set-to-Arrangement Framework for Learning-to-Rank.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI Coordination.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Is Nash Equilibrium Approximator Learnable?
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Learning to Shape Rewards Using a Game of Two Partners.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Multi-Agent Feedback Enabled Neural Networks for Intelligent Communications.
IEEE Trans. Wirel. Commun., 2022

Beyond Relevance Ranking: A General Graph Matching Framework for Utility-Oriented Learning to Rank.
ACM Trans. Inf. Syst., 2022

Online Double Oracle.
Trans. Mach. Learn. Res., 2022

AutoHash: Learning Higher-Order Feature Interactions for Deep CTR Prediction.
IEEE Trans. Knowl. Data Eng., 2022

Enhanced Discrete Multi-Modal Hashing: More Constraints Yet Less Time to Learn.
IEEE Trans. Knowl. Data Eng., 2022

Learning to select cuts for efficient mixed-integer programming.
Pattern Recognit., 2022

CSCD2: an integrated interactional database of cancer-specific circular RNAs.
Nucleic Acids Res., 2022

SAMBA: safe model-based & active reinforcement learning.
Mach. Learn., 2022

HEBO: An Empirical Study of Assumptions in Bayesian Optimisation.
J. Artif. Intell. Res., 2022

A bias-variance evaluation framework for information retrieval systems.
Inf. Process. Manag., 2022

On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective.
CoRR, 2022

Contextual Transformer for Offline Meta Reinforcement Learning.
CoRR, 2022

Structured Q-learning For Antibody Design.
CoRR, 2022

Joint Caching and Transmission in the Mobile Edge Network: A Multi-Agent Learning Approach.
CoRR, 2022

Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training.
CoRR, 2022

Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL.
CoRR, 2022

Fully Decentralized Model-based Policy Optimization for Networked Systems.
CoRR, 2022

Learning Risk-Averse Equilibria in Multi-Agent Systems.
CoRR, 2022

SEREN: Knowing When to Explore and When to Exploit.
CoRR, 2022

Sample-Efficient Optimisation with Probabilistic Transformer Surrogates.
CoRR, 2022

A Review of Safe Reinforcement Learning: Methods, Theory and Applications.
CoRR, 2022

BI-GreenNet: Learning Green's functions by boundary integral network.
CoRR, 2022

Generalizable Information Theoretic Causal Representation.
CoRR, 2022

AD-NEGF: An End-to-End Differentiable Quantum Transport Simulator for Sensitivity Analysis and Inverse Problems.
CoRR, 2022

Settling the Communication Complexity for Distributed Offline Reinforcement Learning.
CoRR, 2022

Self-consistent Gradient-like Eigen Decomposition in Solving Schrödinger Equations.
CoRR, 2022

AntBO: Towards Real-World Automated Antibody Design with Combinatorial Bayesian Optimisation.
CoRR, 2022

Debiased Recommendation with User Feature Balancing.
CoRR, 2022

Bilevel learning for large-scale flexible flow shop scheduling.
Comput. Ind. Eng., 2022

AlzCode: a platform for multiview analysis of genes related to Alzheimer's disease.
Bioinform., 2022

An integrated brain-specific network identifies genes associated with neuropathologic and clinical traits of Alzheimer's disease.
Briefings Bioinform., 2022

circRIP: an accurate tool for identifying circRNA-RBP interactions.
Briefings Bioinform., 2022

Measuring the Non-Transitivity in Chess.
Algorithms, 2022

Inhomogeneous deep Q-network for time sensitive applications.
Artif. Intell., 2022

Masked Gradient-Based Causal Structure Learning.
Proceedings of the 2022 SIAM International Conference on Data Mining, 2022

Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-Based Policy Learning.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

Multi-Agent Reinforcement Learning is a Sequence Modeling Problem.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Enhancing Safe Exploration Using Safety State Augmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

M2N: Mesh Movement Networks for PDE Solvers.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Optimistic Tree Searches for Combinatorial Black-Box Optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Multiagent Q-learning with Sub-Team Coordination.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Scalable Model-based Policy Optimization for Decentralized Networked Systems.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

On the Convergence of Fictitious Play: A Decomposition Approach.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Promoting Quality and Diversity in Population-based Reinforcement Learning via Hierarchical Trajectory Space Exploration.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation.
Proceedings of the International Conference on Machine Learning, 2022

Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization.
Proceedings of the International Conference on Machine Learning, 2022

Learning State Representations via Retracing in Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Reinforcement Learning in Presence of Discrete Markovian Context Evolution.
Proceedings of the Tenth International Conference on Learning Representations, 2022

LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

LPCSE: Neural Speech Enhancement through Linear Predictive Coding.
Proceedings of the IEEE Global Communications Conference, 2022

BOiLS: Bayesian Optimisation for Logic Synthesis.
Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition, 2022

A Game-Theoretic Approach to Multi-agent Trust Region Optimization.
Proceedings of the Distributed Artificial Intelligence - 4th International Conference, 2022

GCS: Graph-Based Coordination Strategy for Multi-Agent Reinforcement Learning.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

Multiagent Q-learning with Sub-Team Coordination.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Learning to Identify Top Elo Ratings: A Dueling Bandits Approach.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
HeRA: an atlas of enhancer RNAs across human tissues.
Nucleic Acids Res., 2021

Are We Forgetting about Compositional Optimisers in Bayesian Optimisation?
J. Mach. Learn. Res., 2021

On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games.
Electron. Colloquium Comput. Complex., 2021

Settling the Bias and Variance of Meta-Gradient Estimation for Meta-Reinforcement Learning.
CoRR, 2021

Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks.
CoRR, 2021

A Game-Theoretic Approach for Improving Generalization Ability of TSP Solvers.
CoRR, 2021

DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention.
CoRR, 2021

Multi-Agent Constrained Policy Optimisation.
CoRR, 2021

BINet: Learning to Solve Partial Differential Equations with Boundary Integral Networks.
CoRR, 2021

Revisiting the Characteristics of Stochastic Gradient Noise and Dynamics.
CoRR, 2021

Learning to Compute Approximate Nash Equilibrium for Normal-form Games.
CoRR, 2021

Implicit Variational Conditional Sampling with Normalizing Flows.
CoRR, 2021

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning.
CoRR, 2021

High-Dimensional Bayesian Optimisation with Variational Autoencoders and Deep Metric Learning.
CoRR, 2021

Discovering Multi-Agent Auto-Curricula in Two-Player Zero-Sum Games.
CoRR, 2021

Learning to Shape Rewards using a Game of Switching Controls.
CoRR, 2021

Modelling Behavioural Diversity for Learning in Open-Ended Games.
CoRR, 2021

Online Double Oracle.
CoRR, 2021

Efficient Semi-Implicit Variational Inference.
CoRR, 2021

Reinforcement learning control of constrained dynamic systems with uniformly ultimate boundedness stability guarantee.
Autom., 2021

An Adversarial Imitation Click Model for Information Retrieval.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Reinforcement Recommendation with User Multi-aspect Preference.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Settling the Variance of Multi-Agent Policy Gradients.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Neural Auto-Curricula in Two-Player Zero-Sum Games.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Ordering-Based Causal Discovery with Reinforcement Learning.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Modelling Behavioural Diversity for Learning in Open-Ended Games.
Proceedings of the 38th International Conference on Machine Learning, 2021

Learning in Nonzero-Sum Stochastic Games with Potentials.
Proceedings of the 38th International Conference on Machine Learning, 2021

Estimating α-Rank from A Few Entries with Low Rank Matrix Completion.
Proceedings of the 38th International Conference on Machine Learning, 2021

Retrospective Thinking based Multi-Agent System for Wireless Video Transmissions.
Proceedings of the ICC 2021, 2021

Joint Caching and Transmission in the Mobile Edge Network: An Multi-Agent Learning Approach.
Proceedings of the IEEE Global Communications Conference, 2021

MAFENN: Multi-Agent Feedback Enabled Neural Network for Wireless Channel Equalization.
Proceedings of the IEEE Global Communications Conference, 2021

Deep Reinforcement Learning for Resource Allocation in Massive MIMO.
Proceedings of the 29th European Signal Processing Conference, 2021

Adaptive Multi-objective Reinforcement Learning for Pareto Frontier Approximation: A Case Study of Resource Allocation Network in Massive MIMO.
Proceedings of the 29th European Signal Processing Conference, 2021

CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Top-N Recommendation with Counterfactual User Preference Simulation.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Grassland: A Rapid Algebraic Modeling System for Million-variable Optimization.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

CMML: Contextual Modulation Meta Learning for Cold-Start Recommendation.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Diverse Auto-Curriculum is Critical for Successful Real-World Multiagent Learning Systems.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Learning Correlated Communication Topology in Multi-Agent Reinforcement learning.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

2020
Strongly Constrained Discrete Hashing.
IEEE Trans. Image Process., 2020

Actor-Critic Reinforcement Learning for Control With Stability Guarantee.
IEEE Robotics Autom. Lett., 2020

Causal World Models by Unsupervised Deconfounding of Physical Dynamics.
CoRR, 2020

HEBO: Heteroscedastic Evolutionary Bayesian Optimisation.
CoRR, 2020

An Overview of Multi-Agent Reinforcement Learning from Game Theoretical Perspective.
CoRR, 2020

SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving.
CoRR, 2020

Learning to Model Opponent Learning.
CoRR, 2020

CausalVAE: Structured Causal Disentanglement in Variational Autoencoder.
CoRR, 2020

Compositional ADAM: An Adaptive Compositional Solver.
CoRR, 2020

GREASE: A Generative Model for Relevance Search over Knowledge Graphs.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

A Deep Recurrent Survival Model for Unbiased Ranking.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

JIT2R: A Joint Framework for Item Tagging and Tag-based Recommendation.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Replica-Exchange Nosé-Hoover Dynamics for Bayesian Learning on Large Datasets.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Multi-Agent Determinantal Q-Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

Multi-Agent Interactions Modeling with Correlated Policies.
Proceedings of the 8th International Conference on Learning Representations, 2020

Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture Search.
Proceedings of the Computer Vision - ECCV 2020, 2020


Learning to Infer User Hidden States for Online Sequential Advertising.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

U-rank: Utility-oriented Learning to Rank with Implicit Feedback.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

3rd International Workshop on EntitY Retrieval and lEarning (EYRE 2020).
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Sequential Advertising Agent with Interpretable User Hidden Intents.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Bi-Level Actor-Critic for Multi-Agent Coordination.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Learning to Communicate Implicitly by Actions.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Learning to Model Opponent Learning (Student Abstract).
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
基于四元数小波变换和Copula模型的图像融合 (Image Fusion Using Quaternion Wavelet Transform and Copula Model).
计算机科学, 2019

H<sub>∞</sub> Model-free Reinforcement Learning with Robust Stability Guarantee.
CoRR, 2019

MANAS: Multi-Agent Neural Architecture Search.
CoRR, 2019

Wasserstein Robust Reinforcement Learning.
CoRR, 2019

Neural Variational Inference For Estimating Uncertainty in Knowledge Graph Embeddings.
CoRR, 2019

Optimizing Object-based Perception and Control by Free-Energy Principle.
CoRR, 2019

Infer Your Enemies and Know Yourself, Learning in Real-Time Bidding with Partially Observable Opponents.
CoRR, 2019

Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning.
CoRR, 2019

Multi-Agent Generalized Recursive Reasoning.
CoRR, 2019

Fog Services and Enabling Technologies.
IEEE Commun. Mag., 2019

CircView: a visualization and exploration tool for circular RNAs.
Briefings Bioinform., 2019

Layout Design for Intelligent Warehouse by Evolution With Fitness Approximation.
IEEE Access, 2019

Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning.
Proceedings of the World Wide Web Conference, 2019

Multi-View Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Neural Variational Inference For Estimating Knowledge Graph Embedding Uncertainty.
Proceedings of the 2019 International Workshop on Neural-Symbolic Learning and Reasoning (NeSy 2019), 2019

A Regularized Opponent Model with Maximum Entropy Objective.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

BayesNAS: A Bayesian Approach for Neural Architecture Search.
Proceedings of the 36th International Conference on Machine Learning, 2019

Probabilistic Recursive Reasoning for Multi-Agent Reinforcement Learning.
Proceedings of the 7th International Conference on Learning Representations, 2019

Know Your Enemies and Know Yourself in the Real-Time Bidding Function Optimisation.
Proceedings of the 2019 International Conference on Data Mining Workshops, 2019

Modeling User Return Time Using Inhomogeneous Poisson Process.
Proceedings of the Advances in Information Retrieval, 2019

Factorized Q-learning for large-scale multi-agent systems.
Proceedings of the First International Conference on Distributed Artificial Intelligence, 2019

MarlRank: Multi-agent Reinforced Learning to Rank.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Learning Adaptive Display Exposure for Real-Time Advertising.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

CoRide: Joint Order Dispatching and Fleet Management for Multi-Scale Ride-Hailing Platforms.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

EYRE 2019: 2nd International Workshop on EntitY REtrieval.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Learning to Advertise for Organic Traffic Maximization in E-Commerce Product Feeds.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
Bidding Machine: Learning to Bid for Directly Optimizing Profits in Display Advertising.
IEEE Trans. Knowl. Data Eng., 2018

DEBTS: Delay Energy Balanced Task Scheduling in Homogeneous Fog Networks.
IEEE Internet Things J., 2018

Learning Shared Dynamics with Meta-World Models.
CoRR, 2018

Learning Multi-agent Implicit Communication Through Actions: A Case Study in Contract Bridge, a Collaborative Imperfect-Information Game.
CoRR, 2018

Factorized Q-Learning for Large-Scale Multi-Agent Systems.
CoRR, 2018

Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning.
CoRR, 2018

Neural Text Generation: Past, Present and Beyond.
CoRR, 2018

DECCO: Deep-Learning Enabled Coverage and Capacity Optimization for Massive MIMO Systems.
IEEE Access, 2018

Texygen: A Benchmarking Platform for Text Generation Models.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Thermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Explanation Mining: Post Hoc Interpretability of Latent Factor Models for Recommendation Systems.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Learning to Design Games: Strategic Environments in Reinforcement Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

On the Equilibrium of Query Reformulation and Document Retrieval.
Proceedings of the 2018 ACM SIGIR International Conference on Theory of Information Retrieval, 2018

Mean Field Multi-Agent Reinforcement Learning.
Proceedings of the 35th International Conference on Machine Learning, 2018

Activation Maximization Generative Adversarial Nets.
Proceedings of the 6th International Conference on Learning Representations, 2018

Learning Multi-touch Conversion Attribution with Dual-attention Mechanisms for Online Advertising.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

A Study of AI Population Dynamics with Million-agent Reinforcement Learning.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

A Neural Stochastic Volatility Model.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Long Text Generation via Adversarial Training with Leaked Information.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Efficient Architecture Search by Network Transformation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Display Advertising with Real-Time Bidding (RTB) and Behavioural Targeting.
Found. Trends Inf. Retr., 2017

MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence.
CoRR, 2017

An Empirical Study of AI Population Dynamics with Million-agent Reinforcement Learning.
CoRR, 2017

Generative Adversarial Nets with Labeled Data by Activation Maximization.
CoRR, 2017

Learning to Design Games: Strategic Environments in Deep Reinforcement Learning.
CoRR, 2017

Multiagent Bidirectionally-Coordinated Nets for Learning to Play StarCraft Combat Games.
CoRR, 2017

Reinforcement Learning for Architecture Search by Network Transformation.
CoRR, 2017

Comprehensive characterization of tissue-specific circular RNAs in the human and mouse genomes.
Briefings Bioinform., 2017

Managing Risk of Bidding in Display Advertising.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

Real-Time Bidding by Reinforcement Learning in Display Advertising.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Dynamic Attention Deep Model for Article Recommendation by Learning Human Editors' Demonstration.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

Content Recommendation by Noise Contrastive Transfer Learning of Feature Representation.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Dynamic Information Retrieval Modeling
Synthesis Lectures on Information Concepts, Retrieval, and Services, Morgan & Claypool Publishers, ISBN: 978-3-031-02301-9, 2016

Optimal Real-Time Bidding Frameworks Discussion.
CoRR, 2016

Learning text representation using recurrent convolutional neural network with highway layers.
CoRR, 2016

Feature Selection as a Multiagent Coordination Problem.
CoRR, 2016

Sprites: detection of deletions from sequencing data by re-aligning split reads.
Bioinform., 2016

Feedback Control of Real-Time Display Advertising.
Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, 2016

Bayesian Performance Comparison of Text Classifiers.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Functional Bid Landscape Forecasting for Display Advertising.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2016

Value at Risk for Risk Evaluation in Information Retrieval.
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016

Bid-aware Gradient Descent for Unbiased Learning with Censored Data in Display Advertising.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Product-Based Neural Networks for User Response Prediction.
Proceedings of the IEEE 16th International Conference on Data Mining, 2016

Resource Allocation Based on Double Auction for Cloud Computing System.
Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016

Deep Learning over Multi-field Categorical Data - - A Case Study on User Response Prediction.
Proceedings of the Advances in Information Retrieval, 2016

Implicit Look-Alike Modelling in Display Ads - Transfer Collaborative Filtering to CTR Estimation.
Proceedings of the Advances in Information Retrieval, 2016

Real-Time Bidding Based Display Advertising: Mechanisms and Algorithms.
Proceedings of the Advances in Information Retrieval, 2016

User Response Learning for Directly Optimizing Campaign Performance in Display Advertising.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

2015
Multi-Keyword Multi-Click Advertisement Option Contracts for Sponsored Search.
ACM Trans. Intell. Syst. Technol., 2015

A term-based methodology for query reformulation understanding.
Inf. Retr. J., 2015

Resource allocation algorithm based on hybrid particle swarm optimization for multiuser cognitive OFDM network.
Expert Syst. Appl., 2015

A lattice framework for pricing display advertisement options with the stochastic volatility underlying model.
Electron. Commer. Res. Appl., 2015

An Empirical Study on Display Ad Impression Viewability Measurements.
CoRR, 2015

Real-Time Bidding: A New Frontier of Computational Advertising Research.
Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, 2015

Risk-Hedged Venture Capital Investment Recommendation.
Proceedings of the 9th ACM Conference on Recommender Systems, 2015

Statistical Arbitrage Mining for Display Advertising.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

A Theoretical Analysis of Two-Stage Recommendation for Cold-Start Collaborative Filtering.
Proceedings of the 2015 International Conference on The Theory of Information Retrieval, 2015

Estimating the Uncertainty of Average F1 Scores.
Proceedings of the 2015 International Conference on The Theory of Information Retrieval, 2015

Dynamic Information Retrieval: Theoretical Framework and Application.
Proceedings of the 2015 International Conference on The Theory of Information Retrieval, 2015

A Bayesian Hierarchical Model for Comparing Average F1 Scores.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

Document Boltzmann Machines for Information Retrieval.
Proceedings of the Advances in Information Retrieval, 2015

2014
Introduction to the Special Issue on Diversity and Discovery in Recommender Systems.
ACM Trans. Intell. Syst. Technol., 2014

Bias-variance analysis in estimating true query model for information retrieval.
Inf. Process. Manag., 2014

Real-Time Bidding Benchmarking with iPinYou Dataset.
CoRR, 2014

A Lattice Framework for Pricing Display Ad Options with the Stochastic Volatility Underlying Model.
CoRR, 2014

Dynamic information retrieval modeling.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Optimal real-time bidding for display advertising.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

An empirical study of reserve price optimisation in real-time bidding.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

A Dynamic Pricing Model for Unifying Programmatic Guarantee and Real-Time Bidding in Display Advertising.
Proceedings of the Eighth International Workshop on Data Mining for Online Advertising, 2014

Generalized Bias-Variance Evaluation of TREC Participated Systems.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

2013
Bridging memory-based collaborative filtering and text retrieval.
Inf. Retr., 2013

Iterative Expectation for Multi Period Information Retrieval
CoRR, 2013

Multi-Keyword Multi-Click Option Contracts for Sponsored Search Advertising.
CoRR, 2013

Interactive exploratory search for multi page search results.
Proceedings of the 22nd International World Wide Web Conference, 2013

Probabilistic group recommendation via information matching.
Proceedings of the 22nd International World Wide Web Conference, 2013

Bias-variance decomposition of ir evaluation.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

Optimizing top-n collaborative filtering via dynamic negative item sampling.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

To personalize or not: a risk management perspective.
Proceedings of the Seventh ACM Conference on Recommender Systems, 2013

Interactive collaborative filtering.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Real-time bidding for online advertising: measurement and analysis.
Proceedings of the Seventh International Workshop on Data Mining for Online Advertising, 2013

2012
Internet Advertising: An Interplay among Advertisers, Online Publishers, Ad Exchanges and Web Users
CoRR, 2012

A Theory of Information Matching
CoRR, 2012

Selling futures online advertising slots via option contracts.
Proceedings of the 21st World Wide Web Conference, 2012

Dynamical information retrieval modelling: a portfolio-armed bandit machine approach.
Proceedings of the 21st World Wide Web Conference, 2012

Using control theory for stable and efficient recommender systems.
Proceedings of the 21st World Wide Web Conference 2012, 2012

2nd international workshop on diversity in document retrieval (DDR 2012).
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012

Precoding with Known Transmit Coupling and Spatial Covariance Matrices.
Proceedings of the 75th IEEE Vehicular Technology Conference, 2012

Adaptive diversification of recommendation results via latent factor portfolio.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

Top-k Retrieval Using Facility Location Analysis.
Proceedings of the Advances in Information Retrieval, 2012

Sequential selection of correlated ads by POMDPs.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
The 1st international workshop on diversity in document retrieval.
SIGIR Forum, 2011

A Unified Relevance Retrieval Model by Eliteness Hypothesis
CoRR, 2011

Workshop on novelty and diversity in recommender systems - DiveRS 2011.
Proceedings of the 2011 ACM Conference on Recommender Systems, 2011

Contextual Image Annotation via Projection and Quantum Theory Inspired Measurement for Integration of Text and Visual Features.
Proceedings of the Quantum Interaction - 5th International Symposium, 2011

Text Retrieval Methods for Item Ranking in Collaborative Filtering.
Proceedings of the Advances in Information Retrieval, 2011

Statistical information retrieval modelling: from the probability ranking principle to recent advances in diversity, portfolio theory, and beyond.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Structured collaborative filtering.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

On Modeling Rank-Independent Risk in Estimating Probability of Relevance.
Proceedings of the Information Retrieval Technology, 2011

2010
Personalization of tagging systems.
Inf. Process. Manag., 2010

Self-taught hashing for fast similarity search.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

On statistical analysis and optimization of information retrieval effectiveness metrics.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Optimizing multiple objectives in collaborative filtering.
Proceedings of the 2010 ACM Conference on Recommender Systems, 2010

Laplacian Co-hashing of Terms and Documents.
Proceedings of the Advances in Information Retrieval, 2010

Goal-Driven Collaborative Filtering - A Directional Error Based Approach.
Proceedings of the Advances in Information Retrieval, 2010

RGU at ImageCLEF2010 Wikipedia Retrieval Task.
Proceedings of the CLEF 2010 LABs and Workshops, 2010

Automata Modeling for Cognitive Interference in Users' Relevance Judgment.
Proceedings of the Quantum Informatics for Cognitive, 2010

Explanation of Relevance Judgement Discrepancy with Quantum Interference.
Proceedings of the Quantum Informatics for Cognitive, 2010

Tensor Product of Correlated Textual and Visual Features: A Quantum Theory Inspired Image Retrieval Framework.
Proceedings of the Quantum Informatics for Cognitive, 2010


2009
Topic (query) selection for IR evaluation.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Risky business: modeling and exploiting uncertainty in information retrieval.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Portfolio theory of information retrieval.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Risk-Aware Information Retrieval.
Proceedings of the Advances in Information Retrieval, 2009

Mean-Variance Analysis: A New Document Ranking Theory in Information Retrieval.
Proceedings of the Advances in Information Retrieval, 2009

Bridging the gap: complex networks meet information and knowledge management.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Language Models of Collaborative Filtering.
Proceedings of the Information Retrieval Technology, 2009

Personalization on a Peer-to-Peer Television System.
Proceedings of the Handbook of Multimedia for Digital Entertainment and Arts, 2009

2008
Wi-Fi Walkman.
Encyclopedia of Wireless and Mobile Communications, 2008

Unified relevance models for rating prediction in collaborative filtering.
ACM Trans. Inf. Syst., 2008

Personalization on a peer-to-peer television system.
Multim. Tools Appl., 2008

Probabilistic relevance ranking for collaborative filtering.
Inf. Retr., 2008

TRIBLER: a social-based peer-to-peer system.
Concurr. Comput. Pract. Exp., 2008

Generation of Query-Biased Concepts Using Content and Structure for Query Reformulation.
Proceedings of the Natural Language and Information Systems, 2008

2006
Experiential Sampling on Multiple Data Streams.
IEEE Trans. Multim., 2006

Experiential Sampling in Multimedia Systems.
IEEE Trans. Multim., 2006

Unifying user-based and item-based collaborative filtering approaches by similarity fusion.
Proceedings of the SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006

Distributed collaborative filtering for peer-to-peer file sharing systems.
Proceedings of the 2006 ACM Symposium on Applied Computing (SAC), 2006

A User-Item Relevance Model for Log-Based Collaborative Filtering.
Proceedings of the Advances in Information Retrieval, 2006

2005
Automatic video logo detection and removal.
Multim. Syst., 2005

Analogies based video editing.
Multim. Syst., 2005

Self-organizing distributed collaborative filtering.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

2004
Video content representation on tiny devices.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

2003
Experience based sampling technique for multimedia analysis.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

Experiential sampling for monitoring.
Proceedings of the 2003 ACM SIGMM Workshop on Experiential Telepresence, 2003

A hierarchical framework for face tracking using state vector fusion for compressed video.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003


  Loading...