Xiaoteng Ma

Orcid: 0000-0002-7250-6268

According to our database1, Xiaoteng Ma authored at least 41 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
KEPC-Push: A Knowledge-Enhanced Proactive Content Push Strategy for Edge-Assisted Video Feed Streaming.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

Smart Data-Driven Proactive Push to Edge Network for User-Generated Videos.
Proceedings of the IEEE INFOCOM 2024, 2024

Single-Trajectory Distributionally Robust Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

SEABO: A Simple Search-Based Method for Offline Imitation Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Efficient Multi-agent Reinforcement Learning by Planning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning Diverse Risk Preferences in Population-Based Self-Play.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
A unified algorithm framework for mean-variance optimization in discounted Markov decision processes.
Eur. J. Oper. Res., December, 2023

VRCT: A Viewport Reconstruction-Based 360° Video Caching Solution for Tile-Adaptive Streaming.
IEEE Trans. Broadcast., September, 2023

What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
CoRR, 2023

Uncertainty-driven Trajectory Truncation for Model-based Offline Reinforcement Learning.
CoRR, 2023

Single-Trajectory Distributionally Robust Reinforcement Learning.
CoRR, 2023

Cross-Domain Policy Adaptation via Value-Guided Data Filtering.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning (Extended Abstract).
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Proceedings of the International Conference on Machine Learning, 2023

Uncertainty-Driven Trajectory Truncation for Data Augmentation in Offline Reinforcement Learning.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

2022
Learning-Based Joint QoE Optimization for Adaptive Video Streaming Based on Smart Edge.
IEEE Trans. Netw. Serv. Manag., 2022

QAVA: QoE-Aware Adaptive Video Bitrate Aggregation for HTTP Live Streaming Based on Smart Edge Computing.
IEEE Trans. Broadcast., 2022

Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning.
J. Artif. Intell. Res., 2022

Exploiting Reward Shifting in Value-Based Deep RL.
CoRR, 2022

Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation.
CoRR, 2022

Knowledge-based Temporal Fusion Network for Interpretable Online Video Popularity Prediction.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

MagNet: Cooperative Edge Caching by Automatic Content Congregating.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

RORL: Robust Offline Reinforcement Learning via Conservative Smoothing.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Exploit Reward Shifting in Value-Based Deep-RL: Optimistic Curiosity-Based Exploration and Conservative Exploitation via Linear Reward Shaping.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Mildly Conservative Q-Learning for Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Offline Reinforcement Learning with Value-based Episodic Memory.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Efficient Continuous Control with Double Actors and Regularized Critics.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Video Super-Resolution and Caching - An Edge-Assisted Adaptive Video Streaming Solution.
IEEE Trans. Broadcast., 2021

Learning to Discover Task-Relevant Features for Interpretable Reinforcement Learning.
IEEE Robotics Autom. Lett., 2021

MGPSN: Motion-Guided Pseudo Siamese Network for Indoor Video Head Detection.
CoRR, 2021

Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning.
CoRR, 2021

Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Average-Reward Reinforcement Learning with Trust Region Methods.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

2020
SOAC: The Soft Option Actor-Critic Architecture.
CoRR, 2020

Wasserstein Distance guided Adversarial Imitation Learning with Reward Shape Exploration.
CoRR, 2020

Distributional Soft Actor Critic for Risk Sensitive Learning.
CoRR, 2020

Fairness Control of Traffic Light via Deep Reinforcement Learning.
Proceedings of the 16th IEEE International Conference on Automation Science and Engineering, 2020

2019
Steward: smart edge based joint QoE optimization for adaptive video streaming.
Proceedings of the 29th ACM Workshop on Network and Operating Systems Support for Digital Audio and Video, 2019

Bi-level Proximal Policy optimization for Stochastic Coordination of EV Charging Load with Uncertain Wind Power.
Proceedings of the 2019 IEEE Conference on Control Technology and Applications, 2019

2018
Attendance and Security System Based on Building Video Surveillance.
Proceedings of the Advancements in Smart City and Intelligent Building, 2018


  Loading...