Mengwei Xu

Orcid: 0000-0001-6271-6993

According to our database1, Mengwei Xu authored at least 124 papers between 2014 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
CAN-Verify: Automated analysis for BDI agents.
Sci. Comput. Program., 2025

2024
Efficient, Scalable, and Sustainable DNN Training on SoC-Clustered Edge Servers.
IEEE Trans. Mob. Comput., December, 2024

Accelerating Vertical Federated Learning.
IEEE Trans. Big Data, December, 2024

Communication-Efficient Satellite-Ground Federated Learning Through Progressive Weight Quantization.
IEEE Trans. Mob. Comput., September, 2024

Benchmarking Mobile Deep Learning Software.
GetMobile Mob. Comput. Commun., September, 2024

Seamless Cross-Edge Service Migration for Real-Time Rendering Applications.
IEEE Trans. Mob. Comput., June, 2024

A Comprehensive Deep Learning Library Benchmark and Optimal Library Selection.
IEEE Trans. Mob. Comput., May, 2024

Quantitative modelling and analysis of BDI agents.
Softw. Syst. Model., April, 2024

FLASH: Heterogeneity-Aware Federated Learning at Scale.
IEEE Trans. Mob. Comput., January, 2024

Small Language Models: Survey, Measurements, and Insights.
CoRR, 2024

Recall: Empowering Multimodal Embedding for Edge Devices.
CoRR, 2024

MobileViews: A Large-Scale Mobile GUI Dataset.
CoRR, 2024

ELMS: Elasticized Large Language Models On Mobile Devices.
CoRR, 2024

FedMoE: Personalized Federated Learning via Heterogeneous Mixture of Experts.
CoRR, 2024

Empowering 1000 tokens/second on-device LLM prefilling with mllm-NPU.
CoRR, 2024

ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents.
CoRR, 2024

The CAP Principle for LLM Serving: A Survey of Long-Context Large Language Model Serving.
CoRR, 2024

LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Automation Task Evaluation.
CoRR, 2024

LLM as a System Service on Mobile Devices.
CoRR, 2024

A First Look at GPT Apps: Landscape and Vulnerability.
CoRR, 2024

Lightweight Protection for Privacy in Offloaded Speech Understanding.
CoRR, 2024

A Survey of Resource-efficient LLM and Multimodal Foundation Models.
CoRR, 2024

Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security.
CoRR, 2024

Towards Energy-efficient Federated Learning via INT8-based Training on Mobile DSPs.
Proceedings of the ACM on Web Conference 2024, 2024

High-density Mobile Cloud Gaming on Edge SoC Clusters.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

More is Different: Prototyping and Analyzing a New Form of Edge Server with Massive Mobile SoCs.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

FwdLLM: Efficient Federated Finetuning of Large Language Models with Perturbed Inferences.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

An Empirical Study of Rust-for-Linux: The Success, Dissatisfaction, and Compromise.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation.
Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, 2024

PieBridge: Fast and Parameter-Efficient On-Device Training via Proxy Networks.
Proceedings of the 22nd ACM Conference on Embedded Networked Sensor Systems, 2024

Poster: Efficient and Accurate Mobile Task Automation through Learning from Code.
Proceedings of the 22nd Annual International Conference on Mobile Systems, 2024

Mobile Foundation Model as Firmware.
Proceedings of the 30th Annual International Conference on Mobile Computing and Networking, 2024

Deciphering the Enigma of Satellite Computing with COTS Devices: Measurement and Analysis.
Proceedings of the 30th Annual International Conference on Mobile Computing and Networking, 2024

Resource-efficient In-orbit Detection of Earth Objects.
Proceedings of the IEEE INFOCOM 2024, 2024

FedRDMA: Communication-Efficient Cross-Silo Federated LLM via Chunked RDMA Transmission.
Proceedings of the 4th Workshop on Machine Learning and Systems, 2024

WiP: Efficient LLM Prefilling with Mobile NPU.
Proceedings of the Workshop on Edge and Mobile Foundation Models, 2024

Large Language Models on Mobile Devices: Measurements, Analysis, and Insights.
Proceedings of the Workshop on Edge and Mobile Foundation Models, 2024

A Practical Operational Semantics for Classical Planning in BDI Agents.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Safeguard Privacy for Minimal Data Collection with Trustworthy Autonomous Agents.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

SoCFlow: Efficient and Scalable DNN Training on SoC-Clustered Edge Servers.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

5G Edge Computing - Technologies, Applications and Future Visions
Springer, ISBN: 978-981-97-0212-1, 2024

2023
Demystifying the QoS and QoE of Edge-hosted Video Streaming Applications in the Wild with SNESet.
Proc. ACM Manag. Data, December, 2023

A large-scale holistic measurement of crowdsourced edge cloud platform.
World Wide Web (WWW), September, 2023

The First Verification Test of Space-Ground Collaborative Intelligence via Cloud-Native Satellites.
CoRR, 2023

LLMCad: Fast and Scalable On-device Large Language Model Inference.
CoRR, 2023

Rethinking Mobile AI Ecosystem in the LLM Era.
CoRR, 2023

EdgeMoE: Fast On-Device Inference of MoE-based Large Language Models.
CoRR, 2023

Federated Fine-tuning of Billion-Sized Language Models across Mobile Devices.
CoRR, 2023

A Comprehensive Survey on Orbital Edge Computing: Systems, Applications, and Algorithms.
CoRR, 2023

ELASTIC: Edge Workload Forecasting based on Collaborative Cloud-Edge Deep Learning.
Proceedings of the ACM Web Conference 2023, 2023

Successful Swarms: Operator Situational Awareness with Modelling and Verification at Runtime.
Proceedings of the 32nd IEEE International Conference on Robot and Human Interactive Communication, 2023

Quantitative Verification and Strategy Synthesis for BDI Agents.
Proceedings of the NASA Formal Methods - 15th International Symposium, 2023

Boosting DNN Cold Inference on Edge Devices.
Proceedings of the 21st Annual International Conference on Mobile Systems, 2023

Federated Few-Shot Learning for Mobile NLP.
Proceedings of the 29th Annual International Conference on Mobile Computing and Networking, 2023

Efficient Federated Learning for Modern NLP.
Proceedings of the 29th Annual International Conference on Mobile Computing and Networking, 2023

How Far Have Edge Clouds Gone? A Spatial-Temporal Analysis of Edge Network Latency In the Wild.
Proceedings of the 31st IEEE/ACM International Symposium on Quality of Service, 2023

A Holistic QoS View of Crowdsourced Edge Cloud Platform.
Proceedings of the 31st IEEE/ACM International Symposium on Quality of Service, 2023

Evaluating and Enhancing the Robustness of Federated Learning System against Realistic Data Corruption.
Proceedings of the 34th IEEE International Symposium on Software Reliability Engineering, 2023

Privacy as a Resource in Differentially Private Federated Learning.
Proceedings of the IEEE INFOCOM 2023, 2023

CAN-verify: A Verification Tool For BDI Agents.
Proceedings of the iFM 2023 - 18th International Conference, 2023

Niagara: Scheduling DNN Inference Services on Heterogeneous Edge Processors.
Proceedings of the Service-Oriented Computing - 21st International Conference, 2023

Tango: Harmonious Management and Scheduling for Mixed Services Co-located among Distributed Edge-Clouds.
Proceedings of the 52nd International Conference on Parallel Processing, 2023

Towards Practical Few-shot Federated NLP.
Proceedings of the 3rd Workshop on Machine Learning and Systems, 2023

Uncertain Machine Ethical Decisions Using Hypothetical Retrospection.
Proceedings of the Coordination, Organizations, Institutions, Norms, and Ethics for Governance of Multi-Agent Systems XVI, 2023

FedAdapter: Efficient Federated Learning for Mobile NLP.
Proceedings of the ACM Turing Award Celebration Conference - China 2023, 2023

2022
Modelling and verifying BDI agents with bigraphs.
Sci. Comput. Program., 2022

SoC-Cluster as an Edge Server: an Application-driven Measurement Study.
CoRR, 2022

Federated NLP in Few-shot Scenarios.
CoRR, 2022

AUG-FedPrompt: Practical Few-shot Federated NLP with Data-augmented Prompts.
CoRR, 2022

Device-centric Federated Analytics At Ease.
CoRR, 2022

Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading.
CoRR, 2022

Understanding and Optimizing Deep Learning Cold-Start Latency on Edge Devices.
CoRR, 2022

AutoFedNLP: An efficient FedNLP framework.
CoRR, 2022

A Comprehensive Benchmark of Deep Learning Libraries on Mobile Devices.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Commutativity-guaranteed Docker Image Reconstruction towards Effective Layer Sharing.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Towards Robust Intelligence in Space.
Proceedings of the IEEE Smartworld, 2022

Verifying BDI Agents in Dynamic Environments.
Proceedings of the 34th International Conference on Software Engineering and Knowledge Engineering, 2022

Melon: breaking the memory wall for resource-efficient on-device machine learning.
Proceedings of the MobiSys '22: The 20th Annual International Conference on Mobile Systems, Applications and Services, Portland, Oregon, 27 June 2022, 2022

Mandheling: mixed-precision on-device DNN training with DSP offloading.
Proceedings of the ACM MobiCom '22: The 28th Annual International Conference on Mobile Computing and Networking, Sydney, NSW, Australia, October 17, 2022

Position Paper: Renovating Edge Servers with ARM SoCs.
Proceedings of the 7th IEEE/ACM Symposium on Edge Computing, 2022

2021
A Case for Camera-as-a-Service.
IEEE Pervasive Comput., 2021

Joint Placement of UPF and Edge Server for 6G Network.
IEEE Internet Things J., 2021

Autonomous Learning System Towards Mobile Intelligence.
Int. J. Softw. Informatics, 2021

Observable and Attention-Directing BDI Agents for Human-Autonomy Teaming.
Proceedings of the Proceedings Third Workshop on Formal Methods for Autonomous Systems, 2021

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Video Analytics with Zero-streaming Cameras.
Proceedings of the 2021 USENIX Annual Technical Conference, 2021

TaintStream: fine-grained taint tracking for big data platforms through dynamic code translation.
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021

Probabilistic BDI Agents: Actions, Plans, and Intentions.
Proceedings of the Software Engineering and Formal Methods - 19th International Conference, 2021

Towards Ubiquitous Learning: A First Measurement of On-Device Training Performance.
Proceedings of the EMDL@MobiSys 2021: Proceedings of the 5th International Workshop on Embedded and Mobile Deep Learning, 2021

Boosting Mobile CNN Inference through Semantic Memory.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

From cloud to edge: a first look at public edge platforms.
Proceedings of the IMC '21: ACM Internet Measurement Conference, 2021

Tiansuan Constellation: An Open Research Platform.
Proceedings of the IEEE International Conference on Edge Computing, 2021

2020
Extending BDI agents with robust program execution, adaptive plan library, and efficient intention progression.
PhD thesis, 2020

DeepWear: Adaptive Local Offloading for On-Wearable Deep Learning.
IEEE Trans. Mob. Comput., 2020

Relaxed constant positive linear dependence constraint qualification and its application to bilevel programs.
J. Glob. Optim., 2020

Hierarchical Federated Learning through LAN-WAN Orchestration.
CoRR, 2020

Heterogeneity-Aware Federated Learning.
CoRR, 2020

Neural Architecture Search over Decentralized Data.
CoRR, 2020

Approximate query service on autonomous IoT cameras.
Proceedings of the MobiSys '20: The 18th Annual International Conference on Mobile Systems, 2020

A query engine for zero-streaming cameras.
Proceedings of the MobiCom '20: The 26th Annual International Conference on Mobile Computing and Networking, 2020

2019
MUIT: A Domain-Specific Language and its Middleware for Adaptive Mobile Web-Based User Interfaces in WS-BPEL.
IEEE Trans. Serv. Comput., 2019

Approximate Query Processing on Autonomous Cameras.
CoRR, 2019

Supporting Video Queries on Zero-Streaming Cameras.
CoRR, 2019

A First Look at Deep Learning Apps on Smartphones.
Proceedings of the World Wide Web Conference, 2019

Intention Interleaving Via Classical Replanning.
Proceedings of the 31st IEEE International Conference on Tools with Artificial Intelligence, 2019

2018
DeepType: On-Device Deep Learning for Input Personalization Service with Minimal Privacy Concern.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2018

PrivacyShield: A Mobile System for Supporting Subtle Just-in-time Privacy Provisioning through Off-Screen-based Touch Gestures.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2018

When Mobile Apps Going Deep: An Empirical Study of Mobile Deep Learning.
CoRR, 2018

Quantitative Stability of Two-Stage Linear Second-Order Conic Stochastic Programs with Full Random Recourse.
Asia Pac. J. Oper. Res., 2018

A Formal Approach to Embedding First-Principles Planning in BDI Agent Systems.
Proceedings of the Scalable Uncertainty Management - 12th International Conference, 2018

DeepCache: Principled Cache for Mobile Deep Vision.
Proceedings of the 24th Annual International Conference on Mobile Computing and Networking, 2018

A Framework for Plan Library Evolution in BDI Agent Systems.
Proceedings of the IEEE 30th International Conference on Tools with Artificial Intelligence, 2018

Using Touch-screen Gestures for Just-in-time Privacy Provisioning.
Proceedings of the 2018 ACM International Joint Conference and 2018 International Symposium on Pervasive and Ubiquitous Computing and Wearable Computers, 2018

Power sandbox: power awareness redefined.
Proceedings of the Thirteenth EuroSys Conference, 2018

2017
ShuffleDog: Characterizing and Adapting User-Perceived Latency of Android Apps.
IEEE Trans. Mob. Comput., 2017

Enabling Cooperative Inference of Deep Learning on Wearables and Smartphones.
CoRR, 2017

Accelerating Convolutional Neural Networks for Continuous Mobile Vision via Cache Reuse.
CoRR, 2017

AppHolmes: Detecting and Characterizing App Collusion among Third-Party Android Markets.
Proceedings of the 26th International Conference on World Wide Web, 2017

2016
MUIT: A Middleware for Adaptive Mobile Web-based User Interfaces in WS-BPEL.
CoRR, 2016

2015
Smoothing SQP Methods for Solving Degenerate Nonsmooth Constrained Optimization Problems with Applications to Bilevel Programs.
SIAM J. Optim., 2015

Smoothing augmented Lagrangian method for nonsmooth constrained optimization problems.
J. Glob. Optim., 2015

2014
On solving simple bilevel programs with a nonconvex lower level program.
Math. Program., 2014

A smoothing augmented Lagrangian method for solving simple bilevel programs.
Comput. Optim. Appl., 2014

Solving semi-infinite programs by smoothing projected gradient method.
Comput. Optim. Appl., 2014


  Loading...