Multiagent Meta-Reinforcement Learning for Optimized Task Scheduling in Heterogeneous Edge Computing Systems

Liwen Niu, Xianfu Chen*, Ning Zhang, Yongdong Zhu, Rui Yin, Celimuge Wu, Yangjie Cao

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

34 Citations (Scopus)

Abstract

Mobile-edge computing (MEC) brings the potential to address the ever increasing computation demands from the mobile users (MUs). In addition to local processing, the resource-constrained MUs in an MEC system can also offload computation to the nearby servers for remote execution. With the explosive growth of mobile devices, computation offloading faces the challenge of spectrum congestion, which, in turn, deteriorates the overall quality of computation experience. This article, hence, investigates computation task scheduling in a heterogeneous cellular and WiFi MEC system. Such a system provides both licensed and unlicensed spectrum opportunities. Due to the sharing of communication and computation resources as well as the uncertainties, we formulate the problem of computation task scheduling among the competing MUs in a stationary heterogeneous edge computing system as a noncooperative stochastic game. We propose an approximation-based multiagent Markov decision process without the global system state observations, under which a multiagent proximal policy optimization (PPO) algorithm is derived to solve the corresponding Nash equilibrium. When expanding to a nonstationary heterogeneous edge computing system, the obtained algorithm suffers from the slow convergence due to constrained adaptability. Accordingly, we explore meta-learning and propose a multiagent meta-PPO algorithm, which rapidly adapts the control policy learning to the nonstationarity. Numerical experiments demonstrate performance gains from our proposed algorithms.
Original languageEnglish
Article number10032543
Pages (from-to)10519-10531
JournalIEEE Internet of Things Journal
Volume10
Issue number12
DOIs
Publication statusPublished - 7 Jun 2023
MoE publication typeA1 Journal article-refereed

Funding

This work was supported in part by the Zhejiang Lab Open Program under Grant 2021LC0AB06; in part by the National Key Research and Development Program of China under Grant 2021YFB2900200; in part by the Key Research and Development Program of Zhejiang Province under Grant 2021C01197; in part by the National Natural Science Foundation of China under Grant 62271438, Grant 62062031, and Grant 61972092; in part by the Zhejiang Provincial Natural Science Foundation of China under Grant LGG22F010008; in part by the ROIS NII Open Collaborative Research under Grant 22S0601; in part by the JSPS KAKENHI under Grant 21H03424; and in part by the Collaborative Innovation Major Project of Zhengzhou under Grant 20XTZX06013.

Keywords

  • Wireless fidelity
  • Task analysis
  • Processor scheduling
  • Edge computing
  • Servers
  • Scheduling
  • Training

Fingerprint

Dive into the research topics of 'Multiagent Meta-Reinforcement Learning for Optimized Task Scheduling in Heterogeneous Edge Computing Systems'. Together they form a unique fingerprint.

Cite this