Learning to Entangle Radio Resources in Vehicular Communications: An Oblivious Game-Theoretic Perspective

  • Xianfu Chen
  • , Celimuge Wu*
  • , Mehdi Bennis
  • , Zhifeng Zhao
  • , Zhu Han
  • *Corresponding author for this work

    Research output: Contribution to journalArticleScientificpeer-review

    Abstract

    In this paper, we investigate non-cooperative radio resource management in a vehicle-to-vehicle communication network. The technical challenges lie in high-vehicle mobility and data traffic variations. Over the discrete scheduling slots, each vehicle user equipment (VUE)-pair competes with other VUE-pairs in the coverage of a road side unit (RSU) for the limited frequency to transmit queued data packets, aiming to optimize the expected long-term performance. The frequency allocation at the beginning of each slot at the RSU is regulated by a sealed second-price auction. Such interactions among VUE-pairs are modeled as a stochastic game with a semi-continuous global network state space. By defining a partitioned control policy, we transform the original game into an equivalent stochastic game with a global queue state space of finite size. We adopt an oblivious equilibrium (OE) to approximate the Markov perfect equilibrium, which characterizes the optimal solution to the equivalent game. The OE solution is theoretically proven to be with an asymptotic Markov equilibrium property. Due to the lack of a priori knowledge of network dynamics, we derive an online algorithm to learn the OE solution. Numerical simulations validate the theoretical analysis and show the effectiveness of the proposed online learning algorithm.

    Original languageEnglish
    Article number8674536
    Pages (from-to)4262-4274
    Number of pages13
    JournalIEEE Transactions on Vehicular Technology
    Volume68
    Issue number5
    DOIs
    Publication statusPublished - 1 May 2019
    MoE publication typeA1 Journal article-refereed

    Funding

    Manuscript received September 1, 2018; revised February 10, 2019; accepted March 11, 2019. Date of publication March 26, 2019; date of current version May 28, 2019. This work was supported in part by the Finnish Funding Agency for Innovation (TEKES) under the Project “Wireless for Verticals (WIVE),” in part by the Academy of Finland under Grants 319759, 319758, and 289611, in part by the Telecommunications Advanced Foundation, and in part by the US MURI AFOSR MURI 18RT0073 and the National Science Foundation under Grants 1717454, 1731424, 1702850, and 1646607. The review of this paper was coordinated by the Guest Editors of Special Section on Machine Learning-Based Internet of Vehicles. (Corresponding author: Celimuge Wu.) X. Chen is with the VTT Technical Research Centre of Finland, Oulu 1000, Finland (e-mail:,[email protected]).

    Keywords

    • Markov decision process
    • Markov perfect equilibrium
    • oblivious equilibrium
    • reinforcement learning
    • stochastic games
    • Vehicle-to-vehicle communications

    Fingerprint

    Dive into the research topics of 'Learning to Entangle Radio Resources in Vehicular Communications: An Oblivious Game-Theoretic Perspective'. Together they form a unique fingerprint.

    Cite this