Abstract
In this paper, a reinforcement learning based approach is proposed to realize the distributed power and spectrum allocation for the Device-to-Device (D2D) communications in unlicensed bands, named as D2D-U. To guarantee the harmonious coexistence with the WiFi networks, the conventional duty-cycle muting (DCM) is employed by the D2D-U links. With the proposed learning approach, D2D-U links can optimally select the time fraction on unlicensed channels without knowing the accurate WiFi traffic in a dynamic WiFi working environment. To address the state space explosion during the learning process, the Deep Q-learning network (DQN) is adopted by combining a deep neural network (DNN) with the traditional Q-learning mechanism. After obtaining the available time fraction on unlicensed channels, the spectrum and power allocation on licensed and unlicensed bands can be optimized jointly via the classic convex optimization methods at each D2D-U link. Numerical results are demonstrated to verify the effectiveness of the proposed approach.
Original language | English |
---|---|
Title of host publication | 2019 IEEE/CIC International Conference on Communications Workshops in China, ICCC Workshops 2019 |
Publisher | IEEE Institute of Electrical and Electronic Engineers |
Pages | 42-47 |
Number of pages | 6 |
ISBN (Electronic) | 978-1-7281-0738-7, 978-1-7281-0737-0 |
ISBN (Print) | 978-1-7281-0739-4 |
DOIs | |
Publication status | Published - 2019 |
MoE publication type | A4 Article in a conference publication |
Event | 2019 IEEE/CIC International Conference on Communications Workshops in China, ICCC Workshops 2019 - Changchun, China Duration: 11 Aug 2019 → 13 Aug 2019 |
Conference
Conference | 2019 IEEE/CIC International Conference on Communications Workshops in China, ICCC Workshops 2019 |
---|---|
Country/Territory | China |
City | Changchun |
Period | 11/08/19 → 13/08/19 |