Abstract
Volumetric video provides users with a good viewing experience of six degrees of freedom (DoF) and has wide applications in many fields such as teleconferencing and online games. However, the huge data volume and strict latency requirements of point cloud video, the most popular representative of volumetric video, pose a challenge to its transmission. Existing point cloud video transmission algorithms usually segment a long video by every one or several group of frames, predict network bandwidth and FoV information, then perform adaptive transmission by solving the quality of experience (QoE) optimization problem. However, such segmentation neglects the impact of current optimization decisions on the subsequent video streaming process, as well as the large prediction error for a long interval, severely degrading user’s QoE. Moreover, the complex constrained optimization problem makes the solution time too long to meet the real-time video streaming requirements. To this end, in this paper, we propose a rolling prediction-optimization-transmission (POT) framework, which makes predictions of network bandwidth and FoV in each short rolling window to reduce prediction error. And our framework takes into account the upper bounded QoE contribution of the subsequent point cloud video to improve the system performance. In addition, we design a deep reinforcement learning based real-time solver to make decisions of the fixed structure optimization problem in each roll, allowing our system to run in real-time. We have performed simulations and experiments, and the results show that our solution outperforms existing methods.
Original language | English |
---|---|
Pages (from-to) | 7870-7883 |
Journal | IEEE Transactions on Circuits and Systems for Video Technology |
Volume | 33 |
Issue number | 12 |
DOIs | |
Publication status | Published - 1 Dec 2023 |
MoE publication type | A1 Journal article-refereed |
Funding
This work was supported in part by the National Natural Science Foundation of China under Grant 52077049 and in part by the Anhui Provincial Natural Science Foundation under Grant 2008085UD04.
Keywords
- Bandwidth
- deep reinforcement learning
- Optimization
- Point cloud compression
- Point cloud video
- Prediction algorithms
- QoE model
- Quality of experience
- Real-time systems
- real-time transmission
- rolling optimization
- Streaming media
- video streaming