TY - JOUR
T1 - A hybrid classical-quantum approach to speed-up Q-learning
AU - Sannia, A.
AU - Giordano, A.
AU - Gullo, Nicolino Lo
AU - Mastroianni, C.
AU - Plastina, F.
PY - 2023/12
Y1 - 2023/12
N2 - We introduce a classical-quantum hybrid approach to computation, allowing for a quadratic performance improvement in the decision process of a learning agent. Using the paradigm of quantum accelerators, we introduce a routine that runs on a quantum computer, which allows for the encoding of probability distributions. This quantum routine is then employed, in a reinforcement learning set-up, to encode the distributions that drive action choices. Our routine is well-suited in the case of a large, although finite, number of actions and can be employed in any scenario where a probability distribution with a large support is needed. We describe the routine and assess its performance in terms of computational complexity, needed quantum resource, and accuracy. Finally, we design an algorithm showing how to exploit it in the context of Q-learning.
AB - We introduce a classical-quantum hybrid approach to computation, allowing for a quadratic performance improvement in the decision process of a learning agent. Using the paradigm of quantum accelerators, we introduce a routine that runs on a quantum computer, which allows for the encoding of probability distributions. This quantum routine is then employed, in a reinforcement learning set-up, to encode the distributions that drive action choices. Our routine is well-suited in the case of a large, although finite, number of actions and can be employed in any scenario where a probability distribution with a large support is needed. We describe the routine and assess its performance in terms of computational complexity, needed quantum resource, and accuracy. Finally, we design an algorithm showing how to exploit it in the context of Q-learning.
UR - http://www.scopus.com/inward/record.url?scp=85149618477&partnerID=8YFLogxK
U2 - 10.1038/s41598-023-30990-5
DO - 10.1038/s41598-023-30990-5
M3 - Article
C2 - 36890198
AN - SCOPUS:85149618477
SN - 2045-2322
VL - 13
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 3913
ER -