Autonomous Industrial Management via Reinforcement Learning

Leonardo Espinosa-Leal*, Anthony Chapman, Magnus Westerlund

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

12 Citations (Scopus)

Abstract

Industry has always been in the pursuit of becoming more economically efficient and the current focus has been to reduce human labour using modern technologies. Even with cutting edge technologies, which range from packaging robots to AI for fault detection, there is still some ambiguity on the aims of some new systems, namely, whether they are automated or autonomous. In this paper, we indicate the distinctions between automated and autonomous systems as well as review the current literature and identify the core challenges for creating learning mechanisms of autonomous agents. We discuss using different types of extended realities, such as digital twins, how to train reinforcement learning agents to learn specific tasks through generalisation. Once generalisation is achieved, we discuss how these can be used to develop self-learning agents. We then introduce self-play scenarios and how they can be used to teach self-learning agents through a supportive environment that focuses on how the agents can adapt to different environments. We introduce an initial prototype of our ideas by solving a multi-armed bandit problem using two ε-greedy algorithms. Further, we discuss future applications in the industrial management realm and propose a modular architecture for improving the decision-making process via autonomous agents.
Original languageEnglish
Pages (from-to)8427-8439
Number of pages13
JournalJournal of Intelligent and Fuzzy Systems
Volume39
Issue number6
DOIs
Publication statusPublished - 4 Dec 2020
MoE publication typeA1 Journal article-refereed

Keywords

  • Autonomous systems
  • digital twin
  • industry 4.0
  • reinforcement learning
  • self-play

Fingerprint

Dive into the research topics of 'Autonomous Industrial Management via Reinforcement Learning'. Together they form a unique fingerprint.

Cite this