Abstract
Business Finland and European Commission have announced that industrial metaverse and virtual worlds are new focus areas. Main goal of the project is to build the first version of roadmap and proof-of-concept of an AI supported human-robot operation platform for industrial metaverse.
Utilization of large language models (LLM) such as ChatGPT have become popular tools in everyday life. As usage is progressing more within various industries, the question arises how to connect these into multi-user/device environments. For the LLM to function as an assistant to the operators, and provide value within the given task, the user-interface (UI) needs to be straightforward.
Motivation for the project thrives from the possibility to provide specific instructions to a novice operator about previously unknown task in a way that operator can effortlessly interact with an entity about the progress. The target is a LLM supported human-robot work allocation on a concept level.
Since the development of LLM based assistants in industrial metaverse context is a new topic, the project provides a novel approach for building a data pipeline from operator to a robot via a LLM as a conversational agent. The approach provides scaling options to multiple robots and operators.
The project utilizes ChatGPT4.0 APIs with a use-case specific context to provide the LLM a scope and OpenAI Whisper for text-to-speech and speech-to-text. LLM communicates with both a human operator and a robotic manipulator. The operator communicates via speech to the LLM either on a desktop application or via augmented reality (AR) headset, while LLM communicates through a socket to the manipulator, giving it commands to perform specific steps of the use-case. The robotic manipulator informs the LLM of its progress, which the LLM then articulates to the operator, thus providing interim information about the overall progress.
Project provides a pipeline and a proof-of-concept to manage a ChatGPT4.0 based virtual assistant in an assembly use case together with an operator and a robotic manipulator. The operator can interact with the assistant either on a desktop interface or with an AR-headset, while receiving information about the use case and how to progress in it. A robotic manipulator provides materials for the operator without operator needing to specify the robot, as the LLM watches over the process under the specified use case context.
Utilization of large language models (LLM) such as ChatGPT have become popular tools in everyday life. As usage is progressing more within various industries, the question arises how to connect these into multi-user/device environments. For the LLM to function as an assistant to the operators, and provide value within the given task, the user-interface (UI) needs to be straightforward.
Motivation for the project thrives from the possibility to provide specific instructions to a novice operator about previously unknown task in a way that operator can effortlessly interact with an entity about the progress. The target is a LLM supported human-robot work allocation on a concept level.
Since the development of LLM based assistants in industrial metaverse context is a new topic, the project provides a novel approach for building a data pipeline from operator to a robot via a LLM as a conversational agent. The approach provides scaling options to multiple robots and operators.
The project utilizes ChatGPT4.0 APIs with a use-case specific context to provide the LLM a scope and OpenAI Whisper for text-to-speech and speech-to-text. LLM communicates with both a human operator and a robotic manipulator. The operator communicates via speech to the LLM either on a desktop application or via augmented reality (AR) headset, while LLM communicates through a socket to the manipulator, giving it commands to perform specific steps of the use-case. The robotic manipulator informs the LLM of its progress, which the LLM then articulates to the operator, thus providing interim information about the overall progress.
Project provides a pipeline and a proof-of-concept to manage a ChatGPT4.0 based virtual assistant in an assembly use case together with an operator and a robotic manipulator. The operator can interact with the assistant either on a desktop interface or with an AR-headset, while receiving information about the use case and how to progress in it. A robotic manipulator provides materials for the operator without operator needing to specify the robot, as the LLM watches over the process under the specified use case context.
| Original language | English |
|---|---|
| Publication status | Published - 21 Oct 2024 |
| MoE publication type | Not Eligible |
| Event | FCAI AI Day + Nordic AI Meet 2024 - University of Helsinki, Helsinki, Finland Duration: 21 Oct 2024 → 22 Oct 2024 https://fcai.fi/ai-day-2024 |
Conference
| Conference | FCAI AI Day + Nordic AI Meet 2024 |
|---|---|
| Country/Territory | Finland |
| City | Helsinki |
| Period | 21/10/24 → 22/10/24 |
| Internet address |
Keywords
- Large Language Model
- Human-Robot Interaction
- Augmented reality
- Prompt-engineering
Fingerprint
Dive into the research topics of 'LLM based virtual assistant for human-robot interaction'. Together they form a unique fingerprint.Projects
- 1 Finished
-
HiFive: Meaningful industrial work in hybrid human-technology-AI teams
Aromaa, S. (Manager), Helaakoski, H. (Owner), Wahlström, M. (Participant), Hakanen, T. (Participant), Tikka, P. (Participant), Goriachev, V. (Participant), Tammela, A. (Participant), Heikkilä, P. (Participant), Lammi, H. (Participant), Salonen, T.-T. (Participant), Tammentie, B. (Participant), Kääriäinen, J. (Participant), Rainio, K. (Participant), Vierimaa, M. (Participant), Siltanen, P. (Participant) & Gotcheva, N. (Owner)
1/04/24 → 31/03/26
Project: Business Finland project
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver