Prompting to Gather Object Categories in NeRF Scenes Related to Manufacturing

  • Selen Pehlivan*
  • , Santeri Hyvärinen
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

Abstract

Despite the effectiveness of closed-set object detectors, recent advancements have introduced zero-shot detectors that can recognize a wide range of object categories across different environments. These detectors rely on text prompts, such as object tags. This study explores using multimodal large language models (MLLMs) to gather and refine object information from NeRF scenes into tags. We propose a training-free pipeline for extracting object-specific details, such as category, color, material, and functionality, from 3D scenes via prompting. Subsequently, we investigate how to apply the object tagging problem to NeRF-reconstructed scenes, particularly in a manufacturing context. This pipeline is evaluated in manufacturing environments for object recognition, with the resulting categories serving as inputs for zero-shot object detection and other tasks.
Original languageEnglish
Title of host publicationAdvances in Artificial Intelligence in Manufacturing II - Proceedings of the 2nd European Symposium on Artificial Intelligence in Manufacturing, 2024
PublisherSpringer
Pages242-250
ISBN (Print)9783031864889
DOIs
Publication statusPublished - 2025
MoE publication typeA4 Article in a conference publication
Event2nd European Symposium on Artificial Intelligence in Manufacturing, ESAIM 2024 - Athens, Greece
Duration: 16 Oct 202416 Oct 2024

Publication series

SeriesLecture Notes in Mechanical Engineering
ISSN2195-4356

Conference

Conference2nd European Symposium on Artificial Intelligence in Manufacturing, ESAIM 2024
Country/TerritoryGreece
CityAthens
Period16/10/2416/10/24

Funding

This research funded by the VTT Technical Research Centre of Finland.

Keywords

  • 3D Scene Understanding
  • Multimodal Large Language Models
  • NeRF
  • Object Recognition
  • Prompting

Fingerprint

Dive into the research topics of 'Prompting to Gather Object Categories in NeRF Scenes Related to Manufacturing'. Together they form a unique fingerprint.

Cite this