Object detection in design diagrams with machine learning

Jukka K. Nurminen, Kari Rainio (Corresponding author), Jukka Pekka Numminen, Timo Syrjänen, Niklas Paganus, Karri Honkoila

Research output: Chapter in Book/Report/Conference proceedingChapter or book articleScientificpeer-review

Abstract

Over the years companies have accumulated large amounts of legacy data. With modern data mining and machine learning techniques the data is increasingly valuable. Therefore being able to convert legacy data into a computer understandable form is important. In this work, we investigate how to convert schematic diagrams, such as process and instrumentation diagrams (P&I diagrams). We use modern machine learning based approaches, in particular, the Yolo neural network system, to detect high-level objects, e.g. pumps or valves, in diagrams which are scanned from paper archives or stored in pixel or vector form. Together with connection detection and OCR this is an essential step for the reuse of old planning data. Our results show that Yolo, as an instance of modern machine learning based object detection systems, works well with schematic diagrams. In our concept, we use a simulator to automatically generate labeled training material to the system. We then retrain a previously trained network to detect the components of our interest. Detection of large components is accurate but small components with sizes below 15% of page size are missed. However, this can be worked around by dividing a big diagram into a set of smaller subdiagrams with different scales, processing them separately, and combining the results.

Original languageEnglish
Title of host publicationCORES 2019
Subtitle of host publicationProgress in Computer Recognition Systems
PublisherSpringer
Pages27-36
Number of pages10
ISBN (Electronic)978-3-030-19738-4
ISBN (Print)978-3-030-19737-7
DOIs
Publication statusPublished - 1 Jan 2020
MoE publication typeA3 Part of a book or another research book
EventInternational Conference on Computer Recognition Systems, CORES 2019 - Polanica-Zdrój, Poland
Duration: 20 May 202022 May 2020

Publication series

SeriesAdvances in Intelligent Systems and Computing
Volume977
ISSN2194-5357

Conference

ConferenceInternational Conference on Computer Recognition Systems, CORES 2019
Abbreviated titleCORES 2019
CountryPoland
CityPolanica-Zdrój
Period20/05/2022/05/20

    Fingerprint

Keywords

  • Legacy data
  • Machine learning
  • Object detection
  • Schematic diagrams

Cite this

Nurminen, J. K., Rainio, K., Numminen, J. P., Syrjänen, T., Paganus, N., & Honkoila, K. (2020). Object detection in design diagrams with machine learning. In CORES 2019: Progress in Computer Recognition Systems (pp. 27-36). Springer. Advances in Intelligent Systems and Computing, Vol.. 977 https://doi.org/10.1007/978-3-030-19738-4_4