Skip to main navigation Skip to search Skip to main content

A systematic data preprocessing approach based on Three-Tier architecture: Ensuring reproducibility, version control, and use of cleaned data for digital twins in mineral processing

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

Abstract

Clean data is crucial for creating dependable digital twins in mineral processing. Incorporating Machine Learning (ML) algorithms within digital twins further strengthens the significance of utilizing clean data. A systematic and organised data preprocessing approach based on Three-Tier architecture is utilized for methodical and transparent data transformations in this work. Such an approach ensures that the data used by ML algorithms is consistently clean and reliable, enhancing the overall effectiveness of digital twins. The approach has been conceptually well-grounded in a Three-Tier architecture (User tier, Computation tier, Data tier), where different tiers can independently deal with the distinct aspects of data preprocessing. The User tier primarily handles the interaction with the user interface. Data preprocessing fits into the Computation tier, where it handles core preprocessing tasks (different tasks are organised into different modules) related to data cleaning, transformation, feature engineering, and data integration. Lastly, the Data tier focuses on storing and retrieving data. Such separation ensures a modular, extensible, and maintainable approach to managing data processing tasks within the application, thereby significantly enhancing the reproducibility, version control, and reliability of cleaned data in the project.

Original languageEnglish
Title of host publication2025 International Conference on Artificial Intelligence, Computer, Data Sciences and Applications (ACDSA)
PublisherIEEE Institute of Electrical and Electronic Engineers
ISBN (Electronic)9798331535629
ISBN (Print)979-8-3315-3563-6
DOIs
Publication statusPublished - 2025
MoE publication typeA4 Article in a conference publication
Event2nd International Conference on Artificial Intelligence, Computer, Data Sciences, and Applications, ACDSA 2025 - Antalya, Turkey
Duration: 7 Aug 20259 Aug 2025

Conference

Conference2nd International Conference on Artificial Intelligence, Computer, Data Sciences, and Applications, ACDSA 2025
Country/TerritoryTurkey
CityAntalya
Period7/08/259/08/25

Funding

The authors sincerely acknowledge the support of Business Finland and VTT for funding this research.

Keywords

  • data pipeline
  • data preprocessing
  • data versioning
  • machine learning
  • mineral processing

Fingerprint

Dive into the research topics of 'A systematic data preprocessing approach based on Three-Tier architecture: Ensuring reproducibility, version control, and use of cleaned data for digital twins in mineral processing'. Together they form a unique fingerprint.

Cite this