Abstract
Clean data is crucial for creating dependable digital twins in mineral processing. Incorporating Machine Learning (ML) algorithms within digital twins further strengthens the significance of utilizing clean data. A systematic and organised data preprocessing approach based on Three-Tier architecture is utilized for methodical and transparent data transformations in this work. Such an approach ensures that the data used by ML algorithms is consistently clean and reliable, enhancing the overall effectiveness of digital twins. The approach has been conceptually well-grounded in a Three-Tier architecture (User tier, Computation tier, Data tier), where different tiers can independently deal with the distinct aspects of data preprocessing. The User tier primarily handles the interaction with the user interface. Data preprocessing fits into the Computation tier, where it handles core preprocessing tasks (different tasks are organised into different modules) related to data cleaning, transformation, feature engineering, and data integration. Lastly, the Data tier focuses on storing and retrieving data. Such separation ensures a modular, extensible, and maintainable approach to managing data processing tasks within the application, thereby significantly enhancing the reproducibility, version control, and reliability of cleaned data in the project.
| Original language | English |
|---|---|
| Title of host publication | 2025 International Conference on Artificial Intelligence, Computer, Data Sciences and Applications (ACDSA) |
| Publisher | IEEE Institute of Electrical and Electronic Engineers |
| ISBN (Electronic) | 9798331535629 |
| ISBN (Print) | 979-8-3315-3563-6 |
| DOIs | |
| Publication status | Published - 2025 |
| MoE publication type | A4 Article in a conference publication |
| Event | 2nd International Conference on Artificial Intelligence, Computer, Data Sciences, and Applications, ACDSA 2025 - Antalya, Turkey Duration: 7 Aug 2025 → 9 Aug 2025 |
Conference
| Conference | 2nd International Conference on Artificial Intelligence, Computer, Data Sciences, and Applications, ACDSA 2025 |
|---|---|
| Country/Territory | Turkey |
| City | Antalya |
| Period | 7/08/25 → 9/08/25 |
Funding
The authors sincerely acknowledge the support of Business Finland and VTT for funding this research.
Keywords
- data pipeline
- data preprocessing
- data versioning
- machine learning
- mineral processing
Fingerprint
Dive into the research topics of 'A systematic data preprocessing approach based on Three-Tier architecture: Ensuring reproducibility, version control, and use of cleaned data for digital twins in mineral processing'. Together they form a unique fingerprint.Projects
- 1 Finished
-
AIMODE: Development of Artificial Intelligence and Machine Learning for Online Perception and Operating Mode Optimization in Process Industry
Linnosmaa, J. (Manager), Seppi, M. (Participant), Zeb, A. (Participant), Saarela, O. (Participant), Verma, N. (Participant), Freimane, L. (Participant), Aho, J. (Participant) & Tahkola, M. (Participant)
1/09/22 → 31/08/25
Project: Business Finland project
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver