Privacy compliant health data as a service for AI development

Project: EU project

Project Details

Description

Artificial intelligence (AI) enables data-driven innovations in health care. AI systems, which process vast amounts of data quickly and in detail, show promise both as a tool for preventive health care and clinical decision-making. However, the distributed storage and limited access to health data form a barrier to innovation, as developing trustworthy AI systems requires large datasets for training and validation. Furthermore, the availability of anonymous datasets would increase the adoption of AI-powered tools by supporting health technology assessments and education.

Secure, privacy compliant data utilization is key for unlocking the full potential of AI and data analytics.In this proposal, we will advance the current state-of-the-art data synthesis methods towards a more generalized approach of synthetic data generation. We will also develop metrics for testing and validation, as well as protocols that enable synthetic data generation without access to real-world data (through multi-party computation). We aim to provide:
1) Improved methods and technical pipelines for privacy-preserving data synthesis including different data formats such as EHRs and medical images,
2) Easy to use and configurable data services to enable AI developers’ access to larger pools of decentralized de-identified data through multi-party computing,
3) Provide anonymous data on demand or from a (temporary) repository,
4) Establish a Data Market – facilitating data sharing and monetization incl. incentives-based provision of data to the services,
5) Integrate the data market and the data service ecosystem as a X-European health data hub in the European Health Data Space, and
6) Validate the results with real-world use-cases focusing on high impact diseases, cancer types in particular.

In the PHASE IV AI project, VTT is leading the work package focused on defining and collecting requirements and specifications for the project's methods, data, and solutions. Additionally, VTT is exploring the quality and validity of synthetic medical data, with a particular emphasis on methods for generating synthetic medical images. From a utilization standpoint, VTT investigates the limitations and advantages of using synthetic data in the development of decision support systems for medical staff. Moreover, VTT is overseeing how the PHASE IV AI solutions align with the upcoming EHDS Act's standards and requirements.
AcronymPHASE IV AI
StatusActive
Effective start/end date1/10/2330/09/26

Collaborative partners

  • VTT Technical Research Centre of Finland
  • University of Vienna
  • Institute for Systems and Computer Engineering, Technology, and Science (INESC-TEC)
  • Centre Hospitalier Universitaire Vaudois
  • Sabancı University
  • Turku University of Applied Sciences
  • Fujitsu Technology Solutions S.A.
  • Nottingham University Hospitals NHS Trust
  • Nottingham Trent University
  • AINIGMA Technologies
  • Varsinais-Suomen hyvinvointialue
  • Resilience Guard GmbH
  • Fujitsu Belgium
  • Fundació Eurecat
  • Engineering Ingegneria Informatica S.p.A.
  • Katholieke Universiteit Leuven (KU Leuven)
  • Vall d'Hebron University Hospital
  • INPHER Sàrl
  • University of Turku (lead)
  • LeanXcale SL

Funding category

  • Horizon Europe

Keywords

  • HORIZON-HLTH-2022-IND-13-02
  • artificial intelligence
  • oncology
  • ecosystems