Projects per year
Abstract
AI Alignment is a term used to summarize the aim of making artificial intelligence (AI) systems behave in line with human intentions and values. There has been little consideration in previous AI Alignment studies of the need for AI Alignment to be adaptive in order to contribute to the survival of human organizations in changing environments. This research gap is addressed here by defining human intentions and values in terms of survival biophysics: entropy, complexity, and adaptive behavior. Furthermore, although technology alignment has been a focus of studies for more than thirty years, there has been little consideration in AI Alignment studies of established resources for aligning technologies. Unlike the current focus of AI Alignment on addressing potential AI risks, technology alignment is generally focused on aligning with opportunities. Established resources include the critical realist philosophy of science, scientific theories, total quality management practices, technology alignment methods, engineering techniques, and technology standards. Here, these established resources are related to the alignment of different types of machine learning with different levels of human organizations. In addition, established resources are related to a well-known hypothetical extreme example of AI Misalignment, and to major constructs in the AI Alignment literature. Overall, it is argued that AI Alignment needs to be adaptive in order for human organizations to be able to survive in changing environments, and that established resources can facilitate Adaptive AI Alignment which addresses risks while focusing on opportunities.
Original language | English |
---|---|
Pages (from-to) | 2570–2600 |
Number of pages | 31 |
Journal | Machine Learning and Knowledge Extraction |
Volume | 6 |
Issue number | 4 |
Publication status | Published - 6 Nov 2024 |
MoE publication type | A1 Journal article-refereed |
Funding
This research was funded by the Research Council of Finland grant number 357221, and by VTT Technical Research Centre of Finland Ltd.
Fingerprint
Dive into the research topics of 'Adaptive AI Alignment: Established resources for aligning machine learning with human intentions and values in changing environments'. Together they form a unique fingerprint.Projects
- 1 Active
-
DOMINIC: Developmental Multi-Robot Systems in Cognitive Manufacturing
Heikkilä, T. (CoPI), Halbach, E. (Manager) & Känsäkoski, N. (Participant)
1/09/23 → 31/08/26
Project: Academy of Finland project