TY - GEN
T1 - Distributed Memory-Efficient Algorithm for Extreme Learning Machines Based on Spark
AU - Akusok, Anton
AU - Espinosa-Leal, Leonardo
AU - Björk, Kaj-Mikael
AU - Lendasse, Amaury
PY - 2024
Y1 - 2024
N2 - This work presents a distributed limited-memory algorithm for Extreme Learning Machines (ELM) with training data stored in Spark. The method runs batch matrix computations to obtain the direct non-iterative solution of ELM, reads data only once, and uses the least network bandwidth, making it very computationally efficient. This is achieved by extensive use of lazy evaluation and generators in Spark, deliberately avoiding operations that may lead to data caching. The method scales to virtually infinite datasets and any number of ELM neurons, with runtime being the only restricting factor.
AB - This work presents a distributed limited-memory algorithm for Extreme Learning Machines (ELM) with training data stored in Spark. The method runs batch matrix computations to obtain the direct non-iterative solution of ELM, reads data only once, and uses the least network bandwidth, making it very computationally efficient. This is achieved by extensive use of lazy evaluation and generators in Spark, deliberately avoiding operations that may lead to data caching. The method scales to virtually infinite datasets and any number of ELM neurons, with runtime being the only restricting factor.
UR - https://github.com/akusok/pyspark-elm
U2 - 10.1007/978-3-031-55056-0_1
DO - 10.1007/978-3-031-55056-0_1
M3 - Conference article in proceedings
SN - 978-3-031-55055-3
SN - 978-3-031-55058-4
T3 - Proceedings in Adaptation, Learning and Optimization
SP - 1
EP - 8
BT - Proceedings of ELM 2022
A2 - Björk, Kaj-Mikael
PB - Springer
CY - Cham
T2 - 12th International Conference on Extreme Learning Machines (ELM 2022)
Y2 - 8 December 2022 through 9 December 2022
ER -