TY - GEN
T1 - Automatic Speech Recognition of Finnish-Swedish Dialects
T2 - 21st International Conference on Smart Technologies & Education (STE-2024)
AU - Espinosa-Leal, Leonardo
AU - Adolfsson, Kristoffer Kuvaja
AU - Shcherbakov, Andrey
PY - 2024
Y1 - 2024
N2 - This paper explores the performance of two different automatic speech recognition models for the Finnish-Swedish language. The first model, Whisper V1 released by OpenAI and the second, the KBLab model trained using a large dataset by the National Library of Sweden. These models were trained initially using data from the Swedish language from Sweden, and the results were compared with previous work trained using a dataset of Finnish-Swedish audio. Our results indicate that general models perform at the same level, opening up the possibility of using these in Finland for the inclusion of the Finnish-Swedish minority.
AB - This paper explores the performance of two different automatic speech recognition models for the Finnish-Swedish language. The first model, Whisper V1 released by OpenAI and the second, the KBLab model trained using a large dataset by the National Library of Sweden. These models were trained initially using data from the Swedish language from Sweden, and the results were compared with previous work trained using a dataset of Finnish-Swedish audio. Our results indicate that general models perform at the same level, opening up the possibility of using these in Finland for the inclusion of the Finnish-Swedish minority.
UR - http://www.scopus.com/inward/record.url?scp=85197434824&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-61905-2_30
DO - 10.1007/978-3-031-61905-2_30
M3 - Conference article in proceedings
SN - 978-3-031-61904-5
VL - 2
T3 - Lecture Notes in Networks and Systems
SP - 309
EP - 315
BT - Smart Technologies for a Sustainable Future
A2 - Auer, Michael E.
A2 - Langmann, Reinhard
A2 - May, Dominik
A2 - Roos, Kim
PB - Springer
CY - Cham
Y2 - 6 March 2024 through 8 March 2024
ER -