Comparing machine learning methods on Raman spectra from eight different spectrometers

Christoph Lange*, Maxim Borisyak, Martin Kögler, Stefan Born, Andreas Ziehe, Peter Neubauer, M. Nicolas Cruz Bournazou

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

Abstract

In biotechnology, Raman Spectroscopy is becoming increasingly popular as a process analytical technology (PAT) for measuring substrates, metabolites, and product-related concentrations. By recording the vibrational modes of molecular bonds, it provides information non-invasively in a high-dimensional spectrum. Machine learning models are used to transform these spectral data into meaningful concentrations of species. Typically, one assumes a linear relationship between intensity and concentrations and learns these relationships using a partial least squares (PLS) model. However, in biological cultivations with a very large number of components, nonlinear models such as convolutional neural networks (CNN) offer significant advantages. In this work, we show that training one CNN on spectra from eight different spectrometers significantly outperforms PLS models. Specifically, we created samples with known concentrations of glucose, sodium acetate and magnesium sulfate and measured more than 2200 spectra of these samples with eight different spectrometers. We trained one CNN on the spectra from all eight datasets simultaneously. This shows great potential for laboratories with data from more than one spectrometer as they do not need to spend extra effort in calibrating individual PLS models, but they can use a joint CNN, which even improves the overall accuracy. In addition, we compare the eight different spectrometers against each other. The results suggest that three spectrometers are better suited for quantifying glucose, sodium acetate, and magnesium sulfate given the models.

Original languageEnglish
Article number125861
JournalSpectrochimica Acta - Part A: Molecular and Biomolecular Spectroscopy
Volume334
DOIs
Publication statusPublished - 5 Jun 2025
MoE publication typeA1 Journal article-refereed

Funding

CL, MB, SB, PN, and MNCB gratefully acknowledge the financial support of the German Federal Ministry of Education and Research ( 01DD20002A - KIWI biolab). MK work is part of the Research Council of Finland Flagship Programme, Photonics Research and Innovation (PREIN) , decision number 346545 . The work was also supported by the Printed intelligence infrastructure funding, decision 358621 . AZ gratefully acknowledges funding from the German Federal Ministry of Education and Research under the grant BIFOLD25B .

Keywords

  • Convolutional neural network
  • Machine learning
  • Partial least squares
  • Raman spectroscopy

Fingerprint

Dive into the research topics of 'Comparing machine learning methods on Raman spectra from eight different spectrometers'. Together they form a unique fingerprint.

Cite this