News
Article
Author(s):
This tutorial guides spectroscopy practitioners through the integration of Raman spectroscopy and machine learning (ML) techniques for detecting microplastics (MPs) in aquatic and environmental samples.
Introduction and Relevance
The pervasive presence of microplastics in aquatic environments poses ecological and health risks due to their ingestion by marine organisms and potential to bioaccumulate toxic substances (1). Identifying and classifying MPs accurately is critical for monitoring pollution levels and understanding environmental impacts. Raman spectroscopy is a well-established vibrational spectroscopic technique that provides detailed molecular fingerprints of polymers, enabling the identification of microplastic particles based on their characteristic vibrational modes (2,9).
However, traditional Raman analysis faces challenges such as overlapping spectra from mixed samples, weak signals from very small particles, and time-consuming manual interpretation.
Recent advances combine Raman spectroscopy with machine learning (ML) algorithms to enhance detection accuracy and speed by automating spectral classification and feature extraction (1,7,8). This tutorial introduces readers to the fundamental Raman spectral features of common microplastics, explores ML methods suitable for this purpose, and provides practical guidelines to apply these tools effectively in the laboratory or field studies.
Machine Learning Enhanced Raman Spectroscopy for Microplastics Detection in Environmental Samples: A Practical Tutorial © IndigoElf -chronicles-stock.adobe.com
Core Tutorial Content
1. Principles and Definitions:
Raman spectroscopy is a vibrational spectroscopic technique that probes molecular structure through the inelastic scattering of monochromatic light, typically from a laser. When photons interact with molecular vibrations, most are elastically scattered (Rayleigh scattering), but a small fraction experience an energy shift due to vibrational energy changes—this is the Raman effect. The resulting Raman spectrum displays intensity versus Raman shift (in cm⁻¹), producing a unique molecular fingerprint for each material. In the case of polymers, characteristic vibrational modes—such as C–H stretching, C–C skeletal stretching, or aromatic ring breathing—produce peaks at specific wavenumbers that can be used for microplastics polymer identification.
Machine learning (ML), in this context, refers to computational methods that enable automated pattern recognition in Raman spectra without explicit programming of chemical knowledge. ML algorithms learn from examples of known spectra, extracting key features—either manually through techniques like principal component analysis (PCA) or automatically via deep learning networks—and then classify unknown spectra into categories such as polymer type. Common algorithms in this field include support vector machines (SVM) for robust decision boundary creation, Random Forest (RF) for ensemble-based classification, K-Nearest Neighbors (KNN) for similarity-based recognition, and Autoencoders for unsupervised feature learning. The synergy between Raman spectroscopy’s high chemical specificity and ML’s ability to handle complex spectral data enables rapid, accurate identification of microplastics in environmental samples.
2. Raman Spectroscopy and Polymer Vibrations Assignments
Raman spectroscopy exploits the inelastic scattering of monochromatic light to provide molecular vibrational information unique to chemical structures. Polymers in microplastics produce characteristic Raman peaks corresponding to molecular bond vibrations such as C–C stretches, CH₂ bending, and aromatic ring modes (2,3,5). Typical polymers found in MPs include polyethylene (PE), polypropylene (PP), polystyrene (PS), and polyethylene terephthalate (PET), each with signature vibrational bands (Table I).
Table I: Common microplastic polymers with Raman band assignments
3. Machine Learning Algorithms for Spectral Classification
ML involves algorithms that learn patterns from data, making them suitable for automating the classification of complex Raman spectra. Common algorithms include:
4. How It Works in Practice
Sample Preparation and Data Acquisition
Microplastic samples are isolated from water or sediment using filtration or density separation. Raman spectra are acquired using a microscope-coupled Raman system with an excitation wavelength typically between 532 nm and 785 nm to minimize fluorescence background (2).
Spectral Preprocessing
Raw spectra require preprocessing steps such as baseline correction, smoothing, normalization, and cosmic ray removal to improve data quality for ML input (1,7).
Feature Extraction and Model Training
Preprocessed spectra are subjected to PCA or other methods to extract key features. A labeled training dataset with known polymer types is used to train classification algorithms like RF or SVM. The model learns spectral patterns associated with each polymer class (1).
Classification and Validation
New spectra from unknown samples are input into the trained model, which predicts polymer identity based on learned patterns. Model performance is validated using metrics like accuracy, precision, recall, and cross-validation methods (1,7).
5. Application Examples
Sunil and colleagues demonstrated the use of RF and SVM classifiers on Raman spectral datasets for microplastics detection in environmental samples, achieving high classification accuracy and reduced analysis time compared to manual interpretation (1). Jin and colleagues used deep learning models on Raman data, achieving robust polymer identification even in mixed microplastic samples (7). These studies illustrate the synergy between Raman spectral detail and ML's pattern recognition power.
6. Tips and Common Pitfalls
6. Illustrative Examples to Support Understanding
To complement the concepts described in this tutorial, a series of illustrations would greatly aid in visualizing the integration of Raman spectroscopy and ML for microplastics detection.
The first illustration would depict a schematic diagram of a Raman microscope setup tailored for microplastic analysis. This would show the laser source emitting monochromatic light, the optical path through microscope objectives focusing on microplastic particles on a sample stage, and the scattered light collected and directed to the spectrometer for detection. Such a diagram helps readers grasp the experimental configuration and key components involved in acquiring Raman spectra from tiny environmental particles.
Next, a set of example Raman spectra for common polymers such as polyethylene (PE), polypropylene (PP), polystyrene (PS), and polyethylene terephthalate (PET) would be presented. Each spectrum would be annotated with labeled peaks corresponding to specific vibrational modes—for instance, the styrene ring breathing mode near 1000 cm⁻¹ in PS, the C–C skeletal stretch around 1064–1100 cm⁻¹ in PE, and the carbonyl (C=O) stretch near 1720 cm⁻¹ in PET. This visual comparison would reinforce understanding of how molecular structure influences the spectral fingerprint and the importance of these bands in identifying polymer types.
A third illustration would outline the workflow of the ML process applied to Raman spectral data. It would depict the stages from raw spectral acquisition, through preprocessing steps such as baseline correction and normalization, to feature extraction via PCA. This would be followed by a branch showing different classification algorithms—RF, SVM, K-Nearest Neighbors, and deep learning models—applied to the extracted features, culminating in the classification results that identify polymer types. Such a flowchart clarifies how spectral data transforms into actionable information using ML.
Finally, a score plot from PCA would visually demonstrate how spectra from different microplastic polymers cluster in reduced-dimensional space. This plot highlights how the algorithm discerns patterns and groupings that separate polymer classes based on their spectral variance, illustrating the effectiveness of ML in differentiating microplastics even when spectral differences are subtle.
Together, these illustrations would provide readers with a clear visual roadmap of both the instrumentation and data analysis steps, enhancing comprehension and practical application of Raman spectroscopy coupled with ML for microplastic detection.
7. Conclusion and Practical Takeaways
Combining Raman spectroscopy with ML provides a practical, powerful approach for identifying microplastics in environmental samples with improved speed and accuracy. Understanding key vibrational bands and their polymer assignments is essential to interpreting spectral data. ML algorithms such as RF and SVM enable automated classification by recognizing subtle spectral patterns beyond manual detection. Careful spectral preprocessing, diverse training datasets, and validation ensure robust model performance. Practitioners can apply these techniques in environmental monitoring, pollution assessment, and material science research to better characterize microplastics and their impact.
References
(1) Sunil, M.; Pallikkavaliyaveetil, N.; Gopinath, A.; Chidangil, S.; Kumar, S.; Lukose, J. Machine learning assisted Raman spectroscopy: A viable approach for the detection of microplastics. J. Water Process Eng. 2024, 60, 105150. DOI: 10.1016/j.jwpe.2024.105150
(2) Nava, V.; Frezzotti, M. L.; Leoni, B.; et al. Raman Spectroscopy for the Analysis of Microplastics in Aquatic Systems. Appl. Spectrosc. 2021, 75 (11), 1341–1357. DOI: 10.1177/00037028211043119
(3) Thermo Fisher Scientific. Classification of polyethylene by Raman spectroscopy (Application Note AN52301). Thermo Fisher Scientific, 2019. Available at: https://assets.thermofisher.com/TFS-Assets/MSD/Application-Notes/AN52301-classification-polyethylene-Raman-spectroscopy-app-note.pdf (accessed 2025-08-13).
(4) Laramée, A. W.; Lanthier, C.; Pellerin, C. Raman Investigation of the Processing Structure Relations in Individual Poly(ethylene terephthalate) Electrospun Fibers. Appl. Spectrosc. 2022, 76 (1), 51–60. DOI: 10.1177/00037028211049242
(5) Sagitova, E. A.; Donfack, P.; Nikolaeva, G. Y.; Prokhorov, K. A.; et al. Regularity Modes in Raman Spectra of Polyolefins: Part II. Polyethylene and Ethylene Copolymers. Vibr. Spectrosc. 2016, 84, 139–145. DOI: 10.1016/j.vibspec.2016.03.013
(6) Prokhorov, K. A.; Nikolaeva, G. Y.; Sagitova, E. A. Regularity Modes in Raman Spectra of Polyolefins: Part I. Propylene/Olefin Copolymers. Vibr. Spectrosc. 2016, 85, 22–28. DOI: 10.1016/j.vibspec.2016.03.021
(7) Jin, N.; Song, Y.; Ma, R.; Li, J.; Li, G.; Zhang, D. Characterization and Identification of Microplastics Using Raman Spectroscopy Coupled with Multivariate Analysis. Anal. Chim. Acta 2022, 1197, 339519. DOI: 10.1016/j.aca.2022.339519
(8) Ramanna, S.; Morozovskii, D.; Swanson, S.; Bruneau, J. Machine Learning of Polymer Types from the Spectral Signature of Raman Spectroscopy Microplastics Data. arXiv 2022, arXiv:2201.05445. Available at: https://arxiv.org/abs/2201.05445 (accessed 2025-08-13).
(9) Araujo, C. F.; Nolasco, M. M.; Ribeiro, A. M. P.; Ribeiro-Claro, P. J. A. Identification of Microplastics Using Raman Spectroscopy: Latest Developments and Future Prospects. Water Res. 2018, 142, 426–440. DOI: 10.1016/j.watres.2018.05.060
Get essential updates on the latest spectroscopy technologies, regulatory standards, and best practices—subscribe today to Spectroscopy.