Using machine learning methods and spectroscopy, scientists from Central South University in Hunan, China created a unique method of analyzing empty puparia to identify insect species. Their research was published in Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy (1).
Set of insects | Image Credit: © Alekss - stock.adobe.com
Species identification, specifically within entomological surveys, can have a great impact on biodiversity assessment to environmental management, to forensic investigations (1). Insect species identification can be done using a variety of objects, including eggs, larvae, and pupae. Empty puparia, for example, can be the sole source of entomological evidence available when an insect dies, and this aspect of species identification is relatively unstudied.
Empty puparia are the exoskeletons that remain after insect eclosion, safeguarding intra-puparium tissue from damage. There have been many studies on the composition of empty puparia, leading to its use in multiple fields, such as developing antibacterial drugs and in postmortem interval (PMI) estimation (1). That said, traditional analysis methods fall to tell the difference between incomplete empty puparia and species that are morphologically similar. This has led to a need in easier and faster techniques for detecting empty puparia.
In this study, attenuated total reflectance-Fourier transform infrared spectroscopy (ATR-FTIR) was used to acquire the spectral information from empty puparia of five different species of fly. The data was then subjected to spectral pre-processing to obtain average spectra for preliminary analysis. Following this, principal component analysis (PCA) and orthogonal partial least squares-discriminant analysis (OPLS-DA) were used for clustering and classifying the spectra. Afterwards, three machine learning models–Support Vector Machines (SVM), K-nearest neighbor (KNN), and Random Forest (RF)–were used to analyze spectra from different waveband groups.
During the clustering and classification process, two wavebands (3000–2800 cm−1 and 1800–1300 cm−1) were deemed significant in distinguishing one of the species, Aldrichina graham. As for the machine learning models, the biological fingerprint region (1800–1300 cm−1) showed a great ability in identifying empty puparia species. Notably, the SVM model exhibited a 100% accuracy in identifying all five fly species. Overall, the scientists view this as a notable first step in identifying insect species with empty puparia, specifically using infrared spectroscopy and machine learning methods for the process. According to them, this study provides “a robust research foundation for future investigations in this area” (1).
(1) Zhang, X.; Yang, F.; Xiao, J.; Qu, H.; Jocelin, N. F.; Ren, L.; Guo, Y. Analysis and Comparison of Machine Learning Methods for Species Identification Utilizing ATR-FTIR Spectroscopy. Spectrochim. Acta Part B At. Spectrosc. 2024, 308, 123713. DOI: https://doi.org/10.1016/j.saa.2023.123713
Get essential updates on the latest spectroscopy technologies, regulatory standards, and best practices—subscribe today to Spectroscopy.
Drone-Mounted Infrared Camera Sees Invisible Methane Leaks in Real Time
July 9th 2025Researchers in Scotland have developed a drone-mounted infrared imaging system that can detect and map methane gas leaks in real time from up to 13.6 meters away. The innovative approach combines laser spectroscopy with infrared imaging, offering a safer and more efficient tool for monitoring pipeline leaks and greenhouse gas emissions.
How Spectroscopy Drones Are Detecting Hidden Crop Threats in China’s Soybean Fields
July 8th 2025Researchers in Northeast China have demonstrated a new approach using drone-mounted multispectral imaging to monitor and predict soybean bacterial blight disease, offering a promising tool for early detection and yield protection.
ATR-FTIR Spectroscopy Enhances Accuracy in Identifying Asphyxial Deaths, Study Finds
July 8th 2025Researchers at Xi’an Jiaotong University have demonstrated that ATR-FTIR spectroscopy, combined with histological analysis and machine learning, can accurately distinguish between drowning and strangulation in forensic cases.
Radar and Soil Spectroscopy Boost Soil Carbon Predictions in Brazil’s Semi-Arid Regions
July 7th 2025A new study published in Geoderma demonstrates that combining soil spectroscopy with radar-derived vegetation indices and environmental data significantly improves the accuracy of soil organic carbon predictions in Brazil’s semi-arid regions.