Data Analytics, Statistics, Chemometrics, and Artificial Intelligence

Latest News


Unsolved Problems in Spectroscopy - Part 8

This tutorial explores the challenges posed by nonlinearities in spectroscopic calibration models, including physical origins, detection strategies, and correction approaches. Linear regression methods such as partial least squares (PLS) dominate chemometrics, but real-world data often violate linear assumptions due to Beer–Lambert law deviations, scattering, and instrumental artifacts. We examine extensions beyond linearity, including polynomial regression, kernel partial least squares (K-PLS), Gaussian process regression (GPR), and artificial neural networks (ANNs). Equations are provided in full matrix notation for clarity. Practical applications across near-infrared (NIR), mid-infrared (MIR), Raman, and atomic spectroscopies are discussed, and future research directions are outlined with emphasis on hybrid models that integrate physical and statistical knowledge.

NIR aquaphotomics is used for biofluid and food analysis © By Sona-chronicles-stock.adobe.com

Near-infrared (NIR) spectroscopy combined with aquaphotomics shows potential for a rapid, non-invasive approach to detect subtle biochemical changes in biofluids and agricultural products. By monitoring water molecular structures through water matrix coordinates (WAMACs) and visualizing water absorption spectrum patterns (WASPs) via aquagrams, researchers can identify disease biomarkers, food contaminants, and other analytes with high accuracy. This tutorial introduces the principles, practical workflow, and applications of NIR aquaphotomics for everyday laboratory use.

Unsolved Problems in Spectroscopy - Part 6

This tutorial provides an in-depth discussion of methods to make machine learning (ML) models interpretable in the context of spectroscopic data analysis. As atomic and molecular spectroscopy increasingly incorporates advanced ML techniques, the black-box nature of these models can limit their utility in scientific research and practical applications. We present explainable artificial intelligence (XAI) approaches such as SHAP, LIME, and saliency maps, demonstrating how they can help identify chemically meaningful spectral features. This tutorial also explores the trade-off between model complexity and interpretability.

Unsolved Problems in Spectroscopy - Part 5

This tutorial contrasts classical analytical error propagation with modern Bayesian and resampling approaches, including bootstrapping and jackknifing. Uncertainty estimation in multivariate calibration remains an unsolved problem in spectroscopy, as traditional, Bayesian, and resampling approaches yield differing error bars for chemometric models like PLS and PCR, highlighting the need for deeper theoretical and practical solutions.

Rear view of senior farmer standing in soybean field examining crop at sunset. | Image Credit: © Zoran Zeremski - stock.adobe.com

A new review article highlights how Explainable Artificial Intelligence (XAI) can enhance transparency, trust, and innovation in agricultural spectroscopy, paving the way for smarter and more sustainable food quality assessment.

Unsolved Problems in Spectroscopy, Part 4

This tutorial investigates the persistent issue of sample heterogeneity—chemical and physical—during spectroscopic analysis. Focus will be placed on understanding how spatial variation, surface texture, and particle interactions influence spectral features. Imaging spectroscopy, localized sampling strategies, and adaptive averaging algorithms will be reviewed as tools to manage this problem, as one of the remaining unsolved problems in spectroscopy.