Data Analytics, Statistics, Chemometrics, and Artificial Intelligence

Latest News


Unsolved Problems in Spectroscopy - Part 2

This tutorial addresses the critical issue of analyte specificity in multivariate spectroscopy using the concept of Net Analyte Signal (NAS). NAS allows chemometricians to isolate the portion of the signal that is unique to the analyte of interest, thereby enhancing model interpretability and robustness in the presence of interfering species. While this tutorial introduces the foundational concepts for beginners, it also includes selected advanced topics to bridge toward expert-level applications and future research. The tutorial covers the mathematical foundation of NAS, its application in regression models like partial least squares (PLS), and emerging methods to optimize specificity and variable selection. Applications in pharmaceuticals, clinical diagnostics, and industrial process control are also discussed.

Unsolved Problems in Spectroscopy - Part 1

This tutorial examines the modeling of diffuse reflectance (DR) in complex particulate samples, such as powders and granular solids. Traditional theoretical frameworks like empirical absorbance, Kubelka-Munk, radiative transfer theory (RTT), and the Hapke model are presented in standard and matrix notation where applicable. Their advantages and limitations are highlighted, particularly for heterogeneous particle size distributions and real-world variations in the optical properties of particulate samples. Hybrid and emerging computational strategies, including Monte Carlo methods, full-wave numerical solvers, and machine learning (ML) models, are evaluated for their potential to produce more generalizable prediction models.

Thomas Mayerhöfer and Jürgen Popp

In this tutorial, Thomas G. Mayerhöfer and Jürgen Popp introduce complex-valued chemometrics as a more physically grounded alternative to traditional intensity-based spectroscopy measurement methods. By incorporating both the real and imaginary parts of the complex refractive index of a sample, this approach preserves phase information and improves linearity with sample analyte concentration. The result is more robust and interpretable multivariate models, especially in systems affected by nonlinear effects or strong solvent and analyte interactions.