This tutorial addresses the critical issue of analyte specificity in multivariate spectroscopy using the concept of Net Analyte Signal (NAS). NAS allows chemometricians to isolate the portion of the signal that is unique to the analyte of interest, thereby enhancing model interpretability and robustness in the presence of interfering species. While this tutorial introduces the foundational concepts for beginners, it also includes selected advanced topics to bridge toward expert-level applications and future research. The tutorial covers the mathematical foundation of NAS, its application in regression models like partial least squares (PLS), and emerging methods to optimize specificity and variable selection. Applications in pharmaceuticals, clinical diagnostics, and industrial process control are also discussed.
Abstract
In multivariate spectral analysis, overlapping signals from multiple components often obscure the specific contribution of a target analyte. The Net Analyte Signal (NAS) is a theoretical and practical construct developed to isolate and quantify this unique contribution. NAS has become a fundamental tool for assessing sensitivity, selectivity, and signal-to-noise ratio (SNR) in chemometric modeling. Importantly, as the number and diversity of interferents increase in a dataset, the NAS component for an analyte typically shrinks, leading to lower signal strength and increased noise sensitivity. This has critical implications for constructing global versus local models. This tutorial presents the derivation of NAS in matrix notation, explores its role in improving specificity in PLS and PCR models, and discusses strategies for integrating NAS with modern data-driven techniques such as sparse regression and multi-block data fusion. The significance of NAS in quality control, process monitoring, and method validation is illustrated through practical applications.
Introduction
In multivariate spectroscopic calibration, overlapping spectra from multiple constituents often complicate the task of isolating signals for individual analytes. Classical univariate selectivity definitions fall short in multicomponent systems where collinearity is the rule rather than the exception. To address this, Lorber, Kowalski, and others introduced the concept of the Net Analyte Signal (NAS), a vector-based metric that isolates the part of the spectral signal that is unique to the analyte of interest (1–4).
NAS provides insight into the specificity and interpretability of multivariate models. Its framework allows us to decompose a measured spectrum into orthogonal contributions from the analyte, interfering components, and residual noise. NAS attempts to quantify how much of the measured signal is truly unique to the selected analyte of interest. (1).
Mathematical Formulation of the Net Analyte Signal
The Net Analyte Signal (NAS) is a method for extracting the portion of a signal that is uniquely attributable to the analyte of interest, independent of the effects from other chemical species or background interferences. This concept is critical in multivariate calibration, where spectral data is typically influenced by multiple overlapping components (1–4).
Why NAS?
In classical univariate calibration, the signal from a single analyte can be isolated at a specific wavelength. In multivariate systems, however, spectral overlap prevents this isolation. The NAS approach effectively projects out the interference contributions, leaving a residual component that contains information from the target analyte. Table I shows the various symbols used for NAS computation (1–4).
1. Project Out the Interference Space
To isolate the signal uniquely associated with analyte k, remove the components explained by other analytes.
2. Compute the Net Analyte Signal Direction
As more interfering components are added to ŝk,net, the NAS vector may decrease in magnitude, eventually approaching the noise floor. This property is essential when evaluating the feasibility of global calibration models.
3. Compute the NAS Vector for an Unknown Sample
4. Estimate the Analyte Concentration
Interpretation
Relation of NAS to Other Methods
Net Analyte Signal (NAS) Performance Metrics
The paper by Lorber, Faber, and Kowalski (1) defines three key analytical performance metrics derived directly from the Net Analyte Signal (NAS) formalism. Table II summarizes these NAS performance metrics (1–4).
1. Selectivity (SELk)
Selectivity quantifies how uniquely the analyte's signal stands apart from interfering components. It is defined as the cosine of the angle between the analyte signal and its NAS vector.
Formula:
Interpretation:
This is a normalized projection, indicating how much of uklies in its unique, interference-free subspace.
2. Sensitivity (SENk)
Sensitivity reflects the magnitude of the NAS response per unit concentration of analyte k. It's simply the norm of the NAS direction vector, assuming the analyte signal is linear with concentration. When SENk is small, even moderate noise levels can severely degrade prediction performance, guiding decisions on model scope and acceptable levels of interference.
Formula:
Interpretation:
3. Limit of Detection (LODk)
LOD is defined as the minimum detectable concentration of analyte k, based on the level of noise in the measurement and the sensitivity of the system.
Formula:
Where:
1. Partial Least Squares (PLS) Regression Vector
Partial least squares (PLS) and principal component regression (PCR) derive predictive calibration models based on directions of maximum variance or correlation (5,6). These methods may achieve high predictive accuracy but can blur interpretability due to loading vectors that mix spectral analyte and interferent contributions.
The PLS regression vector for analyte k is:
Where:
wₖ is the weight vector to latent variable k
tₖ is the latent variable score (component) for k
This equation implies that the PLS regression vector is a linear combination of component weight vectors and their associated scores.
Matrix Form:
Let:
2. Net Analyte Signal (NAS) and Orthogonal Projection
Orthogonal Signal Correction (OSC) removes variation in X that is orthogonal to the response y. This improves specificity by filtering out unrelated interferents (1–6).
Standard Form:
The orthogonal signal-corrected matrix (XOSC) is obtained by removing variation in Xorthogonal to the analyte signal space for analyte j in this case:
3. Sparse Modeling (LASSO / Elastic Net)
Sparse methods such as LASSO and Elastic Net enhance NAS by reducing collinearity and improving interpretability. These methods find solutions by minimizing interferences:
Why the Net Analyte Signal (NAS) Was Needed
In traditional spectroscopy, quantification of analyte concentration has relied heavily on either univariate calibration (for example, the Beer-Lambert law applied to a single wavelength) or multivariate methods such as principal component regression (PCR) and partial least squares (PLS). While these multivariate techniques improved prediction accuracy by leveraging entire spectra, they lacked explicit quantification of how much of the signal was truly attributable to the specific analyte of interest (1–4).
The key challenge arises when spectra exhibit significant collinearity—that is, spectral overlap—between analyte and matrix components. Under such conditions, regression methods often yield models that custom-fit a specific data set quite well but remain vulnerable to systematic bias or overfitting; thus do not provide a generalized solution. These models may also respond to variance from interferents rather than analyte-specific features, especially when different analyte concentrations are correlated in the calibration set.
To simplify the problem, think of a typical wet chemical reference method that measures a specific analyte of interest; if none of the analyte is present, a sound analytical method should yield none of that analyte as a result. When using spectra and multivariate analysis, there is typically an analytical result greater than zero for an analyte even when none is present.
Because of this serious specificity issue, there was (and is) a pressing need for a metric that would:
This was the gap that the Net Analyte Signal (NAS) concept was developed to address.
Research Origins and Development
The NAS concept was introduced and formalized in the 1980s and 1990s by Lorber, Kowalski, and co-workers (1,3). Their insight was to reinterpret the measured spectrum as a vector in multidimensional space and to partition this vector into three orthogonal components:
By constructing an orthogonal projection onto the complement of the interferent subspace, they mathematically isolated the "net" contribution of the analyte signal. This projection is conceptually akin to a “specificity filter,” separating shared (collinear) variance from unique variance.
Later developments expanded the NAS theory to assess figures of merit (FOMs) described in Table II, such as (3):
The method was also extended to multiple analytes, multicomponent resolution, and wavelength selection strategies.
Effectiveness of the NAS Approach
NAS has proven to be a powerful and interpretable metric for:
In contrast to latent variable models, NAS provides a physically interpretable signal component and helps prevent reliance on coincidental correlations. For this reason, NAS-based validation is especially useful in:
Example Applications
The NAS method has been employed across a wide variation of analytical contexts:
1. Pharmaceutical Quality Control
2. Clinical Chemistry
3. Industrial Process Control
4. Food and Agricultural Analysis
Future Research Directions
Despite its robust mathematical foundation, the NAS method continues to evolve as spectroscopy intersects with modern data science. Several emerging research directions include:
1. Integration with Deep Learning
While deep learning models can deliver high prediction accuracy, they often lack interpretability. Combining NAS with neural networks may enhance explainability by constraining models to NAS-derived subspaces. This can improve confidence in clinical or forensic applications.
2. NAS in Multi-Block Data Fusion
In systems where multiple data sources (e.g., NIR + Raman, or spectral + imaging) are available, NAS could help isolate analyte-specific features across blocks. Multi-block NAS formulations may improve diagnostics in omics, metabolomics, and environmental sensing.
3. Real-Time Adaptive Models
In adaptive calibration or model updating scenarios, NAS can guide selective retraining of the model by highlighting spectral regions where specificity has degraded. This could prove vital in wearable devices or embedded analyzers.
4. Sparse and Orthogonal Fusion
Modern trends favor sparse models for interpretability and robustness. NAS can be extended to sparse representations, allowing variable selection based on analyte specificity rather than regression weight alone. Orthogonal NAS components may also enhance interpretability in hybrid modeling.
5. Automated Variable Selection and Interpretability Tools
NAS can serve as a criterion for selecting variables (wavelengths or features) that are both predictive and specific to the analyte. Integration with recursive feature elimination or genetic algorithms could further improve model transparency and reduce overfitting.
Conclusion
Although the NAS makes a valiant effort to resolve analyte specificity in multivariate calibrations, a robust method to create models that are as specific to the analyte as univariate primary analysis methods that will pass regulatory scrutiny has yet to be developed and widely demonstrated.
References
(1) Lorber, A.; Faber, K.; Kowalski, B. R. Net Analyte Signal Calculation in Multivariate Calibration. Anal. Chem. 1997, 69 (8), 1620–1626. DOI: 10.1021/ac960862b
(2) Ferré, J.; Faber, N. K. M. Net Analyte Signal Calculation for Multivariate Calibration. Chemom. Intell. Lab. Syst. 2003, 69 (1–2), 123–136. DOI: 10.1016/S0169-7439(03)00118-7
(3) Lorber, A. Error Propagation and Figures of Merit for Quantification by Solving Matrix Equations. Anal. Chem. 1986, 58 (6), 1167–1172. PDF available at this link (accessed 2025-06-20).
(4) Ferré, J.; Brown, S. D.; Rius, F. X. Improved Calculation of the Net Analyte Signal in Inverse Multivariate Calibration. J. Chemom. 2001, 15 (6), 537–553. DOI: 10.1002/cem.647
(5) Haaland, D. M.; Thomas, E. V. Partial Least-Squares Methods for Spectral Analyses. Anal. Chem. 1988, 60 (11), 1193–1202. DOI: 10.1021/ac00162a020
(6) Wold, S.; Sjöström, M.; Eriksson, L. PLS-Regression: A Basic Tool of Chemometrics. Chemom. Intell. Lab. Syst. 2001, 58 (2), 109–130. DOI: 10.1016/S0169-7439(01)00155-1
(7) Rajalahti, T.; Kvalheim, O. M. Multivariate Data Analysis in Pharmaceutics: A Tutorial Review. Int. J. Pharm. 2011, 417 (1–2), 280–290. DOI: 10.1016/j.ijpharm.2011.02.019
(8) Wise, B. M.; Gallagher, N. B. The Process Chemometrics Approach to Process Monitoring and Fault Detection. J. Process Control 1996, 6 (6), 329–348. DOI: 10.1016/0959-1524(96)00009-1
_ _ _
This article was partially constructed with the assistance of a generative AI model and has been carefully edited and reviewed for accuracy and clarity.
Get essential updates on the latest spectroscopy technologies, regulatory standards, and best practices—subscribe today to Spectroscopy.
AI-Powered Fusion Model Improves Detection of Microplastics in the Atmosphere
July 17th 2025Researchers from Nanjing University of Information Science & Technology have introduced a breakthrough AI-enhanced multimodal strategy for real-time detection of polyamide microplastics contaminated with heavy metals.
High-Speed Immune Cell Identification Using New Advanced Raman BCARS Spectroscopy Technique
July 16th 2025Irish researchers have developed a lightning-fast, label-free spectroscopic imaging method capable of classifying immune cells in just 5 milliseconds. Their work with broadband coherent anti-Stokes Raman scattering (BCARS) pushes the boundaries of cellular analysis, potentially transforming diagnostics and flow cytometry.