Chemometrics in Spectroscopy - Linearity in Calibration: Other Tests for Non-Linearity

Jerome Workman Jr.;

Chemometrics in Spectroscopy - Linearity in Calibration: Other Tests for Non-Linearity

April 1, 2005

By Jerome Workman Jr.

Article

Spectroscopy

SpectroscopySpectroscopy-04-01-2005

Volume 20

Issue 4

This third part in a series on non-linearity looks at other tests and how they can be applied in laboratories that must meet FDA regulations.

We continue here what our last column started (1): discussions of other ways to test data for non-linearity. We'll begin by reviewing what we want to test. FDA/ICH guidelines, starting from a univariate perspective, considers the relationship between the actual analyte concentration and what they generically call the "test result," a term that is independent of the technology used to ascertain the analyte concentration. This term therefore holds good for every analytical methodology from manual wet chemistry to the latest high-tech instrument. In the end, even the latest instrumental methods have to produce a number, representing the final answer for that instrument's quantitative assessment of the concentration, which is the test result from that instrument. This is a univariate concept to be sure, but the same concept that applies to all other analytical methods. Things might change in the future, but currently this is the way analytical results are reported and evaluated.

The question to be answered, then, is that for any given method of analysis, is the relationship between instrument readings (test results) and the actual concentration linear?

Three tests of this characteristic were discussed in previous columns on this topic — the FDA/ICH recommendation of linear regression with a report of various regression statistics, visual inspection of a plot of test results versus the actual concentrations, and use of the Durbin-Watson statistic. Because we analyzed these tests previously we will not discuss them further here, but a summary is provided in Table I, along with other tests for non-linearity that we explain and discuss in this column.

We now proceed to present various linearity tests that can be found in the statistical literature.

F-Test

Figure 1 shows a schematic representation of the

-test for linearity. Note that there are some similarities to the Durbin-Watson test. The key difference between this test and the Durbin-Watson test is that in order to use the

-test as a test for (non)linearity, you must have measured many repeat samples at each value of the analyte. The variabilities of the readings for each sample are pooled, providing an estimate of the within-sample variance. This is indicated by the label "Operative difference for denominator." By analysis of variance, we know that the total variation of residuals around the calibration line is the sum of the within-sample variance (

_within

) plus the variance of the means around the calibration line. Now, if the residuals truly are random, unbiased, and in particular if the model is linear, then we know that the means for each sample will cluster randomly around the calibration line, and that their variance will equal

_within

^1/2

(indicated by the label "Operative difference for numerator"). The ratio of these two variances will be distributed as the F-distribution, with an expected value of unity. If there is non-linearity, such as is shown in Figure 1, then the variance corresponding to the means will be inflated by the systematic offset of each sample, and the computed F-ratio will be statistically significantly larger than unity.

Table I. Various tests for (non)linearity that have been proposed and a summary of their characteristics.

This test thus shares several characteristics with the Durbin-Watson test. It is based on well-known and rigorously sound statistics. It is amenable to automated computerized calculation and suitable for automatic operation in an automated process situation. It does not have the "fatal flaw" of the Durbin-Watson statistic.

On the other hand, it shares some of the disadvantages of the Durbin-Watson statistic. It also is based upon a comparison of variances, so that it is of low statistical power. It requires many more samples and readings than the Durbin-Watson statistic, because each sample must be measured many times. In general it is not applicable to historical data, because the data must have been collected using the proper protocols, and rarely are so many readings taken for each sample as this test requires. It also is not specific for non-linearity. Outliers, poorly fitting models, bias, or error in the reference values, or other defects of the data can appear to be non-linearity.

Figure 1. Schematic representation of the residuals of the F-test.

Normality of Residuals

In a well-behaved calibration model, residuals will have a normal (that is, Gaussian) distribution. In fact, as we have previously discussed, least-squares regression analysis also is a maximum likelihood method, but only when the errors are normally distributed. If the data does not follow the straight line model then there will be an excessive number of residuals with too-large values, and the residuals then will not follow the normal distribution. It follows, then, that a test for normality of residuals also will detect non-linearity.

Over time, statisticians have devised many tests for the distributions of data, including one that relies on visual inspection of a particular type of graph. Of course, this is no more than the direct visual inspection of the data or of the calibration residuals themselves. However, a statistical test also is available — the χ² test for distributions, which we have described previously. This test could be applied to the question, but shares many of the disadvantages of the F-test and other tests. The main difficulty is the practical one: this test is very insensitive and therefore requires a large number of samples and a large departure from linearity in order for this test to be able to detect it. Also, like the F-test it is not specific for non-linearity, false positive indication can be triggered by other types of defects in the data.

We continue in our next column with an explanation of a new test that has been devised, which overcomes the limitations of the various tests we already have described.

Reference

1. H. Mark and J. Workman,

Spectroscopy

(3), 34-39 (2005).

Jerome Workman Jr. serves on the Editorial Advisory Board of Spectroscopy and is director of research, technology, and applications development for the Molecular Spectroscopy & Microanalysis division of Thermo Electron Corp. He can be reached by e-mail at: jerry.workman@thermo.com.

Jerome Workman Jr.

Howard Mark serves on the Editorial Advisory Board of Spectroscopy and runs a consulting service, Mark Electronics (Suffern, NY). He can be reached via e-mail at: hlmark@prodigy.net.

Howard Mark

Articles in this issue

Chemometrics in Spectroscopy - Linearity in Calibration: Other Tests for Non-Linearity

Photonic Bandgap Fibers in Gas Detection

Characterization of Archaeological Findings by FT-IR Spectroscopy

The Baseline - Eyes: The Body's Own Spectroscopes

Tech Note: Diffractive MEMS in Spectroscopy

End of the Spectrum: Science Searches for the Truth about Tut

Get essential updates on the latest spectroscopy technologies, regulatory standards, and best practices—subscribe today to Spectroscopy.

Subscribe Now!

Related Content

Drone-mountrd Infrared camera sees invisible methane leaks in real time © DigitalSpace -chronicles-stock.adobe.com

Drone-Mounted Infrared Camera Sees Invisible Methane Leaks in Real Time

Jerome Workman, Jr.

July 9th 2025

Article

Researchers in Scotland have developed a drone-mounted infrared imaging system that can detect and map methane gas leaks in real time from up to 13.6 meters away. The innovative approach combines laser spectroscopy with infrared imaging, offering a safer and more efficient tool for monitoring pipeline leaks and greenhouse gas emissions.

Drone with spectroscopy reveals hidden threats to soybean crops in China © Та -chronicles-stock.adobe.com

How Spectroscopy Drones Are Detecting Hidden Crop Threats in China’s Soybean Fields

Jerome Workman, Jr.

July 8th 2025

Article

Researchers in Northeast China have demonstrated a new approach using drone-mounted multispectral imaging to monitor and predict soybean bacterial blight disease, offering a promising tool for early detection and yield protection.

Hand of male holding soil in the hands for planting. | Image Credit: © krisana - stock.adobe.com

Radar and Soil Spectroscopy Boost Soil Carbon Predictions in Brazil’s Semi-Arid Regions

Will Wetzel

July 7th 2025

Article

A new study published in Geoderma demonstrates that combining soil spectroscopy with radar-derived vegetation indices and environmental data significantly improves the accuracy of soil organic carbon predictions in Brazil’s semi-arid regions.

How spectroscopy-based drones and AI are transforming deep soil moisture monitoring © aubriella -chronicles-stock.adobe.com

Advancing Deep Soil Moisture Monitoring with AI-Powered Spectroscopy Drones

Jerome Workman, Jr.

July 7th 2025

Article

A Virginia Tech study has combined drone-mounted NIR hyperspectral imaging (400 nm to 1100 nm) and AI to estimate soil moisture at root depths with remarkable accuracy, paving the way for smarter irrigation and resilient farming.

AI Boosts SERS for Next Generation Biomedical Breakthroughs

Jerome Workman, Jr.

July 2nd 2025

Article

Researchers from Shanghai Jiao Tong University are harnessing artificial intelligence to elevate surface-enhanced Raman spectroscopy (SERS) for highly sensitive, multiplexed biomedical analysis, enabling faster diagnostics, imaging, and personalized treatments.

Artificial Intelligence Accelerates Molecular Vibration Analysis, Study Finds

Jerome Workman, Jr.

July 1st 2025

Article

A new review led by researchers from MIT and Oak Ridge National Laboratory outlines how artificial intelligence (AI) is transforming the study of molecular vibrations and phonons, making spectroscopic analysis faster, more accurate, and more accessible.

Chemometrics in Spectroscopy - Linearity in Calibration: Other Tests for Non-Linearity

F-Test

Normality of Residuals

Reference

Newsletter