Trueness is the closeness of agreement between the average of an infinite number of replicate measured quantity values and a reference quantity value.
Trueness is not a quantity and therefore cannot be expressed numerically. Rather it is expressed as bias.
Bias is a measure of a systematic measurement error, the component of measurement error that remains constant in replicate measurements.
The bias can be expressed in absolute measurement units or as a percentage relative to the known value. A point estimate is a single value that is the best estimate of the true unknown parameter; a confidence interval is a range of values and indicates the uncertainty of the estimate.
The bias is an estimate of the true unknown bias in a single study. If the study were repeated, the estimate would be expected to vary from study to study. Therefore, if a single estimate is compared directly to 0 or compared to the allowable bias the statement is only applicable to the single study. To make inferences about the true unknown bias you must perform a hypothesis test:
The null hypothesis states that the bias is equal to 0, against the alternative hypothesis that it is not equal zero. When the test p-value is small, you can reject the null hypothesis and conclude that the bias is different to zero.
It is important to remember that a statistically significant p-value tells you nothing about the practical importance of what was observed. For a large sample, the bias for a statistically significant hypothesis test may be so small as to be practically useless. Conversely, although there may some evidence of bias, the sample size may be too small for the test to reach statistical significance, and you may miss an opportunity to discover a true meaningful bias. Lack of evidence against the null hypothesis does not mean it has been proven to be true, the belief before you perform the study is that the null hypothesis is true and the purpose is to look for evidence against it. An equality test at the 5% significance level is equivalent to comparing a 95% confidence interval to see if it includes zero.
The null hypothesis states that the bias is outside an interval of practical equivalence, against the alternative hypothesis that the bias is within the interval considered practically equivalent. When the test p-value is small, you can reject the null hypothesis and conclude that the bias is practically equivalent, and within the specified interval.
An equivalence test is used to prove a bias requirement can be met. The null hypothesis states the methods are not equivalent and looks for evidence that they are in fact equivalent. An equivalence hypothesis test at the 5% significance level is the same as comparing the 90% confidence interval to the allowable bias interval.
Estimate the trueness of a measurement system or measurement procedure.
Test if the value of the reference material is equal to the assigned value; that is the bias is zero.
It is possible for the bias from a study to be different to 0, but for the difference to be due to the random error in your study. The smaller your study, the larger the bias has to be to be declared statistically significant. Also, just because bias is statistically significant does not mean that it is of concern, it may still meet your allowable bias performance requirements.
Test if the bias meets performance requirements; that is the bias is less than allowable bias.
If the bias from a study is less than the allowable bias, it meets the performance requirements, but you cannot make any statements relating to probability. An equivalence test allows you to make statements with a given level of confidence about what you observed.
Nonlinear bias is a component of bias that cannot be represented by a linear relationship between the measured and true values.
A measurement procedure is linear when there is a mathematically verified straight-line relationship between the measured and true values. It is an important parameter as it allows linear interpolation of results between points.
Bias due to nonlinearity is measured as the difference between the linear fit and the best fitting polynomial fit.
Determine whether a measurement system or procedure provides measured quantity values (within a measuring interval) that are directly proportional to the true value.
Determine whether a measurement system or procedure provides measured quantity values (within a measuring interval) that are directly proportional.
Interference bias is a component of bias caused by nonspecificity attributable to the presence of a specific interfering substance.
Measurement systems analysis study requirements and dataset layout.
Use a column for the measured variable (Measured value) and optionally a by variable (Level); each row is a separate measurement.
| Level (optional) | Measured value |
|---|---|
| 120 | 121 |
| 120 | 118 |
| 120 | 124 |
| 120 | 120 |
| 120 | 116 |
| … | … |
| 240 | 240 |
| 240 | 246 |
| 240 | 232 |
| 240 | 241 |
| 240 | 240 |
| … | … |
Use a column for the measured variable (Measured value), and 1 columns for the random factor (Run), and optionally a by variable (Level); each row is a separate measurement.
| Level (optional) | Run | Measured value |
|---|---|---|
| 120 | 1 | 121 |
| 120 | 1 | 118 |
| 120 | 1 | … |
| 120 | 2 | 120 |
| 120 | 2 | 116 |
| 120 | 2 | … |
| 120 | 3 | … |
| 120 | … | … |
| … | … | … |
| 240 | 1 | 240 |
| 240 | 1 | 242 |
| 240 | 1 | … |
| 240 | 2 | 260 |
| 240 | 2 | 238 |
| 240 | 2 | … |
| 240 | 3 | … |
| 240 | … | … |
| … | … | … |
Use a column for the measured variable (Measured value), and 2 columns for the random factors (Day, Run), and optionally a by variable (Level); each row is a separate measurement.
| Level (optional) | Day | Run | Measured value |
|---|---|---|---|
| 120 | 1 | 1 | 121 |
| 120 | 1 | 1 | 118 |
| 120 | 1 | 1 | … |
| 120 | 1 | 2 | 120 |
| 120 | 1 | 2 | 116 |
| 120 | 1 | 2 | … |
| 120 | 1 | 3 | … |
| 120 | 1 | … | … |
| 120 | 2 | 1 | 124 |
| 120 | 2 | 1 | 119 |
| 120 | 2 | 1 | … |
| 120 | 2 | 2 | 118 |
| 120 | 2 | 2 | 121 |
| 120 | 2 | 2 | … |
| 120 | 2 | 3 | … |
| 120 | 2 | … | … |
| … | … | … |
Use multiple columns for the replicates of the measured variable (Measured value) and a by variable (Level), each row is a separate combination of level and factors.
| Level | Measured value | ||
|---|---|---|---|
| 1 | 0.7 | 0.9 | … |
| 2 | 4.6 | 4.1 | … |
| 3 | 6.5 | 6.9 | … |
| 4 | 11 | 12.2 | … |