Measurement uncertainty should not be confused with Measurement error.
In metrology, measurement uncertainty is the expression of the statistical dispersion of the values attributed to a quantity measured on an interval or ratio scale.
All measurements are subject to uncertainty and a measurement result is complete only when it is accompanied by a statement of the associated uncertainty, such as the standard deviation. By international agreement, this uncertainty has a probabilistic basis and reflects incomplete knowledge of the quantity value. It is a non-negative parameter.
The measurement uncertainty is often taken as the standard deviation of a state-of-knowledge probability distribution over the possible values that could be attributed to a measured quantity. Relative uncertainty is the measurement uncertainty relative to the magnitude of a particular single choice for the value for the measured quantity, when this choice is nonzero. This particular single choice is usually called the measured value, which may be optimal in some well-defined sense (e.g., a mean, median, or mode). Thus, the relative measurement uncertainty is the measurement uncertainty divided by the absolute value of the measured value, when the measured value is not zero.
The purpose of measurement is to provide information about a quantity of interest – a measurand. Measurands on ratio or interval scales include the size of a cylindrical feature, the volume of a vessel, the potential difference between the terminals of a battery, or the mass concentration of lead in a flask of water.
No measurement is exact. When a quantity is measured, the outcome depends on the measuring system, the measurement procedure, the skill of the operator, the environment, and other effects.[1] Even if the quantity were to be measured several times, in the same way and in the same circumstances, a different measured value would in general be obtained each time, assuming the measuring system has sufficient resolution to distinguish between the values.
The dispersion of the measured values would relate to how well the measurement is performed. If measured on a ratio or interval scale, their average would provide an estimate of the true value of the quantity that generally would be more reliable than an individual measured value. The dispersion and the number of measured values would provide information relating to the average value as an estimate of the true value. However, this information would not generally be adequate.
The measuring system may provide measured values that are not dispersed about the true value, but about some value offset from it. Take a domestic bathroom scale. Suppose it is not set to show zero when there is nobody on the scale, but to show some value offset from zero. Then, no matter how many times the person's mass were re-measured, the effect of this offset would be inherently present in the average of the values.
The to the Expression of Uncertainty in Measurement" (commonly known as the GUM) is the definitive document on this subject. The GUM has been adopted by all major National Measurement Institutes (NMIs) and by international laboratory accreditation standards such as ISO/IEC 17025 General requirements for the competence of testing and calibration laboratories, which is required for international laboratory accreditation, and is employed in most modern national and international documentary standards on measurement methods and technology. See Joint Committee for Guides in Metrology.
Measurement uncertainty has important economic consequences for calibration and measurement activities. In calibration reports, the magnitude of the uncertainty is often taken as an indication of the quality of the laboratory, and smaller uncertainty values generally are of higher value and of higher cost. The American Society of Mechanical Engineers (ASME) has produced a suite of standards addressing various aspects of measurement uncertainty. For example, ASME standards are used to address the role of measurement uncertainty when accepting or rejecting products based on a measurement result and a product specification,[2] to provide a simplified approach (relative to the GUM) to the evaluation of dimensional measurement uncertainty,[3] to resolve disagreements over the magnitude of the measurement uncertainty statement,[4] and to provide guidance on the risks involved in any product acceptance/rejection decision.[5]
The above discussion concerns the direct measurement of a quantity, which incidentally occurs rarely. For example, the bathroom scale may convert a measured extension of a spring into an estimate of the measurand, the mass of the person on the scale. The particular relationship between extension and mass is determined by the calibration of the scale. A measurement model converts a quantity value into the corresponding value of the measurand.
There are many types of measurement in practice and therefore many models. A simple measurement model (for example for a scale, where the mass is proportional to the extension of the spring) might be sufficient for everyday domestic use. Alternatively, a more sophisticated model of a weighing, involving additional effects such as air buoyancy, is capable of delivering better results for industrial or scientific purposes. In general there are often several different quantities, for example temperature, humidity and displacement, that contribute to the definition of the measurand, and that need to be measured.
Correction terms should be included in the measurement model when the conditions of measurement are not exactly as stipulated. These terms correspond to systematic errors. Given an estimate of a correction term, the relevant quantity should be corrected by this estimate. There will be an uncertainty associated with the estimate, even if the estimate is zero, as is often the case. Instances of systematic errors arise in height measurement, when the alignment of the measuring instrument is not perfectly vertical, and the ambient temperature is different from that prescribed. Neither the alignment of the instrument nor the ambient temperature is specified exactly, but information concerning these effects is available, for example the lack of alignment is at most 0.001° and the ambient temperature at the time of measurement differs from that stipulated by at most 2 °C.
As well as raw data representing measured values, there is another form of data that is frequently needed in a measurement model. Some such data relate to quantities representing physical constants, each of which is known imperfectly. Examples are material constants such as modulus of elasticity and specific heat. There are often other relevant data given in reference books, calibration certificates, etc., regarded as estimates of further quantities.
The items required by a measurement model to define a measurand are known as input quantities in a measurement model. The model is often referred to as a functional relationship. The output quantity in a measurement model is the measurand.
Formally, the output quantity, denoted by
Y
X1,\ldots,XN
Y=f(X1,\ldots,XN),
where
f
h(Y,
X1,\ldots,XN)=0.
It is taken that a procedure exists for calculating
Y
X1,\ldots,XN
Y
See also: Propagation of uncertainty. The true values of the input quantities
X1,\ldots,XN
X1,\ldots,XN
X1,\ldots,XN
Consider estimates
x1,\ldots,xN
X1,\ldots,XN
X1,\ldots,XN
x1,\ldots,xN
X1,\ldots,XN
i
u(xi)
Xi
xi
The use of available knowledge to establish a probability distribution to characterize each quantity of interest applies to the
Xi
Y
Y
Xi
Y
The figure below depicts a measurement model
Y=X1+X2
X1
X2
Y
Once the input quantities
X1,\ldots,XN
Y
Y
Y
Y
Often an interval containing
Y
Y
Prior knowledge about the true value of the output quantity
Y
Y
Y
Y
Knowledge about an input quantity
Xi
In Type A evaluations of measurement uncertainty, the assumption is often made that the distribution best describing an input quantity
X
X
For a Type B evaluation of uncertainty, often the only available information is that
X
a
b
See main article: Sensitivity analysis.
Sensitivity coefficients
c1,\ldots,cN
y
Y
x1,\ldots,xN
X1,\ldots,XN
Y=f(X1,\ldots,XN)
ci
f
Xi
X1=x1
X2=x2
Y=c1X1+ … +cNXN,
with
X1,\ldots,XN
xi
u(xi)
ciu(xi)
y.
Y=f(X1,\ldots,XN)
|ci|u(xi)
u(y)
y
u(y)
y
Y
|ci|u(xi)
Y=f(X1,\ldots,XN)
u2(y)=
2u | |
c | |
1 |
2(x | |
1) |
+ … +
2u | |
c | |
N |
2(x | |
N), |
which is known as the law of propagation of uncertainty.
When the input quantities
Xi
u(y)
See also: Uncertainty analysis and Quality of analytical results. The main stages of uncertainty evaluation constitute formulation and calculation, the latter consisting of propagation and summarizing.The formulation stage constitutes
Y
Y
Y
The calculation stage consists of propagating the probability distributions for the input quantities through the measurement model to obtain the probability distribution for the output quantity
Y
Y
y
Y
Y
u(y)
y
Y
The propagation stage of uncertainty evaluation is known as the propagation of distributions, various approaches for which are available, including
Y
t
Y
Y
For any particular uncertainty evaluation problem, approach 1), 2) or 3) (or some other approach) is used, 1) being generally approximate, 2) exact, and 3) providing a solution with a numerical accuracy that can be controlled.
When the measurement model is multivariate, that is, it has any number of output quantities, the above concepts can be extended.[13] The output quantities are now described by a joint probability distribution, the coverage interval becomes a coverage region, the law of propagation of uncertainty has a natural generalization, and a calculation procedure that implements a multivariate Monte Carlo method is available.
See also: Confidence interval, Credible interval and Fiducial interval. The most common view of measurement uncertainty uses random variables as mathematical models for uncertain quantities and simple probability distributions as sufficient for representing measurement uncertainties. In some situations, however, a mathematical interval might be a better model of uncertainty than a probability distribution. This may include situations involving periodic measurements, binned data values, censoring, detection limits, or plus-minus ranges of measurements where no particular probability distribution seems justified or where one cannot assume that the errors among individual measurements are completely independent.
A more robust representation of measurement uncertainty in such cases can be fashioned from intervals.[14] [15] An interval [''a'', ''b''] is different from a rectangular or uniform probability distribution over the same range in that the latter suggests that the true value lies inside the right half of the range [(''a'' + ''b'')/2, ''b''] with probability one half, and within any subinterval of [''a'', ''b''] with probability equal to the width of the subinterval divided by b − a. The interval makes no such claims, except simply that the measurement lies somewhere within the interval. Distributions of such measurement intervals can be summarized as probability boxes and Dempster–Shafer structures over the real numbers, which incorporate both aleatoric and epistemic uncertainties.