In statistics, the strictly standardized mean difference (SSMD) is a measure of effect size. It is the mean divided by the standard deviation of a difference between two random values each from one of two groups. It was initially proposed for quality control[1] and hit selection[2] in high-throughput screening (HTS) and has become a statistical parameter measuring effect sizes for the comparison of any two groups with random values.[3]
In high-throughput screening (HTS), quality control (QC) is critical. An important QC characteristic in a HTS assay is how much the positive controls, test compounds, and negative controls differ from one another. This QC characteristic can be evaluated using the comparison of two well types in HTS assays. Signal-to-noise ratio (S/N), signal-to-background ratio (S/B), and the Z-factor have been adopted to evaluate the quality of HTS assays through the comparison of two investigated types of wells. However, the S/B does not take into account any information on variability; and the S/N can capture the variability only in one group and hence cannot assess the quality of assay when the two groups have different variabilities.[1] Zhang JH et al. proposed the Z-factor.[4] The advantage of the Z-factor over the S/N and S/B is that it takes into account the variabilities in both compared groups. As a result, the Z-factor has been broadly used as a QC metric in HTS assays. The absolute sign in the Z-factor makes it inconvenient to derive its statistical inference mathematically.
To derive a better interpretable parameter for measuring the differentiation between two groups, Zhang XHD[1] proposed SSMD to evaluate the differentiation between a positive control and a negative control in HTS assays. SSMD has a probabilistic basis due to its strong link with d+-probability (i.e., the probability that the difference between two groups is positive).[2] To some extent, the d+-probability is equivalent to the well-established probabilistic index P(X > Y) which has been studied and applied in many areas.[5] [6] [7] [8] [9] Supported on its probabilistic basis, SSMD has been used for both quality control and hit selection in high-throughput screening.[1] [2] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] [21]
As a statistical parameter, SSMD (denoted as
\beta
\mu1
2 | |
\sigma | |
1 |
\mu2
2 | |
\sigma | |
2 |
\sigma12.
\beta=
\mu1-\mu2 | |||||||||||||||
|
If the two groups are independent,
\beta=
\mu1-\mu2 | |||||||||||||||
|
\sigma2
\beta=
\mu1-\mu2 | |
\sqrt{2 |
\sigma}.
In the situation where the two groups are correlated, a commonly used strategy to avoid the calculation of
\sigma12
D
\muD
2 | |
\sigma | |
D |
\beta=
\muD | |
\sigmaD |
.
\bar{X}1,\bar{X}2
2, | |
s | |
1 |
2 | |
s | |
2 |
\hat{\beta}=
\bar{X | |
1 |
-\bar{X}2}{\sqrt{s
2}}. | |
2 |
When the two groups have normal distributions with equal variance,the uniformly minimal variance unbiased estimate(UMVUE) of SSMD is,[10]
\hat{\beta}=
\bar{X | |
1 |
-
\bar{X} | ||||
|
((n1-1)
2 | |
s | |
1 |
+(n2-1)
2)}}, | |
s | |
2 |
n1,n2
K ≈ n1+n2-3.48
In the situation where the two groups are correlated, based on a paired difference with a sample size
n
\bar{D}
2 | |
s | |
D |
\hat{\beta}=
\bar{D | |
\hat{\beta}=
| \sqrt{ | |||||
|
2 | |
n-1 |
SSMD looks similar to t-statistic and Cohen's d, but they are different with one another as illustrated in.[3]
SSMD is the ratio of mean to the standard deviation of the difference between two groups. When the data is preprocessed using log-transformation as we normally do in HTS experiments, SSMD is the mean of log fold change divided by the standard deviation of log fold change with respect to a negative reference. In other words, SSMD is the average fold change (on the log scale) penalized by the variability of fold change (on the log scale)[23] . For quality control, one index for the quality of an HTS assay is the magnitude of difference between a positive control and a negative reference in an assay plate. For hit selection, the size of effects of a compound (i.e., a small molecule or an siRNA) is represented by the magnitude of difference between the compound and a negative reference. SSMD directly measures the magnitude of difference between two groups. Therefore, SSMD can be used for both quality control and hit selection in HTS experiments.
\bar{X}P,\bar{X}N
2, | |
s | |
P |
2 | |
s | |
N |
nP,nN
\hat{\beta}=
\bar{X | |
P |
-
\bar{X} | ||||
|
((nP-1)
2 | |
s | |
P |
+(nN-1)
2)}}, | |
s | |
N |
K ≈ nP+nN-3.48
\hat{\beta}=
\bar{X | |
P |
-\bar{X}N}{\sqrt{s
2}}. | |
N |
\hat{\beta}=
\tilde{X | |
P |
-\tilde{X}N}{1.4826
2 | |
\sqrt{\tilde{s} | |
P |
+
2}}, | |
\tilde{s} | |
N |
\tilde{X}P,\tilde{X}N,\tilde{s}P,\tilde{s}N
The Z-factor based QC criterion is popularly used in HTS assays. However, it has been demonstrated that this QC criterion is most suitable for an assay with very or extremely strong positive controls.[10] In an RNAi HTS assay, a strong or moderate positive control is usually more instructive than a very or extremely strong positive control because the effectiveness of this control is more similar to the hits of interest. In addition, the positive controls in the two HTS experiments theoretically have different sizes of effects. Consequently, the QC thresholds for the moderate control should be different from those for the strong control in these two experiments. Furthermore, it is common that two or more positive controls are adopted in a single experiment.[11] Applying the same Z-factor-based QC criteria to both controls leads to inconsistent results as illustrated in the literatures.[10] [11]
The SSMD-based QC criteria listed in the following table[20] take into account the effect size of a positive control in an HTS assay where the positive control (such as an inhibition control) theoretically has values less than the negative reference.
Quality Type | A: Moderate Control | B: Strong Control | C: Very Strong Control | D: Extremely Strong Control | |
---|---|---|---|---|---|
Excellent | \beta\le-2 | \beta\le-3 | \beta\le-5 | \beta\le-7 | |
Good | -2<\beta\le-1 | -3<\beta\le-2 | -5<\beta\le-3 | -7<\beta\le-5 | |
Inferior | -1<\beta\le-0.5 | -2<\beta\le-1 | -3<\beta\le-2 | -5<\beta\le-3 | |
Poor | \beta>-0.5 | \beta>-1 | \beta>-2 | \beta>-3 |
In application, if the effect size of a positive control is known biologically, adopt the corresponding criterion based on this table. Otherwise, the following strategy should help to determine which QC criterion should be applied: (i) in many small molecule HTS assay with one positive control, usually criterion D (and occasionally criterion C) should be adopted because this control usually has very or extremely strong effects; (ii) for RNAi HTS assays in which cell viability is the measured response, criterion D should be adopted for the controls without cells (namely, the wells with no cells added) or background controls; (iii) in a viral assay in which the amount of viruses in host cells is the interest, criterion C is usually used, and criterion D is occasionally used for the positive control consisting of siRNA from the virus.[20]
Similar SSMD-based QC criteria can be constructed for an HTS assay where the positive control (such as an activation control) theoretically has values greater than the negative reference. More details about how to apply SSMD-based QC criteria in HTS experiments can be found in a book.[20]
In an HTS assay, one primary goal is to select compounds with a desired size of inhibition or activation effect. The size of the compound effect is represented by the magnitude of difference between a test compound and a negative reference group with no specific inhibition/activation effects. A compound with a desired size of effects in an HTS screen is called a hit. The process of selecting hits is called hit selection. There are two main strategies of selecting hits with large effects.[20] One is to use certain metric(s) to rank and/or classify the compounds by their effects and then to select the largest number of potent compounds that is practical for validation assays.[17] [19] [22] The other strategy is to test whether a compound has effects strong enough to reach a pre-set level. In this strategy, false-negative rates (FNRs) and/or false-positive rates (FPRs) must be controlled.[14] [15] [16] [25] [26]
SSMD can not only rank the size of effects but also classify effects as shown in the following table based on the population value (
\beta
Effect subtype | Thresholds for negative SSMD | Thresholds for positive SSMD |
---|---|---|
Extremely strong | \beta\le-5 | \beta\ge5 |
Very strong | -5<\beta\le-3 | 5>\beta\ge3 |
Strong | -3<\beta\le-2 | 3>\beta\ge2 |
Fairly strong | -2<\beta\le-1.645 | 2>\beta\ge1.645 |
Moderate | -1.645<\beta\le-1.28 | 1.645>\beta\ge1.28 |
Fairly moderate | -1.28<\beta\le-1 | 1.28>\beta\ge1 |
Fairly weak | -1<\beta\le-0.75 | 1>\beta\ge0.75 |
Weak | -0.75<\beta<-0.5 | 0.75>\beta>0.5 |
Very weak | -0.5\le\beta<-0.25 | 0.5\ge\beta>0.25 |
Extremely weak | -0.25\le\beta<0 | 0.25\ge\beta>0 |
No effect | \beta=0 |
In a primary screen without replicates, assuming the measured value (usually on the log scale) in a well for a tested compound is
Xi
nN
\bar{X}N
\tilde{X}N
sN
\tilde{s}N
SSMD=
Xi-\bar{X | |
N}{s |
N\sqrt{2(nN-1)/K}},
K ≈ nN-2.48
SSMD*=
Xi-\tilde{X | |
N}{1.4826\tilde{s} |
N\sqrt{2(nN-1)/K}}
In a confirmatory or primary screen with replicates, for the i-th test compound with
n
\bar{d}i
2 | |
s | |
i |
SSMD=
| \sqrt{ | |||||
|
2 | |
n-1 |