Hazard ratio explained

In survival analysis, the hazard ratio (HR) is the ratio of the hazard rates corresponding to the conditions characterised by two distinct levels of a treatment variable of interest. For example, in a clinical study of a drug, the treated population may die at twice the of the control population. The hazard ratio would be 2, indicating a higher hazard of death from the treatment.

For example, a scientific paper might use an HR to state something such as: "Adequate COVID-19 vaccination status was associated with significantly decreased risk for the composite of severe COVID-19 or mortality with a[n] HR of 0.20 (95% CI, 0.17–0.22)."^[1] In essence, the hazard for the composite outcome was 80% lower among the vaccinated relative to those who were unvaccinated in the same study. So, for a hazardous outcome (e.g., severe disease or death), an HR below 1 indicates that the treatment (e.g., vaccination) is protective against the outcome of interest. In other cases, an HR greater than 1 indicates the treatment is favorable. For example, if the outcome is actually favorable (e.g., accepting a job offer to end a spell of unemployment), an HR greater than 1 indicates that seeking a job is favorable to not seeking one (if "treatment" is defined as seeking a job).^[2]

Hazard ratios differ from relative risks (RRs) and odds ratios (ORs) in that RRs and ORs are cumulative over an entire study, using a defined endpoint, while HRs represent instantaneous risk over the study time period, or some subset thereof. Hazard ratios suffer somewhat less from selection bias with respect to the endpoints chosen and can indicate risks that happen before the endpoint.

Definition and derivation

Regression models are used to obtain hazard ratios and their confidence intervals.^[3]

The instantaneous hazard rate is the limit of the number of events per unit time divided by the number at risk, as the time interval approaches 0:

h(t)=\lim_\Delta

	observedeventsininterval [t,t+\Deltat]/N(t)
	\Deltat

where N(t) is the number at risk at the beginning of an interval. A hazard is the probability that a patient fails between

and

t+\Deltat

, given that they have survived up to time

, divided by

\Deltat

, as

\Deltat

approaches zero.

The hazard ratio is the effect on this hazard rate of a difference, such as group membership (for example, treatment or control, male or female), as estimated by regression models that treat the logarithm of the HR as a function of a baseline hazard

h_0(t)

and a linear combination of explanatory variables:

logh(t)=f(h_0(t),\alpha+\beta₁X₁+ … +\beta_kX_k).

Such models are generally classed proportional hazards regression models; the best known being the Cox proportional hazards model,^[3] ^[4] and the exponential, Gompertz and Weibull parametric models.

For two groups that differ only in treatment condition, the ratio of the hazard functions is given by

e^\beta

, where

\beta

is the estimate of treatment effect derived from the regression model. This hazard ratio, that is, the ratio between the predicted hazard for a member of one group and that for a member of the other group, is given by holding everything else constant, i.e. assuming proportionality of the hazard functions.

For a continuous explanatory variable, the same interpretation applies to a unit difference. Other HR models have different formulations and the interpretation of the parameter estimates differs accordingly.

Interpretation

In its simplest form, the hazard ratio can be interpreted as the chance of an event occurring in the treatment arm divided by the chance of the event occurring in the control arm, or vice versa, of a study. The resolution of these endpoints are usually depicted using Kaplan–Meier survival curves. These curves relate the proportion of each group where the endpoint has not been reached. The endpoint could be any dependent variable associated with the covariate (independent variable), e.g. death, remission of disease or contraction of disease. The curve represents the odds of an endpoint having occurred at each point in time (the hazard). The hazard ratio is simply the relationship between the instantaneous hazards in the two groups and represents, in a single number, the magnitude of distance between the Kaplan–Meier plots.^[5]

Hazard ratios do not reflect a time unit of the study. The difference between hazard-based and time-based measures is akin to the difference between the odds of winning a race and the margin of victory. When a study reports one hazard ratio per time period, it is assumed that difference between groups was proportional. Hazard ratios become meaningless when this assumption of proportionality is not met.

If the proportional hazard assumption holds, a hazard ratio of one means equivalence in the hazard rate of the two groups, whereas a hazard ratio other than one indicates difference in hazard rates between groups. The researcher indicates the probability of this sample difference being due to chance by reporting the probability associated with some test statistic.^[6] For instance, the

\beta

from the Cox-model or the log-rank test might then be used to assess the significance of any differences observed in these survival curves.^[7]

Conventionally, probabilities lower than 0.05 are considered significant and researchers provide a 95% confidence interval for the hazard ratio, e.g. derived from the standard deviation of the Cox-model regression coefficient, i.e.

\beta

.^[8] Statistically significant hazard ratios cannot include unity (one) in their confidence intervals.

The proportional hazards assumption

The proportional hazards assumption for hazard ratio estimation is strong and often unreasonable.^[9] Complications, adverse effects and late effects are all possible causes of change in the hazard rate over time. For instance, a surgical procedure may have high early risk, but excellent long term outcomes.

If the hazard ratio between groups remain constant, this is not a problem for interpretation. However, interpretation of hazard ratios become impossible when selection bias exists between groups. For instance, a particularly risky surgery might result in the survival of a systematically more robust group who would have fared better under any of the competing treatment conditions, making it look as if the risky procedure was better. Follow-up time is also important. A cancer treatment associated with better remission rates might on follow-up be associated with higher relapse rates. The researchers' decision about when to follow up is arbitrary and may lead to very different reported hazard ratios.^[10]

The hazard ratio and survival

Hazard ratios are often treated as a ratio of death probabilities. For example, a hazard ratio of 2 is thought to mean that a group has twice the chance of dying than a comparison group. In the Cox-model, this can be shown to translate to the following relationship between group survival functions:

S_1(t)=

	r
S
	0(t)

(where r is the hazard ratio).^[11] Therefore, with a hazard ratio of 2, if

S_0(t)=0.2

(20% survived at time t),

S_1(t)=0.2²=0.04

(4% survived at t). The corresponding death probabilities are 0.8 and 0.96. It should be clear that the hazard ratio is a relative measure of effect and tells us nothing about absolute risk.^[12]

While hazard ratios allow for hypothesis testing, they should be considered alongside other measures for interpretation of the treatment effect, e.g. the ratio of median times (median ratio) at which treatment and control group participants are at some endpoint. If the analogy of a race is applied, the hazard ratio is equivalent to the odds that an individual in the group with the higher hazard reaches the end of the race first. The probability of being first can be derived from the odds, which is the probability of being first divided by the probability of not being first:

HR=

	P{1
	-

conversely,

	HR
	1+HR

In the previous example, a hazard ratio of 2 corresponds to a 67% chance of an early death. The hazard ratio does not convey information about how soon the death will occur.

The hazard ratio, treatment effect and time-based endpoints

Treatment effect depends on the underlying disease related to survival function, not just the hazard ratio. Since the hazard ratio does not give us direct time-to-event information, researchers have to report median endpoint times and calculate the median endpoint time ratio by dividing the control group median value by the treatment group median value.

While the median endpoint ratio is a relative speed measure, the hazard ratio is not. The relationship between treatment effect and the hazard ratio is given as

e^\beta

. A statistically important, but practically insignificant effect can produce a large hazard ratio, e.g. a treatment increasing the number of one-year survivors in a population from one in 10,000 to one in 1,000 has a hazard ratio of 10. It is unlikely that such a treatment would have had much impact on the median endpoint time ratio, which likely would have been close to unity, i.e. mortality was largely the same regardless of group membership and clinically insignificant.

By contrast, a treatment group in which 50% of infections are resolved after one week (versus 25% in the control) yields a hazard ratio of two. If it takes ten weeks for all cases in the treatment group and half of cases in the control group to resolve, the ten-week hazard ratio remains at two, but the median endpoint time ratio is ten, a clinically significant difference.

Notes and References

Effectiveness of Paxlovid in Reducing Severe COVID-19 and Mortality in High Risk Patients . 2022-06-02 . Clinical Infectious Diseases . 10.1093/cid/ciac443. 35653428 . Najjar-Debbiny . R. . Gronich . N. . Weber . G. . Khoury . J. . Amar . M. . Stein . N. . Goldstein . L. H. . Saliba . W. . 76 . 3 . e342–e349 . 9214014 .
Flinn . C. . Heckman . J. . 1982 . New Methods for Analyzing Labor Force Structural Dynamics . Journal of Econometrics . 18 . 1 . 115–168 . 10.1016/0304-4076(82)90097-5 . 16100294 . Elsevier Science Direct.
Spruance . Spotswood . Julia E. Reid, Michael Grace, Matthew Samore . Hazard Ratio in Clinical Trials . Antimicrobial Agents and Chemotherapy . August 2004 . 48 . 8 . 2787–2792 . 10.1128/AAC.48.8.2787-2792.2004 . 15273082 . 478551.
Cox . D. R. . Regression-Models and Life-Tables . Journal of the Royal Statistical Society . 1972 . 34 . B (Methodological) . 2 . 187–220 . 5 December 2012 . dead . https://web.archive.org/web/20130620214140/http://hydra.usc.edu/pm518b/literature/cox-72.pdf . 20 June 2013 . dmy-all.
Book: Brody, Tom. Clinical Trials: Study Design, Endpoints and Biomarkers, Drug Safety, and FDA and ICH Guidelines. 2011. Academic Press. 165–168. 9780123919137.
Book: Motulsky, Harvey. Intuitive Biostatistics: A Nonmathematical Guide to Statistical Thinking. 2010. Oxford University Press. 9780199730063. 210–218.
Book: Biostatistics: The Bare Essentials. 2008. PMPH-USA. 9781550093476. 283–287. Geoffrey R. Norman. David L. Streiner. 7 December 2012.
Book: Survival Analysis: A Self-Learning Text. 2005. Springer. 9780387239187. David G. Kleinbaum. 2. Mitchel Klein. 7 December 2012.
Book: Cantor, Alan. Sas Survival Analysis Techniques for Medical Research. 2003. SAS Institute. 9781590471357. 111–150.
Hernán. Miguel. The Hazards of Hazard Ratios. Epidemiology. January 2010. 21. The Changing Face of Epidemiology. 1. 13–15. 10.1097/EDE.0b013e3181c1ea43. 20010207. 3653612.
Gretchen Kimmick, Electra D. Paskett, Kurt Lohmana, Robert Tucker. Interpreting Measures of Treatment Effect in Cancer Clinical Trials. The Oncologist. June 2002. 7. 3. 181–187. 10.1634/theoncologist.7-3-181. 12065789. 7 December 2012. L. Douglas Case. 46520247. free. 24 December 2019. https://web.archive.org/web/20191224133118/http://theoncologist.alphamedpress.org/content/7/3/181.full. dead.
Book: Newman, Stephan. Biostatistical Methods in Epidemiology. 2003. John Wiley & Sons. 9780471461609.

Hazard ratio explained

Definition and derivation

Interpretation

The proportional hazards assumption

The hazard ratio and survival

The hazard ratio, treatment effect and time-based endpoints

See also

Notes and References