The NEJM Special Article that introduced the concept of NNT in 1988 examined subgroups of a hypertension trial. The authors noted that patients with target organ damage appeared to derive more absolute benefit from antihypertensive drugs than those without target organ damage. They used these observations to argue that clinicians would get more “bang for their buck” when treating patients who already had target organ damage. They expressed “bang for your buck” in terms of NNT:
Laupacis A, Sackett D, Roberts R; An Assessment of Clinically Useful Measures of the Consequences of Treatment: N Engl J Med 1988;318:1728-1733
“The “number needed to be treated” is the number of patients who must be treated in order to prevent one adverse event. For example, in the Veterans Administration trial, if 100 control patients without target-organ damage had been followed for three years (risk of adverse event 0.098), 10 events should have been expected. If, however, 100 such patients had been treated with antihypertensive agents and followed for three years (risk of adverse event, 0.040), only four events would have been expected. Thus, on average, treating 100 such patients for three years would have prevented six (10-4) adverse events, meaning that 17 patients (100 divided by 6) would have had to be treated in order to prevent one event. However, similar calculations reveal that among patients with initial target-organ damage, only seven would have had to be treated for three years in order to prevent one event.
Mathematically, the number needed to be treated is equivalent to the reciprocal of the absolute risk reduction. The number needed to be treated has the same advantage over the relative risk reduction and odds ratio as the absolute risk reduction in that it expresses efficacy in a manner that incorporates both the baseline risk without therapy and the risk reduction with therapy…”
As a physician without expertise in statistics or epidemiology, I feel like I’ve never been able to internalize, deeply enough, statistical criticisms of NNT. The criticisms are multiple, but it feels like there must be a single “root” conceptual error, perpetrated by the developers of the technique, without which the technique would never have caught on.
Question: Is the most fundamental problem with NNT the fact that it relies on an assumption that the absolute event rate in a given RCT arm would remain constant from one trial to the next?
The second sentence of the first paragraph reads: “For example, in the Veterans Administration trial, if 100 control patients without target-organ damage had been followed for three years (risk of adverse event 0.098), 10 events should have been expected.” But later in their paper, the authors describe one of several “shortcomings” of the NNT as follows:
“…Fourth, any measure of the benefit of treatment may vary considerably in different trials of the same or similar therapy because of different patient populations, trial designs (e.g., whether the therapy is evaluated in a setting designed to maximize compliance or as part of routine patient care), or chance…”
Isn’t this “shortcoming” a fatal flaw of the NNT concept (??) Doesn’t this “shortcoming” completely undermine the purpose of calculating NNT (??) Am I missing something (?)
The authors clearly (but indirectly) acknowledge the fact that another trial, designed with the same eligibility criteria, and testing the same therapy, might have recorded quite a different event rate in the control arm. Yet, for some reason, they don’t consider the fact that the group-level ARR can vary from one trial to another to be a “deal-breaker” when they advise extrapolating it to decision-making at the level of an individual patient (?)…
Does a failure to recognize the profound implications of the non-transportability (?correct wording) of the risk difference also lie at the root of the RNCT phenomenon (?) If people believe that an event rate documented in a trial’s control arm is a “constant” (i.e., that the control arm event rate will be the same from one trial to the next), then they will fail to understand the purpose of concurrent control (?)