Cause-Agnostic RCT (CAR)

Over on X there is a debate relevant causal inference vs RCT design. But what is an RCT?

Since Bradford Hill’s landmark study we have seen the RCT morph into two species.

So can we no longer speak of or debate RCTs as if they comprise a unified entity… a thing. There are CIRs (causal-integrity RCTs) for example the original Bradford Hill causal design, and CARs (causa-agnostic RCTs).

  1. a CIR conditions on a cause AND a covariate vector.

  2. a CAR conditions one or more thresholds (S=1) and are agnostic to cause.

The RCTs of synthetic syndromes like sepsis and ARDS are CARs.

Slide showing the left side of the DAG of a CIR (top panel) in relation to a CAR with the emphasis that CARs are generally NOT on the causal path so they generally crash. .

CIRs are what we all thought RCTs are. They have morphed into the much easier CAR which use non-disease (and non-cause) specific threshold sets as triage thresholds as the gate making case finding and a high n easy but there is no causal unity so they are internally valid (as a function of randomization) but externally non transportable.

The first thing a clinician should ask is: is this study a CIR of a CAR?

This is not to say that CARs (cause-agnostic RCT) are always invalid. Covariate matching might be valid if the same matching can then be applied to the population external to the CAR but not if conditioned on a threshold with a few perceived covariates as in the classic Petty Bone RCTs, the invalid CARs (RCT mimics) which have devastated the image of the RCT in general in critical care.

This “covariate” matching is potentially accomplishable by computer based subphenotype relational time pattern recognition.

Success of such an advanced covariate based CAR technique is likely capturing the unknown cause or mediator in a sense converting it into a CIR equivalent.

Note that matching “covariate” vectors by prognosis (eg AUC-mortality) cannot accomplish this potential CAR to CIR conversion to equivalency.

So the critical care clinicians have to lead the way because these causal paths and covariate vectors and trajectories are complex.

this is a call for CI and design community “synergy,” This distinction of CIR and CAR complements Pearl’s formal identifiability conditions by applying them to the clinical gate itself and it honors the design focused axiom that; “the integrity of the design determines transportability.”

Yet If the CI school insists that the design world must BEGIN by accepting layered propensity-weighting assumptions as definitive, and the RCT design establishment continues to ignore the categorical difference between a Bradford Hill-style CIR and a modern Petty-Bone CAR, then we will remain trapped in the epistemic interface gap.

If this lack of collaboration persists, it is up to clinicians can break this impasse because trialists have been unable to engage in such antidogmatic discussions. Witness you saw all of them run away here. The present state is creating disciplinary dissonance because it challenges 5 decades of methodological consensus. Indeed this analysis questions the heart of the present and decades old hierarchical critical care task force controlled syndrome science itself.

Of course the intellectual antipathy is palpable between CI and RCT design communities. You can cut it with a knife and it has long been unrelenting. I don’t see any intellectual sympathy coming from either camp. So this time it’s up to the clinicians to take the initial steps to save their patients from the perils of intractably siloed epistemic orthodoxy.

Firstly: by telling the CI community we yet don’t know enough to START with layered assumptions and,

Secondly: by refusing to consume Petty-Bone CAR based research and rejecting the task-force-driven scaffolds upon which CAR “science” rests.

I look forward to further discussions.

1 Like

Notice in the above diagram I’ve provided a ghosted indication of covariates in the CAR diagram. Can anyone see why? If you do, then you already understand what a CAR is. If you don’t, go back to the DAGs and think more deeply before you read below.

———-

Did you get it? Yes, of course the covariates are ghosted because they’re phantoms. They are statistical placeholders without a causal parent. In a cause-agnostic RCT (CAR), there’s no true disease node upstream to anchor them, so their apparent relationships are artifacts of conditioning on the synthetic gate S = 1. They look like covariates, but in causal terms they don’t exist.

Now if you understand this then you can easily see why CAR are derived from an apical error. The apical error was the action of replacing upstream cause with a non-disease and non-cause threshold (S=1).

-Do you agree with the ghosting of the covariates?

-What do you think if the CIR and CAR distinction?Is there any value to a CAR such as hypothesis generation?

-What should be the disclosure to the participants if enrolled in a CAR?

-Is it true that we can’t generate a valid causal equivalent by matching signals to a prognosis alone?

-How would time series matrix components (relational time patterns) be encapsulated as part of a “causal equivalent vector” based design when the cause is unknown?

Here for example we can see that there are relational times series patterns of combined myeloid perturbation and hemostatic perturbation. How would such a relational time pattern be studied as a potential targetable cause equivalent?

Would exploration by CI assisted OS using EMR data be useful to find the cause equivalents for RCT?

Explaining further, suppose the myeloid/ hemostatic vector was identified by CI as a potential targetable cause for aspirin instead of using Sepsis -3 (SOFA 1.0 plus suspicion). This would still be a CAR but it might contain the unknown target and be a “CIR equivalent”. Covariates would then potentially be valid.

All of this research opens up when we publicly abandon the “blind CAR”. (The CAR with an international task force (central control) defined SOFA or P/F as a gate) and we return the design of the research to “the people”. This mean no more intellectual colonization of LMIC based researchers in the doctrine of the task force and promulgation of the error so they can move on quickly.

“The people”, the hard working researchers across the world, are many times more likely to find the “cause equivalents” and to generate better and newer ideas than the anchored task force based in the USA can possibly conceive.

Yet “the people” are presently told to follow the rules and one of those is a new rule published this month. ….SOFA 2.0 a new gate (S=1) for another decade of blind CARs.

Development and Validation of the SOFA-2 Score https://jamanetwork.com/journals/jama/fullarticle/2840822?utm_source=twitter&utm_campaign=content-shareicons&utm_content=article_engagement&utm_medium=social&utm_term=110125

We know then what’s coming next…Sepsis 4 replacing Sepsis 3 (which used SOFA 1.0 as the CAR gate). Sepsis 4 will use SOFA 2.0 for another decade of blind centrally controlled sepsis CARs.

Some will generate phenotypes after the gate of SOFA 2.0 creating blind CAR of “phenotypes” that met S=1 just as was performed with SOFA 1.0.

Musical gates, blind CARs, central control of S=1. Titrate the collider, rinse and repeat for 35 years of blind CARs, let’s fix this for the public.

I can’t stop this unless I can at least precipitate discourse.

1 Like

Illustrative Case: The Pulse Oximetry Trial as a Cause-Agnostic RCT (CAR)

Let us now consider the consequences, both beneficial and detrimental, of a cause-agnostic randomized controlled trial (CAR).

To illustrate, I will reference a landmark perioperative RCT and its subsequent Cochrane review of pulse oximetry in postoperative patients (n = 20,802):

Pedersen T, et al. Pulse oximetry for perioperative monitoring. Cochrane Database Syst Rev. 2014. PMID: 24638894.

The review concluded:

Routine continuous pulse oximetry monitoring did not reduce transfers to intensive care and did not decrease mortality, and it remains unclear whether any real benefit was derived from the application of this technology for patients recovering from cardiothoracic surgery in a general care area.”

The primary trial underpinning this conclusion was a classic CAR, a randomized design conditioned not on a defined causal pathway but on a generalized postoperative state (S = 1).

The Causal Structure

Consider the simplified DAG in which postoperative patients enter through the gate S = 1, representing the “postoperative” state. The target is the timely detection of early complications which may reduce the number of complications since complications have an apical component which cascade into a plurality of complications and, sometimes, lead to mortality and increase Length of Stay (LOS)

Upstream, multiple distinct complications (causal pathways) feed into that gate:

  • D₁: Pain-treatment overdosing (iatrogenic respiratory depression)

  • D₂: Congestive heart failure (CHF)

  • D₃: Pulmonary embolism (PE)

Each of these conditions interacts differently with pulse oximetry:

  • For D₁, early detection of hypoventilation may improve survival.

  • For D₂ and D₃, however, continuous oximetry may paradoxically delay recognition of clinical deterioration. Respiratory alkalosis can transiently elevate oxygen saturation and thus blunt the device’s sensitivity to adverse events.

The delay may be shorter in older patients (PE) and longer in younger patients who can generate more profound respiratory alkalosis; conversely, in CHF, the effect size is weighted toward older age strata.

Here, the covariate “age” appears influential, yet it is not grounded in a causal framework.

Implications for the Estimated Treatment Effect

Within this mixture, early detection in D₁ may be offset by delayed detection in D₂ and D₃. The global average treatment effect (ATE) on survival therefore trends negative, even though within one disease stratum (D₁) the treatment is beneficial.

Such aggregation across unrecognized causal heterogeneity produces an apparent null or harmful result and generates covariates that are statistical artifacts rather than causal entities.

Policy Consequences

This landmark CAR effectively halted adoption of routine pulse oximetry for decades.

Ironically, from a policy perspective, the conclusion was defensible as the technology’s global ATE was near zero.

Yet from a pathophysiological perspective, the lesson was misleading.

Because the design lacked causal grounding, the field learned nothing about the distinct mechanisms by which pulse oximetry could help or harm. The study was, in effect, a “numbers game in the blind.”

The Epistemic Lesson

This is the defining flaw of a blind CAR: it yields no pathophysiological insight.

Complications in critical care evolve dynamically, and salvage depends on recognizing those mechanisms, not merely on aggregate outcomes.

Thus, we can summarize:

  • The good: The policy outcome, cautious non-adoption was arguably correct for the aggregate population.

  • The bad: The nuanced causal lessons were lost; incorrect inferences were drawn about the potential value of the technology in specific causal contexts.

A CAR is, therefore, a blind instrument.

There is no justification for remaining blind when causal-symbolic modeling (cSM) can be applied both before trial design, to define valid gating criteria and strata, and after analysis, to interpret heterogeneous effects in the light of causal structure.

In short, cSM opens the eyes of the CAR, transforming blind randomization into informed causal inquiry.

1 Like

Keeping with the CAR theme, I am trying to determine the extent of shrinkage in the REMAP-CAP trial and how that might have effected (reduced) the difference in treatment effects across diseases between the corticosteroids in influenza strata and the corticosteroids with other pooled pneumonia types.

IMHO REMAP-CAP platform represents an evolved form of the Cause-Agnostic RCT (CAR) but the Bayesian hierarchical modeling introduces mathematical conflation of biological (causal) mechanistic differences of the means and the variances.

So IMHO statistical exchangeability and biological coherence may be conflated in this RCT design.

In a purely mathematical (non biological) sense, exchangeability means that the outcome distributions of treated and untreated subjects can be considered similar up to random noise.

Hierarchical or Bayesian models exploit this by borrowing strength across strata that appear empirically alike.

But biological coherence requires something deeper, a shared causal mechanism through which the intervention acts. In other words mathematical similarity does not assure the coherence that enables acceptable pooling.

Two strata may be exchangeable in a statistical model yet fundamentally incoherent in biological terms, as when pneumococcal pneumonia and influenza pneumonia are partially pooled within the same “CAP” gate.

Borrowing strength across such incoherent strata narrows uncertainty mathematically while widening the agnostic state causally. The means move together but thus us a mathematical not a biological induced similarity.

The apparent precision of the posterior becomes a false clarity, concealing the epistemic discontinuity between mathematics and mechanism. Perceived noise elimination is actually the narrowing of the true biological force separation of the means.

A valid cause-informed RCT restores alignment by restricting exchangeability to strata that are mechanistically coherent, where biological and statistical similarity coincide.

Its Bayesian hierarchical framework functions as a statistical patch for the structural weakness that defines all CARs. Specifically the absence of a unifying causal anchor within the enrollment gate S=1.

Because the platform randomizes across patients defined by a synthetic syndrome (CAP) rather than a verified disease, the population is causally incoherent.

The hierarchical model shrinks stratum effect estimates toward a common mean but in the presence of primary disease (target) mixtures this may treat biological heterogeneity as if it is statistical noise.

In this design, the hyperpriors on the global mean and the between-stratum variance are apparently enforcing partial exchangeability among non-exchangeable disease states but the gamma is unknown so I could be wrong about that. Please advise.

The shrinkage provides stability: the model smooths random variation across small, dispersed strata to maintain functional adaptive randomization and produce interpretable posteriors.

Yet the smoothing operates only at the statistical level and thus was the intent given the dispersion of the centers which creates noise to be removed. Yet this does not resolve or mitigate the underlying causal fragmentation but rather hides it.

What appears as precision in the posterior is thus a mathematical artifact of enforced similarity, not genuine transportable knowledge. Biological differences are mitigated, the treatment effects appear similar as a partial function of shrinkage alone.

So in causal-symbolic terms, the REMAP-CAP framework is a PettyBone CAR mimic:

it replaces missing causal structure with a mathematically unifying methodology. Borrowing is beneficial within mechanistically coherent strata but not within biologically incoherent mechanisms to generate a single marginal estimand as well as multiple conditional estimands which are pulled together mathematically despite the fact that their raw differences may have a causal (biological) origin which is now mitigated by the math.

One wonders how adaptive randomization would be effected by an early chance exposure to sequential exposure to different mixes, of different diseases, with different treatment effects.

In my paper I don’t go to this depth but rather present the DAGs and leave the discussion of pooling and adaptive randomization, both of which require sufficient biological coherence, to the statisticians. .

I hope there will be clarifications of these musings. I may be wrong on some of these details but not on the fundamental premise which I have researched for my paper.

REMAP-CAP is a major advance and everyone must be in awe (as I was) relevant the cool method applied to bring centers across the world into mathematical, noise reduced, harmony. Yet there may still be some bugs inherented from standard 50 year old critical care RCT methodology. Let’s find them.