Potential issues with risk differences from logistic regression models

Thank you. I realise using the GUSTO trial as the code example was misleading, as a treatment-trail situation is not where I intend to use this. A more detailed description of the specific circumstance I am targeting is here. In short, a claim has been made that a specific exposure is a severely underrecognized cause of a certain common condition and is the cause of a significant proportion of this condition in the general population. I have a large population-based sample in which each participant is prospectively screened for the exposure and condition. Assuming the exposure does not protect against other unrelated causes of the condition, I want to infer, for the population, what proportion of the condition could reasonably be assumed to be caused the exposure to either support or refute the claims that have been made. Importantly, the claims are made on the population level, i.e. not what would be expected to happen to the individual.

In this scenario, would my approach be reasonable?

1 Like