Logistic regression assumptions

Greetings:

This is my first post to this forum and I want to thank Prof Harrell for creating this resource for students of this subject like me.

On to my question:

RMS p. 221 states: “logistic model makes no distributional assumption whatsoever.” [emphasis added]

May I ask how to reconcile it with the work by economists in the context of choice models that asserts that logistic model assumes that the underlying utility must have an extreme value distribution. For instance here is a quote:

The relation of the logit formula to the distribution of unobserved utility (as opposed to the characteristics of choice probabilities) was developed by Marley, as cited by Luce and Suppes (1965), who showed that the extreme value distribution leads to the logit formula. McFadden (1974) completed the analysis by showing the converse: that the logit formula for the choice probabilities necessarily implies that unobserved utility is distributed extreme value.

Discrete Choice Methods with Simulation, by Kenneth Train, Cambridge University Press, 2002 (berkeley.edu), Chapter 3, p. 34

There are lots of ways to derive models. The fact that there is a link to utility theory doesn’t mean that the logistic model makes a distributional assumption. We are talking about the data distribution (and sometimes the sampling scheme) when we refer to model assumptions. The only distributional assumption made by the binary logistic model is that the observations are independent, i.e., you have one observation per subject. Other than that, any time you have a binary Y=0, 1 you automatically have a Bernoulli distribution no matter what (assuming independence holds) so in reality you are assuming nothing. It is the non-distributional regression assumptions (right hand side of the model) that matter in logistic regression.

Blockquote May I ask how to reconcile it with the work by economists in the context of choice models that asserts that logistic model assumes that the underlying utility must have an extreme value distribution

Not sure if it helps with your question but we use utilities differently in our group for clinical applications, including for ordinal outcomes: first we use whatever statistical model is more likely to fit the data best and is computationally tractable. Then we convert the relevant parameters of the model into outcomes that are clinically interpretable by clinicians (e.g., probability of response to therapy at 6 weeks) which facilitates eliciting relevant utility functions for these outcomes. This allows conversion of probabilities (including joint probabilities of outcomes of interest) into mean utilities that can be used to guide decisions that maximize mean utility. See an example here within a phase I-II trial modeling context. Note that we used there a probit link (similarly to what economists often use) instead of a logit, as probit often aligns with Juhee Lee’s modeling preferences in part because the effect of frailty vectors can be more clear under probit. But in theory logit could have been used under a different modeling philosophy.

Now, since whether to use logit or something else is itself a decision, the trade-offs can in theory be expressed as utilities as well within such a framework at a more meta- (or hierarchical) level. But I cannot see how logit would be incompatible with the utility-based framework described above (other than making aspects of modeling easier as mentioned)?

This is classic decision making. It is best to separate, as you did, the data model from the utility model. The data model is used to provide evidence for or against various assertions. Utilities measure only the consequences of acting on assertions.

1 Like