100% agree - the absence of calibration is a major omission. The editors/reviewers should have required following TRIPOD-AI.
Also given the hype about this it is remarkable to see how marginal the increases in discrimination are over the baseline model, which it is important to note only included age, gender, BMI, and race/ethnicity.
Of course, datamethods readers don’t need reminding that comparing c-statistics is not how to look at added predictive value…
