Hello Sudhi, indeed this is a fundamental concern with methods like ICC – a measurement in a heterogeneous population appears more reliable than the exact same measurement applied to a homogeneous population. I have essentially stopped using ICCs for this reason, and followed prof Harrell’s advice to rely on U-statistics. Related post: