RE: Q1. Frank is listed on these 2 recent papers that compare “statistical” vs “machine learning” in a health context. I don’t know of anything specific towards refereeing.
Austin, P. C., Harrell, F. E., Lee, D. S., & Steyerberg, E. W. (2022). Empirical analyses and simulations showed that different machine and statistical learning methods had differing performance for predicting blood pressure. Scientific Reports, 12(1), 1-11. link
Austin, P. C., Harrell Jr, F. E., & Steyerberg, E. W. (2021). Predictive performance of machine and statistical learning methods: Impact of data-generating processes on external validity in the “large N, small p” setting. Statistical methods in medical research, 30(6), 1465-1483. link
I’m reminded of this thread that was critical of using the term ML to make a paper seem more sophisticated.
RE Q3. This data methods thread has lots of relevant citations: