data analysis


data reduction data reduction (principal components, etc.), clustering, unsupervised learning accuracy accuracy and information measures, discriminaton, calibration probability Probability theory, meaning, and application exclusive of statistical tests, etc. generalizability Generalizability of studies and statistical inferences, sample representativeness, target population formal statistical tests and inference descriptive descriptive and exploratory data analysis, hypothesis generating more than confirmatory analysis bayes Bayesian data analysis, modeling, inference model validation model validation and interpretation comparative methods comparative performance of statistical analysis methods and predictive modeling approaches machine learning machine learning, exclusive of traditional statistical models variable selection Selection of predictive features in multivariable modeling, one-at-a-time screening of variables, and the cost of feature selection compared to using fuller models, possibly with penalization (shrinkage; regularization). models Formulation, parameter estimation, and interpretation of specific statistical models causal inference Methods and approaches to causal inference data problems statistical approaches dealing with missing data and measurement error modeling strategy General model specification issues, nonlinearities, interactions and heterogeneity of treatment effect, avoiding categorization, how to sequence multiple steps (which may involve multiple imputation and data reduction) uncertainty Quantifying uncertainty, displaying uncertainty, estimation of uncertainty, incorporation of uncertainty into decision making, etc. This includes but is not limited to confidence intervals, standard errors, Bayesian credible intervals, and sources of uncertainty. reporting This subcategory relates to how results of data analyses should be reported, for example which summary statistics should be reported for a logistic regression model.
Topic Replies Activity
About the data analysis category 2 January 21, 2019
Estimation of time to separation of survival curves 5 January 23, 2020
Comparing rates of a risk factor in 2 groups 6 January 21, 2020
Optimal Decision-making for Imputation/Predicted IVs 1 January 21, 2020
Bootstrap vs. cross-validation for model performance 7 January 19, 2020
Risk prediction in Cox regression using counting process style of input 1 January 17, 2020
Swopping exposed and unexposed groups in calculating OR 4 January 17, 2020
Meta-analysis with incomplete data 7 January 16, 2020
Multiplicity Adjustments in Bayesian Analysis 10 January 16, 2020
How to compare and evaluate models for a new feature? 2 January 11, 2020
Weibull AFT survival analysis - best method to present data when publising/presenting 4 January 11, 2020
Pre-specifying the distribution of AFT models according to AIC/BIC criteria and other exceptions to the use of observed effects 2 January 9, 2020
Dynamic model averaging (or similar) for pharmacokinetic curve fitting - looking for suggestions 1 January 9, 2020
Appropriate statistical analysis for predicting early death 1 January 9, 2020
Identifying patients through phenotype definition 8 January 5, 2020
Split + resampling, or just resampling? 3 January 4, 2020
Why significant variable doesn't improve model performance? 2 January 2, 2020
Seed for cross validation gives different results in L0 regularization for LASSO 2 December 23, 2019
Choosing a Classification probability threshold when using nested cross-validation 10 December 19, 2019
CART and Multiple Imputation 4 December 16, 2019
How to handle pre-post measures for a predictor 4 December 14, 2019
Precision and decision making 30 December 12, 2019
How to handle large number of observations below LLOQ 5 December 11, 2019
Conceptual question about survival analysis vs logistic regression: when is one more appropriate than the other? 15 December 2, 2019
Marginal vs conditional (Mixed) models used in Joint model analysis. The SPRINT trial controversy 10 December 2, 2019
Nomogram for Lasso-Cox regression 2 November 28, 2019
Creating contingency table when OR, p-value known 7 November 28, 2019
Reference Collection to push back against "Common Statistical Myths" 34 November 28, 2019
How to determine the p value cutoff for unvariate regression analysis to be included in multivariate analysis 3 November 17, 2019
Competing risk for AFT models in R 5 November 23, 2019