Background Dichotomisation of continuous variables before analysis has frequently been criticised but, nonetheless, remains a common approach. We were interested in the effects of dichotomisation of an outcome variable when two predictors are examined.
Methods Assuming a log-normally distributed continuous outcome, a three-level and a binary independent variable, we evaluated the results that would be obtained by logistic regression after dichotomisation. Different cut-offs, predictor effects and dispersions were examined, with a special focus on interaction terms.
Results Depending on the specific parameter combination, dichotomisation introduced sometimes substantial spurious interactions between the two predictor variables regarding their association with the outcome. These interactions could be assigned statistical significance even with modest sample sizes. Real-life data on sex×weight as determinants of γ-glutamyltransferase provided a practical example of these issues.
Conclusions The findings presented add a new aspect to the controversy surrounding dichotomisation of continuous variables. Researchers should critically examine whether the validity of their results might be hampered by such phenomena.
- spurious association
Statistics from Altmetric.com
Funding Deutsche Forschungsgemeinschaft, Germany.
Competing interests None.
Provenance and peer review Not commissioned; externally peer reviewed.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.