Over 60 years ago Ronald Fisher demonstrated a number of potential pitfalls with statistical analyses using ratio variables. Nonetheless, these pitfalls are largely overlooked in contemporary clinical and epidemiological research, which routinely uses ratio variables in statistical analyses. This article aims to demonstrate how very different findings can be generated as a result of less than perfect correlations among the data used to generate ratio variables. These imperfect correlations result from measurement error and random biological variation. While the former can often be reduced by improvements in measurement, random biological variation is difficult to estimate and eliminate in observational studies. Moreover, wherever the underlying biological relationships among epidemiological variables are unclear, and hence the choice of statistical model is also unclear, the different findings generated by different analytical strategies can lead to contradictory conclusions. Caution is therefore required when interpreting analyses of ratio variables whenever the underlying biological relationships among the variables involved are unspecified or unclear.
Biostatistics Unit, Centre for Epidemiology and Biostatistics, University of Leeds, Leeds, UK; Leeds Dental Institute, University of Leeds, Leeds, UK; London Metropolitan University Graduate School, London, UK
%0 Journal Article
%1 Tu2010
%A Tu, Yu-Kang
%A Law, Graham R
%A Ellison, George T H
%A Gilthorpe, Mark S
%D 2010
%J Pharmaceutical statistics
%K AnalysisofVariance Animals BodyWeight Cats Female Male OrganSize StatisticsasTopic StatisticsasTopic:methods
%N 1
%P 77-83
%R 10.1002/pst.377
%T Ratio index variables or ANCOVA? Fisher's cats revisited.
%U http://dx.doi.org/10.1002/pst.377 http://www.ncbi.nlm.nih.gov/pubmed/19337988
%V 9
%X Over 60 years ago Ronald Fisher demonstrated a number of potential pitfalls with statistical analyses using ratio variables. Nonetheless, these pitfalls are largely overlooked in contemporary clinical and epidemiological research, which routinely uses ratio variables in statistical analyses. This article aims to demonstrate how very different findings can be generated as a result of less than perfect correlations among the data used to generate ratio variables. These imperfect correlations result from measurement error and random biological variation. While the former can often be reduced by improvements in measurement, random biological variation is difficult to estimate and eliminate in observational studies. Moreover, wherever the underlying biological relationships among epidemiological variables are unclear, and hence the choice of statistical model is also unclear, the different findings generated by different analytical strategies can lead to contradictory conclusions. Caution is therefore required when interpreting analyses of ratio variables whenever the underlying biological relationships among the variables involved are unspecified or unclear.
%@ 1539-1612