Bruce Weaver
2014-03-28 22:54:07 UTC
Hello,
A recent article by O'Boyle & Aguinis (2012) demonstrated that most individual job performance follows a Pareto rather than a normal distribution. This is a VERY big deal, because most statistical techniques used by psychology and management researchers assume normally distributed data. If now tools like ANOVA, regression, and any variations on the general linear model cannot be used to correctly analyze job performance data, what alternatives (other than chi-square) are available in SPSS? What tools can be used to, for example, make between-group difference comparisons, when your data is Pareto rather than normally distributed? Any guidance would be appreciated.
Thank you!
Your question is not really about SPSS, so is probably more appropriateA recent article by O'Boyle & Aguinis (2012) demonstrated that most individual job performance follows a Pareto rather than a normal distribution. This is a VERY big deal, because most statistical techniques used by psychology and management researchers assume normally distributed data. If now tools like ANOVA, regression, and any variations on the general linear model cannot be used to correctly analyze job performance data, what alternatives (other than chi-square) are available in SPSS? What tools can be used to, for example, make between-group difference comparisons, when your data is Pareto rather than normally distributed? Any guidance would be appreciated.
Thank you!
for a group like sci.stat.consult (where I have cross-posted this reply).
I assume you are referring to this article:
http://hrprofessionalsmagazine.com/wp-content/uploads/2013/03/Normality-of-Performance-Paretian-Theory.pdf
I don't have time to read it right now, but here are a few quick comments.
1. As George Box observed, nothing in nature is normally distributed.
The normal distribution is an approximation that can serve as a useful
model. (He also said there's no such thing as a straight line in
nature, but that linear fits can provide useful models.)
2. Methods that assume normality typically assume normality of the
(model fitting) errors, or normality of the sampling distribution of
some statistic, not normality of the raw data.
3. As Herman Rubin and others have said many times, normality of the
errors is far less important than the independence of the errors and
homoscedasticity (i.e., the "identically distributed" part of
"independently and identically distributed").
HTH.
--
Bruce Weaver
***@lakeheadu.ca
http://sites.google.com/a/lakeheadu.ca/bweaver/Home
"When all else fails, RTFM."
Bruce Weaver
***@lakeheadu.ca
http://sites.google.com/a/lakeheadu.ca/bweaver/Home
"When all else fails, RTFM."