Efficiency Comparison of Parametric and Non-Parametric Tests for Testing the Difference Between Central Values of More Than Two Independent Populations Using Counting Data
Keywords:
Kruskal-Wallis test, Van der Waerden test, Welch’s test, power of a test, type I errorAbstract
This research aimed to compare the performance of the one-way analysis of variance (F test), Welch's test, Kruskal-Wallis test, Van der Waerden test, and Median test for testing the difference between central values of more than two independent populations using counting data with binomial and Poisson distributions. The criteria used to evaluate and compare the efficiency of the proposed tests are the ability to control the probability of type I error and power. The results showed that for the binomial distribution data, all tests could control the probability of type I error for most situations, except when the sample size of each group was 10, and the variance of each group was less than 4, and when the sample size of each group was 15 and (n, p) were (10, 0.1) and (10, 0.7). For the Poisson distribution data, all tests could also control the probability of type I error in almost situations. However, except for the Welch's Test, the Van der Waerden Test, and the Median Test, the probability of type 1 error could not be controlled in cases when the sample size of each group was ten and = 30. When considering the power, it was found that the analysis of variance provided the highest power than other tests in almost situations. The results of this study will guide data analysts in choosing an appropriate statistic for testing the difference between central values of more than two independent populations when data is count data.
References
Alexander, R. A., & Govern, D. M. (1994). A new and simpler approximation for ANOVA under variance heterogeneity. Journal of Educational and Behavioral Statistics, 19, 91-101.
Blanca, M. J., Alarcón, R., Arnau, J., Bono, R., & Bendayan, R. (2017). Non-normal data: Is ANOVA still a valid option?. Psicothema, 29(4), 552-557.
Bowarnkitiwong, S.,& Areekul, K. (2017). Robustness of f-test for heterogeneity of population variances. SDU Research Journal, 13(1), 1-16.
Büning, H. (1997). Robust analysis of variance. Journal of Applied Statistics, 24, 319-332.
Cochran, W.G. (1947). Some consequences when the assumptions for the analysis of variance are not satisfied. Biometrics, 3, 22-38.
Everitt, B. S. (2002). The Cambridge Dictionary of Statistics. Cambridge University Press.
Gamage, J., & Weerahandi, S. (1998). Size performance of some tests in one-way ANOVA. Communications in Statistics -Simulation and Computation, 27, 625-640.
Glass, G., Peckham, P., & Sanders, J.R. (1972). Consequences of failure to meet assumptions underlying the fixed effects analysis of variance and covariance. Review ofEducational Research, 42, 237-288.
Harrison, R. L. (2010). Introduction to monte carlo simulation.AIP conference proceedings, 1204(1), 17-21.
Harwell, M. R., Rubinstein, E. N., Hayes, W. S., & Olds, C. C. (1992). Summarizing Monte Carlo results in methodological research: The one-and two-factor fi xed effects ANOVA cases. Journal of Educational and Behavioral Statistics, 17, 315-339.
Hecke, T.V. (2012). Power Study of ANOVA Versus Kruskal-Wallis Test. Journal of Statistics and Management Systems, 15(2-3), 241-247.
Khomduean, J.,& Araveeporn, A. (2017). Efficiency Comparison of Statistic for Testing Three Population Means in Case of Homogeneity and Heterogeneity of Variance. Journal of Applied Statistics and Information Technology, 2(2), 61-77.
Khomduean, J.,& Araveeporn, A. (2017). Efficiency Comparison of Statistic for Testing Three Population Means in Case of Heterogeneity of Variance. Thai Science and Technology Journal, 25(6), 918-928.
Klubnual, P. (2018). The efficiency of parametricand non-parametric statistics on location testing with multiple population groups. RMUTSB Academic Journal, 6(1), 84-100.
Kruskal, W. H.,& Wallis, W. A. (1952). Use of ranks in one-criterion variance analysis. Journal of the American Statistical Association 47(260), 583–621.
Lee, S., & Ahn, C. H. (2003). Modified ANOVA for unequal variances. Communications in Statistics -Simulation and Computation, 32, 987-1004.
Lesch, S. M.,& Jeske, D. R. (2009). Some Suggestions for Teaching About Normal Approximations to Poisson and Binomial Distribution Functions. The American Statistician, 63(3), 274-277.
Lix, L. M., Keselman, J. C., & Keselman, H. J. (1996). Consequences of assumption violations revisited: A quantitative review of alternatives to the one-way analysis of variance F test. Review of Educational Research, 66, 579-619.
Lüpsen, H. (2017). Comparison of nonparametric analysis of variance methods -A Vote for van derWaerden. Communications in Statistics -Simulation and Computation, 30, 1-30.
McDonald, J.H. (2014). Handbook of Biological Statistics(3rded.).Sparky House Publishing, Baltimore, Maryland.
Moder, K. (2007). How to keep the Type I Error Rate in ANOVA if Variances are Heteroscedastic. Austrian Journal of Statistics, 36(3), 179-188.
Moder, K. (2010). Alternatives to F-test in one way ANOVA in case of heterogeneity of variances (a simulation study). Psycho
logical Test and Assessment Modeling, 52, 343-353.
Patrick, J. D. (2007). Simulations to analyze Type I error and power in the ANOVA F test and nonparametric alternatives [Unpublished master’s thesis].University of West Florida. http://etd.fcla.edu/WF/WFE0000158/Patrick_Joshua_Daniel_200905_MS.pdf
Sangthong, M. (2018). Efficiency Comparison of Parametric and Nonparametric Statistics on Central Testing Between Population Groups When Using Five-level Likert-type Scales. Thai Science and Technology Journal,28(3), 403-418.
Scheffe, H. (1959). The analysis of variance(6thed.). New York: Jonh Wiley & Sons.
Sheskin, D.J. (2000). HandbookofParametric and Nonparametric Statistical Procedures(2nded.).Boca Raton, FL: Chapman and Hall/ CRC.
Sinsomboonthong, S. (2017). Experimental Designs in Agriculture. Bangkok, Thailand: Chamchuree Products Co., LTD.
Sinsomboonthong, S. (2020). Nonparametric Statistics. Bangkok, Thailand: Chamchuree Products Co., LTD.
Urawong, K., Chumnaul, J., Phaochamrun,C., & Singhabumrung, A. (2022). A comparing on performance of t-test and Wilcoxon rank sum test for two independent populations using counting data. The Journal of Applied Science, 21(1), 1-17.
Van der Waerden, B.L. (1952). Order tests for the two-sample problem and their power.Indagationes Mathematicae, 14, 453–458.
Van der Waerden, B.L. (1953). Order tests for the two-sample problem. II, III, Proceedings of the Koninklijke Nederlandse Akademie van Wetenschappen, Serie A, 564, 303–310.
Welch, B.L. (1951). On the comparison of several mean values: an alternative approach. Biometrika, 38(1), 330-336.
Yiğit, E., & Gökpınar, F. (2010). A Simulation study on tests for one-way ANOVA under the unequal variance assumption. Communications Faculty of Sciences University of Ankara, Series A1, 59, 15-34.
Zijlstra, W. (2004). Comparing the Student ́s t and the ANOVA contrast procedure with five alternative procedures [Unpublished master’s thesis].RijksuniversiteitGroningen.

Downloads
Published
License
Copyright (c) 2023 Journal of Applied Science and Emerging Technology

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.