Confidence Intervals for the Difference between Two Proportions by Sampling Importance Resampling Algorithm
Main Article Content
Abstract
Confidence intervals for the difference between two proportions ( p1 – p2 ) have gained interest by statisticians from past to present. Several researchers have proposed and compared the confidence intervals. Nowadays, computers play an important role in computational statistics. Methods of confidence interval constructions could employ algorithms instead of direct analytical method. In this research, Sampling Importance Resampling (SIR) algorithm frequently found in Bayesian statistics is applied to construct the confidence intervals which are then compared to those obtained from Bayes, hybrid, and Wald confidence intervals. The results show that the confidence intervals obtained from SIR give the coverage probabilities similar to those of Bayes and hybrid methods, and these 3 methods are superior to the Wald method in all population characteristics and sample sizes. Because SIR algorithm directly uses samples of p1 and p2, the confidence intervals of p1 – p2 are definitely in [–1, +1]. In case of , SIR algorithm is able to construct a confidence interval while the Wald interval is not when the standard error equals zero. The characteristic of population also affects the length of intervals. If p1 and p2 are close to .5 (symmetric), the interval length tends to be greater than that calculated from skewed population, for example p1 = p2 = .05. To conclude, in many situations, SIR algorithm is able to produce a good confidence interval similar to Bayes and hybrid method, one of the most efficient methods in literature review.
Article Details
The articles published are the opinion of the author only. The author is responsible for any legal consequences. That may arise from that article.
References
[2] S. E. Vollset, “Confidence intervals for a binomialproportion,” Statistics in Medicine, vol. 12, pp. 809–824, 1993.
[3] R. G. Newcombe, “Interval estimation for the difference between independent proportions: Comparison of eleven methods,” Statistics in Medicine, vol. 17, pp. 873–890, 1998.
[4] M. G. Haviland, “Yates's correction for continuity and the analysis of 2 × 2 contingency tables,” Statistics in Medicine, vol. 9, pp. 363–367, 1990.
[5] S. L. Beal, “Asymptotic confidence intervals for the difference between two binomial parameters for use with small samples,” Biometrics, vol. 43, pp. 941–950, 1987.
[6] J. B. S. Haldane, “On a Method of Estimating Frequencies,” Biometrika, vol. 33, pp. 222–225, 1945.
[7] I. J. Good, The Estimation of Probabilities: An Essay on Modern Bayesian Methods. Massachusetts: The MIT Press, 1965.
[8] R. Mee, “Confidence bounds for the difference between two probabilities,” Biometrics, vol. 40, pp. 1175–1176, 1984.
[9] O. Miettinen and M. Nurminen, “Comparative analysis of two rates,” Statistics in Medicine, vol. 4, pp. 213–226, 1984.
[10] G. Berry and P. Armitage, “Mid-P confidence intervals: A brief review,” The Statistician, vol. 44, no. 4, pp. 417–423, 1995.
[11] A. Agresti and B. A. Coull, “Approximate is better than ‘exact’ for interval estimation of binomial proportions,” The American Statistician, vol. 52, pp. 119–126, 1998.
[12] A. Agresti and B. Caffo, “Simple and effective confidence intervals for proportions result from adding two successes and two failures,” The American Statistician, vol. 54, pp. 280–288, 2000.
[13] B. P. Carlin and T. A. Louis, Bayesian Methods for Data Analysis. Florida: Chapman & Hall/CRC, 1996.
[14] J. F. Reed III, “Improved confidence intervals for the difference between two proportions,” Journal of Modern Applied Statistical Methods, vol. 8, no. 1, pp. 208–214, 2009.
[15] D. B. Rubin, “A noniterative sampling/importance resampling alternative to the data augmentation algorithm for creating a few imputations when fractions of missing information are modest: The SIR algorithm. Discussion of tanner and wong,” Journal of the American Statistical Association, vol. 85, pp. 543–546, 1987.
[16] D. B. Rubin, Using the SIR algorithm to simulate posterior distributions. In J. M. Bernardo, M. H. DeGroot, D. V. Lindley, and A. F. Smith. Bayesian Statistics, Clarendon, Oxford, 1988.
[17] C. A. Rohde, Introductory Statistical Inference with the Likelihood Function, 1st ed. London: Springer, 2014.