Power and sample size calculations for Poisson and zero-inflated Poisson regression models

Nabil Channouf, Marc Fredette, Brenda Macgibbon

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Although sample size calculations for testing a parameter in the Poisson regression model have been previously done, very little attention has been given to the effect of the correlation structure of the explanatory covariates on the sample size. A method to calculate the sample size for the Wald test in the Poisson regression model is proposed, assuming that the covariates may be correlated and have a multivariate normal distribution. Although this method of calculation works with any pre-specified correlation structure, the exchangeable and the AR(1) correlation matrices with different values for the correlation are used to illustrate the approach. The method used here to calculate the sample size is based on a modification of a methodology already proposed in the literature. Rather than using a discrete approximation to the normal distribution which may be much more problematic in higher dimensions, Monte Carlo simulations are used. It is observed that the sample size depends on the number of covariates for the exchangeable correlation matrix, but much more so on the correlation structure of the covariates. The sample size for the AR(1) correlation matrix changes less substantially as the dimension increases, and it also depends on the correlation structure of the covariates, but to a much lesser extent. The methodology is also extended to the case of the zero-inflated Poisson regression model in order to obtain analogous results.

Original languageEnglish
Pages (from-to)241-251
Number of pages11
JournalComputational Statistics and Data Analysis
Volume72
DOIs
Publication statusPublished - Apr 2014

Fingerprint

Sample Size Calculation
Poisson Regression
Poisson Model
Correlation Structure
Covariates
Regression Model
Siméon Denis Poisson
Sample Size
Normal distribution
Correlation Matrix
Zero
Calculate
Wald Test
Multivariate Normal Distribution
Methodology
Discrete Approximation
Higher Dimensions
Testing
Gaussian distribution
Monte Carlo Simulation

Keywords

  • AR(1)
  • Correlation structure
  • Exchangeable
  • Generalized linear models
  • Monte Carlo simulations
  • Wald test

ASJC Scopus subject areas

  • Computational Mathematics
  • Computational Theory and Mathematics
  • Statistics and Probability
  • Applied Mathematics

Cite this

Power and sample size calculations for Poisson and zero-inflated Poisson regression models. / Channouf, Nabil; Fredette, Marc; Macgibbon, Brenda.

In: Computational Statistics and Data Analysis, Vol. 72, 04.2014, p. 241-251.

Research output: Contribution to journalArticle

@article{5e75d451f0054f6e8b70a314d5d6b209,
title = "Power and sample size calculations for Poisson and zero-inflated Poisson regression models",
abstract = "Although sample size calculations for testing a parameter in the Poisson regression model have been previously done, very little attention has been given to the effect of the correlation structure of the explanatory covariates on the sample size. A method to calculate the sample size for the Wald test in the Poisson regression model is proposed, assuming that the covariates may be correlated and have a multivariate normal distribution. Although this method of calculation works with any pre-specified correlation structure, the exchangeable and the AR(1) correlation matrices with different values for the correlation are used to illustrate the approach. The method used here to calculate the sample size is based on a modification of a methodology already proposed in the literature. Rather than using a discrete approximation to the normal distribution which may be much more problematic in higher dimensions, Monte Carlo simulations are used. It is observed that the sample size depends on the number of covariates for the exchangeable correlation matrix, but much more so on the correlation structure of the covariates. The sample size for the AR(1) correlation matrix changes less substantially as the dimension increases, and it also depends on the correlation structure of the covariates, but to a much lesser extent. The methodology is also extended to the case of the zero-inflated Poisson regression model in order to obtain analogous results.",
keywords = "AR(1), Correlation structure, Exchangeable, Generalized linear models, Monte Carlo simulations, Wald test",
author = "Nabil Channouf and Marc Fredette and Brenda Macgibbon",
year = "2014",
month = "4",
doi = "10.1016/j.csda.2013.09.029",
language = "English",
volume = "72",
pages = "241--251",
journal = "Computational Statistics and Data Analysis",
issn = "0167-9473",
publisher = "Elsevier",

}

TY - JOUR

T1 - Power and sample size calculations for Poisson and zero-inflated Poisson regression models

AU - Channouf, Nabil

AU - Fredette, Marc

AU - Macgibbon, Brenda

PY - 2014/4

Y1 - 2014/4

N2 - Although sample size calculations for testing a parameter in the Poisson regression model have been previously done, very little attention has been given to the effect of the correlation structure of the explanatory covariates on the sample size. A method to calculate the sample size for the Wald test in the Poisson regression model is proposed, assuming that the covariates may be correlated and have a multivariate normal distribution. Although this method of calculation works with any pre-specified correlation structure, the exchangeable and the AR(1) correlation matrices with different values for the correlation are used to illustrate the approach. The method used here to calculate the sample size is based on a modification of a methodology already proposed in the literature. Rather than using a discrete approximation to the normal distribution which may be much more problematic in higher dimensions, Monte Carlo simulations are used. It is observed that the sample size depends on the number of covariates for the exchangeable correlation matrix, but much more so on the correlation structure of the covariates. The sample size for the AR(1) correlation matrix changes less substantially as the dimension increases, and it also depends on the correlation structure of the covariates, but to a much lesser extent. The methodology is also extended to the case of the zero-inflated Poisson regression model in order to obtain analogous results.

AB - Although sample size calculations for testing a parameter in the Poisson regression model have been previously done, very little attention has been given to the effect of the correlation structure of the explanatory covariates on the sample size. A method to calculate the sample size for the Wald test in the Poisson regression model is proposed, assuming that the covariates may be correlated and have a multivariate normal distribution. Although this method of calculation works with any pre-specified correlation structure, the exchangeable and the AR(1) correlation matrices with different values for the correlation are used to illustrate the approach. The method used here to calculate the sample size is based on a modification of a methodology already proposed in the literature. Rather than using a discrete approximation to the normal distribution which may be much more problematic in higher dimensions, Monte Carlo simulations are used. It is observed that the sample size depends on the number of covariates for the exchangeable correlation matrix, but much more so on the correlation structure of the covariates. The sample size for the AR(1) correlation matrix changes less substantially as the dimension increases, and it also depends on the correlation structure of the covariates, but to a much lesser extent. The methodology is also extended to the case of the zero-inflated Poisson regression model in order to obtain analogous results.

KW - AR(1)

KW - Correlation structure

KW - Exchangeable

KW - Generalized linear models

KW - Monte Carlo simulations

KW - Wald test

UR - http://www.scopus.com/inward/record.url?scp=84890560773&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84890560773&partnerID=8YFLogxK

U2 - 10.1016/j.csda.2013.09.029

DO - 10.1016/j.csda.2013.09.029

M3 - Article

VL - 72

SP - 241

EP - 251

JO - Computational Statistics and Data Analysis

JF - Computational Statistics and Data Analysis

SN - 0167-9473

ER -