Saturday, 2 August 2014

statistics - Variance stabilization for Poisson data

Intro

Let Z>0 be a random variable with the mean and variance defined as E{Z} and Var{Z}, respectively. The variance stabilization transform (VST) f(z) turns heteroskedastic data z to homoskedastic data f(z) with constant variance, e.g., variance equals 1.



Poisson distribution
For Poisson distributed data with E{Z}=Var{Z}=λ this VST, so-called Anscombe transformation, is given by [1,2]:



f(z)=2z+3/8



Based on the first order Taylor expansion we can write (this is called Delta method in the literature) [3]:



Var{f(z)}(dfdz|z=E{Z})2Var{Z}=Var{Z}E{Z}+3/8




Problem
I performed a Monte Carlo simulations to compare sample variance of the stabilized data f(z) and the variance obtained by the above equation, i.e., Var{f(z)} both numerically as a sample variance and theoretically as follows:



Var{f(z)}λλ+3/8



Moreover, I went further and derived second order approximation for Var{f(z)}.



There is a mismatch between variance of stabilized data f(z) (green curve) and those obtained theoretically and numerically by means of the Var{f(z)}. Can anyone explain me this inconsistency?




enter image description here




  1. Anscombe, F. J. (1948), "The transformation of Poisson, binomial and negative-binomial data", Biometrika 35 (3–4): 246–254, doi:10.1093/biomet/35.3-4.246, JSTOR 2332343

  2. http://en.wikipedia.org/wiki/Anscombe_transform

  3. Kendall's Advanced Theory of Statistics: Volume 1: Distribution Theory by Alan Stuart and Keith Ord (Apr 20, 2009), page. 351, eq. 10.14

No comments:

Post a Comment

real analysis - How to find limhrightarrow0fracsin(ha)h

How to find lim without lhopital rule? I know when I use lhopital I easy get $$ \lim_{h\rightarrow 0}...