Friday, 31 January 2014

numerical methods - Approximating the normal CDF for 0leqxleq7




In answer https://stackoverflow.com/a/23119456/2421256, an approximation of the complementary normal CDF (ie 12π+xet22dt) was given for 0x7 using a rational function as follows :
12πex22N0+N1x+...+N6x6M0+M1x+...+M7x7



How can one derive such an approximation,without relying on an already implemented approximation (ie no methods based on curve fitting), and a bound for the absolute error ?



EDIT : I added a bounty because I believe this question is not at all trivial.



EDIT 2 : I'm adding here what I've been able to do for now. Indeed, as Ian pointed out, for 0x2 a good approximation of the normal CDF can be derived easily, by using a Taylor series expansion of et22 and integrating it term by term between 0 and x.



For 2x7, I proved, using Mill's inequality and the fact that +xet22dtxex22 is increasing in x, that P(Z>x)xex2298π

, which yields an error smaller than 2.39×103, but it is far from the precision yielded by the rational approximation above (around 1016, checked numerically since I don't have a rigourous bound for it).




EDIT 3 : if it's of help, the target error is something smaller than 109.


Answer



This isn't a definitive answer, but hopefully points you in the right direction. The short of it is, I don't know how the coefficients were derived, but it is likely related to some Pade, Remez, interpolating, or least-squares approximant. Read on for details.



The function we're talking about is
Q(x)=12πxet2/2dt=12erfc(x2),


for x>0, where erfc is the complimentary error function.




The link you provide gives code for an approximation of the following form, slightly different than how it was stated in the original question (note the lack of normalization by 2π and a different form of the polynomials).
˜Q(x)=ex22(((((N6z+N5)z+N4)z+N3)z+N2)z+N1)z+N0((((((M7z+M6)z+M5)z+M4)z+M3)z+M2)z+M1)Z+M0


Why do we pick this form? Well, asymptotically, the Q function is well approximated by ex2/2x2π as x. See here for details on that. This justifies factoring out the asymptotic behavior and looking for a rational approximation of what remains. If we factored out the asymptotic form, we could approximate what remains with something that goes to 1 for large x, i.e., something with equal order in the numerator and denominator. If we leave that extra factor of x in the denominator from the asymptotic piece, we can account for it by absorbing it into the denominator of the rational function, giving one higher degree in the denominator polynomial. The same goes for the factor of 2π, it gets absorbed in to the approximation.



Now, how on earth did Hart come up with those Ni and the Mj coefficients? I thought it was by requiring that the Taylor series of ˜Q(x)ex2/2 matches that of Q(x)ex2/2 at some a point, up to a high enough order to uniquely determine all the coefficients. This is the Pade approximation technique, and you should be able to find more about the error bounds for approximations resulting from this technique. I used Mathematica to calculate these below.



I thought that surely, if you calculate that Pade approximant around x=0, and scale the coefficients to match this weird M0=440.413735824752 (Pade approximants are usually normalized so that the constant term in the denominator is 1) you'd get the formula. Instead, I get:




iNiMi0220.207440.4141152.909711.438295.568525.241323.168234.06644.633661.981350.3475210.266160.1544531.0479970.048886



It's a similar pattern to Hart's numbers, and the values are all in the same ballpark as Hart's numbers, but it is not a match. Then I thought that there is this weird breakpoint in the code that appears to be 1027.07. This lead me down the path of looking to see if the Pade approximant near 102 or at the halfway point of the interval, 1022, appropriately scaled, gave Hart's original coefficients. They don't.



So here are some possibilities.





  1. Hart's coefficients are for a Pade approximant around SOME point on the interval [0,10/2], and I don't have the patience to find which point.

  2. Hart used some other method to generate these coefficients, and they don't correspond to a Pade approximant at all. One candidate is the Remez algorithm for rational functions, which should guarantee the error oscillates equally across the domain of approximation. Another candidate is that the coefficients are derived from an interpolation between some fixed points with known values of the domain. Yet another is a least-squares fit between the rational approximant a a bunch of known values of the function.



This has been a long winded answer, and boils down to "I don't know". But I hope the techniques I mentioned can be fruitful avenues for you to continue looking into this. I would finally suggest getting the original "Algorithm 5666 for the error function" and seeing if there is an explanation of the methodology in book in which it appears.


No comments:

Post a Comment

real analysis - How to find limhrightarrow0fracsin(ha)h

How to find limh0sin(ha)h without lhopital rule? I know when I use lhopital I easy get $$ \lim_{h\rightarrow 0}...