Short version:
I would like to calculate the expected value if you apply the sigmoid function 11+e−x to a normal distribution with expected value μ and standard deviation σ.
If I'm correct this corresponds to the following integral:
∫∞−∞11+e−x1σ√2π e−(x−μ)22σ2dx
However, I can't solve this integral. I've tried manually, with Maple and with Wolfram|Alpha, but didn't get anywhere.
Some background info (why I want to do this):
Sigmoid functions are used in artificial neural networks as an activation function, mapping a value of (−∞,∞) to (0,1). Often this value is used directly in further calculations but sometimes (e.g. in RBM's) it's first stochastically rounded to a 0 or a 1, with the probabililty of a 1 being that value. The stochasticity helps the learning, but is sometimes not desired when you finally use the network. Just using the normal non-stochastic methods on a network that you trained stochastically doesn't work though. It changes the expected result, because (in short):
E[S(X)]≠S(E[X])
for most X. However, if you approximate X as a normal distribution and could somehow calculate this expected value, you could eliminate most of the bias. That's what I'm trying to do.
Answer
I doubt that there's a closed-form solution. However, here's a series in powers of σ:
(e−μ+1)−1+(e−μ−1)e−μ2(e−μ+1)3σ2+(e−3μ−11e−2μ+11e−μ−1)e−μ8(e−μ+1)5σ4+e−μ(e−5μ−57e−4μ+302e−3μ−302e−2μ+57e−μ−1)48(e−μ+1)7σ6+e−μ(e−7μ−247e−6μ+4293e−5μ−15619e−4μ+15619e−3μ−4293e−2μ+247e−μ−1)384(e−μ+1)9σ8+O(σ10)
EDIT: To obtain this, first do the change of variables x=μ+σt. The
integral becomes
1√2π∫∞−∞e−t2/21+e−μ−σt dt
Now take the Maclaurin series 11+e−μ−σt=11+e−μ+e−μσt(1+e−μ)2+e−μ(e−μ−1)σ2t2(1+e−μ)3+…
and integrate term by term.
No comments:
Post a Comment