Setting
Let (Xi)i≤N be a set of i.i.d. random variables, with Xi mapping to some interval [a,b].
Let Yk:N be the kth order statistic of this set and v∈[a,b].
Denote by fX,FX the continuous pdf and the continuous CDF of Xi and by fYk:N the pdf of Yk:N
Quantity of interest
I am interested in the truncated expectation of the order statistic E[Yk:N|Yk:N>v].
This can be written as E[Yk:N|Yk:N>v]=∫∞vyfYk:N(y)dy∫∞vfYk:N(y)dy.
Conjecture
Computing this quantity in MATLAB, suggests that
E[Yk:N|Yk:N>v]→N→∞v.
Also my intuition is in line with this conjecture: For growing N, the support of fYk:N shrinks to a small region and we can predict E[Yk:N|Yk:N>v] better. Furthermore, the probability of the next value being close to v is large.
However, I am missing a formal proof.
Any ideas?
Answer
We need to assume something. Assume E|X|<∞ and F(v) is increasing, such that for all u>v, F(u)>F(v)
For u>v we have,
P(Yk:n>u|Yk:n>v)=P(Yk:n>u)P(Yk:n>v).
Now P(Yk:n>x) is asking for the probability that out of n tries at most k−1 of the Xi is below or equal to x. So if Nn,x∈Bin(F(x),n) (binomial distributed) we have,
P(Yk:n>x)=P(Nn,x<k).
Now this probability is decreasing in x and it is not hard to see that we can write for a fixed k,
P(Nn,x<k)=C(x,n)nk−1(1−F(x))n−k, with $C(x,n)
\frac{P(Y_{k:n}>u)}{P(Y_{k:n}>v)} = \frac{P(N_{n,u}
$$ if F(v)>0, with 0≤p<1 due to the fact that F(v) is monotonically increasing at v. Hence, this goes to zero as n goes to infinity. If F(v)=0, then we note, E(Yk:n|Yk:n>v)=E(Yk:n) and it enough to observe that still we have $C(x,n)
P(Y_{k:n}>u) = P(N_{n,u}
This shows the most probability mass lies at v so expectation over any finite region above an u will have a value that goes to zero and because of E|X| is finite, the tail goes to zero and we are left with essentially a delta measure on v and the expectation is indeed v.
No comments:
Post a Comment