Suppose you have a fair N-sided die. You decide to roll it until M unique values have been produced (i.e. you re-roll all previously rolled values). How many times will you roll the die? (Given 2<=M<=N.)
I know that for the special case of M=2 it's simply a matter of how many times you have to re-roll your attempt for the second value, so the distribution is: PN2(X=u)=(1N)u−2(1−1N)=N−1Nu−1
And that for any M the probability of the lowest possible outcome X=M (i.e. no re-rolls): PNM(X=M)=M−1∏i=0(1−iN)=1NMM−1∏i=0(N−i)=N!NM(N−M)!
The final clue I've got is that the probability distributions for subsequent values of M can be defined using the probability distribution of the previous, like so:
PNM(X=u)=u−M+1∑i=1(PNM−1(X=u−i)(M−1N)i−1(1−M−1N))
With that I can determine the probability distribution for any value of M I want, for instance M=3:
PN3(X=u)=u−3+1∑i=1(PN2(X=u−i)(3−1N)i−1(1−3−1N))
=u−2∑i=1((N−1Nu−i−1)(2N)i−1(1−2N))
=u−2∑i=1((N−1Nu−1)Ni(N2)(2N)i(N−2N))
=(N−1)(N−2)2⋅Nu−1u−2∑i=1(2i)
=(N−1)(N−2)Nu−1u−1∑i=0(2i)
=((N−1)(N−2)Nu−1)(1−2u1−2)
=(N−1)(N−2)(2u−1)Nu−1
However, I have no idea how to turn this into a generic formula that will allow me to calculate the probability for any N, M, and u without going through the process of figuring out the PMF of every value of M leading up to the one I want.
No comments:
Post a Comment