Thursday, 22 May 2014

hypothesis testing - What is the probability that dice are rigged?

We are given a single six-sided die. We produce a sequence of $n$ dice rolls, all of them rolling six. We do not know whether the die is fair or not.



How can we calculate the probability that the die is loaded or rigged given $n$ rolls of six?




In other words, what is the probability $q$ that the null hypothesis is false, given a series of $n$ successful Bernoulli trials?





I heard a non-mathematician say that the probability of one six on a fair die is $\frac{1}{6}$, and so the probability of rolling 4 sixes in a row on a fair die is $\frac{1}{6^4} = \frac{1}{1296}$. So far, so good.



But then, he said that the probability that the die is not loaded is $\frac{1}{1296}$, and so the probability that the die is loaded is $\frac{1295}{1296}$.



This does not add up to me. By the same logic, if I roll the die once and get six, the probability that the die is loaded is $\frac{5}{6}$, which cannot be true. You don't call a person a cheat for rolling a six once.






I think that to answer this question, I have to use the binomial distribution somehow, since:




  • the probability of a six, fair or not, remains constant and equal to $p$

  • I am only interested in success/failure




At this point, I get lost. The problem is that I only know the probability for the null hypothesis $p_0 = \frac16$, and I don't know what the actual value for $p$ is. I don't know where to go from here.



Am I asking the wrong question? Must I set a confidence level $\alpha$? If so, suppose I set $\alpha = 0.05$? $\alpha = 0.01$? I apologize for any incorrect terminology. I am a computer programmer, not a statistician or mathematician.



Edit: It looks like I have to specify how badly the dice must be loaded before I call them unfair. Suppose I say rolling a six has to be at least $r = 10\%$ more likely than a fair die (i.e. $p \ge p_0\cdot\left(1 + r\right) = \frac{11}{60}$) before I call it rigged?

No comments:

Post a Comment

real analysis - How to find $lim_{hrightarrow 0}frac{sin(ha)}{h}$

How to find $\lim_{h\rightarrow 0}\frac{\sin(ha)}{h}$ without lhopital rule? I know when I use lhopital I easy get $$ \lim_{h\rightarrow 0}...