Suppose we have a fair coin. The event Heads will be marked as $H$ and Tails as $T$. So we have, for an arbitrary flip
$$P(H) = 1/2 \\
P(T) = 1/2$$
We wish to calculate the probability of flipping the coin twice and obtaining either
- both tails
- both heads
- one heads, one tails $(\star)$
Now, some of my confusion begins. I'll try to keep a numbered list of the concerns I have.
- The problem as stated is ambiguous. For the bullet marked by $(\star)$ it's unclear whether I mean the sequence $(H, T)$, with respect to time, or the unordered pair $\{T, H\}$. Clearly these things have different probabilities.
Regarding my confusion in (1), the author of the question could easily make precise what is meant (as I did in my statement of confusion). However, the original ambiguity leads me to what I think is my deeper confusion on the definition of the probability of independent events. For example—and by Wikipedia's definition—if events $A$ and $B$ are independent, then their joint probability is the product of their individual probabilities. In other words:
$$P\left(A\cap B\right) = P\left(A\right)P\left(B\right)$$
So, this definition seems to correspond to the probability of flipping our coin and getting one of the sequences: $TH$, $HT$. Or we could discard time considerations and consider $HT = TH$, more formally in set notation as $\{T, H\} = \{H, T\}$, then add $P(HT) + P(TH)$ to get $P(H\cap T)$...?
- I think I should normally interpret independent probability questions like this as irrespective of time or sequence. Except in the sense that the number of sequences that are "unordered equivalent" to the unordered pair in question, are the "weight" of the unordered pair.
Okay, that might have been a confusing but it's because I'm confused. Maybe what I mean can be made clear by building what I think is the full situation of the coin flipping, up to two flips.
\begin{align}
P(T) &= 0.5\\
P(H) &= 0.5 \\
P(TT) = P(T \cap T) &= 0.25 \\
P(HH) = P(H \cap H) &= 0.25 \\
P(HT) = P(H)P(T) = P(T)P(H) = P(TH) &=0.25 \\
P(T \cap H) = P(HT) + P(TH) &=0.5
\end{align}
Where concatenated letters refer to the sequence and the set operator refers to the unordered pair (or set).
- Is this notation consistent?
And lastly, I was motivated to ask these questions primarily by a problem I read in Data Science from Scratch, which essentially asks the follow up question: what is the probability of getting two Heads given that the first flip was Heads?
The answer is intuitively $0.5$—the problem becomes only dependent on the second, yet-to-be-flipped coin, but how does the notation work? To set it up, based on the very definition provided by the text, we have
\begin{align}
P(H \cap H | H) = \frac{P
\left((H \cap H)\cap H\right)}{P(H)} = \frac{P(H)P(H)P(H)}{P(H)}
\end{align}
Which doesn't make sense, because there is not third flip of the coin, and the second flip is independent of the first. But if we think of the coin flipping, and conditional probability in terms of a sequence we could write
\begin{align}
P(HH | H) = \frac{P
\left((HH)\cap H\right)}{P(H)} = \frac{P(H)P(H)P(H)}{P(H)}
\end{align}
But ends up being the same thing?
- I'm missing something here, whether it's notation, definition(s), or conceptually. Will you please guide me, in a rigorous way, to the intuitive result that: given one heads, the probability of getting two is $1/2$?
Thank you!
No comments:
Post a Comment