Hacker News new | past | comments | ask | show | jobs | submit login

The probability that he would make that statement given what his children are is a different question than the probability that the other child is also a boy given that he made that statement. Your conditional probabilities are correct, but your conclusion about what they mean for the original question is flawed.

    N = Total number of ways to have 2 children over 7 days = 14^2 = 196
    B = Two sons born on Tuesday.
    O = Exactly one son born on Tuesday.
    A = At least one son born on Tuesday. 
    T = Two sons.
    S = The statement.
Priors:

    P(B) = 1/N = 1/196 = 0.005
    P(O) = 26/N = 26/196 = 0.133
    P(A) = P(B) + P(O) = 27/N = 27/196 = 0.138 
Your conditional probabilities:

    P(S|O) = .5  
    P(S|B) = 1  
An interesting number we can infer from your conditionals is the probability that a father selected at random would make the statement:

    P(S) = P(S|B)P(B) + P(S|O)P(O) = 1.0 * 0.005 + 0.5 * 0.133 = 0.0715 
But the question we asked about the other child already takes into account the fact that he did make that statement, meaning we're back to only caring about those 27 cases:

    P(B|S) = 1/27 = 0.037
    P(O|S) = 26/27 = 0.963
    P(A|S) = 27/27 = 1.0
    P(T|S) = 13/27 = .481 
Another interesting number we can infer from your conditionals is the probability that a father would make the statement given that at least one of his children was a boy born on Tuesday:

    P(S|A) = P(S|B)P(B|A) + P(S|O)P(O|A) = 1.0 * 0.037 + 0.5 * 0.963 = 0.519
If you still think this is incorrect, can you point to exactly which number is wrong and explain why?



The problem I have is that you're assuming that each of the 27 outcomes has equal probability, but the chance that we received the information in the way we did makes that a flawed assumption.

The best analogous problem is the German Tank Problem:

http://en.wikipedia.org/wiki/German_tank_problem

If we have destroyed a single German tank with a serial number 100, we can at least to begin to make an estimate on the size of the German force, by basically asking the question:

"If they have 200 tanks, what was the chance one we randomly killed was this serial number? 500? 1000?"

And then combining n=100->infinity to form a probability distribution. You can then say that there is an x% chance that Germany has 500 tanks, and a y% chance that Germany has 10,000 tanks.

However - if instead, we asked 'does there exist a German tank with a serial number 100', and the answer is yes, this does NOT tell us anything past the fact that their tanks are >= 100 in number.

We have the exact same information, but how it was determined changes the outcome drastically.

Does that make sense?


No, it doesn't make sense, because it doesn't apply here. We aren't estimating the count of anything. We know he has two kids, we know there are two genders, and we know there are seven days. All other relevant counts can be calculated directly from these, no estimation required.

I already showed the probability that we would receive the message the way we did, assuming we're sampling fathers with two children, and its pretty low. If we drop the sampling assumption, it would go even lower. But that's irrelevant to the actual question, because we've already won that lottery. I've also shown the probability that we would get the statement we got given that the father had at least one son born on a Tuesday, but again, we already won that lottery.

If you still insist, can you please stop talking in hand-wavy fake math and show some actual concrete numbers? To start, if each of the 27 possibilities are not equally likely, what are the actual probabilities and why?




Consider applying for YC's W25 batch! Applications are open till Nov 12.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: