Saturday, 29 April 2006

Probably incorrect probability

A recent post on the blog Blackademic about the alleged rape of a black woman by white Duke university lacrosse players generated a flurry of comments. One of them anonymously argued:
"Unfortunately, statistically a black women is significantly more likely to make a false accusation of rape than to have been raped by a white man. According to the National Crime Victimization Survey ( http://www.ojp.usdoj.gov/bjs/pub/pdf/cvus/current/cv0342.pdf ), less than .0004% of black rape victims were raped by whites. (The NCVS reports the percentage as 0% because there were less than 10 reported cases. I assumed 9 cases, to come up with an actual percentage) Even with the most conservative figure of 2% of rape allegations being false, this means in the case of the Duke Rape Case, the victim is 5000 times more likely to have made a false accusation than to have actually been raped."
There were some perplexed responses to this dramatic claim:
  • "yeah, cuz stats and figures are ALWAYS correct--whatever. it depends on who did the survery and for whom."
  • "How did you come up with the 5000 times more likely figure? That makes no sense at all. Using the figures you cited, the victim regrdless of race is likely to be lying only 2% of the time."
  • "Even if this study were accurate, and even if it were ethical to invoke the laws of probability to determine whether someone is believable-- two outsized ifs-- one coin, landing heads up, doesn't determine the likelihood of the next coin landing heads up. Neither does one woman, 30 years ago, have any bearing on the likelihood that another woman is telling the truth."
  • "the statistics and logic are just that - excercises in probability that tell us nothing about the case in question, because they are not equal to evidence."
But I think these responses miss the point: as far as I can see, the claim is simply incorrect. I think my reasoning is correct, but if I've slipped up please leave a comment.

First, Anonymous claimed that "less than .0004% of black rape victims were raped by whites." I followed the link to the National Crime Victimization Survey to check on this. The total number of rapes or sexual assaults of blacks listed was 24,010 and based on "about 10 or fewer sample cases" the perceived offender was white 0.0% of the time. I'm not entirely sure what this means, but Anonymous reasoned that as many as 9 of the offenders might be white. Now 9 out of 24,010 is about 0.04%, not 0.0004%. Anonymous then introduces "the most conservative figure of 2% of rape allegations being false". Dividing 2% by the incorrect figure of 0.0004%, Anonymous claims that "the victim is 5000 times more likely to have made a false accusation than to have actually been raped". If we divide 2% by the correct figure of 0.04%, we get 50 not 5000!

But apart from this error, the interpretation of the ratio is wrong. For it to be right, we would have to know the probability that the woman was raped. But that's not what the 0.04% represents. Instead, it's an estimate of the probability that if a black person is raped, the offender is white.

So the whole thing is invalid. The problem isn't with probabilistic reasoning per se, it's with faulty probabilistic reasoning. And that's a shame when something so important is at stake.

No comments:

Post a Comment