Monday, April 08, 2013
DIPS and BABIP
Pizza gives it a once-over. For those new to the whole debate, you may find this piece interesting (solvingdips.pdf).
The issue with focusing on BABIP is that it's half of a bigger equation. If you do BABIP, then you MUST do SLGBIP (or, basically, just do wBABIP, which is the wOBA version of BABIP). On top of which, you must include DP (and reach on error... anyone who ignores reach on error is doing it wrong).
A GB pitcher has a higher BABIP than a FB pitcher. However, for each BIP, the FB pitcher will give up more extrabase hits. And for each BIP, the GB pitcher will get more DP. In the end, what we REALLY care about is the run value. And the run value is identical for GB and FB (excluding HR).
In order to get a balanced view, you need to look at run value per BIP. And the significance there will shrink. This is why something as simple as K minus BB+HB-IB per PA works. It's because whatever truth you do find in BABIP, there's a counter-truth that reduces its significance.
?
MGL asks a question in the comments, and Pizza responds:
As far as I can tell, Pizza IS talking about regression, just as MGL is positing.
Furthermore, his last paragraph I find incredibly hard to believe. Pizza is right out saying that given a pitcher with a .240 BABIP in his last 100 BIP, that given an either/or choice, it’s better to describe him as a .240 pitcher than a .300 pitcher.
In my best guess, I’d call such a pitcher a true .297 or .298 pitcher. Maybe I’m wrong, and we can say he’s a .295 pitcher. I don’t believe any answer below a .290. This would set the regression point of r=.50 at BIP=500.
And to set it so that Pizza would claim he’s more likely a true .240 than a true .300, that would set r=.50 at BIP=99 (or lower).
No way.
Either that, or I have no idea what Pizza did in his article to suggest his claim.