Wednesday, March 11, 2015
cFIP
?Jonathan Judge was kind enough to send me his original article several weeks back. He was also kind enough to include a couple of suggestions, notably kwERA. The whole research is very interesting. My takeaway though is a bit different from his: given 170 batters faced, just doing K minus BB per PA is about as good as anything you can create.
(I have to question some of the correlation results he got with the "all" pitchers insofar as kwERA is concerned. It just doesn't seem possible to get some of those results. However, I will concede that something like cFIP would be better for those pitchers who had fewer than 40 innings.)
I had a back and forth with Jonathan on this:
Thus, if we are looking for an accurate estimator of pitcher ability, what we should be considering is not how the estimator predicts future run expectancy, but how the estimator correlates with itself in consecutive seasons.
I told him he was wrong. He felt fairly strongly he was right. However, I do agree with him that a stat that does both well in terms of "descriptive" and "predictive" would make the overall point moot. Anyway, the point I was making is that you have to correlate to what you care about, which in this case is runs. It's (mostly) irrelevant that runs is a pitcher+fielder outcome.
Anyway, it's terrific research, and I'm glad that Jonathan spent as much time as he did in doing the research and presenting the work. He was very enjoyable to communicate with, even if my style could have easily turned him off. So, I thank him as well for continuing the dialogue.
This was the “pre-thread” I had on the topic of self-correlation:
http://tangotiger.com/index.php/site/comments/do-we-care-about-a-metric-correlating-with-itself-or-with-the-thing-were-ac