I’m going to introduce a few new stats for our sortable reports and cards in just a moment, but first I’d like to talk about what kinds of things we’re looking to add and what our decision process looks like.
Our readers really like baseball stats, but they like looking at ones that matter. It’s our job to make sure that our statistics matter, that they communicate something meaningful, and that they do it in a useful way.
For us to add a stat, it doesn’t have to be world-shaking. It can merely be interesting (and some people may not even agree on that). But it does have to meet a few criteria:
- It needs to be different enough from our existing metrics to provide added value. Perhaps not substantively—it can say the same thing, so long as the presentation is different enough that there’s some value in having both presentations.
- When a new metric disagrees with an existing metric, it’s important to make sure that we can clearly communicate why the two differ, and what the value is to having both answers. If we think one metric is provably better than the other, we use that one instead.
We plan to continue adding things to our sortable reports over the offseason, but when we do so it’ll be guided by those two considerations. Now, on to the fun stuff.
We’ve introduced two new metrics that should seem rather familiar to most of you, even if we’ve never presented them ourselves. They’re “plus” metrics, similar to OPS+ and ERA+ from Baseball Reference.
Let’s start with RPA+ (RPA_PLUS in the sortables), which is our new offensive rate stat along the lines of OPS+. What it isn’t is a departure from how we evaluate hitters currently; TAv and RPA+ will return the exact same rank order of batters. At the core of both of them is the exact same adjusted runs per plate appearance, figured using linear weights values derived from run expectancy tables. In this case, there is no right or wrong way to handle the scaling of adjusted R/PA for presentation—some people find it more intuitive to look at R/PA scaled to batting average, others to 100. (There are also technical reasons to prefer either, based upon what you’re doing with each.) Essentially, they’re two ways of looking at the same information—complementary, rather than redundant or competing.
So why might you prefer RPA+ to OPS+? There are a few reasons:
- RPA+ includes a league quality adjustment. Batters in the AL face tougher pitchers, and so the same batting line in the AL means a better hitter. TAv and RPA+ capture this, while OPS+ ignores it.
- RPA+ is a better estimate of a batter’s production than OPS+, which undervalues high OBP, low SLG players and overvalues high SLG, low OBP players.
The complement to RPA+ is Fair RA+, again similar to ERA+. Unlike with RPA+, Fair RA+ will return a different order than Fair RA, because it includes park and league adjustments. Why favor Fair RA+ over ERA?
- Fair RA separates what a pitcher has done from our estimate of his defensive support.
- Fair RA uses all runs, not just earned runs—ERA+ will overrate a groundball pitcher relative to a flyball pitcher with equivalent production.
-
Fair RA uses a different construction to produce the scaling factor. ERA+ is actually the league average ERA divided by the player’s own ERA, in order to make larger values better (rather than allowing smaller values to be better). We take Fair RA, divide by the league average, and subtract the result from two. This gives the same “bigger is better” property ERA+ has but keeps the units meaningful and makes it easier to work with mathematically. Patriot explains the reasons for this, but in summary, taking a weighted average of ERA+ across seasons is more unnecessarily complicated, and it makes the units of ERA+ essentially meaningless. A Fair RA+ of 110, for instance, means a pitcher's Fair RA is 10% better (that is to say, lower) than the league average. ERA+ doesn't behave linearly at all. Comparing the two methods, setting 4.5 as the league average:
What I've called "two-minus" is the method we are using; "inverse" is the method behind ERA+. What you find is that an additional 10 points of ERA+ means different things based upon where on the curve that additional 10 points occurs, while 10 points of Fair RA+ consistently means the same thing for any point of the scale.
Since I’ve taken the time to point out yet again how OPS falls short of linear weights-based measures of offense, I’d also like to announce that we’ve included TAv and RPA+ on the batter and pitcher opponent quality reports. We’ve retained the slash lines and OPS as well, but this is one place where we can put our money where our mouth is as far as putting our preferred offensive metrics out there for consumption. We’ve also added TAv against for pitchers, for those who want to see a pitcher’s performance in those terms. (Yes, TAv against attributes all fielding performance to the pitcher. Under consideration is a derivation of TAv against that is more DIPSy.)
We’re also introducing a new breakdown for batting WARP, which you can find in this custom sortable report. We’ve broken WARP apart so you have a better idea of what goes into each players’s numbers. Included are:
- Batting Runs Above Average, or BRAA
- REP_LEVEL, which is the amount added to a player’s BRAA to give us runs above replacement.
- POS_ADJ, which tells you how many runs a player is credited based on his fielding position,
- And TOT_DEF, which is POS_ADJ plus a player’s FRAA—in other words, everything in WARP that measures a player’s defensive abilities.
This is only the start—we’ve got some other things cooking, and we’re really excited to continue to improve these sorts of offerings. (Of particular interest is defense for catchers, including Mike Fast's work on catcher framing.)
And as a reminder, while the default sortables are available to everyone, custom sortables are reserved to subscribers only. Right now, you can save six bucks on a yearly subscription with our Big September coupon code.
Thank you for reading
This is a free article. If you enjoyed it, consider subscribing to Baseball Prospectus. Subscriptions support ongoing public baseball research and analysis in an increasingly proprietary environment.
Subscribe now
Thanks.
I want to see pitcher stolen bases and pitcher stolen base percentage. I'm talking about pitchers as offensive players. Heck, throw in pitcher home runs as well. Break down each pitcher's offensive stats. (I know a lot of this is available already.)
The more, the merrier.
Is this true? I guess it may be, but I'd never thought of it this way. I'd always thought the AL/NL difference was that pitchers faced 12.5% more "real" batters due to the DH, plus the AL East spending machines consolidates some quality there overall, making winning harder for the rest of the AL. But in focusing just on pitchers, I'd have guessed that the Phillies, Braves, and Giants match up pretty well with any three in the AL, and the rest of the pack in either league doesn't strike me as that imbalanced.
Just caught my eye and made me wonder...
Thanks for all this, though.
Playing baseball in the American League is harder than playing baseball in the National League, something we get reminded about every time interleague play rolls around and then we promptly forget about, because for the vast majority of the schedule the AL plays the AL and so nobody really notices the disparity. But it's real, it's not just a consequence of the DH and it shows up in the stats of individual hitters and pitchers.
1. Colin, please, help me understand how does OPS+ not include a league adjustment? Isn't it based on the league average?
2. You don't mention a park adjustment in RPA+ (or I missed it). Does it have it? (If it does not, then FRA+ is not a precise compliment.)
3. How does ERA+ overrate a groundball pitcher relative to a fly ball pitcher?
4. I can see ERA+ overrating pitchers with a good defense or good relievers to back him up. Does FRA+ get around those issues? Assuming not, are we getting a defense independent measuring stick in the + format?
3)I believe it's because errors are more likely on groundballs than fly balls, so GB pitchers get a little "boost" to their Earned RA relative to FB pitchers.
4)From the article: "Fair RA separates what a pitcher has done from our estimate of his defensive support."
As for ERA+ and groundball pitchers - an error is more likely to occur on a groundball than a flyball. So if you take two pitchers of otherwise equal production (measured by RA), the groundball pitcher will tend to have more errors behind him and thus more unearned runs.
And yes, Fair RA both accounts for defensive support and inherited/bequeathed runners, and so Fair RA+ accounts for them as well.
On another note, I'm not quite clear how TAv and RPA+ are different besides one being formatted/evaluated like batting average and the other formatted with 100 as a baseline for league average. Does that really make RPA+ a new metric? That'd be like saying a .300 hitter gets a base hit 30% of the time and calling that 30% a new metric.
***
For Fangraphs readers, it's akin to wRC+ on their site. All the metrics at Fangraphs and now at BPro courtesy of Colin, have as their basis Linear Weights (long live Pete Palmer). They differ basically in their park adjustments.
You see that at the very top with Bautista at 181 at Fangraphs and 199 at BPro. Otherwise, the two lists are reasonably similar.
Also, any idea when advanced stats will be available for pre-60's players again? Again it's a small thing and I'm sure it's being worked on, but it would be cool to have again.
Anyway, thanks for the recent updates and additions, and keep up the great work.