Fielding
Fielding
Wednesday, June 24, 2020
Roughly speaking, first basemen convert 75% of the plays they are responsible for into outs.
Roughly speaking, center fielders convert 75% of the plays they are responsible for into outs.
Does this mean the average 1B is equal to the average CF in converting plays into outs? No. Obviously we have a degree of difficulty. If this was Darin Erstad, it would be closer to 85% if he played 1B and 80% if he played CF. What we need is a common baseline. We are NOT talking about “what if Keith Hernandez played CF”. That’s not what we are talking about. We are taking one player, or a small group of players, taking those players, and moving them around the field. We are creating a common baseline. Let’s call this non-existant player Willie Kelly. Willie Kelly is not even an average fielder. He’s a below average fielder. He makes for a terrible shortstop, and acceptable fielder at 1B.
What we do is ask: how many plays would Willie Kelly make at each position.
- The average 1B makes 75% of the plays. Willie Kelly makes 70%
- The average 3B makes 70%, while Willie Kelly makes 60%
- The average 2B makes 65%, while Willie Kelly makes 55%
- The average SS makes 65%, while Willie Kelly makes 53%
- The average CF makes 75%, while Willie Kelly makes 65%
- The average RF makes 67%, while Willie Kelly makes 59%
- The average LF makes 65%, while Willie Kelly makes 59%
Now, don’t hold me to all those numbers. I rounded some here and there, as I’m just going to make a point. And the point is that each fielder has his own context, his own degree of difficulty. And you can’t just look at outs per play. You can’t look at the average for that position. You need a common baseline. You can make the argument that you want a fairly low common baseline, someone like Willie Kelly. Or maybe you want something a bit higher than that. Maybe a better baseline is William Kelliman. Maybe this is what we want:
- The average 1B makes 75% of the plays. William Kelliman makes 73%
- The average 3B makes 70%, while William Kelliman makes 65%
- The average 2B makes 65%, while William Kelliman makes 60%
- The average SS makes 65%, while William Kelliman makes 59%
- The average CF makes 75%, while William Kelliman makes 70%
- The average RF makes 67%, while William Kelliman makes 63%
- The average LF makes 65%, while William Kelliman makes 63%
Still, the point remains: common baseline. And I think this is what Bill James is after in his article. And if so, he’s right. If that’s not what he’s after, that’s fine, because the above is how I approach it. That’s how I evaluate fielding: against a common baseline.
What you CANNOT do is look at the BATTING production for each fielder and use that. Pete Palmer was wrong about very few things. This was one of them. But as Bill noted in his article:
When Pete Palmer first explained this idea to me, in a letter probably in 1976, I thought, “OK, well, that’s clever; we can use that until we figure out some way to determine the ACTUAL defensive value of a shortstop.” The problem is, we never did. As Sabermetrics grew, it just skipped over that problem. Pete had offered a way to work around the absence of knowledge, so people said, “OK, that’s good; let’s go with that.”
Bill is right. It was a useful stand-in, a placeholder until we can figure it out. And Bill actually had a better approach, one based on the Fielding Spectrum. And that is basically the approach I follow, somewhat roughing out those edges. And my approach is basically what the others have refined on their side for WAR. What Bill does is pretty close to what I do. What Fangraphs does is pretty close to what I do. And what Reference does is pretty close to what I do. We have a pretty close agreement. It just hasn’t been really laid down too strongly on paper, if only because there are still some rough edges, notably with catchers.
Anyway, that’s where we are.
UPDATE (for clarification):
We are leaving Keith Hernandez at 1B and comparing him to Willie Kelly.
We are leaving Ozzie Smith at SS and comparing him to Willie Kelly.
We are never, EVER comparing Ozzie and Keith both at SS or both at 1B. That’s what we are NOT doing.
This is no different than “common opponents” comparison of two teams that never face each other. That’s what we are doing here.
(4)
Comments
• 2020/06/25
•
Fielding
Sunday, May 24, 2020
In 1979, in games started by Gary Carter, the Expos allowed 463 runs and made 3705 outs, for a runs per 27 outs of 3.37. That season, in games started by his backups, the Expos allowed 5.08 runs per 9 IP. That difference is a whopping 1.71 runs per 9 IP, or 66% as much as his mates.
However, Carter had 136 starts compared to the 24 from his mates. In terms of the WEIGHT of that 1.71 difference, we would use the harmonic mean, which is 41 games (or 1073 outs).
We can go through every season of his career and repeat this process. And in his career, his weighted runs allowed is 86% of that of his mates (3.66 RA / 9IP with Carter starting compared to 4.28 when his mates start). In other words, his team has allowed 0.62 fewer runs to score per 9 IP, when Gary Carter started the game behind the plate, compared to his mates that season.
Gary Carter started 1954 games (or if you use outs, the equivalent of 1964 9-inning games). And 0.62 runs per game times 1964 games (unrounded) is 1204 fewer runs allowed. That is the best figure in the last 100 years.
Using this method, here are the top 11:
- 1204 runs reduced: Gary Carter
- 1004: Mickey Cochrane
- 961: Tony Pena
- 871: Yadi Molina
- 807: Brad Ausmus
- 757: Andy Seminick (who??)
- 750: Rick Ferrell (who??)
- 701: Johnny Bench
- 682: Jason Varitek
- 652: Al López (who?? he did have MVP votes in 7 seasons)
- 647: Russell Martin
Fans of WOWY will recognize this as...well, WOWY.
The main problems, at least for THIS iteration, are as follows:
- Are the backups of Gary Carter disproportionately worse than the backups of Cochrane and Pena and so on?
- Was Gary Carter paired disproportionately with the best pitchers on his staff, compared to his mates?
- How much can Random Variation affect these results?
The Random Variation one is a big one. In the Gary Carter starts, his teams allowed 7246 runs. And since this is 1204 lower than his mates, his mates would come in at a pro-rated 8450 runs. Just because we've OBSERVED 8450 (pro-rated) runs doesn't mean that's the true rate. How much Random Variation is there in runs allowed? I should figure it out at this point, but let's say it's one standard deviation is 3 runs per game. With 1964 starts, you take the square root and multiply by 3 runs and that gives you 133 runs. Since we have observations on both Carter and his mates, the Random Variation of the DIFFERENCE is 133 times root 2, or 188 runs. So, based on that number 3 that I totally made up, we can reduce the 1204 by twice 188 runs due to Random Variation (for 2 standard deviations). That still leaves us with a whopping 828 runs lowered. So, whatever you can say RAndom Variation contributes to the noise in that 1204 runs, it won't be able to wipe away even half of it. Most of it is real.
Anyway, that's all I've got for Iteration 1. If an Aspiring Saberist wants to take it from here, go for it.
***
Note: reason I started this was because of Ryan Doumit, since he was getting hurt by the framing numbers. Using this method, his teams allowed 188 more runs. And if you go all the way to the bottom of his Fangraphs page:
And look right under the Defense column, all the way to the last line, you will see this number: -178 runs. In other words, this WOWY method (188 runs) supports the evaluation of Doumit and his framing numbers (178 runs)!
***
Note 2: since someone will ask, Mike Piazza was better than his mates by 322 runs. Make of it what you will. I just know whatever number this method spit out, someone is going to complain.
(12)
Comments
• 2020/07/26
•
Fielding
Saturday, April 18, 2020
If you haven’t read it, I recommend reading Part 1 first.
Let’s walk through this play (link on Savant, go watch that first please), and the ten paths we can follow in assigning ownership of this batted ball to the fielder.
First we’ll take the five paths: If you know the location of all the fielders.
A: We know everything about this batted ball, so this base hit belongs 100% to the 3B
B: We know it’s a groundball, and it was hit in the traditional-SS slice, so this base hit belongs 100% to the 3B
C: We know it’s a groundball, but we don’t know its slice. Therefore, we assign responsibility to the four infielders based on the bat-side of the batter. I won’t run the numbers, but for illustration purposes, it’ll be something like:
- 0.20 1B
- 0.40 2B
- 0.30 SS
- 0.10 3B
In other words, without knowing where the ball is hit, other than it’s a groundball, but we do know where the fielders are standing, we are taking an educated guess.
D: We know its slice is the SS/LF-gap, but we don’t know whether it’s hit on the ground or in the air. For illustration purposes, it’ll be something like:
In other words, another educated guess as to the ownership of this basehit.
E: We know nothing about this batted ball, other than it’s a base hit. So, ownership will be something like:
- 0.10 1B
- 0.15 2B
- 0.15 SS
- 0.10 3B
- 0.10 LF
- 0.15 CF
- 0.25 RF
So, all that was knowing the location of the fielders, and knowing various degrees of information of the batted ball.
Now, let’s look at the next five paths based on not knowing where any of the fielders are.
And therefore, what we SHOULD do is take all known seven-fielder alignments (from the standard alignment that we all know to the overshift that has become prevalent, and everything in between), and run through our scenarios for all those hundreds of possible alignments. And then weight our results based on how frequent those alignments occur, for our given batter, park and game situation.
But for Blog Purposes Only, we’ll just choose ONE fielder alignment, the standard fielding alignment for LHH. And we do this especially because many fielding systems (implicitly) operate under this assumption.
A+F: We know everything about this batted ball, but we don’t know anything about the fielders. So, we assign ownership of this ball 100% to the SS. That’s right, the SS. That’s because that’s where the SS normally would be. Remember, we don’t know where the fielders are, so we have to place them somewhere.
B+F: We know it’s a groundball, and hit toward the SS slice: we assign ownership of this ball 100% to the SS. You are seeing the problem, right?
C+F: We know it’s a groundball, but that’s it. Since it’s a LHH, we’ll split along these lines:
- 0.25 1B
- 0.55 2B
- 0.15 SS
- 0.05 3B
D+F: We know its slice is the SS/LF-gap, but we don’t know whether it’s hit on the ground or in the air. For illustration purposes, it’ll be something like:
In this case, the 3B goes away, and the SS takes his place.
E+F: We know nothing about the ball, or the fielders. Just that it’s a basehit. So, we split fairly evenly:
- 0.10 1B
- 0.15 2B
- 0.15 SS
- 0.10 3B
- 0.15 LF
- 0.20 CF
- 0.15 RF
***
So, what does all this mean? Well, it means that the less you know, the more you will shift ownership of the base hit from the 3B to one or more of the other fielders. And therefore, it automatically means that you are going to be wrong, to some degree or other.
How wrong? Well, when we know everything about everything on this play, it should be 100% ownership to the 3B. So as you go through the ten paths above, you can see how wrong you are, by seeing how much away from that 100% 3B you are.
For this particular play, it looks like this, in terms of ownership of the play going to the 3B:
- 100%: A
- 100%: B
- 30%: D
- 10%: C
- 10%: E
- 10%: E+F
- 5%: C+F
- 0%: A+F
- 0%: B+F
- 0%: D+F
So, the only time you are 100% right requires you to know the location of the fielders, and as much as the trajectory as you can.
And when you know nothing about the location of the fielders, the BEST you can hope is to be wrong 90% of the time, with a good chance of being wrong 100% of the time, ESPECIALLY if you know too much about that batted ball!
Tuesday, April 14, 2020
It starts with identifying for every batted ball the assignment of those plays based on whether they are outs, errors, or base hits.
- For outs, it is relatively straightforward: whoever is the first touch fielder is assigned the play. It's not 100% true, but it's close to that. (An example of where it would not be is when a 1B pulls down an errant throw. So in those cases, we transfer ownership from first-touch to last-touch.)
- For errors, it is relatively straightforward: whoever is awarded the error is assigned the play.
- For hits, it's a bit more complicated. I started writing it out, but it started to become complicated. So then I had the idea to do a flow chart. Hopefully, the chart below makes it easy enough to follow, without it being overwhelming.
Next time, I'll come up with examples. And then after that, show you how the expected out rates are established.
Click to embiggen
(2)
Comments
• 2020/04/17
•
Fielding
Sunday, April 05, 2020
Batters
From 2015-2019, the spread in BACON (batting average on contact) is one standard deviation of 40 points. This is for all batters with at least 100 balls hit into play (HIP). The average number of HIP was 294, and therefore we can establish that Random Variation would account for a spread of 28 points.
If we observe a 40 point spread, and we know that Random Variation accounts for a 28 point spread, then the remaining difference is 29 point spread. (That’s 29^2 = 40^2 - 28^2.) Of that 29 point spread, 13 points can be accounted for by the park. And therefore, the difference of a 29 point spread and a 13 point spread is a 26 point spread.
In other words, the TRUE TALENT spread in BACON for batters is one standard deviation = 0.026.
Thanks to a suggestion by Straight Arrow Reader GuyM, I repeated the above for xBACON. In other words, rather than taking the observed batting average on contact, I instead rely on the Statcast speed+angle quality of contact equivalency. The observed spread was much lower, at 31 points.
Random Variation accounts for 17 points.
A little sidenote. For binary results of BACON, one standard deviation for one batted ball is root of .343*(1-.343), whereby the .343 represents the average BACON. And so, one standard deviation is 0.47. However, for xBACON, it is not a binary outcome. Taking the standard deviation of every xBACON value, we get 0.29. (Which also happens to be the standard deviation of a uniform distribution.)
Park variation on xBACON accounts for 6 points.
So, with a quality of contact 31 point spread, a random 17 point spread, a park 6 point spread, that leaves us with a true 26 point spread. Which is the same number that we got from the outcome method.
Pitchers and Fielders
I repeated all this for pitchers, but this time I removed all HR. So rather than all contacted balls, it is only balls in the field of play (BIP). The true spread based on the observed outcome is 17 points, while the true spread based on the quality of contact is 13 points.
Now, why would these be so different, while the batter ones were identical? Fielders(*). While we accounted for the parks, we did not account for the fielders. And how can we account for the fielders? Well, the quality of contact establishes the spread for the pitchers. That’s 13 points. And therefore, the missing variable, what gets us from a spread of 13 points (using quality of contact) to a spread of 17 points (using observed outcomes) is the fielders. And since 11^2 = 17^2 - 13^2, then we can say that the spread in fielding is one standard deviation = 0.011.
(*) I may say Fielders here, but I actually mean Fielding. That's because the way I handle Fielding, it's a combination of Fielder skill and Fielding Alignment (which I treat as a team influence).
In other words we have this:
- 36^2 (Observed Spread)
- = 29^2 (Spread from Random Variation, given 250 BIP)
- + 13^2 (Spread from Pitching)
- + 11^2 (Spread from Fielding)
- + 13^2 (Spread from Parks)
In other words, we can establish that each of the pitcher, fielding, and park are roughly similar in impact to the outcome of a ball in play. But more than all of them combined is good ole Random Variation.
And so, if you tried to partition responsibility based on a “left over” approach, whereby you account for two of Pitching, Fielding, Parks, and assign the remaining result, that last variable will absorb the entirety of the Random Variation. Which is almost certainly what you don’t want to do.
My preference is to just leave Random Variation unassigned. But if you must assign it to something, you may as well do it proportionally to the other three variables (Pitching, Fielding, Park).
Friday, April 03, 2020
Thought exercise
Suppose you have 10 teams that ALWAYS shift their infielders. Not half the time like the Dodgers,who lead the league. I mean all the time. Against LHH, the 3B plays in short RF, and the SS plays on the right side of the bag. Against RHH, the 2B plays on the left side of the bag. I’ll call these teams The Shifters.
And suppose you have 20 teams that NEVER shift their infielders. Not 13% like the Cubs, lowest in the league. I mean NEVER. In other words, all the infielders are where tradition would dictate. I’ll call these teams The Traditionalists.
Now, since The Shifters and The Traditionalists are playing against the same teams, they all face a similar distribution of batted balls. Suppose that The Shifters are getting a few more outs out of their infielders against LHH, but they’ve overshifted and getting a few less outs against RHH. In other words, overall, there’s the same number of outs.
Let’s further suppose that the 3B on The Shifters are converting 3 outs per game, and the 3B on The Traditionalists are ALSO converting 3 outs per game. It’s just that the 3B on The Shifters, against a LHH, are getting them all from short RF, while against RHH they are all closer to the 3B line. The 3B outs against The Traditionalists are all where you’d expect them.
Enter The Zone
The way “zone” systems work is that they assign a “zone of responsibility”. They decide which zone each OFFICIAL POSITION is responsible for. The “official position” is technically the fielding position on the batting lineup. And they figure that out based on those zones that each position is converting plays into outs. Since the short RF position has plays being converted into outs not-often (only 10 of the 30 teams have their 3B there against LHH, and none against RHH), that zone does not belong to any infielder.
So, these zone systems have a numerator (outs) and denominator (plays) for all the zones where an OFFICIAL POSITION owns. What happens to outs made “out of zone”. In some systems, it gets added to the numerator only. In other systems, it gets added to both numerator and denominator. What happens to hits “out of zone” of the 3B? Well, those either go to the “in zone” of one of the infielders, or they just go away altogether. In other words, a basehit by a LHH to short RF against The Shifters that is not in-zone to the 2B is ALSO NOT in-zone to the 3B. This basehit disappears. The denominator for The Shifters just got lower.
For a zone-based system with a league of 10 The Shifters and 20 The Traditionalists, this is a systematic bias. And the more data you have, the worse it is. That’s because the more data you have, the less Random Variation there is, and the more the Systematic Bias will expose itself.
Exit The Zone
An old-school system doesn’t have this issue. That’s because EVERY batted ball is assigned to a fielder. All basehits are, effectively, shared among all seven fielders, regardless of where the fielders are, or the ball went. You may think this is a problem. And, in the short-run, it is. But given 5 or 10 or 15 years of data, that Random Variation will get reduced substantially. And so, if you have 3 outs per game being made by 3B among The Shifters and 3 outs per game by 3B among The Traditionalists, they will all look the same. That’s because the OFFICIAL POSITION is what is being held responsible not where they are standing. This makes no sense on a single play level. This is still a systematic bias. It’s just a different kind of systematic bias than a zone-system.
Where are we today?
Now, all of that is theoretical. The point of all that is to understand how things work. Where are we in 2019? I don’t know. That’s where The Aspiring Saberist comes into play. Hopefully, that’s one of you reading all this.
With regards to the way Statcast works: we remove positioning altogether. And so, the area of responsibility is where you are actually standing for that particular play. If a 3B is in short RF, he is not a 3B. He is “fielder in short RF”. We don’t care about his official fielding position on the batting lineup card. Just saying that should make it clear why we can’t use his official fielding position on the batting lineup card. That’s not how responsibility works. Where you stand IS what you are responsible for. So, whether your official fielding position on the batting lineup card is 2B, SS, or 3B, and you are standing in short RF, then we use that Fielder Role to establish responsibility. Your Role establishes your Responsibility.
Wednesday, March 25, 2020
One of the things that we’ve done in the long past is to give a different run value for 1B/3B, compared to 2B/SS. The idea was simple enough to understand: if a 2B or SS allowed a hit, it was likely a single. And if it was a 1B/3B, there’s a chance that it could be an extra base hit down the line.
Seems reasonable enough. So, what we ended up doing, in the long past, was to give .75 runs per play for 2B/SS and .80 runs for 1B/3B. Again, seems reasonable enough.
I looked at the Outs Above Average (for infielders only; I’ll do outfielders later today or tomorrow). And while the direction of that theory holds, the magnitude does not hold quite as much. For the 2B/SS roles, the impact of their play is -.005 runs, compared to the average infield play. While for the 1B/3B roles, the impact of their play is +.010 runs, compared to the average infield play. (The overall WEIGHTED average is 0, and you get there because there’s about 2X the plays at 2B/SS compared to 1B/3B).
So, the end result is that the gap in runs between the middle infielders and the corner infielders is about .015 runs, not the presumed long past value of .050 runs.
Why would that be? It’s probably easiest to say that 5% of the “assigned hits” are extrabase hits. But as we know, there’s alot more than just 5% hits that are extrabase hits, even if we limit it to the infield. For example, almost 10% of groundballs are extra base hits. So why the discrepancy? Well, half of those groundball extra base hits are “automatic hits”. In other words, they are hits not because the fielder wasn’t good enough to get there, but rather, his POSITIONING didn’t allow him for a chance to get there. And since Outs Above Average takes as an assumption of fact that the positioning of the player is not a skill of the player (easier to believe these days with shifting), then those auto-hits are not opportunities for the player. They end up being noise.
When we get to Layered Hit Probability (and by extension Layered wOBA), we will recover those “lost” hits, and be able to properly assign them to “team fielding alignment”. But, for the Outs Above Average metric, those aren’t in play (no pun intended).
Ok, so you may be thinking,we lost half, so maybe instead of the long past value of .050 runs, maybe it should be .025 runs? That is a good thought. Except, alot of those remaining extra base hits that are assigned to the fielder are “really difficult”. In other words, they remain in the pool for the player, but the hit probability is so low that they have limited damage to the fielder.
So, if you want a quick summary: the kind of hit that an infielder is responsible for is almost always a single. And because of that, when you look at outs saved, the translation to runs saved will be almost identical for middle infielders as for corner infielders.
Next time, I’ll compare IF to OF.
(2)
Comments
• 2020/03/27
•
Fielding
Tuesday, January 14, 2020
?The theory would be that by being out of position, a fielder will have less familiarity with a situation and so will perform worse than his "natural" location. If this is true, we should see it in Outs Above Average. I did something fairly simple: what is the OAA for a TEAM INFIELD if we have the shift on? And what is the OAA for a TEAM INFIELD if all the fielders are in their standard location?
Note: A shift is any formation where you have 3 or more infielders to one side of the bag.
So, there is an effect and in the direction you'd expect. But not the magnitude. When an infield is playing in its standard formation, they convert 0.0003 more outs per play. With about 2000 plays a season, that works out to 0.6 more outs per season. Hardly a number to worry over, even if you can make the case that it is "true" that they perform "worse".
HOWEVER. However, because shifts are disproportionately set with a LHH, we can break down the OAA of the team infield between LHH and RHH. And when we do that, well, things start to change. With a RHH, the infield does perform better in its standard formation, by a whopping 0.006 outs per play, which is 12 plays per season. Since a shift with RHH essentially means moving the second baseman from the right side to the left side, it is that positioning that we can narrow down as the culprit. This is also consistent with other research I've shown in the past where the performance of RHH on shifts is noticeably worse for the fielding team.
As for LHH, the OAA is slightly BETTER when the infield is in a shift formation, by 0.0026 outs per play, or 5 outs per season. At this point, the "familiarity" issue likely no longer applies, given that one-third of LHH plate appearances are being shifted. This may also explain why LHH on shifts is somewhat better for the fielding team: in addition to getting the fielders in a better spot, they perform slightly better when in those spots.
This is all preliminary, so it'll be interesting to break this down in the coming weeks and months.
Update: I should note that I did not control for the quality of fielders. So, if a team that shifts more happens to do so with better fielders with LHH, then that would explain the results we see. And if a team that shifts more happens to do so with WORSE fielders with RHH, that would ALSO explains the results we see. As I said, this is the first step.
Thursday, January 09, 2020
This is what has been perplexing me for months. Is this a bias we want to remove, or retain? That shows the OAA (Outs Above Average) for balls hit in the hole. The black line is pretty sweet at 0. But the bLue (for left, or 3B) is above average and the Red (for right or SS) is below average. And this IS what we'd expect: for balls that both players can reach, the 3B will have a better shot at getting the runner than the SS, simply based on the running direction. The 3B is closing the distance as he gets the ball, while the SS is enlarging it by the time he releases it.
Therefore, do we want to adjust this "bias"? When Fred Lynn, LHH, faces a RHP, he has an advantage over Jim Rice, RHH. But, we don't adjust that away, since being a LHH is part of Lynn.
We could make the argument that since the Rockies are positioning Story and Arenado, that Story has no choice. So, we can't penalize him. But, all fielders have some leeway, pitch to pitch to move around, as they respond to expected pitch types and locations and runner leads.
So, a ball may be hit halfway between Story and Arenado, but that's only AFTER Story and Arendo take their spot on the field. Because of the nuanced nature of being able to move two or three steps in response to the pitch call and runner movement, what we'd actually want is the pre-nuanced, club-controlled spot for each fielder. Which is unknown.
If we remove the bias altogether, and make SS and 3B equal on these plays, we are saying that SS and 3B are equal in getting to balls in the hole. Which we can only say if we add the condition of "compared to other players at their locations, not to each other".
And that chart above is going to look kinda confusing if you see the black, red, and blue lines all sitting one right on top of the other.
What would YOU like to see?
(12)
Comments
• 2020/01/25
•
Fielding
On 95%+ plays, with an average 97% out rate, Baez made all 147 plays (100%, compared to an expected 142 outs). That's +5
Tatis on similar plays: 111 plays made only 101 outs (91%) compared to an expected 107, or -6.
In the outfield, the vast majority of the OAA is based on 2+ star plays.
In the infield, HALF the value is on 90%+ plays. This is because the chance of a misplay is so much greater on groundballs than airballs.
Making the routine play has tremendous value.
In case you missed it, you can slice/dice right here.
?Lots of excellent question everywhere on the newly released Infield Defense metric. Twitter, Reddit, and at BTF among the places I've seen so far.
What I'll do is create a Q&A based on the questions or issues being raised. I'll start with BTF, and go as far as I can before I go to bed, then pick it up in the morning. I'll create one Q&A per comment. So, check it out below in a few minutes.
(73)
Comments
• 2020/01/10
•
Fielding
Wednesday, January 08, 2020
?Primer article by Mike on MLB.com
Savant main page by Daren, along with drill down and player pages, with Jason bringing the data together.
My tech blog post: a slimmed down web version, and the expanded downloadable PDF.
Sunday, November 24, 2019
?In an excellent article on Catcher Framing, Mike created this image at the team level, which shows the percentage of called strikes in The Shadow Zone.
He further pointed out:
The top team, Arizona, and the bottom team, Chicago, each had a nearly identical amount of takes in that area, 4,819 for the D-backs and 4,803 for the White Sox. Yet the D-backs, led by good framing from Carson Kelly and Alex Avila, had over 400 more called strikes there.
This puts the impact in stark terms. Looking at the called strike rate in The Shadow Zone, one catching team can get 200 more strikes than the average team, while another catching team can get 200 fewer strikes. How much value CAN a strike have? I can tell you the answer is 0.125 runs per called strike, and so, we're talking about +/- 25 runs.
But, let's describe it in something a bit cruder, but with more relevance. If you think of 3 strikes being a strike out, and 9 strikes being an inning, then 200 called strikes would be about 22 perfect innings. Each inning generates an average of 0.5 runs, and so, a clean inning saves you 0.5 runs. If you have 22 of those, then you've saved 11 runs. That's the crude way. The better way is 0.125 runs per called strike.
As for simply relying on the called strike rate in The Shadow Zone, we can compare that to the runs saved on the strike calls per 100 pitches. As you can see, an extremely strong relationship. Indeed, an r of close to 0.95. So, if you are having a tough time buying into Catcher Framing and runs and how all that is derived, you can take the first step and simply look at its most basic: percentage of pitches called strike in The Shadow Zone. If you can do that, you'll be 90% of the way there.
?
(Click to embiggen)
Monday, October 21, 2019
On Sept 15, 2008, at PNC Park, Dodgers catcher Russell Martin caught 19 called pitches in the inside part of the Shadow Zone. That would be zones 11 through 19, within the green dotted line.
While today, those are called strikes almost 80% of the time, it wasn't the case back in 2008. That could be any combination of the umpires improving over time and the tracking system improving over time. So, it would be more accurate to say that he caught those 19 pitches in the reported region noted above. Of those 19, 14 were called strikes.
In that same ballpark on that same day, his teammate A.J. Ellis was also catcher, as was opposing catcher Ryan Doumit. Those catchers caught 18 pitches in the same reported region, but got only 4 pitches called strikes (or 22.2%). Had Martin got the same calls, he would have gotten 19 x 22.2% = 4.2 strikes, instead of his actual 14. In other words, he got 9.8 more called strikes than the other catchers that day in that park.
On April 2nd against the Giants at Dodger Stadium, in the outside part of the Shadow Zone, with Bengie Molina as his opposing catcher, he got 3 strikes out of 20 pitches compared to Molina of 7 for 13. That made Martin MINUS 7.8 strikes that day.
And so we can go through every single game in the same way, and tally up the results. In the Heart of the Plate, he was +63 strikes (+35 at Dodger Stadium, +28 away). However, we would NOT expect any venue bias because of the way we are directly comparing Martin to the other catchers in the same venue on the same day.
- In the inside part of the Shadow Zone, he was +43 at home, +45 away, for a total of +88.
- In the outside part of the Shadow Zone: +29 home, +7 away.
- In the Chase Zone: +39 home, -6 away.
- In the Waste Area: +1 home, 0 away.
All tallied up: +147 home, +74 away, +221 total. Each strike is about 1/8th of a run, and so those +221 strikes translates to +28 runs.
In a more elaborate process that considers more variables and the zone in a more granular fashion, Fangraphs shows +30 runs.
When I repeat this for every year, Martin's career comes out to +171 runs. Fangraphs has a very similar +166 runs.
As much as it strains the credulity to think that Martin's framing could have led to +28 runs, I also can't reject that conclusion. I can reduce that number somewhat for the uncertainty level of the measurement. But given the way I controlled for the metric, by directly comparing Martin to the other catchers in the same park on the same day, that's a tough call as well.
I could repeat the above by focusing on each individual bin and controlling for the pitcher, and potentially the batter. But that basically will put me on a path to replicate Fangraphs. And given that without doing any of that I ALREADY match Fangraphs, all I'd be doing is further matching Fangraphs.
So, I don't want to agree with the numbers, but I am forced to.
I should note that we don't see these wide numbers in the past few years. That could be any combination of the umpires improving and the tracking system improving. It could also be that teams are now very aware not to have a Ryan Doumit behind the plate, so it could be improvement in catcher selection and coaching of catchers. In other words, whatever inefficiencies exist, it's being slowly closed on all sides.
Wednesday, October 02, 2019
This is the point at which Cain got the ball.
?
Runner is about 75 feet from 3B. Taylor Sprint Speed is 29 ft/s, meaning he needs 75/29 = 2.6 seconds
Cain will have to make an almost 200 foot throw. He has a somewhat below average arm at 85 mph. Here's where we need to leave the world of mph and enter the world of feet / sec. 85mph is 125 ft/s. That's at release. The ball will slow down in flight. Roughly speaking, it'll lose 10% every 60 feet.
In this case, we'd do 200/60 = 3.33, and 0.9^3.33 = 70%. So at arrival, the speed of the ball is 70% of 125 ft/s or 88 ft/s. So the average speed of the ball in flight is about 106 ft/s. And so, a 200 foot throw will get there in about 200/106 = 1.9 seconds. (It's not this straightforward, but it's close enough.)
The exchange time (pickup to release) for a throw is about 0.5 to 0.75 seconds, which means that the ball would have reached the VICINITY of 3B in 2.4 to 2.65 seconds. It would have been close if the throw was on target. Which of course, it might not be.
How successful would Cain have been? Probably 60% if the throw is on target. And maybe it's on target 70% of the time? So, about 40% of the time he gets the runner maybe?
In the meantime, it would allow the batter to reach second base as the tying run. But, there were two outs! Making the third out at thirdbase is a cardinal sin for baserunners. Which makes it very appealing for the defense.
Let's work some MORE numbers.
http://tangotiger.net/we.html
Bottom of the 8th, 2 outs, down by 2 runs. Our choices are:
- runners on 1B and 3B (our baseline)
or
- runner on 2B and 3B
- end of inning
So, our baseline is a win expectancy for the Nationals of 15.8%.
- If Cain went for it and missed, then the win expectancy is 19.2%.
- If Cain got the out, then the win expectancy for the Nats is 7.1%.
In other words, the tradeoff is that the Nats gets +3.4% if Cain doesn't hit the target in time, or the Nats are -8.7% if Cain gets Taylor to end the inning.
All Cain has to do is make the play 28% of the time. That is:
- 28% of the time, the Nats lose 8.7%
- 72% of the time, the Nats gain 3.4%
And that's breakeven.
Remember, we guessed that Cain would have gotten Taylor about 40% of the time, and he only needed to get him 30% of the time.
Cain should have gone to third.
Monday, August 12, 2019
?Statcast Intern Kristen Austin had a sensational presentation at Saber Seminar and you can see her presentation in its entirety here. One of the topics she briefly discussed was Abandoned Rates. I asked her to expand upon it in a blog post, and so she has this PDF that she is sharing with us.
(8)
Comments
• 2019/08/16
•
Fielding
Monday, June 17, 2019
?How do I create a metric, and more specifically, how did the Jump metric come about? There is alot of art and science to the process of metric creation. For the pure artists, sorry, but we need some science. For the pure scientists, sorry, but we need some art.
What I am always trying to do, is organize, classify, categorize the data. We do this so we can actually speak the data. For example, we can create a function of exit speed to create a "hardest hitter". That function would likely be a quadratic function of some sort. Or, we can say "batted ball hit at least 95 mph". As much as the scientists want that function, and as much as I want it as well which you can see it here, it's too hard to speak that. 95+ is ubiquitous. And, just as important, it's an excellent proxy for that function. If we can speak it, with little loss of accuracy, then speak it. In other occasions, I can't do it, and so, I go all-in on creating a function (or series of functions), which is what Catch Probability is. Though even there, I try to come up with a shorthand, such as each foot affects the Catch Probability by 4%.
For Jump specifically, the primary decision is whether to represent the unit in time or distance. Do we want to show that Kiermaier is a certain number of seconds quicker than average, or a certain number of feet quicker than average. And by seconds, I mean, tenths of seconds. As I tried both ways, it become clear, I had to represent it in feet. No one can appreciate what 0.1 or 0.2 seconds means. Everyone can appreciate what 3 feet means. If a player JUST misses a catch, we don't say "he missed it by 0.1 seconds". We DO say "he missed it by a step" (or 3 feet). We can freeze a play and see that distance, but not that time. Anyway, so it become clear that the result had to be in feet.
Once that decision is made, then the other choice is a given: the selection must be made in seconds. In other words, if the unit you create is expressed by time, then the data must be partitioned by distance. And if your unit is expressed in distance, then partition the data by time. This is critical. If you don't see it, you will when you create your own metrics.
Knowing that time is the partition, now we need to select thresholds. We do this because we need to organize, classify, categorize the data. Virtually all catches are made with 3+ seconds from pitch release to catch. This becomes my first point of reference: let's focus on Jump solely based on performance in the first (up to) 3 seconds. It might have been 2 or 2.5 or 2.8. As I tried different ways, 3 seconds became the threshold.
The next thing is what we mean by Jump. And we actually had a few components. After many discussions with the rest of the Statcast team, principally Mike, Jason, Travis, Cory, Matt, we finally settled on three: Reaction, Route, Burst.
It was especially with discussions with Mike that cemented the process. We had a few discussions on whether going "the right way" is needed for Reaction and Burst. Once we decided that Route would encapsulate going "the right way", the other two pieces fell into place quickly.
Burst was interesting because at the same time, Travis was working on speed components for batter-runners, other than Sprint Speed. And since Sprint Speed uses the same scale, and can be compared between batter-runner, runners-on-base, fielders, it was highly desirable, if not necessary, that the same applies for Burst. We quickly settled on 1.5 seconds as the time window for Burst, for batter-runner. And given that I had already established 3 seconds for the Jump window, chopping that into two windows, of 0 to 1.5 (Reaction) and 1.5 to 3.0 (Burst), came into being very quickly. In addition, the Burst Distances for fielders at 1.5 to 3.0 is similar to the Burst Distances for time threshold for batter-runner that we chose. It all came into place.
Reaction was purely distance travelled in the first 1.5 seconds, regardless of direction. Burst was the next 1.5 seconds, also regardless of direction. Route was the bridging metric that was the difference between distance travelled and distance covered. And therefore, Jump is the total distance covered (not travelled) in the first 3 seconds, in the correct direction.
Now, just because all of this came into place and seemed to make sense wasn't enough. We need the metric to actually represent something about the fielder. Once we saw Jackie Bradley Jr being on the leaderboards with both quick reaction and indirect route, year after year, we knew we had it. And then seeing the results of other players, the very strong correlation year to year, it all came into place.
The last step was actually the longest: productionize the metric. We had to get this into the pipeline for our various endpoints. We had to get Daren to add his magic with Savant to take what is essentially tabular data and make it resonate with the fans. Mike had to do all the research to come up with a sabermetric staple of an article, one that is both relevant and timeless.
Anyway, so that's the process for metric creation in general, and for Jump in particular.
Wednesday, May 15, 2019
?One of the team members was asking me how is it possible that the wall and/or going back can have such a dramatic effect on Catch Probability. And he showed me an example, which was a pretty dramatically different number. There are four main variables for Catch Probability:
- How far does the fielder have to run from his starting point to the (eventual) landing point?
- How much time does he have to get there?
- Does he need to run back?
- Is the wall an impediment to making the play?
For this illustration, I will show you the actual results, as well as the estimated catch probability, for plays where the fielder has to run 80 to 90 feet, with an opportunity time (pitch release to landing) of 4.5 to 5.0 seconds, with the 4 combinations of wall and/or back.
??
To read the first line: we have 1101 plays since 2016 where the fielder had to cover 80 to 90 feet in 4.5 to 5.0 seconds, where he did not have to run back, nor was the wall an impediment. The Estimated Catch Probability was 54%, while the actual catch rate under those conditions was 55%. The last line shows that the outfielder had to run back and that the wall was an impediment. Under those conditions, they caught the ball 3% of the time, compared to an estimated 4%.
I used the above example because that was the test case that I was asked. The results were pretty good. Almost as good if I check similar conditions, like so:
??
This one is an extra 0.5 seconds of opportunity time to make the play. Not nearly as good, but still pretty good. Also note that those 0.5 seconds adds 30% to 60% of making the out.
The rough rule of thumb is that for plays in the sweetspot, 1 foot = 4% and 0.1 seconds = 10%. It obviously tapers off when the catch probability is closer to 0% and 100%.
Below you will find all the data plotted out.
Read More
(13)
Comments
• 2019/05/21
•
Fielding
•
Statcast
Monday, January 28, 2019
This blog post will just be about Justin Verlander, and the focus is only on the outfield. We have 217 batted balls that we either assigned to an outfielder, or was unplayable by an outfielder (either it was too far for even the best outfielder to make a play, or it hit the wall high). Of those, 167 were caught, or 77%. He benefited very slightly from his outfielders, who combined were +2 outs above average. In other words, we estimate, based on the batted balls he allowed, relative to the fielding alignment and parks they were in, for 165 of those balls to be caught. So, his xOuts (among outfield plays only, or xOutsOF if you will) is 76%.
In part 2, we'll look at all the pitchers. (I have no idea where he ranks, since I have yet to run it for anyone else.)
Here's the breakdown of the 217 batted balls:
- 34: 34 hits, all impossible to catch
- 13: 12 hits, 1 out, at under 50% catch probability (average of 15%; in other words, we would have expected two to be caught, but only one was)
- 10: 1 hit, 9 outs, at 50-75% catch probability (average of 66%; expect 7 to be caught but 9 were)
- 15: 2 hits, 13 outs, at 75-90% catch prob (average of 86%; expect 13, and 13 caught )
- 18: 1 hit, 17 outs, at 90-99% catch prob (average of 95%, expect 17, and 17 caught)
- 127: 0 hits, 127 outs, at 99%+ (average of 99.5%, expect 126, and 127 caught)
All in all, we see that 127 were pure gimmes, and 34 were pure auto-hits (gimme outs and takey hits). In other words, Verlander managed to get 161 of 217, or 74%, of the batted balls to not involve any fielder skill (other than possibly positioning). It's the other 56 batted balls where there's some kind of fielder skill involved.
More to come...
Friday, January 18, 2019
?I highlighted this terrific research on Twitter two weeks ago, but my comments there are ephemeral, and this research really should get the exposure it deserves.
Also when you look at pitchers you should control for that too, similar to batters. Verlander for example is not shifted the same amount as the other Astros pitchers.And limiting it to bases empty is a good idea. It's very controlled environment, and 57% of PA occur with bases empty, so we won't suffer from lack of sample size.
Every layer you peel, you will find two more layers underneath. Lots of good stuff to uncover, keep going!
If ever you get 2 more questions for each answer you get, then you are on the right path. Once you get to the point that you have no more questions, then that's a sign you hit a dead end.
Recent comments
Older comments
Page 2 of 151 pages < 1 2 3 4 > Last ›Complete Archive – By Category
Complete Archive – By Date
FORUM TOPICS
Jul 12 15:22 MarcelsApr 16 14:31 Pitch Count Estimators
Mar 12 16:30 Appendix to THE BOOK - THE GORY DETAILS
Jan 29 09:41 NFL Overtime Idea
Jan 22 14:48 Weighting Years for NFL Player Projections
Jan 21 09:18 positional runs in pythagenpat
Oct 20 15:57 DRS: FG vs. BB-Ref
Apr 12 09:43 What if baseball was like survivor? You are eliminated ...
Nov 24 09:57 Win Attribution to offense, pitching, and fielding at the game level (prototype method)
Jul 13 10:20 How to watch great past games without spoilers