Tangotiger Blog

Wednesday, February 06, 2013

Why doesn’t USPS increase its stamp to 63 cents and stop Saturday delivery?

By Tangotiger

Canada should often be considered a "sandbox" for USA, a development environment to see how things work on a small scale. The mail system is one such comparison point. USA's population is about ten times that of Canada. As luck would have it, USPS has about 80 billion$ in expenses, while Canada Post has 8 billion$. Per-capita expenses therefore as the same.

On the other hand, while Canada Post is close to breakeven (8 billion$ in revenue), USPS? only collects 60 billion$ in revenue. What is the main difference between Canada Post and USPS? Well, Canada does not have Saturday delivery. And, it costs 63 cents to mail a letter in Canada, while it costs only 46 cents in USA.

Canada Post collects one dollar in revenue for every one dollar in expense. USPS collects 75 cents in revenue for every one dollar in expense. Therefore, USPS needs to increase their revenue by +33%. (That is, they want to collect 80 billion$ in revenue, not 60 billion$, so an extra 20 billion on their base of 60 billion$).

And 46 cents plus 33.33% is 61 cents. That's very comparable to Canada's 63 cents.

FedEx and UPS also operate in Canada, so the competition is also there.

So, what is stopping Congress from authorizing the 63 cent stamp? (In which case, there's going to be a MASSIVE run on the Forever Stamp, making it one of the best investments ever, and almost certainly creating a secondary market of businesses whose sole function will be to sell the Forever Stamp at somewhere between 46 and 63 cents.)

(17) Comments • 2013/02/08 • Blogging

Should we prefer a spread in forecasts?

By Tangotiger

All other things equal? Glenn talks about something that Colin brought up.

Let's take a step back first. Marcel, as we all know, gives an identical forecast for any player who has never played in MLB. And, as we learned, Marcel has a smaller margin of error than virtually every forecasting system.

Hold on to your hats here. Ready? These are the players that are Pure Rookies. They had no prior MLB history for any system to draw from. Marcel decided to give a blanket .335 forecast for each player, while the other four systems relied on their minor league stats.

<span>wOBA Error System 0.319 0.0000 Actual 0.306 0.0436 Chone 0.335 0.0416 Marcel 0.320 0.0414 Oliver 0.313 0.0430 Pecota 0.307 0.0439 Zips</span>

First off, we see forecasts all over the place. While the group of Pure Rookies hit .319 wOBA, the other four systems forecasted .306 to .320. Marcel of course was exactly .335.

But, look at the error term: Marcel nearly won! And Chone, which was leading in each sub-category took a bit of a hit here. Chone, along with Zips forecasted the overall mean too low, and the error term were the highest. Not that any of the systems really redeemed themselves here.

I go on and discuss at length the selection bias issue. Go over there to read about it. Anyway, let's set aside the selection bias issue. Let's presume there isn't one. And we don't even have to talk about wOBA.

We can simply talk about a team's pre-season W/L forecast. And we can do it for NFL. If I predict 8-8 for every single team, I'll be off by, I dunno, say an average of 3 wins per team. But, if someone else starts forecasting 11-5 and 7-9 and 3-13, and they ALSO end up with an average error of 3 wins per team, is that better? What if they forecast one team for 4-12 and they end up 9-7. Should we be happy that that's the price we pay for predicting a team to go 13-3 and they in fact go 14-2?

Let's say it is.

Let's take another case where I forecast 8-8 for every team, and I'm off by 3 wins per team. And someone else has forecasts all over the place from 3-13 to 13-3, and they are off by 3.5 wins per team. Who made the better forecast?

In my case, the standard deviation of my forecasts was exactly 0, while the other guy had a standard deviation in his forecasts of say 2.3 wins. And let's say we actually observe 2.8 wins as a standard deviation. So, yes, he was better able to forecast the league-wide spread. But, was the price too high? That he was off by 3.5 wins per team in making that kind of spread in forecast, is that necesssarily better than me being off by 3 wins by having zero spread in team forecasts?

Where's the tradeoff here?

?

(20) Comments • 2013/02/06 • Forecasting • Statistical_Theory • Football

Tuesday, February 05, 2013

Example of Replacement-Level players

By Tangotiger

Dave does a good job at identifying replacement-level players. In order to (partially) combat the selection bias that Dave correctly noted, he should present the 2011 season as separate from the 2012 season.

To combat selection bias, we don’t want to just focus on what these players did last year, as the fact that we’re identifying players who were forced to sign minor league deals means that we’re starting with a group that likely underachieved last year. A replacement level player who overperformed in 2012 likely secured his place on a 40 man roster for the winter, removing him from the pool of players available to sign minor league deals or get passed through waivers. So, we need to adjust for the fact that the 2012 performances are likely a bit below their actual talent levels, and we can simply adjust by looking at a larger pool of data. We don’t want to go back too far, of course, as many of these guys are aging players who aren’t what they used to be, so to try and come up with a balance of a larger but still relevant sample, we’ll simply focus on how these 24 players did over the last two years.

So, everything he said was correct, except the last three words. He should have focused on 2011, as that (mostly) represents an unbiased estimator of the group's talent in 2013.?

I'd also like to see the study repeated for pitchers. I think we might get some "weirder" results.

(9) Comments • 2013/02/05 • Talent_Distribution

Method of Baseball Reference’s madness

By Tangotiger

In response to discussion of Paul Abbott on Bill James' site:

Paul Abbott: his Wins Above Replacement (WAR) as calculated by Baseball Reference (which I note as rWAR) in 2000 was 2.3 wins, while in 2001 was 1.1. That 1.2 wins difference is explained in part because Forman has calculated that his opponents in those years were stronger in 2000 than 2001 (by 0.45 runs per 9IP), as well as his fielders were better in 2001 than 2000 (by coincidentially also 0.45 runs per 9IP). After also considering park factors, Forman figured an average pitcher in Abbott's context in 2000 would allow 5.00 runs per 9IP and in 2001 would allow 4.28. Since he gave up 4.47 in 2000 (0.53 runs per 9IP better than average) and 4.36 in 2001 (0.08 runs worse than average), we have a 0.61 run per 9IP gap. Which for the seasons in question is about a 12 run gap, or 1.2 wins. This is not to suggest that I agree or can support all of Forman's assumptions in those calculations, but simply that there was a method to what seemed like madness.?

***

Paul Abbott was assigned a record of 17-4 for those historical 2001 Mariners, while a 9-7 record for the 2000 Mariners. For whatever it's worth, Sean's neutralizing function gives Abbott a 9-10 record in 2000 and 7-10 in 2001.

Note that Sean's neutralizing function actually assumes that the pitcher will receive a normal distribution of runs, and recasts his performance as if it were randomly set against that distribution. This is a bit hard to explain, but:

If a pitcher did in fact face a normal distribution of runs, in a normal park, under a normal environment, his neutralized W-L record can STILL be massively affected, because Sean ALSO presumes that the pitcher's performance (that is, his runs allowed) should have been randomly distributed.

So, he's not only neutralizing the player's environment, but he's also neutralizing any "timing" component of his runs allowed. I don't know if I necessarily agree (or disagree) with that approach.

What's weird is that he doesn't do that for hitters, with respect to runs scored and RBIs. In that case, a leadoff hitter like Rickey will maintain his massive gap in R and RBI, thereby preserving some part of that context. I didn't check to see if a guy's "clutch" performance (i.e., driving in far more runners than his seasonal lines would suggest) is preserved.

Anyway, no one ever talks about it, so I'm putting it out there.

(16) Comments • 2013/02/11 • Linear_Weights

Discussion of the stats landscape

By Tangotiger

A decent effort to lay out the discussion. First the old school, then some of the basic new school stats.

There are some errors, for example:

Lots of people like to use the statistic OPS (OBP+SLG) as a quick, shorthand way of combing all of these stats. The caveat to this is thus; is a “point” of on-base percentage equal to a “point” of slugging? No, it is not; the slugging point is worth more because of what it represents.

?Unfortunately, that's not true. As I've shown in the past, the best weighting of OBP and SLG is roughly 1.7 OBP for every 1 SLG.

(3) Comments • 2013/02/05 • Linear_Weights

PITCHf/x park effects

By Tangotiger

Good job by Jon to present a basic version of park effects.

You can of course refine it further, but I think his basic version provides enough information to show that there is some park-to-park variance in the data.

(13) Comments • 2013/02/05 • Ball_Tracking

Ubermodels are…

By Tangotiger

?models that capture the involvement of various entities in games to estimate each of their impact toward winning

Involvement: participation, without necessarily attribution of responsibility, skill, or luck

Entity: player, manager, umpire, park, weather, loving hand of god, cruel hand of fate

Estimate: approximate calculation whose rough value can be derived in multiple ways

***

Can we all agree on this?

(3) Comments • 2013/02/05 • Linear_Weights

Monday, February 04, 2013

Match-fixing in Europe

By Tangotiger

The story.?

(10) Comments • 2013/02/12 • Soccer

Fangraphs Plus

By Tangotiger

Make your annual five dollar donation to Fangraphs to say "thank you", and get ?plenty of "you are welcome" gifts from the good group of analysts there. I think it's obvious that David has done more than anyone to making wOBA, FIP and other things I've dabbled in as ubiquitous as possible, not to mention the other 99% of the things that make his site great, and you should do your best to show your appreciation to what David is doing over there. It's really insane that he does all that he does without charging subscription fees, and yet paying his writers.

() Comments • • Books

Spread of offense v defense

By Tangotiger

Phil makes the point that while the spread in offense and defense is roughly the same in MLB, the NHL has a larger spread in defense than offense. And the reason should be clear: goalies. I agree with basically everything Phil said in there.?

(15) Comments • 2013/02/06 • Talent_Distribution • Hockey

When do QB metrics stabilize?

By Tangotiger

Great stuff here! Love to see the techniques I use on baseball being applied to other sports. So, yards per attempt has an r=.50 after about 800 attempts. Basically, that means you need nearly two years for half the metric to be considered signal and half to be noise. He goes through some metrics, and determines:

Stat	Formula	Stabilizes	Seasons
Sack%	Sack / Dropback	around 400 dropbacks	0.75
Comp%	Comp / Att	around 500 attempts	1.00
YPA	Yards / Att	around 800 attempts	1.60
YPC	Yards / Comp	around 650 completions	2.15
TD%	Pass TD / Att	around 2250 attempts	4.50
INT%	INT / Att	around 5000 attempts	10.00

Note that because he took a QB's numbers in the same year, then something like TD rate does not necessarily reflect the QB's skill, but rather may have a bias in his receivers.

Interception rate being so low is also interesting. That may be because in order to survive, then you can't throw alot of interceptions to begin with. In order to find a signal in something, you need to have samples that have a large range in talent to begin with. If everyone has low interception rates, then it's harder to find out who is really really low in interception rates, and who is really low, and who is low.

This is why something like save percentage for goalies has low correlation: if you aren't saving pucks, you aren't going to be in the league long enough to be part of the sample. And this is why strikeout rates have high correlation: you CAN survive in the league if you have a very low and very high K rate. So, with a large range in talent, then you need less sample for the signal to get through.

Anyway, so that explains the sack rate stabilizing so fast: it's highly biased on the team's line, and staying away from sacks is not a primary requirement to being a QB (though it is a secondary one).

(6) Comments • 2013/02/04 • Statistical_Theory • Football

WAR hammer for nail question

By Tangotiger

Dave does a good job of contrasting WAR to the other metrics, and why it has particular uses. My favorite part though is when he dismisses the Pitcher Wins as an answer, since the question is one that no one asks to begin with:

How many times did that pitcher complete at least five innings, leave the game with his team having outscored the opponent through the point at which he was removed, and then watch his relievers finish the game for him without surrendering the lead that his teammates helped create in the first place??

(40) Comments • 2013/02/05 • Linear_Weights

Sabermetric Super Bowl

By Tangotiger

I don't watch enough football to remember to have Brian's live win probability chart up. One thing that I like to know is if a team that is trailing has more than a 50% chance of winning.

It's more obvious to see in baseball, where if you are down by 1 in the bottom of the ninth, no outs, and bases loaded, it's the defense that's sweating bullets, not the offense. But, when it does it flip over to the offense? Well, it just tips over with runners on first and second, no outs, down by 1, or runner on third, down by 1. (It's often the case that having runners on first and second is equivalent to just having a runner on third.)

Brian's site is (now) blocked at the office. Can someone tell me if there was ever a point where the trailing team had more than a 50% chance of winning?

The other fascinating play was at the end, with the safety. I think the broadcast team did a great job to bring it up at all, and it was interesting to hear their off-the-cuff unprepared analysis for it, saying they wouldn't do it. But, they totally didn't consider that they could run eight seconds off the clock. Even without the clock-running, it would seem it might have been more than breakeven to go for it. I'll wait for Brian's analysis on that too. But, I thought the director blew it by not showing us the defense formation? from a high view, and showing what the defense was going to do about it. I think in that case, it demanded a bird's eye view.

Any other high-leverage strategy plays?

(29) Comments • 2013/02/06 • Football

Mozeliak talks the numbers game

By Tangotiger

Blogger Anna gives us his insight:

"There is no perfect stat, but when you look at trying to define Wins Above Replacement, it is a very simple place to grab information and get a feel for it," he said.

There are several versions of WAR out there, and while he looks at all of them, he does have a favorite.

"I use our own internal system, because that's what I'm most familiar with and also in the past five, six years I feel like we've made very good decisions based on it," Mozeliak said. "My confidence in it is very strong."

?

() Comments • • MLB_Management

Baseball Bucket List

By Tangotiger

I don't know that this blogger has the right idea about what a bucket list is. There should be some level of certainty and planning possible. Being at a ballpark to witness an? inside-the-parker is a freak occurrence (though, I'll raise those stakes by saying I watched an inside-the-parker walk-off in extra innings... thank you Marquis Grissom!). And easily my most memorable moment is when (deaf) Curtis Pride got his first MLB hit, and as he's standing on second base, we banged and stomped as loud as we could, on the idea that we could somehow make him hear. He said that he felt the vibrations of the stadium beneath his feet. Still, not bucket list.

Anyway, baseball bucket list? I don't really have a desire to go to every single ballpark, nor to meet any players. I was at the Cup-clinching game in Montreal 1993. So, maybe go to a Game 7? But, the reason that 1993 Cup game was memorable was because we won, and I don't know that I'd want on my bucket list to attend a game on the chance they'd win the Cup, only to not win. Again, I don't think that's bucket list material.

What do you got?

(2) Comments • 2013/02/04 • History

Sunday, February 03, 2013

Lachemann Brothers

By Tangotiger

?A very nice story on the Lachemann brothers. I didn't realize this:

“It was the highest up and lowest down in my entire life,” Lachemann

said. “Ten times we had two strikes and couldn’t put the game away.”

() Comments • • History

Saturday, February 02, 2013

Cross-era comparisons

By Tangotiger

?I know everyone loves to do it. You have to choose whether you are comparing two players on their own, or whether you are comparing how the two players would do, if they had access to identical environments (nutritional, equipment, training, etc). That is, are we transplanting the 1936 version of Jesse Owens, or are we transplanting his grandparents so that Owens would be born at the same time and place as Usain Bolt?

Rally also brought up the following point, which I posted to Bill James:

In football, basketball, hockey, we wouldn't think of comparing the best 2001 team to the best 1954 team, and think that the 1954 team could beat the 2001 team. Someone at my site suggested it's because baseball had an earlier start historically (say 30 years before
basketball and hockey), and so, we need to shift our persepective by 30 years, so that we get to a plateau like we might with baseball. I don't buy that argument in the least. What do you think?

Asked by: tangotiger

Answered: 2/1/2013

Oh, I certainly buy that argument. MOST of the improvement in baseball skills occurred before the NFL was organized in 1920 or whenever it was. Baseball gets better, but the pace at which baseball is improving has certainly been cut down by the passage of time.

() Comments • • History • Talent_Distribution

Pre-response to anti-WAR

By Tangotiger

?Crashburn linked to Caple's sentiments on WAR.

I thought Mark Simon's objection from two years ago is sufficient. I added a couple of my thoughts at the time.

As far as I can tell, all the anti-WAR sentiment is really missing the forest for the trees.

(1) Comments • 2013/02/22 • Linear_Weights

Friday, February 01, 2013

Flashback: Pedro hits Reggie Sanders, breaks up perfect game

By Tangotiger

Weirdly, the Reds announcers talked about "no hitter" and "gem". The MLB person who shows the caption talks about a "no hitter intact". But it was a perfect game! Pedro was 22-up, 22-down at that point. I don't know who those announcers were 19 years ago, but defending Reggie Sanders?

(7) Comments • 2013/02/02 • History

Pitching rotation

By Tangotiger

Bill James made a proposal on his site about how to change the rotation setup. Instead of having gone from a 4-man to 5-man, Bill proposed going the other way, down to a 3-man rotation. Don goes through Bill's comment about PAP in a fair way.

Anyway, Bill proposed a three-man rotation of 54 starts each, but a ceiling on pitches, much like we saw with the Rockies for three weeks.

When it comes down to it, you can probably make as good a case for a three-man as you can for a 4-man, 5-man, and 6-man rotation. Or even a mix, so that some guys start every three days and others start every five days, and others every seven days.? Naturally, the guys starting every three days are going to get pulled far earlier than the other pitchers.

Since every pitcher is different, it's obvious that every pitcher needs to be treated differently.

What will stop any change however is the Won/Loss rule. You can only get a win as a starting pitcher if you pitch at least five innings, but you can get a loss any time. Which is of course a silly rule. So, if you have a starting pitcher scheduled to go for say 60-80 pitches, there's really no reason to make him start the game. He can just as easily come in relief in the third inning, after the first starting pitcher gets pulled for a pinch hitter.

Anyway, lots of potential crazy setups.

(4) Comments • 2013/02/02 • In-game_Strategy

		Reverse Trout
		Statcast: x-stats to update
		Introducing Statcast Park Factors
		Statcast: Which clubs make the good call in shifting RHH and LHH?
		Statcast: Active Spin now from Release using the 3D spin vector
		Statcast Pitch Forces: Lift and Side, Magnus and SSW
		Infield Defense OAA, Iteration 2
		The Five Pillars of WAR
		Statcast Lab: Diverging Fastballs and the Seam-Shifted Wake
		Statcast Lab: What is Gyro and How much of it do you want?
		Divergences
		Statcast Lab: Bias in Catch Probability
		Spin Axis and the Direction of Movement Due to Spin
		Statcast Lab: Batter-Runner v Outfielder, Play at 2B, part 2
		When you shift the infield, how should you shift the outfield? Part 0
		Unit Sphere: Spin Axis
		How close is Mookie Betts to being great enough to be in the Hall of Fame
		Statcast Lab: Should the centerfielder play to pull or go the other way? Part 1 of 2
		Statcast Lab: How much space should you place between infielders, part 1 of N
		Batting Average bias in MVP voting
		Statcast Lab: Batter-Runner v Outfielder, Play at 2B
		Run Values By Pitch Count
		Statcast: 2020 models refreshed
		Scott Karl: the .500 pitcher
		Who is John Wockenfuss?
Older comments Page 149 of 150 pages ‹ First < 147 148 149 150 >
Complete Archive – By Category Complete Archive – By Date 2024 Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov 2023 Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec 2022 Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec 2021 Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec 2020 Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec 2019 Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec 2018 Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec 2017 Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec 2016 Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec 2015 Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec 2014 Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec 2013 Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec FORUM TOPICS Jul 12 15:22 Marcels Apr 16 14:31 Pitch Count Estimators Mar 12 16:30 Appendix to THE BOOK - THE GORY DETAILS Jan 29 09:41 NFL Overtime Idea Jan 22 14:48 Weighting Years for NFL Player Projections Jan 21 09:18 positional runs in pythagenpat Oct 20 15:57 DRS: FG vs. BB-Ref Apr 12 09:43 What if baseball was like survivor? You are eliminated ... Nov 24 09:57 Win Attribution to offense, pitching, and fielding at the game level (prototype method) Jul 13 10:20 How to watch great past games without spoilers

Tangotiger Blog

Wednesday, February 06, 2013

Why doesn’t USPS increase its stamp to 63 cents and stop Saturday delivery?

Should we prefer a spread in forecasts?

Tuesday, February 05, 2013

Example of Replacement-Level players

Method of Baseball Reference’s madness

Discussion of the stats landscape

PITCHf/x park effects

Ubermodels are…

Monday, February 04, 2013

Match-fixing in Europe

Fangraphs Plus

Spread of offense v defense

When do QB metrics stabilize?

WAR hammer for nail question

Sabermetric Super Bowl

Mozeliak talks the numbers game

Baseball Bucket List

Sunday, February 03, 2013

Lachemann Brothers

Saturday, February 02, 2013

Cross-era comparisons

Pre-response to anti-WAR

Friday, February 01, 2013

Flashback: Pedro hits Reggie Sanders, breaks up perfect game

Pitching rotation

Recent comments

Older comments

Complete Archive – By Category

Complete Archive – By Date

FORUM TOPICS

Latest...