[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
THE BOOK cover
The Unwritten Book
is Finally Written!

Read Excerpts & Reviews
E-Book available
as Amazon Kindle or
at iTunes for $9.99.

Hardcopy available at Amazon
SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
Shop Amazon & Support This Blog
RECENT FORUM TOPICS
Jul 12 15:22 Marcels
Apr 16 14:31 Pitch Count Estimators
Mar 12 16:30 Appendix to THE BOOK - THE GORY DETAILS
Jan 29 09:41 NFL Overtime Idea
Jan 22 14:48 Weighting Years for NFL Player Projections
Jan 21 09:18 positional runs in pythagenpat
Oct 20 15:57 DRS: FG vs. BB-Ref

Advanced

Tangotiger Blog

A blog about baseball, hockey, life, and whatever else there is.

Wednesday, February 08, 2017

Do Markov chains work in baseball?

?Something interesting came as a result of the Patriots comeback, where the "models" had them at 99.5%+ of losing at their peak, and 95%+ at the half, while the bettors had them at under 90% at the half.  

The way the BASIC models work is very simple math: just use a transition matrix for various situations that do NOT consider the score, nor the time remaining.  This is the way I do it for baseball.  But I must admit, I never actually tested it.  

In football, we may consider that if a team has a 21+ point lead that the two teams are going to play radically different.  I know this is somewhat true in hockey, where the team that is DOWN by 2 goals is more likely to score the next goal then the team that is up by 2.  This happens because the leading team giving up a goal (so they are only up by 1) costs more than the leading team gaining a third goal.  So, what they basically want to do is reduce the goals they allow by say 20%, at the cost of themselves scoring by say 30%.  It's a "small ball" kind of tactic.  At the same time, the team that is behind want to increase the goals they score by 20% even if it means increasing the goals they allow by 30%.  The net effect is that it does NOT cancel out.

So, we know hockey teams play differently.  We suspect that maybe football teams play differently, hence the idea that the Patriots had a 99.5% chance of winning is probably wrong.  Indeed, someone tweeted out that when you look at teams where the model said they had a greater than 95% chance of winning, they actually ended up winning less than 90% of the time.  The models, therefore, were too basic.

How about baseball?  Well, that was a great idea, so I applied it to baseball, 2010-2016.  I looked for all games where the home team FIRST had a 95%+ chance of winning, prior to the 5th inning.  Remember,  my Markov chain is based just on the run expectancy, and so, is unaware of the change of strategy.  Does it matter?  So, there were 1122 games that met the criteria.  The average estimated win probability was 0.958.  The actual number of wins was 1082 and actual losses was 40, for a win% of 0.964.  So, that one works.

How about a 99%+ chance of winning, in the 8th or later innings?  The average estimated was 0.994  The actual was 4268 wins and 11 losses, for an actual of 0.997.

It seems therefore that in baseball, when we say that the chance of a comeback is 99%, we actually do mean it is 99%.

(11) Comments • 2017/02/09 • Run_Win_Expectancy

Latest...

COMMENTS

Feb 07 15:38
Aging Curve - Swing Speed

Feb 06 11:55
Batting Average as a proxy for fun!  Batting Average as a proxy for fun?

Feb 03 20:21
Valuation implication of straying from the .300 win% replacement level

Jan 31 13:35
Breaking into the Sports Industry WITHOUT learning to code

Jan 26 16:27
Statcast: Update to Catcher Framing

Jan 19 15:02
Young players don’t like the MLB pay scale, while veteran stars love it

Jan 14 23:32
Statcast Lab: Distance/Time Model to Catcher Throwing Out Runners

Jan 07 13:54
How can you measure pitch speed by counting frames?

Jan 02 17:43
Run Value with runners on base v bases empty

Dec 28 13:56
Run Values of Pitches: Final v Intermediate

Dec 27 13:56
Hall of Fame voting structure problem

Dec 23 19:24
What does Andre Pallante know about the platoon disadvantage that everyone else does not?

Dec 21 14:02
Run Values by Movement and Arm Angles

Dec 18 20:45
Should a batter have a steeper or flatter swing (part 2)?

Dec 18 16:19
Art and Science of WAR: Deriving the zero-baseline, historically

THREADS

February 08, 2017
Do Markov chains work in baseball?