[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
THE BOOK cover
The Unwritten Book
is Finally Written!

Read Excerpts & Reviews
E-Book available
as Amazon Kindle or
at iTunes for $9.99.

Hardcopy available at Amazon
SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
Shop Amazon & Support This Blog
RECENT FORUM TOPICS
Jul 12 15:22 Marcels
Apr 16 14:31 Pitch Count Estimators
Mar 12 16:30 Appendix to THE BOOK - THE GORY DETAILS
Jan 29 09:41 NFL Overtime Idea
Jan 22 14:48 Weighting Years for NFL Player Projections
Jan 21 09:18 positional runs in pythagenpat
Oct 20 15:57 DRS: FG vs. BB-Ref

Advanced

Tangotiger Blog

A blog about baseball, hockey, life, and whatever else there is.

Tuesday, July 29, 2014

Regressions run wild

I like the lead-in here.  But then when it gets to the regression... well, you know how I feel about using regressions as a final step.  According to the results, the difference between a triple and a double is 0.62 runs, more than double the actual difference. 

It's also not clear how the outs were handled.  Was all the data done at the per-game level, so that (basically) removes the out as a variable??

The author has some good ideas when he talked about OBP, focusing on the idea of the 3+ runners reaching base in an inning.  That's one of the things that BaseRuns for example did NOT control, the fact that you cannot average more than 3 runners left on base per inning.  Obviously, at the levels we're talking about, it's not going to happen in MLB, or really any league.  But I've wanted to see a model that adhered to the constraint that there are three bases (and how you can make that look nice, like BaseRuns).

So, I don't know what to make of it all.  The author is motivated and willing and has good ideas.  But darn that regression upsets me.  It's stuff we were talking about ten years ago and moved on from.  Regressions are useful first steps, but it can't override logic.

(9) Comments • 2014/08/04 • Statistical_Theory

Latest...

COMMENTS

Feb 19 11:05
Bat-Tracking: Timing Early/Late

Feb 07 15:38
Aging Curve - Swing Speed

Feb 06 11:55
Batting Average as a proxy for fun!  Batting Average as a proxy for fun?

Feb 03 20:21
Valuation implication of straying from the .300 win% replacement level

Jan 31 13:35
Breaking into the Sports Industry WITHOUT learning to code

Jan 26 16:27
Statcast: Update to Catcher Framing

Jan 19 15:02
Young players don’t like the MLB pay scale, while veteran stars love it

Jan 14 23:32
Statcast Lab: Distance/Time Model to Catcher Throwing Out Runners

Jan 07 13:54
How can you measure pitch speed by counting frames?

Jan 02 17:43
Run Value with runners on base v bases empty

Dec 28 13:56
Run Values of Pitches: Final v Intermediate

Dec 27 13:56
Hall of Fame voting structure problem

Dec 23 19:24
What does Andre Pallante know about the platoon disadvantage that everyone else does not?

Dec 21 14:02
Run Values by Movement and Arm Angles

Dec 18 20:45
Should a batter have a steeper or flatter swing (part 2)?

THREADS

July 29, 2014
Regressions run wild