[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
THE BOOK cover
The Unwritten Book
is Finally Written!

Read Excerpts & Reviews
E-Book available
as Amazon Kindle or
at iTunes for $9.99.

Hardcopy available at Amazon
SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
Shop Amazon & Support This Blog
RECENT FORUM TOPICS
Jul 12 15:22 Marcels
Apr 16 14:31 Pitch Count Estimators
Mar 12 16:30 Appendix to THE BOOK - THE GORY DETAILS
Jan 29 09:41 NFL Overtime Idea
Jan 22 14:48 Weighting Years for NFL Player Projections
Jan 21 09:18 positional runs in pythagenpat
Oct 20 15:57 DRS: FG vs. BB-Ref

Advanced

Tangotiger Blog

A blog about baseball, hockey, life, and whatever else there is.

Tuesday, January 12, 2016

Timeline adjustments

?This one is based on hockey, but the similar issues applies in baseball.  And the author proposes something similar to what I've done in the past.

The main issue is "average", that it treats average as a constant throughout time.  By forcing that as the centering point, everything revolves around it.  However, once you've got average, you have to worry about the spread around the average.  In baseball, Michael Shuckers I believe gets around that by transforming all the performances so that the spreads for each season are the same.  (I think that's what he does.)  So, now we've standardized the mean and the spread.  Which of course means that we're forcing a constant across time.

Whether that is a time period with six teams or thirty teams.  As you can see, you have to be careful with standardizing data, just because you want to standardize the data.  It has to make sense.  As Bill James noted 30 years ago: if you are going to multiply A by B, there better damn be a good reason to multiply A by B. 

We talk about Sandy Koufax and his innings and Pedro and his innings.  Koufax for example had the same number of pitches thrown in his 6 best year stretch as Randy Johnson did.  Except Koufax did NOT lead his league.  Randy Johnson led his league easily.  As the blogger notes in discussing hockey in his link, the focus, instead of being the average and the spread of all players, that maybe it should be the top-end players.

I agree.  This will give you a better baseline.  Of course, you have to be careful as to how to select the quantity of players.  This is what I did when Sosa reached 500 HR, I guess some 15 years ago.  I compared Sosa to the 10th place in HR.  I did that for every hitter.  And I just baselined everyone so that the 10th place in HR in every year was the constant.

See, the issue always comes back to the centering point.  Whether I chose the exact 10th place, or the 5th through 15th place, or as the blogger did 3rd through 12th, we're still taking a position as to what is the constant.

In any case, I do agree that the best way to do these comparisons is to use a much higher baseline than "average".  The key though is figuring out this baseline, and how even that will change as the playing context changes.

Latest...

COMMENTS

Nov 23 14:15
Layered wOBAcon

Nov 22 22:15
Cy Young Predictor 2024

Oct 28 17:25
Layered Hit Probability breakdown

Oct 15 13:42
Binomial fun: Best-of-3-all-home is equivalent to traditional Best-of-X where X is

Oct 14 14:31
NaiveWAR and VictoryShares

Oct 02 21:23
Component Run Values: TTO and BIP

Oct 02 11:06
FRV v DRS

Sep 28 22:34
Runs Above Average

Sep 16 16:46
Skenes v Webb: Illustrating Replacement Level in WAR

Sep 16 16:43
Sacrifice Steal Attempt

Sep 09 14:47
Can Wheeler win the Cy Young in 2024?

Sep 08 13:39
Small choices, big implications, in WAR

Sep 07 09:00
Why does Baseball Reference love Erick Fedde?

Sep 03 19:42
Re-Leveraging Aaron Judge

Aug 24 14:10
Science of baseball in 1957

THREADS

January 12, 2016
Timeline adjustments