APBRmetrics

DSMok1 · Last edited by DSMok1 on Tue Oct 26, 2010 12:11 pm; edited 1 time in total

Advanced Statistical Plus/Minus

I've been working on deriving a new SPM regression based purely upon "advanced" stats (like TS% and OR%) for some time now. I feel comfortable enough with the results thus far to release the first iteration of this SPM.

The data used: Neil Paine's collection of 1-Yr APM's (unfortunately without std err's; I estimated the standard errors for weighting purposes), Joe Sill's 4 Year RAPM's, with the regression toward 0 backed out, and finally (and most importantly) Steve Ilardi's 6-Year APM's posted on this forum. These 6 Year APM's had quite low errors, and provided the groundwork for this regression. I weighted each player in the regression by 1/stderr^2, where stderr is their APM standard error.

I then compiled the advanced metrics from the Basketball Reference Play Index for each player, and weighted-averaged the multi-year data (including playoffs, for the APM's that included those). Thus I have 3 APM data sets and the associated advanced statistics.

I experimented with a number of constructions for the rebounding and especially the scoring parts of this regression. Finding a good way to relate turnovers, shooting, usage, and assists proved illusive for some time. I finally now have a construction I am comfortable with, though (like with any construction) there are a few holes.

To avoid over-weighting steals and blocks for defense, I also included offensive rating and defensive rating of the teams. This is not included in the final SPM, because the team adjustment (to make the teams sum to their efficiency differentials) already accounts for this.

Here are the factors in this regression:

Ilardi · Joined: 15 May 2008 Posts: 265 Location: Lawrence, KS

Nice work, DSM: this looks like an important contribution.

A couple of quick questions:

a) Can you provide standard error (se) estimates for the SPM values?

b) Did you consider using any of the advanced metrics from 82games? I've always thought eFG% Allowed would be quite useful in an SPM model . . .

c) What is the correlation between your SPM values for each player and his corresponding APM value? (i.e., the zero-order correlation for the entire league)

d) Any plans for "out-of-sample testing" on this new SPM metric (a la Joe Sill)?

DSMok1 · Posted: Wed Jul 14, 2010 5:38 pm Post subject:

Ilardi · Joined: 15 May 2008 Posts: 265 Location: Lawrence, KS

Thanks: and most guys on the forum call me 'Steve'.

I'd have to get a consult to figure out how to calculate se's on a nonlinear metric like that, but I know it must be do-able. Perhaps someone on this forum can point the way to a workable approach?

As for the correlation between SPM and APM, I might suggest using the 08-09 season, for which you have my 6-season estimates (weighted heavily toward 08-09), as well as your own SPM values.

On the out-of-sample test: presumably it would be possible to calculate SPM values for each player based on games through, say, the first 4 months of last season, and then use those estimates to predict results of the final 2 months. (Same basic approach Joe used with his ridge regression APM numbers.) It would be a fair amount of work, but should be easily do-able, at least in principle.

DSMok1 · Posted: Wed Jul 14, 2010 11:47 pm Post subject:

Ilardi · Joined: 15 May 2008 Posts: 265 Location: Lawrence, KS

DSMok1 · Posted: Thu Jul 15, 2010 1:00 pm Post subject:

Ilardi · Joined: 15 May 2008 Posts: 265 Location: Lawrence, KS

I've also had Varajao rated highly using a more traditional APM approach . . .

DSMok1 · Posted: Sat Jul 17, 2010 12:27 pm Post subject:

Neil Paine · Joined: 13 Oct 2005 Posts: 774 Location: Atlanta, GA

Great work, DSMok!! I'm trying to replicate your work, and I had a question: how are you doing the team adjustment? What I always did was to find the minute-weighted average of each team's SPM and multiply by 5, then subtract that from the team's actual efficiency differential and divide the result by 5. But when I do that, my team adjustments don't match yours (ORL is +1.26, GSW is -1.70). Is it a rounding issue (I'm using the full, calculated versions of the BBR stats, while you used rounded versions), or is my team adjustment method incorrect?
_________________
http://www.basketball-reference.com/blog/

DSMok1 · Posted: Sat Jul 17, 2010 4:02 pm Post subject:

Ilardi · Joined: 15 May 2008 Posts: 265 Location: Lawrence, KS

[quote="DSMok1"]

DSMok1 · Posted: Sat Jul 17, 2010 5:28 pm Post subject:

DSMok1 · Posted: Mon Jul 19, 2010 9:53 am Post subject:

Mike G · Posted: Mon Jul 19, 2010 10:19 am Post subject: