APBRmetrics

DSMok1

Toward an Adjusted Plus/Minus Projection System

I have not seen a comprehensive discussion of the issues related to adjusted plus/minus projection, and wished to begin here.

There are several things to consider:
1) What current measure/past measures of APM should be used?
2) To what mean should the APM measure be regressed to approximate true talent level?
3) How should the regression to the mean be conducted?
4) What aging curve should be used?
5) How do you account for rookies and players who played few minutes?
6) How do you best project minutes?
7) Assembling a roster/minute projections for next season
8 ) Combining team APM to provide a Pythag Win%

-------------------------------------

1) After looking around at various measures, it seemed best to me to use a 1 year stabilized APM, that provided by Ilardi here: http://sonicscentral.com/apbrmetrics/viewtopic.php?t=2294. The results are recent and well stabilized with past results.

2) Does one use the total NBA population to regress to? That seems rather simplistic. That's rather like regressing BABIP to the league mean in MLB--wrong. So how should the population be estimated? Comparable players? Or how about players with similar minutes played (because if a player is played a large number of minutes, a certain expectation on the part of the coach is implied--and he knows more about the situation). This is my initial system: regress toward a mean of similar players in minutes played...

To do this, I binned the players in small clusters and regressed an APM mean curve and standard deviation curve onto the minutes played. The APM to minutes played was a linear with r^2 of nearly 0.7. The standard deviation / minutes curve was parabolic with r^2 of about 0.27 (standard deviation isn't so stable...). This curve indicates that the players near average are relatively well-understood, while those on either end are less predictable (which is a reasonable conclusion). What this allows me to do is to apply the Bayesian Inference system.

The exact relations, based on players with over 400 minutes played:

Neil Paine · Joined: 13 Oct 2005 Posts: 774 Location: Atlanta, GA

We've actually done some work like this at BBR, except with Statistical +/- (not pure Adj. +/-). The same basic tenets should hold true for both, though, since both are attempting to measure the same thing.

Ryan J. Parker · Joined: 23 Mar 2007 Posts: 711 Location: Raleigh, NC

I would like to see an emphasis on predicting team efficiency where we know the following information for each "shift":

The general idea is that players get injured or don't get played for whatever reason. I think a more interesting analysis comes in not actually trying to predict this sort of thing, and rather use previous season data to forecast what would happen if we knew which players would be on the court, and what situations these players would be in.

I'm not suggesting future season predictions aren't important, but there will be a lot of extra noise in these predictions when we really want to focus on a model strictly for predicting points scored by competing lineups.

In any event, I believe this would allow us to properly gauge how well a given model performs in predicting team efficiency. Thus even though we go in having measured (guessed?) what we think are the best predictors of future lineup efficiency, we can now actually test and see where these models do and do not perform well. We can examine low minute players, or situations where a player was given a new home, etc.
_________________
I am a basketball geek.

Crow · Joined: 20 Jan 2009 Posts: 825

Instead of just comparable players by minutes what about by minutes and whether PG. wing or big; or one of the existing similarity systems based on discrete stats and demographics or statistical +/- or a hybrid system of all?

DSMok1 · Posted: Sat Sep 12, 2009 12:07 pm Post subject:

I have delved further into the world of Bayesian statistics.

The basic Bayesian model is here:

Crow · Joined: 20 Jan 2009 Posts: 825

I hope you get even more feedback from those experienced with the calculations. But best wishes / full speed ahead either way.

"1) What current measure/past measures of APM should be used?

This is important and tough question. 1 year stabilized is what I am mainly using but 6 year average in good too. I don't know... what about something like 2/3 thirds 1 year stabilized, 1/3rd 6 year average (or just do the math to find the resulting year weights)? That might be "better".

2) To what mean should the APM measure be regressed to approximate true talent level?

I think this should be position specific at least to the level of PG, wing, big. Whether it should be done for starters/big minutes and subs I don't know but I'd at least consider it for a moment.

4) What aging curve should be used?

I think it should be specific to at least the 3 size/role groups I just suggested above.

5) How do you account for rookies and players who played few minutes?

For rookies I think some combination of draft position and college stats might be better than a 1 assumption fits all approach. For players who played few minutes I might adjust the values by team by quality of that team and system to some degree rather than treat all across the league exactly the same.

6) How do you best project minutes?

I think subjective assessment directly or after a experienced based algorithm could get closest but haven't explicitly tried it.

7) Assembling a roster/minute projections for next season

Ideally I'd think you'd want to try to get at time for 5 man lineups though this will be very difficult given how little obvious logic is being applied.

8 ) Combining team APM to provide a Pythag Win%

I haven't seen anyone report much detail about the covariances. Is it time? If you did, what do you do next? Can you usefully adjust further or just offer a blanket qualifier about the meaning of what you have?

I think the step from list of players on the court to unit production is key. If you had player pair (with and without) and perimeter and interior sub-unit adjusteds from last season maybe you could better estimate how player APM will actually combine into lineup APM and roll-up into team APM. At least better than essentially treating every context as the same for players when lots agree they are not. I don't know if you want to do this or share it but it would be great if you did. Maybe in an ultimate implementation you run all the possibilities thru a Monte Carlo simulator to guide the final choice.