APBRmetrics

Dan Rosenbaum · Posted: Fri Nov 17, 2006 2:35 pm Post subject:

Dave Berri has a very nice discussion of PER, using a "model" to construct player ratings, and the use (or lack thereof) of statistical analysis in NBA decision-making.

http://dberri.wordpress.com/2006/11/17/a-comment-on-the-player-efficiency-rating/

WizardsKev · Joined: 03 Jan 2005 Posts: 460 Location: Washington, DC

Berri & company have raised an interesting point, and I'm kinda torn on it. Yes, shooting efficiency is extremely important -- there's no question. But, someone has to take the shots. Does treating all shot attempts as negative (which I think their metric does) reflect reality, though?

Like I say, I'm torn on the subject. I know that shooting efficiency is valuable, but at the same time, does it really hurt a team when Player A misses and his team gets the offensive rebound?

Dan, didn't you have a number showing a very low shooting percentage as a sort of benchmark for positive adjusted +/-?
_________________
If you can't explain it simply, you don't understand it well enough.

-- Albert Einstein

Dan Rosenbaum · Posted: Fri Nov 17, 2006 3:37 pm Post subject:

The problem is that Wins Produced doesn't really use regression to answer this question. It assumes that the average points per possession throughout the league tells us all we need to know about the breakeven shooting percentage for the marginal shot.

It assumes players convert shots on the margin at the same rate as their average shot. And so if we remove those players who shoot a lot (and draw a lot of the focus of the defense), the shooting percentages of remaining players will not change. Wins Produced is very similar to Bob Chaikin's simulator along those lines. Coming from a sports economics literature dominated by baseball, it is very natural to make this assumption, since this issue does not come up in a sport where every player gets his turn at bat.

When I look at this issue empirically using adjusted plus/minus statistics, I tend to find that the breakeven shooting percentage is somewhere between what PER/NBA Efficiency uses and what Wins Produced uses. Wins Produced rightly addresses a problem with PER/NBA Efficiency, but in my opinion it so overcorrects that the resulting metric does a poor job explaining how players impact winning.

davis21wylie2121 · Joined: 13 Oct 2005 Posts: 183 Location: Atlanta, GA

I found this to be one of Berri's most well-reasoned posts at the WoW Journal. I've said it before -- PER doesn't have the "power of language", because it lacks units. You can say that it's kind of like Points Produced per 40 minutes, but it really isn't, because it mixes apples and oranges (defensive and offensive stats) right and left, the weights are sometimes totally subjective (Why is an assist worth .67 "points"? And frankly, saying that, "on an assist, the passer does one thing and the scorer does two things, so the passer must get 1/3 credit," just doesn't cut it), the league average is forced to be 15, etc. So, yeah, Berri has some legit gripes about NBAEff and PER, because Wins Produced is at least derived pseudo-scientifically.

If he's lurking, I wonder how John would respond to Berri's post (probably wishful thinking on my part, but still)...

Ben · Joined: 13 Jan 2005 Posts: 168 Location: Iowa City

bchaikin · Joined: 27 Jan 2005 Posts: 423 Location: cleveland, ohio

It assumes players convert shots on the margin at the same rate as their average shot. And so if we remove those players who shoot a lot (and draw a lot of the focus of the defense), the shooting percentages of remaining players will not change. Wins Produced is very similar to Bob Chaikin's simulator along those lines......it is very natural to make this assumption...

might you care to expound on what this assumption is that the simulator uses?...

asimpkins · Joined: 30 Apr 2006 Posts: 67

deepak_e · Joined: 26 Apr 2006 Posts: 200

Another thing that bothered me about crediting assists:

Does it really make sense for the passer and shooter to share credit on an assisted score, but for the shooter to be exclusively penalized on the miss?

davis21wylie2121 · Joined: 13 Oct 2005 Posts: 183 Location: Atlanta, GA

Basketball is troublesome in the way that it fails to lend itself to regression. In baseball, if I wanted to find the various weights (in runs) for all of the offensive events that lead to run scoring, I would simply regress team singles, doubles, walks, home runs, etc. against team runs scored, and come out with something like this. If I regressed the relevant offensive stats in basketball against points, however, I would always come out with this formula: Pts = (2*FG) + 3FG + FT. In other words, assigning partial "points" credit to assists and offensive rebounds is simply not intuitive, and pretty much any method that does this is going to have to fudge on the "value" of an assist.

But if you regress on wins, you don't have the problem of points being totally dependent on a few of the variables that you want to regress. I presume that this is why Berri based his research on "wins produced" and not "points produced" -- linear regression is feasible.

Dan Rosenbaum · Posted: Sat Nov 18, 2006 9:53 pm Post subject:

First point, Wins Produced is designed to predict net team efficiency (offensive minus defensive efficiency) and if only one year was used in the prediction, it would perfectly predict net team efficiency. Their results tell us nothing more than that net team efficiency does a pretty good job predicting wins.

Second, Berri often argues that he uses regression to come up with the relative value of points, rebounds, field goal attempts, etc., but in general that is not true.

For points, field goal and free throw attempts, rebounds, steals, and turnovers, Berri simply uses the leaguewide average of points per possession to arrive at the relative value of the linear weights. I guess that is emprical, but since it is so far removed from anything at the individual level, I would argue that it is only marginally more emprical than PER or other linear weights methods. Berri runs a regression, but he did not need to, so I think it is misleading to argue that this is really regression-based.

For blocks and personal fouls, he does use empirical work, but not regression, to arrive at the appropriate weights. Only for assists does he really use regression analysis.

At the end of the day, what we really want to know is how much better does a team, on average, do with a guy who is more of an assister or a scorer or a rebounder. And it only the value of the assister that Berri really uses emprical work to ask the appropriate question.

So I guess in sum I would argue that Wins Produced is much closer to just "picking a number" than it really appears.

admin · Posted: Sun Nov 19, 2006 4:40 am Post subject:

Maybe this is ultimately a trivial point, but I strongly disagree with Berri's disdain for "the laugh test". Confirmation bias is selectively looking for evidence to fit the test hypothesis, but conventional wisdom is not the test hypothesis, it is (or at least should be, in my mind) the null hypothesis. The null hypothesis is supposed to get the benefit of the doubt.

Surprising results shouldn't cause us to reject the method behind them, but they should cause us to scrutinize it and understand why they occur. In the case of Berri's previous effort, the rating for Dennis Rodman revealed some serious flaws - those flaws, in this case, being tied to the effort to determine the value of individual statistics through regression at the team level. If we literally just laughed, that would not have been useful, but neither would have been accepting the results because they were "scientific."

There's also a pragmatic argument. For the most part, we aren't dealing with scientists, so the scientific method is not always appropriate. There is no question that fans and front office personnel are using the laugh test. Should we just ignore that?

Mike G · Joined: 14 Jan 2005 Posts: 971 Location: Delphi, Indiana

tomverve · Joined: 24 Feb 2005 Posts: 17

Dan Rosenbaum · Posted: Sun Nov 19, 2006 3:59 pm Post subject:

Berri and co-authors in their previous work came up with a metric that did not include assists, blocks, or fouls. None of those statistics are directly tied to points scored or possessions, so it is possible (with a team adjustment) to come up with a metric that perfectly predicts team efficiency even without using assists, blocks, or fouls.

So why did Wages of Wins move away from that metric? It passed through the peer-review process, and if they had used a team adjustment (I am not sure they did), it would have predicted team wins just as well as the current Wins Produced.

I think they moved from that metric through lots and lots of e-mails from an APBRmetric member (not me) that convinced them that this older metric did not pass the laugh test. That old metric has almost no correlation with adjusted plus/minus statistics (not that they knew this or would have cared). It led to perverse results in several published papers.

So how can the scientific method and peer-review process fail so badly? Models can be wildly off the mark and so if they are only judged on some form of internal consistency, they can produce perverse results. Calibrating a model to perfectly predict subjective evaluations may not be scientific, but completely ignoring (and belittling like they do in the book) all non-statistical analysis is NOT how good empirical work is done in other areas of economics.

Good empirical work is always a combination of theory and evidence that incorporates as much subjective knowledge as is practically possible. We may risk being non-scientific if we overfit the data to predict our subjective evaluations. But we also run the risk of being irrelevant if we completely ignore subjective evaluations.