APBRmetrics

Ed Küpfer · Joined: 30 Dec 2004 Posts: 647 Location: Toronto

Over the first three games of the 2002 season, Shaq hit 34 of 48 from the free throw line (71%), including a game where he went 16 for 18. What did the coaching staff think? Did they rejoice, believing that Shaq had finally mastered the art of free throw shooting after years of indifference? Or did they think "fluke" and wait for the inevitable return to 50% form? What would you have thought?

At that point in his career, Shaq had been 3393-6390 (53%), having shot a high of 59% as a rookie and a low of 48% in his fifth season.

Ed Küpfer · Joined: 30 Dec 2004 Posts: 647 Location: Toronto

In my earlier post, I described a method for regressing binomial stats, like FT% or win%. Here I will describe how to do the same thing for multinomials, which are stats that can have more than two outcomes, like EFG% (miss, two points, or three points).

The first thing you need to do is to break the formula down into its constituent parts. EFG% is normally calculated as (2ptM + 3ptM*1.5)/FGA, but we need to express it as the probability weighted mean of the likelihood of each event. For EFG%, this is

Ed Küpfer · Joined: 30 Dec 2004 Posts: 647 Location: Toronto

Using the methods I described in the previous posts, I calculated the variance of the distribution for 4 shooting stats amoung the population of NBA players. My dataset included all player-seasons since 1977-78 in which the players had more than 50 attempts. To calculate the variances, I first subrtracted that season's league-wide mean from each players %, and then added 50% (except for 3-pt%, where I added 33%, and FT%, where I added 75%). This centered each season around the same mean, making for for useful comparisons.

Distribution of true shooting ability:

Ed Küpfer · Joined: 30 Dec 2004 Posts: 647 Location: Toronto

Why don't I just go ahead and use this thread to dump numbers? Yes, ed, why don't you? You have my permission.

Using the methods outlined above, I looked at individual player rebounding percentages. I wanted to see what the distribution of rebounding ability looked like, within each position.

I used the data from 82games, showing the number of offensive/defensive rebounds and number of offensive/defensive rebounding opportunities for three seasons (02-03 to 04-05). I limited my search to the top 1000 player-team-seasons in rebounding opportunities. My database has every player listed under one of the following positions: PG, G, SG, GF, SF, F, PF, FC, and C. I don't really trust these positions too well, but it's a start. Here's how each position rebounded (average reb%, weighted by opportunities):

That's fairly obvious, and not very interesting. What I was getting at is the distribution of rebounding ability, which I will summarise with the standard deviation (derived by calculating the variance using that long complicated method above):

cherokee_ACB · Joined: 22 Mar 2006 Posts: 111

Ed Küpfer · Joined: 30 Dec 2004 Posts: 647 Location: Toronto

cherokee_ACB · Joined: 22 Mar 2006 Posts: 111

By absolute vs relative I meant whether

POS OR% sdOR%
PG 2.1% 0.9%

means 2.1 +- 0.9 (absolute) or 2.1 +- 0.009*2.1 (relative to the mean). I assume the former, but the % symbol can be confusing. If that's the case, I'm not sure I'd say that the rebounding ability of small guys is more homogeneous. Rebounding numbers of guards seem to be more spread around their mean (they have a higher stdev/mean ratio) than those of inside players. Anyway, it's just semantics.

Ed Küpfer · Joined: 30 Dec 2004 Posts: 647 Location: Toronto

mgl · Joined: 27 Mar 2006 Posts: 2

Ed, great stuff! I am one of the authors (Lichtman) of the aforementioned book (The Book), although Dolphin, who is a brilliant statistician as well as sabermetrician, wrote the appendix.

Ed Küpfer · Joined: 30 Dec 2004 Posts: 647 Location: Toronto

Ed Küpfer · Joined: 30 Dec 2004 Posts: 647 Location: Toronto

Using the same A B C method as for TS% from above, I calculated the variances of team Offensive and Defensive Ratings. (Slight change: I added a category J, equivelant to the number of team turnovers, and from the p0 category I subtracted the number of team Offiensive Rebounds. This made sure that p0 + p1 + p2 + p3 = Total Possessions = FGA + 0.44 * FTA - OR + TO.) I found something odd: the variation in ORTG among teams is about 20% higher than the variation of DRTG. This difference is consistent no matter what era you look at. Go figure. Anyway, the standard deviation of ORTG is 3.0, and for DRTG is 2.6.
_________________
ed

TexasEx · Joined: 12 May 2006 Posts: 28 Location: Houston, TX

Ed - In your first post, you base your regression off Shaq's first 3 games of the 2002 season. What if you don't have any data for the first few games of the season you want to forecast? Do you just take the info from the previous year and then do the remainder of the steps you lay out?

Eli W · Joined: 01 Feb 2005 Posts: 369

Ed Küpfer · Joined: 30 Dec 2004 Posts: 647 Location: Toronto

At this point Eli knows more about this stuff than I do, so you should probably be listening to what he has to say. But it's worth keeping in mind what RTM is all about: you are combining two numbers (based on the relative confidence you have in each number) to forecast future performance. Eli is trying to get at the nuts and bolts of the combination method, and it's lucky that someone is – I don't have the patience for that anymore. I take a more results-oreiented approach, and use whatever works. "Whatever works" depends on how you define "works" – I'm looking forward to seeing what Eli comes up with.
_________________
ed

Harold Almonte · Joined: 04 Aug 2006 Posts: 430

An out of topic observation: