Joined: 13 Oct 2005 Posts: 373 Location: Atlanta, GA
Posted: Sat Sep 15, 2007 12:29 pm Post subject: Technical DRtg Issues
Hey guys,
I was playing around with my BoP spreadsheet and realized that my DRtg numbers are slightly off (by a fraction of a point) when compared to Justin's numbers at B-R...
Code:
Name Season Tm Stops Stop% DRtg
Michael Jordan 1995-96 CHI 672.2 0.570 99.1
Scottie Pippen 1995-96 CHI 586.5 0.544 100.2
Toni Kukoc 1995-96 CHI 379.0 0.472 103.3
Dennis Rodman 1995-96 CHI 480.1 0.602 97.8
Steve Kerr 1995-96 CHI 303.6 0.415 105.7
Ron Harper 1995-96 CHI 374.9 0.521 101.2
Luc Longley 1995-96 CHI 317.4 0.507 101.8
Bill Wennington 1995-96 CHI 188.2 0.463 103.6
Jud Buechler 1995-96 CHI 141.7 0.502 102.0
Dickey Simpkins 1995-96 CHI 119.5 0.457 103.9
Randy Brown 1995-96 CHI 154.5 0.603 97.7
Jason Caffey 1995-96 CHI 97.4 0.468 103.4
James Edwards 1995-96 CHI 44.8 0.428 105.1
John Salley 1995-96 CHI 44.6 0.611 97.4
Jack Haley 1995-96 CHI 1.2 0.435 104.8
My Stops totals check out perfectly with B-R, so the problem is in one of the steps after Stops are calculated. Here are the equations I'm working with now:
Does anyone who is familiar with BoP see where my problem is? Keep in mind that I made sure it isn't a data problem by inputting Justin's data precisely, so the formulas must be to blame. I'd appreciate any help!
Using my own spreadsheet (which calculates things in a slightly different order), my results match yours exactly and fail to match those on B-R.com. I'll try to take a closer look at this.
I don't what raw data source Dean used in BOP, but his Table 17.19 (page 215) seems to match B-R in DRtg. And in comparing our results to BOP, it seems things go wrong as soon as the transition from Stops to Stop% (BOP has Pippen at 0.55, we have 0.544). We're getting systematically lower Stop%'s than BOP. But strangely, we're also getting systematically lower DRtg's than BOP (BOP has Pippen at 101, we have 100.2). This suggests there could be multiple problems (simply correcting the Stop% won't fix the DRtg, it will only make the discrepancies greater). But it could just be one problem - if we're overestimating team defensive possessions, decreasing them could both raise Stop% and raise DRtg. That's my preliminary look, but I have to go watch football now.
Joined: 05 Jan 2005 Posts: 508 Location: Columbus, OH
Posted: Sat Sep 15, 2007 2:55 pm Post subject:
Guys, I vaguely recall having this problem too (i.e., I couldn't get my figures to match Dean's), and I think the "trick" is to calculate team defensive ratings this way:
Code:
tmPoss = FGA - (ORB/(ORB+(oppTRB-oppORB)))*(FGA-FG)*1.07 + TO + 0.4*FTA
Actually, this reminds me that I should probably change the way I calculate the team offensive and defensive ratings that are seen on the team and league pages. _________________ Regards,
Justin Kubatko
Basketball Stats!
Joined: 13 Oct 2005 Posts: 373 Location: Atlanta, GA
Posted: Sat Sep 15, 2007 3:22 pm Post subject:
Okay, Justin, that trick worked -- provided I don't change TmOppPoss to gmPoss in the Stop% formula. But shouldn't that change be made as well? If we're considering (for all intents and purposes) gmPoss to be the number of possessions that the team faces on defense, then I think we should amend the Stop% formula to reflect this as well.
Anyway, thanks JQ and Justin, I'm glad we got to the bottom of this! Now it's time to watch football...
Joined: 13 Oct 2005 Posts: 373 Location: Atlanta, GA
Posted: Sun Sep 16, 2007 2:09 pm Post subject:
Okay, I finally had the chance to update my database to reflect these changes, including using gmPoss instead of tmPoss/tmOppPoss in %Poss and Stop%. Everyone will probably be getting an e-mail about a new file on the APBRmetrics group, because I got rid of those (now-obsolete) BoP files and added an updated version that has complete stats since 1973-74. For pre-1978 seasons, I estimated turnovers by regressing 1978-2007 TO on height, age, and per-minute stat rates, and forced the team estimated TO totals to match the actual team totals. Which is a really long way of saying that I estimated ORtg/%Poss/DRtg for 1974-1977. Enjoy!
Joined: 30 Dec 2004 Posts: 533 Location: Near Philadelphia, PA
Posted: Sun Sep 16, 2007 4:14 pm Post subject:
davis21wylie2121 wrote:
Okay, I finally had the chance to update my database to reflect these changes, including using gmPoss instead of tmPoss/tmOppPoss in %Poss and Stop%. Everyone will probably be getting an e-mail about a new file on the APBRmetrics group, because I got rid of those (now-obsolete) BoP files and added an updated version that has complete stats since 1973-74. For pre-1978 seasons, I estimated turnovers by regressing 1978-2007 TO on height, age, and per-minute stat rates, and forced the team estimated TO totals to match the actual team totals. Which is a really long way of saying that I estimated ORtg/%Poss/DRtg for 1974-1977. Enjoy!
This is good stuff. As I scroll through some of your estimates from pre-individual turnover times, I see the potential for so many stories about teams, players, and the league as a whole. So many guys I really don't know much about, never saw, but now start to get a feel for. There were some stories I put together from my historical survey in BoP, but these individual numbers could fill in many gaps and suggest some good new work.
It sounds like you took a reasonable approach to estimating individual turnovers, too. Can I ask what stat rates seemed significant in making those projections? _________________ Dean Oliver
Author, Basketball on Paper
http://www.basketballonpaper.com
So the most significant predictors of turnovers per minute seem to be assists and personal fouls, followed by field goal attempts and free throw attempts. This makes sense, because turnovers usually result from people trying to make plays (assists, FGA, FTA) or big men setting illegal screens (PF). What made this a lot more accurate for 74-77 was the fact that the NBA tracked team TO during this period; all I had to do was sum team estimated TO, compare that to .97*actual team TO (because, traditionally, 97% of team turnovers are individual turnovers), and adjust indiv. est. TO to be higher or lower based on the discrepancy between the two totals.
I really want to run your stats this way for pre-1974 seasons as well, but the NBA's lack of team TO, ORb, Stl, & Blk tracking is going to make it very hard. I have estimated individual TO, ORb, Stl, & Blk for every player since 1952 using the same regression method, but the results won't hold water if the team totals don't make sense (for instance, w/o the team adjustment I was consistently getting low team TO totals because of individual regression to the mean). My initial thought is to see if league size had any bearing on the standard deviations of the team-totals in those missing categories, and if so, apply this predicted league standard deviation to the sum-totals of the estimated players for each team. If anyone has a better tactic, though, I'd really love to hear it.
I also should note that I forgot to add the ABA to the spreadsheet, since they tracked all individual stats through 1973-74 as well. I'll hopefully get around to that at some point in the near future.
Joined: 30 Dec 2004 Posts: 533 Location: Near Philadelphia, PA
Posted: Sun Sep 16, 2007 5:46 pm Post subject:
davis21wylie2121 wrote:
...(for instance, w/o the team adjustment I was consistently getting low team TO totals because of individual regression to the mean)...
I was going to ask this, but I'd guess that you were low because turnovers have been trending down for so long. Players have been reducing their mistakes...
The lack of a good estimate for Wilt, Oscar, and Russell always bugs me. I did my best with Wilt and Russ for the one chapter, but I think it can be done better... Just not by me these days... _________________ Dean Oliver
Author, Basketball on Paper
http://www.basketballonpaper.com
DW how do your numbers compare or fit with or do other things distinctive of the "starting point for basketball analysis" regression or the table for statistical +/-? I'd be interested in hearing you walk thru that.
I see 3pt made with a negative coefficient? Isnt that it contrast with assertions and the statiscal +/- regression results that 3pt shooting is a positive? Is it a eFG% treatment issue?
Joined: 03 Jan 2005 Posts: 665 Location: Washington, DC
Posted: Mon Sep 24, 2007 1:25 pm Post subject:
Are there any video archives available of some of these older games? One way of generating an estimate might be to actually sit down and track the stats for some of these older games and see how many blocks, steals, turnovers, etc. were actually taking place. _________________ My blog
Joined: 30 Dec 2004 Posts: 533 Location: Near Philadelphia, PA
Posted: Mon Sep 24, 2007 1:43 pm Post subject:
kjb wrote:
Are there any video archives available of some of these older games? One way of generating an estimate might be to actually sit down and track the stats for some of these older games and see how many blocks, steals, turnovers, etc. were actually taking place.
It is a definite possibility in the not too distant future. _________________ Dean Oliver
Author, Basketball on Paper
http://www.basketballonpaper.com
You cannot post new topics in this forum You cannot reply to topics in this forum You cannot edit your posts in this forum You cannot delete your posts in this forum You cannot vote in polls in this forum