APBRmetrics

Dream- · Joined: 26 Jan 2008 Posts: 13

I am looking for some kind of database or webpage where I can find:

For the current season, date, teams, score, Field goal attempts, for each game played in the season. Additionally a general point spread at the time of the game.

Any ideas where I can find this information?

So far the best I have been able to get is date, teams and score for each game.

Dream- · Joined: 26 Jan 2008 Posts: 13

I have been able to get some of the information.

For games the most useful has been:

http://www.basketball-reference.com/leagues/NBA_2008_games.html

For point spread and O/U info, I can get the games per team (not the best way, but at least I can compile a table from the individual team charts):

http://stats.therx.com/NBA/gamelogs/GameLogs.aspx?TeamId=1

But I am still missing some source where I can find the field goal attempts (other shot data would be very beneficial).

The way Offensive and Defensive efficiencies are computed everywhere is flawed because it does not take into consideration the opposing team's efficiencies (and pace) which is crucial in my opinion.

By using the opposing team efficiencies we can get a normalized efficiency which can then be multiplied by a normalized pace factor to obtain a better predictor.

hoopseng · Joined: 13 Oct 2006 Posts: 54 Location: Basketball Research

Dream- · Joined: 26 Jan 2008 Posts: 13

I could be wrong but the sites I have seen before seem to compute efficiency as a simple average of points made per 100 posessions.

But wouldn't points scored against a weak defensive team count less than points scored against a strong defensive team?

This is why I think you cannot just do a simple average.

Assume 3 idealized teams:

Team A has a real offensive efficiency of 100 points per 100 possesions.
Team B has a real Defensive efficiency of 90 points per 100 possesions.
Team C has a real Defensive efficiency of 110 points per 100 possesions.

What would be the expected points scored by A vs. B in 100 possesions? (surely less than 100, right?)
What would be the expected points scored by A vs. C in 100 possesions? (surely more than 100, right?)

If A scores the expected points against B or C, then its Off Eff should stay the same even when the points scored are above or below 100. But with the usual method, the Off Eff for A will change unless A scores 100 points in both cases.

So I think we need to normalize the Off/Def efficiencies.

PS. Very nice website you have!

gabefarkas · Joined: 31 Dec 2004 Posts: 972 Location: Durham, NC

Dream- · Joined: 26 Jan 2008 Posts: 13

Mountain · Joined: 13 Mar 2007 Posts: 437

Would pages like this do?
http://hosted.stats.com/nba/teamstats.asp?teamno=02&btnGo=Go&type=ologs
using text to columns at the hyphen?

Dream- · Joined: 26 Jan 2008 Posts: 13

DLew · Joined: 13 Nov 2006 Posts: 72

You seem to be arguing that strength of schedule is important. I think everyone agrees, but generally it more or less averages out over many games. Even if a team's schedule was non-average the efficiency numbers are still descriptively accurate, they tell you what the team's points divided possessions for the season was. I don't think anyone is arguing that points divided possessions tells you exactly a team's true offensive strength, but it tends to be in the ball park.

gabefarkas · Joined: 31 Dec 2004 Posts: 972 Location: Durham, NC

Dream- · Joined: 26 Jan 2008 Posts: 13

I have thought of bayesian approaches, and also have a neural net approach in mind.

But I think I will try this first. I think a converging system (similar to ELO perhaps) could be used to bring the efficiencies to their stable value.

Once I have the appropriate data I can run some tests.

Perhaps I am just naive in thinking that this approach will be substantially better than just using averaging... but I'll try anyway.

Mountain · Joined: 13 Mar 2007 Posts: 437

With these team pages you don't have to manually break FGs from FGAs into separate columns.

http://www.basketball-reference.com/fc/tgl.cgi?team=SEA&year=2008

30 cut and pastes isn't that bad.

Dream- · Joined: 26 Jan 2008 Posts: 13

Chicago76 · Joined: 06 Nov 2005 Posts: 77

I may be wrong, but it sounds to me like you're trying to develop a predictive model comprised of SOS-adjusted O and D ratings of teams. Voros McCracken of DIPS fame did this a few years back for international soccer. If this is the case, why not look at the last few years of data to develop the system and test first?

You can get the last years of data using a link similar to the one provided above: http://www.basketball-reference.com/fc/tgl.cgi?team=SEA&year=2007

You could calculate your adjusted O and D Rtg chronologically to determine the predictive ability of your model on the next day vs. the traditional non-SOS adjusted O and D ratings. The ultimate indicator of whether SOS significantly influences the ratings would be in the predictive power of your metric vs. the traditional one.

I suspect adjusting for opponent quality would be an improvement. How much is the question.

The biggest problem may be that the effect of injuries may dwarf any meaningful difference between the two.

This is very interesting. Keep us posted.

Dream- · Joined: 26 Jan 2008 Posts: 13