View previous topic :: View next topic |
Author |
Message |
back2newbelf
Joined: 21 Jun 2005 Posts: 275
|
Posted: Fri Mar 25, 2011 10:03 am Post subject: |
|
|
I can now easily update most files by the press of a button, so there might be almost daily updates from now on. Keep in mind that most ratings, especially the multiyear ones, don't change much from day to day. Time of update is now listed at the top of each ranking. The one year ranking will mostly be updated through the playoffs
I'll probably add another rating soon where I compute ratings of coaches on multiyear data (going back to 2002), then add those ratings to the computation of 1 year player ratings while leaving the coach ratings fixed. That might give a better estimation of players that play on super good/bad defensive teams. Of course, prediction performance of this method will have to be tested first.
Comments on the ratings:
-Thibodeau looks awesome, I didn't think it was possible to jump to the #1 spot after less than a full season
-Completely disagrees with the firing of O'Brian, but this method only measures the influence on the players being on the court. Did he have really weird substitution patterns? _________________ http://stats-for-the-nba.appspot.com/ |
|
Back to top |
|
|
EvanZ
Joined: 22 Nov 2010 Posts: 303
|
|
Back to top |
|
|
EvanZ
Joined: 22 Nov 2010 Posts: 303
|
|
Back to top |
|
|
back2newbelf
Joined: 21 Jun 2005 Posts: 275
|
Posted: Fri Mar 25, 2011 11:59 am Post subject: |
|
|
EvanZ wrote: | Someone asked before, but are there standard errors somewhere? Even a ballpark would be nice for the 1-yr vs. 3.x-yr. |
I tried computing those via bootstrapping, but the way I did it wasn't correct. As of right now, I do not know how to compute them _________________ http://stats-for-the-nba.appspot.com/ |
|
Back to top |
|
|
Ilardi
Joined: 15 May 2008 Posts: 265 Location: Lawrence, KS
|
Posted: Fri Mar 25, 2011 12:19 pm Post subject: |
|
|
back2newbelf wrote: | EvanZ wrote: | Someone asked before, but are there standard errors somewhere? Even a ballpark would be nice for the 1-yr vs. 3.x-yr. |
I tried computing those via bootstrapping, but the way I did it wasn't correct. As of right now, I do not know how to compute them |
Joe Sill has said he has a method, but I believe even there he's not completely sure it's correct. SAS (my stat package of choice) doesn't provide any error estimates with ridge regression, but I'm assuming the error terms have to be lower than those generated by a corresponding regression model without the ridge correction (since the ridge technique helps reign in the coefficient variance inflation that arises due to excessive collinearity among predictors). So, at least we have an upper bound on the errors.
If someone can help elucidate the issue further, I'm sure we'd all be grateful! |
|
Back to top |
|
|
xkonk
Joined: 26 Jan 2011 Posts: 5
|
|
Back to top |
|
|
DSMok1
Joined: 05 Aug 2009 Posts: 611 Location: Where the wind comes sweeping down the plains
|
Posted: Fri Mar 25, 2011 3:04 pm Post subject: |
|
|
I agree with xkonk. I don't think the standard errors can be found that are meaningful. However, you could perhaps come up with some rough estimates based on out-of-sample predictions. Not sure how to do it, though.
The reason stderrs aren't going to be meaningful in ridge regression is this:
You're taking a highly correlated problem. You throw in a bunch of observations of 0 (this is the effect of ridge regression). Then if you get the standard errors of the regression, they'll be quite low using bootstrap or anything. The 0s cause that effect. But the 0s aren't actual data...in fact, they should have an "error" associated with them. _________________ GodismyJudgeOK.com/DStats
Twitter.com/DSMok1 |
|
Back to top |
|
|
Crow
Joined: 20 Jan 2009 Posts: 825
|
Posted: Fri Mar 25, 2011 3:54 pm Post subject: |
|
|
The Sloan research papers are available.
http://www.sloansportsconference.com/research-papers/2011-2/posters/
Anyone care to review and comment on them?
Piette used ridge regression and player pairs in the model. Omidiran used player pairs and boxscore stats in a single regression model I believe (the feasibility and desirability of which I have asked about before) but I don't think he used ridge regression.
Neither used multi-season to further reduce errors or made any other adjustments to my quick read beyond home-court.
Other things that might improve signal or reduce noise include: use of a performance "prior" or minutes or both, aging curves, consideration of the affects of rest & elevation, coaches, possession type and adjustment for clutch & garbage time.
Both cite improvement over their comparison base model. The changes are helpful but in the big picture might still be called modest, evolutionary ones and insufficient to win responsible high reliance on this model alone, though it could still be considered helpful as information from one tool among several with the weight still leaning, perhaps, towards direct boxscore based approaches.
Anyone here plan to incorporate either method for addressing player pairs into their APM version? Any reservations about how they did it?
Anyone plan to add more of the other adjustments listed above to try to reduce noise?
By the way, back2newbelf does the RAPM available at your site use your possession type sensitive approach? I assume it does, but want to be sure.
Last edited by Crow on Sat Mar 26, 2011 1:01 pm; edited 1 time in total |
|
Back to top |
|
|
Ryan J. Parker
Joined: 23 Mar 2007 Posts: 711 Location: Raleigh, NC
|
Posted: Fri Mar 25, 2011 4:25 pm Post subject: |
|
|
Why can't you just use the closed form solution? Is it because of the way lambda is chosen? _________________ I am a basketball geek. |
|
Back to top |
|
|
DSMok1
Joined: 05 Aug 2009 Posts: 611 Location: Where the wind comes sweeping down the plains
|
Posted: Fri Mar 25, 2011 4:31 pm Post subject: |
|
|
Ryan J. Parker wrote: | Why can't you just use the closed form solution? Is it because of the way lambda is chosen? |
How? Got an example of the calculation somewhere? _________________ GodismyJudgeOK.com/DStats
Twitter.com/DSMok1 |
|
Back to top |
|
|
Ryan J. Parker
Joined: 23 Mar 2007 Posts: 711 Location: Raleigh, NC
|
|
Back to top |
|
|
bbstats
Joined: 25 Apr 2010 Posts: 46
|
|
Back to top |
|
|
EvanZ
Joined: 22 Nov 2010 Posts: 303
|
Posted: Wed Mar 30, 2011 4:16 pm Post subject: |
|
|
Crow wrote: | The Sloan research papers are available.
http://www.sloansportsconference.com/research-papers/2011-2/posters/
Anyone care to review and comment on them?
Piette used ridge regression and player pairs in the model. Omidiran used player pairs and boxscore stats in a single regression model I believe (the feasibility and desirability of which I have asked about before) but I don't think he used ridge regression.
|
I was a little disappointed that none of the three player metric papers listed any players. I'd like to see some actual numbers. (The network model showed a couple of lists with a few players, but nothing comprehensive.) Also, do any of these guys make their data public?
The paper that mixed APM and box score stats (Omidiran), one of the goals was to determine weights for the box score stats. So...did I miss something, or are those never given? No fun in that. _________________ http://www.thecity2.com
http://www.ibb.gatech.edu/evan-zamir |
|
Back to top |
|
|
DSMok1
Joined: 05 Aug 2009 Posts: 611 Location: Where the wind comes sweeping down the plains
|
Posted: Wed Mar 30, 2011 4:40 pm Post subject: |
|
|
Note that Jeremias Engelmann is back2newbelf, EvanZ... I wonder if any of the other researchers post here or at least read this forum? _________________ GodismyJudgeOK.com/DStats
Twitter.com/DSMok1 |
|
Back to top |
|
|
EvanZ
Joined: 22 Nov 2010 Posts: 303
|
|
Back to top |
|
|
|