thought id post some of it home it copies right
Can prior season WL and ATS records predict future season performance?
Theory One on the NFL is that the league changes dramatically from one year to the next, with free agency, a new crop of draft picks, coaching changes, and adjustments to the strategies that worked successfully in the prior season taking their toll on the predictive value of past season performance.
Theory Two is that there are always biases in how people respond to the events of the prior season (note for instance the poor historical record of the Super Bowl losers in the next year), and capitalizing on this should be something possible with enough data and the right tools.
We built a database with the straight-up and spread records for each team during the regular season from 1990 on. Then we ran some correlations and regressions on the data to see how well you can predict the next year by the previous ones.
Simple Correlations between previous records
and the current regular season (1990-2003)
Variable Correlate to Correlation
Last Season Wins This Year Wins .49
Last Season Wins This Year ATS .19
Last Season ATS This Year ATS .36
Last Season ATS This Yr Wins .41
So on the surface it looks like there is a fair amount of correlation. Ah, but the problem, as Bob Dylan has sung, is times have changed. If we change the data set to only reflect the 1999 to 2003 timeframe (using 1998 history for '99 projections) we get some notably different results:
Simple Correlations between previous records
and the current regular season (1999-2003)
Variable Correlate to Correlation
Last Season Wins This Year Wins .26
Last Season Wins This Year ATS -.20
Last Season ATS This Year ATS -.15
Last Season ATS This Yr Wins .14
All the correlations drop, and indeed two of them switch signs! A good prior season WL record or ATS record has been a negative indicator for the ensuing season's ATS performance. Some would say the year the Rams came out of nowhere to win it all in 1999 was the line in the sand between the old NFL and the new game...
All right, you may say, last year is not going to tell us very much in itself about this season, but what about using some kind of rolling number of years?
Well, we're glad you asked and we'll move next to regressions:
Regression to predict current season regular season Wins
Factor Coefficient Std Error
Intercept 4.791 .560
Wins, 4 yrs Ago .078 .055
Wins, 3 yrs Ago .027 .060
Wins, 2 yrs Ago .021 .060
Wins, Last Yr .287 .059
R^2 = 0.12
Ugh. Even with a fair amount of data points (n=305), the R squared is very small, and the standard errors are bigger than the coefficients in some cases! This suggests that you're not going to fare very well trying to predict the current season Wins using the last four WL records.
The next track then is to up the ante, let's use both the past four seasons Win records and Spread records to see if we might not find some tidbits of value -- for instance a team that posted a nice number of wins but a lousy spread record might be deemed an underachiever which could have a trend one way or the other in the following year.
Regression to predict current season
Factor Coefficient for WINS Coefficient for ATS
Intercept 4.658 8.531
Wins, 4 yrs Ago .136 .120
Wins, 3 yrs Ago .031 -.030
Wins, 2 yrs Ago -.071 -.060
Wins, Last Yr .325 -.079
Spread Wins, 4 yrs Ago -.110 -.130
Spread Wins, 3 yrs Ago .044 .043
Spread Wins, 2 yrs Ago .160 .119
Spread Wins, 1 yr Ago -.087 -.049
R^2 = 0.13 / 0.05
Even with eight factors in the regression, the R squared values are terrible. Basically we are being told that we should look elsewhere if we're intent on trying to predict next year performance.
Given this, the following projected win tallies should be taken very lightly as they will likely have little predictive value. Not to worry though, we have better regression tools in store!
Wins Spread Wins Projection
Team '00 '01 '02 '03 '00 '01 '02 '03 Wins ATS
TEN 13 7 11 12 7 5.5 10 8.5 10.1 8.4
KAN 7 6 8 13 7 7.5 9.5 10 9.7 7.7
PHI 11 11 12 12 10 10.5 10 11 9.5 7.7
BAL 12 10 7 10 10.5 8 10 9.5 9.3 8.2
STL 10 14 7 12 7 9.5 4 9.5 9.3 7.5
CAR 7 1 7 11 8 7.5 10 6 9.2 8.2
MIA 11 11 9 10 10.5 10.5 9 7 9.2 8.0
NWE 5 11 9 14 7 11 6 13 9.1 6.8
IND 10 6 10 12 9 6 6 9.5 8.8 7.3
DEN 11 8 9 10 9.5 7 7.5 8 8.8 7.8
SEA 6 9 7 10 7 6.5 9 8 8.8 7.8
MIN 11 5 6 9 8 5.5 8 8 8.8 8.4
GNB 9 12 12 10 8 9.5 8 10 8.6 7.6
DAL 5 5 5 10 7.5 8 8 9.5 8.4 7.7
NOR 10 7 9 8 10 6 9 8.5 8.1 8.0
NYJ 9 10 9 6 8 8 10 6 8.1 8.5
TAM 10 9 12 7 9 6.5 10 7 8.0 8.1
OAK 12 10 11 4 10.5 7.5 10 3.5 7.6 8.7
CHI 5 13 4 7 7.5 11.5 5 8 7.5 7.7
BUF 8 3 8 6 7 6 8.5 7 7.5 8.5
NYG 12 7 10 4 9 5.5 8.5 4 7.4 8.7
CLE 3 7 9 5 5 8.5 10.5 6 7.3 8.4
SFO 6 12 10 7 7 9 5.5 8 7.2 7.5
WAS 8 8 7 5 6.5 8.5 7 8.5 7.2 8.4
JAC 7 6 6 5 7.5 8 8 8 7.1 8.4
CIN 4 6 2 8 8 8 4 10 7.1 7.4
DET 9 2 3 5 9 9 6.5 8.5 7.1 8.6
ATL 4 7 9.5 5 6 7 10 6.5 7.1 8.2
PIT 9 13 10.5 6 11 11.5 5.5 8 7.0 7.4
ARI 3 7 5 4 4 9.5 6 6 6.6 8.4
HOU 4 5 8 9 6.5 8.4
SDG 1 5 8 4 7 5.5 9 6 6.1 7.8
So according to this admittedly dubious formula we've concocted from the regression outputs, the top three spread value teams for the NFL 2004 season are projected to be the New York Giants, Oakland Raiders, and Detroit Lions, whereas by far the worst spread value for the coming season is predicted to be the New England Patriots. Hmmm, there may be some hint of truth to that after all...
--------------------------------------------------------------------------------
Can prior season WL and ATS records predict future season performance?
Theory One on the NFL is that the league changes dramatically from one year to the next, with free agency, a new crop of draft picks, coaching changes, and adjustments to the strategies that worked successfully in the prior season taking their toll on the predictive value of past season performance.
Theory Two is that there are always biases in how people respond to the events of the prior season (note for instance the poor historical record of the Super Bowl losers in the next year), and capitalizing on this should be something possible with enough data and the right tools.
We built a database with the straight-up and spread records for each team during the regular season from 1990 on. Then we ran some correlations and regressions on the data to see how well you can predict the next year by the previous ones.
Simple Correlations between previous records
and the current regular season (1990-2003)
Variable Correlate to Correlation
Last Season Wins This Year Wins .49
Last Season Wins This Year ATS .19
Last Season ATS This Year ATS .36
Last Season ATS This Yr Wins .41
So on the surface it looks like there is a fair amount of correlation. Ah, but the problem, as Bob Dylan has sung, is times have changed. If we change the data set to only reflect the 1999 to 2003 timeframe (using 1998 history for '99 projections) we get some notably different results:
Simple Correlations between previous records
and the current regular season (1999-2003)
Variable Correlate to Correlation
Last Season Wins This Year Wins .26
Last Season Wins This Year ATS -.20
Last Season ATS This Year ATS -.15
Last Season ATS This Yr Wins .14
All the correlations drop, and indeed two of them switch signs! A good prior season WL record or ATS record has been a negative indicator for the ensuing season's ATS performance. Some would say the year the Rams came out of nowhere to win it all in 1999 was the line in the sand between the old NFL and the new game...
All right, you may say, last year is not going to tell us very much in itself about this season, but what about using some kind of rolling number of years?
Well, we're glad you asked and we'll move next to regressions:
Regression to predict current season regular season Wins
Factor Coefficient Std Error
Intercept 4.791 .560
Wins, 4 yrs Ago .078 .055
Wins, 3 yrs Ago .027 .060
Wins, 2 yrs Ago .021 .060
Wins, Last Yr .287 .059
R^2 = 0.12
Ugh. Even with a fair amount of data points (n=305), the R squared is very small, and the standard errors are bigger than the coefficients in some cases! This suggests that you're not going to fare very well trying to predict the current season Wins using the last four WL records.
The next track then is to up the ante, let's use both the past four seasons Win records and Spread records to see if we might not find some tidbits of value -- for instance a team that posted a nice number of wins but a lousy spread record might be deemed an underachiever which could have a trend one way or the other in the following year.
Regression to predict current season
Factor Coefficient for WINS Coefficient for ATS
Intercept 4.658 8.531
Wins, 4 yrs Ago .136 .120
Wins, 3 yrs Ago .031 -.030
Wins, 2 yrs Ago -.071 -.060
Wins, Last Yr .325 -.079
Spread Wins, 4 yrs Ago -.110 -.130
Spread Wins, 3 yrs Ago .044 .043
Spread Wins, 2 yrs Ago .160 .119
Spread Wins, 1 yr Ago -.087 -.049
R^2 = 0.13 / 0.05
Even with eight factors in the regression, the R squared values are terrible. Basically we are being told that we should look elsewhere if we're intent on trying to predict next year performance.
Given this, the following projected win tallies should be taken very lightly as they will likely have little predictive value. Not to worry though, we have better regression tools in store!
Wins Spread Wins Projection
Team '00 '01 '02 '03 '00 '01 '02 '03 Wins ATS
TEN 13 7 11 12 7 5.5 10 8.5 10.1 8.4
KAN 7 6 8 13 7 7.5 9.5 10 9.7 7.7
PHI 11 11 12 12 10 10.5 10 11 9.5 7.7
BAL 12 10 7 10 10.5 8 10 9.5 9.3 8.2
STL 10 14 7 12 7 9.5 4 9.5 9.3 7.5
CAR 7 1 7 11 8 7.5 10 6 9.2 8.2
MIA 11 11 9 10 10.5 10.5 9 7 9.2 8.0
NWE 5 11 9 14 7 11 6 13 9.1 6.8
IND 10 6 10 12 9 6 6 9.5 8.8 7.3
DEN 11 8 9 10 9.5 7 7.5 8 8.8 7.8
SEA 6 9 7 10 7 6.5 9 8 8.8 7.8
MIN 11 5 6 9 8 5.5 8 8 8.8 8.4
GNB 9 12 12 10 8 9.5 8 10 8.6 7.6
DAL 5 5 5 10 7.5 8 8 9.5 8.4 7.7
NOR 10 7 9 8 10 6 9 8.5 8.1 8.0
NYJ 9 10 9 6 8 8 10 6 8.1 8.5
TAM 10 9 12 7 9 6.5 10 7 8.0 8.1
OAK 12 10 11 4 10.5 7.5 10 3.5 7.6 8.7
CHI 5 13 4 7 7.5 11.5 5 8 7.5 7.7
BUF 8 3 8 6 7 6 8.5 7 7.5 8.5
NYG 12 7 10 4 9 5.5 8.5 4 7.4 8.7
CLE 3 7 9 5 5 8.5 10.5 6 7.3 8.4
SFO 6 12 10 7 7 9 5.5 8 7.2 7.5
WAS 8 8 7 5 6.5 8.5 7 8.5 7.2 8.4
JAC 7 6 6 5 7.5 8 8 8 7.1 8.4
CIN 4 6 2 8 8 8 4 10 7.1 7.4
DET 9 2 3 5 9 9 6.5 8.5 7.1 8.6
ATL 4 7 9.5 5 6 7 10 6.5 7.1 8.2
PIT 9 13 10.5 6 11 11.5 5.5 8 7.0 7.4
ARI 3 7 5 4 4 9.5 6 6 6.6 8.4
HOU 4 5 8 9 6.5 8.4
SDG 1 5 8 4 7 5.5 9 6 6.1 7.8
So according to this admittedly dubious formula we've concocted from the regression outputs, the top three spread value teams for the NFL 2004 season are projected to be the New York Giants, Oakland Raiders, and Detroit Lions, whereas by far the worst spread value for the coming season is predicted to be the New England Patriots. Hmmm, there may be some hint of truth to that after all...
--------------------------------------------------------------------------------
