HATCH MATHEMATICAL COLLEGE FOOTBALL RANKINGS

 

RANKING PROCEDURES

Last Updated: 1/6/13

 

A number of factors have caused me to believe that it is time to tweak the formula that I have used to rank teams for the past 9 years. Although I have previously avoided editorializing on my website, I believe that I should explain my reasons for making these changes. My commentary will be found in red throughout.

 

RULE 1

DEFINITIONS

1.             Initial Ranking (IR): The final results from the previous season's poll. In the event that new teams join Division 1-FBS, new teams will be assigned a preseason ranking of a position immediately lower than the lowest ranked team from the previous season.

 

a.             The Initial Ranking is used only in the first iteration of each season's results. For subsequent iterations, the final ranking of the previous iteration will be used as the Initial Ranking.

 

b.             As a result, the Initial Ranking's effect will be filtered out as each successive iteration is conducted.

 

I have seen on at least one message board that my system was criticized for having an Initial Ranking. I would answer that (1) all systems have some initial input (even if the initial input is to assign the same value to all teams) and (2) that there is virtually zero impact (probably around 1/1000th of a teams actual Rating) from the Initial Ranking by Week 3, if not sooner.

 

2.             Stable Ranking Set: achieved when one set of final rankings is used as the Initial Rankings for a further iteration which produces the same set of final rankings that are identical to the Initial Rankings.

 

What this means is that I keep iterating the system until the initial and final rankings are identical. This usually takes several iterations.

 

3.             Rank (RK): the rank of a given team is its position in the order of teams' Rating scores, sorted descending. (E.g., a team that has the highest Rating scoreÕs Rank is 1.)

 

4.             Points (PT): Each Team is assigned a ÒPoints Value.Ó

 

a.             For seasons in which less than 129 teams are ranked, the Points total for a given team is the result of the following formula:

 

130 - RK = PT

(E.g., a team that has a Rank of 1 has a Points total of 129.) Points should not be confused with Rating score or Rank.

b.             For some seasons in which 130 or more teams are ranked, the Points total for a given team is the result of the following formula:

 

130 – (RK/#) = PT, where # = 130/the number of ranked teams.

 

I use this same approach in my basketball ratings. Some historical ratings had more than 129 teams eligible for ranking and I didnÕt want to arbitrarily exclude teams. The ever-expanding Division 1-FBS is also likely to wind up with more than 129 teams within the next few years, so this calculation may wind up being used in contemporary ratings before too long.

RULE 2

CALCULATION OF THE PERFORMANCE RATING

1.             Part One.

 

For a game in which Team A defeats Team B by a margin of M,

 

Where M < 11,

             IPRA = PTB x 1.05

            IPRB = (RKA +10) x -1.05

 

Where 10 < M < 20,

                    IPRA = PTB x 1.1

                    IPRB = (RKA + 10) x -1.25

 

Where M > 20,

            IPRA = PTB x 1.2

            IPRB = (RKA + 10) x -1.6

 

Where a tie,

IPRA = (.3 x (PTB x 1.05)) + ((RKB +10) x -1.05))

IPRB = (.3 x (PTA x 1.05)) + ((RKA +10) x -1.05))

 

After an extensive review of the 100 previous seasons I have ranked, I came to two conclusions:

(1)   I had not sufficiently punished teams for losing, especially by large margins. Simply because a team shouldnÕt be rewarded for winning by a big margin doesnÕt mean that a team that loses by a big margin shouldnÕt be punished. Losses by less than 21 points were also assigned more weight.

(2)   My system failed to adequately deal with tied games. This system was born in the Era of Overtime and I have never been satisfied with the historical rankings that treated ties as generally positive results. This change sees a tie as a mostly negative result. The object of the game is to win. Obviously, ties are still decidedly more favorable than losses.

 

2.             Part Two.

 

Where A is at home:

                        PRA = IPRA x 1.00

PRB = IPRB x 0.95

 

Where the Away Team wins:

            PRA = IPRA x 1.05

            PRB = IPRB x 1.05

 

Where the game is at a neutral site:

            PRA = IPRA

                        PRB = IPRB

 

Where the game is a Bowl Game (regardless of site):

            PRA = IPRA x 1.2

                        PRB = IPRB x 1.2

 

 

These values remain unchanged, except for the weighting of Bowl Game results.

 

3.         Part Three.

Ratings are Calculated.

 

RATING = (PRGame 1 + PRGame 2 + PR... + PRFinal Game + (130-IR)) Ö (Games Played)

 

a.             Rating scores will be rounded to one thousandth of a point. Decimal values beyond this value will not be considered, except in case of a tie. If teams are tied at the level of one-ten thousandth of a point, they will be treated as tied.

 

b.             Only teams considered part of Division 1-A/FBS, or the equivalent for the year in question will be ranked. Exception: Teams participating in the Rocky Mountain Conference from 1912-1937 will not be ranked.

 

The RMCC was, for all intents and purposes, hermetically sealed from playing other teams during this period. Inclusion of this conference skews the results badly because there are no external forces (i.e. inter-conference games) to help properly ascertain the strength of RMCC teams. This is an editorial decision, but obviously itÕs an exception dealing with teams that played 75 years ago or more. I donÕt think this decision is substantively problematic.

 

c.             All teams outside of those ranked will be represented by the Division 1-AA Placeholder. The Division 1-AA Placeholder will always be ranked below all rated teams in all rankings and will not be moved from the last place position for any reason.

 

In the past, I had treated the Division 1-AA Placeholder as a regular team that could move up or down in the system. That solution may have worked in, for example, 1993 where only 47 games were played against 1-AA opposition (about 6 games to every 13 teams), or as late as 2005 with only 53 games (a similar ratio as to 1993). In 2011, In 2011, 97 games were played against 1-AA teams (approaching 3 games to every 4 teams). In 2012 that number was 108 (in comparison to 124 teams).

 

I donÕt know about you, but I am sick of seeing allegedly strong teams play 1-AA opponents. My system should not reward Rutgers in 2011 for playing North Carolina Central (2-9 in the MEAC, with a loss to Savannah State(!))  more than if Rutgers had played Tulane or Akron. The idea of abstracting 1-AA teams is a necessary fiction (or else IÕd quickly find myself having to rate every team), but I will not allow teams to profit by playing 1-AA teams when there are 1-A teams available.

 

In my view, the trend towards playing increasing numbers of smaller division opponents will undermine various rating systemsÕ (including the future Playoff Selection CommitteeÕs) abilities to properly value the relative strengths of different leagues and will, resultantly, skew their final results.

 

d.             In the event that the results of a given season would create an irresolvable infinite loop for two or more different teams, teams will be rated based on the average of the results produced. (e.g., when team A is ranked 3rd and team B is ranked 4th in the initial ratings it produces a final rating having team B in 3rd and team A in 4th. Further iterations keep all other ranked teams in identical positions except for A and B that alternatively switch positions without any possibility of a conclusion. The dilemma is resolved by averaging team AÕs final rating when it begins in 3rd and 4th position and doing the same for team B. Whichever teamÕs average rating is higher will be ranked 3rd and the other 4th.)

 

1.             In the event that multiple infinite loops occur within the same season, the final rankings for all teams not caught such a loop will be determined following all averaging under Rule 2.3.d. Ratings for all other teams will reflect the results of the 2.3.d resolution of the loop involving the highest ranked teams.

 

I donÕt think I wrote this very well. If I have two loops, each time I iterate the system the looping teams will be in different positions which impacts many other teamÕs ratings. Where this happens (I think it only happened 2 times in the historical results), every one elseÕs rank will be determined by the ratings when the highest-ranked loop is resolved under 2.3.d. (In my example of A and B above, if A finished 3rd and B finished 4th, every other team not involved in a loopÕs ratings will be calculated as if A finished 3rd and B finished 4th).

 

e.             For the first week of a season, the system will be iterated a maximum of five times. Thereafter, the system will be iterated until it becomes a Stable Ranking Set as defined in Rule 1.2, and subject to the procedures in Rule 2.d. and 2.d.1.

 

RULE 3

DETERMINATION OF THE NATIONAL CHAMPION

Teams will be ranked according to their IR. Scores from each game will be assigned Ratings as discussed in 2.3. The final rankings of each team based on their overall Rating will then become the IR for a next iteration. This process will continue until all rankings have become a Stable Ranking Set, and any teams that are caught in an irresolvable infinite loop have their ratings determined by Rule 2.3.d as described above. The team with the highest Rating after the bowl games will be declared the national champion.

 

It is my view that ultimately, only three factors are germane to ranking teams. (1) Who did you play? (2) Where did you play? (3) How did you do? (Margin of Victory or Defeat – capped at 21 points, to limit the effects of running up the score.)

The touchstone of this system is that the quality of your opponent is the most important of the 3 factors. Although these changes have no impact on (2) and increase the effect of (3), the overriding consideration in this system rightly remains strength of schedule.

Having spent the holidays reworking the past 100 seasons, the effects that I have noticed are that teams are punished more for losing games (which was intentional), allowing for some teams with better records to be pushed up past where they used to be. To be honest, I am not sure if this will increase or decrease accuracy in predicting future games. But this system is not a predictive indicator. It is an indicator of past results. If past results now put more of a premium on winning, then that canÕt be a bad thing.

Thanks for visiting the site and please feel free to send me your hate mail.

-Ben