By Andrew Cohen
As a devoted Harvard basketball fan, I was disappointed on Selection Sunday. By now, you have probably already seen the shot. Harvard had an NCAA tournament berth in its hands, and had it stripped by Princeton. My mourning period was interrupted an hour later when Harvard’s name entered the conversation for an at-large tournament berth. While it was obviously a long shot, I relentlessly investigated the tournament resumes of the other schools on this year’s bubble. For the most part, Harvard was stacked against high-profile power conference schools with decent records and tons of opportunities to play and defeat tournament-caliber teams.
Now I know that Harvard was never a serious contender for an at-large berth. But after watching ESPN College Game Day for 24 straight hours after the loss to Princeton, I can guarantee that Harvard received consideration by the selection committee. That is, the selection committee had to choose between a power conference team like USC, with a measly record of 19-14 but 10 wins this season against Pac-10 teams, and a mid-major team like Harvard with a 23-6 record, but given a chance to play only 5 tournament-ready teams. In this study, I attempt to bring a quantitative analysis to the mid-major/power conference debate for teams on the bubble.
March Madness is all about upsets. The underdogs are the lifeblood of the tournament. According to some (experts included), if it can be determined that a power conference and mid-major school are completely equal in tournament qualification, the berth should be awarded to the mid-major, the perennial collegiate underdog, the George Masons and the Davidsons of the world. After all, these schools are generally untested, haven’t been given the opportunity to prove themselves, and performed close to flawlessly throughout the season. But who actually performs better in the tournament: the mid-major or the power conference schools?
To assess whether power conference teams on the bubble outperform their mid-major counterparts, the dataset chosen included the “last four” at-large selections of every tournament field for the last ten years (2001-2010). Seeding was used to determine this last four because, prior to this year, the final four teams added to the tournament were not publicly disclosed. Any at-large selections holding the same seed as any of the last four were also included in the dataset. Over the ten years, these criteria included 29 power conference teams and 24 mid-major teams.
Tournament results were quantified by teams’ Performance Against Seed Expectations (PASE). As defined by Pete Tiernan of ESPN:
PASE measures the average number of wins a team attains above or below the number its seed position would dictate that it achieves. PASE is calculated by tallying the positive or negative differences between actual and expected wins at each seed position. The total of these differences is then divided by the number of appearances to arrive at an average number of games the team either over-performs or under-performs per tournament.
To further our understanding of the PASE metric, let’s look at an example from the data
set. The Arizona Wildcats of 2009 were seeded 12th and advanced to the sweet sixteen. The Wildcats won 2 tournament games that year, compared to average wins accumulated by a 12 seed of 0.48. Thus, their PASE was 2 – 0.48 = 1.52. To learn more about PASE, a short ESPN article can be found here.
The average power conference PASE was 0.166, and the average mid-major PASE was -0.053. A two sided unpaired t-test confirmed that this small difference in bubble performance between the power conference and mid-major schools was not statistically significant at the α = .1 level.
A slight outlier in this data is the Missouri 2002 team that went to the elite eight as a 12 seed. The only team in the data set to win three games, Missouri Tigers boasted a PASE of 2.52 that year. Led by current NBA journeyman Kareem Rush, the Tigers perhaps inflate the power conference PASE statistic to boost it slightly higher than the mid-major PASE. It is also worth noting that George Mason does not appear in this data set. While George Mason was selected as an at-large bid in 2006 when it made its Final Four run as an 11 seed, the last four at-large teams in 2006 were seeded 12th and 13th.
We can interpret this data to mean that power conference bubble teams tend not to outperform the mid-major squads. As discussed earlier, a constant debate among ESPN College Game Day analysts over the weekend was whether or not to give preferential treatment to underexposed mid-major teams on the bubble, if it could be determined that a given power conference and mid-major team had equal tournament resumes. While this TV debate gave Harvard fans false hope (and UAB and VCU fans real hope), history shows no indication of a difference in tournament performance. With the four team expansion in effect this season, the bubble was regarded as overpopulated with equally qualified teams. According to the predictions of ESPN’s Joe Lunardi, the Selection Committee contemplated doling out the last four spots among 11 teams. (The list is: Saint Mary’s, Clemson, Virginia Tech, USC, Alabama, Georgia, Boston College, UAB, Harvard, Missouri State, and VCU. Clemson, USC, UAB, and VCU received the last four bids). When forced to make this choice between 6 power conference schools and 5 mid-major schools, the Committee chose two of each. While this simple study cannot evaluate the correctness of the committee’s last four picks, it can conclude that the exclusion of a team based solely on conference status is unjust.