Find a Coach by the Numbers, Part I

Some of you may recall RedmondLonghorn's fascinating two-part series on Football Study Hall before the season:

Apparent Talent, Team Quality and Coaching Effects

Apparent Talent, Team Quality & Coaching Effects: Part Deux

Since we're now on the cusp of a new coaching search, and since Big(g)Ern started the discussion in earnest here, I thought I would re-introduce RL's math to the party, with my own twist.

This will be a two-part series. Today I'll dig into RL's numbers a bit, give them a few basic tweaks, and present some interesting observations from the resulting dataset. In Part II I'll update the data to include this year's stats-to-date and present the complete results with some additional analysis.

The Source

For those of you too lazy to click the links above, RL has put together a quietly amazing dataset containing coach names, 4-year recruiting star averages for each team compiled from both Scout and Rivals, Football Outsider's F/+ and S&P+, and Sagarin Predictor scores for each individual season at every school in a BCS conference from 2006 to 2012. That's 466 observations (66 schools x 7 years, plus 2 for Utah, 1 for TCU and 1 for Temple) times 6 categories of data = 2796 datapoints.

(That's a lot.)

Then RL plays with the data to suss out coaching effects. Basically, he's trying to provide evidence for the general notion that by accounting for player talent separately, good coaching can be identified and its impact measured with statistics.

But I'm not interested in general notions. I want to know which specific coaches might be an adequate replacement for Mack Brown. Fortunately RL has blessed us with the vast majority of data entry needed to get such an analysis off the ground, in a format easily cut-n-pasteable into Excel. If you're out there, RL: holy crap, man. Thank you.

If you don't want to understand how it all works in detail, that's fine. Might want to skip this section in blockquote:

The Problem

Conceptually, success in any team sport (hence "quality") can be explained as the intersection of two factors: the ability of the players ("talent") and the value added by (or subtracted from) their environment, including stuff like strategy, game technique, strength and agility training, cooperative team relationships, etc. We'll call that "coaching", since the coach is accountable for most of those factors.

The fundamental relationship between talent and coaching appears to be best described as multiplicative, not additive. Meaning, the best teams almost invariably have both. You can't win FBS championships with South Dakota State talent nor can you win championships with a drunk idiot mule at the helm, Barry Switzer notwithstanding.

So far we have this basic equation: C(oaching) x T(alent) = Q(uality).

RichmondLonghorn has given us good proxy data for T (Rivals, Scout data) and Q (F/+, S&P+, Sagarin). What I'd like to do is plug that data into the equation and solve for C.

Problem is, we can't do it THAT simply, because our proxy for Talent is denominated in "stars" while our proxies for Quality are denominated in...uh..."points" I guess.

We can still solve for Coaching but it will take a bit of simple calculus. First, we run a regression of Talent on Quality across the entire dataset and produce an equation that can be used to predict Quality based solely on Talent score:

via i882.photobucket.com

(Wonk note: I went with an exponential regression analysis because I want the line to be slightly curved upward, i.e., to set the bar disproportionately higher for coaches who already have good recruits. Call it an Un-Kiffening of the data. The R-squared isn't quite as good of a fit as a linear approach but it's still fine. Also: this analysis is performed after the data adjustments noted below.)

Then for each school-year, we can compare predicted Quality scores to actual Quality scores, and the difference for each is our Coaching effect for that school-year.

However, that Coaching effect isn't exactly what I'm looking for. C will tell me that Bill Snyder was a fantastic coach in 2012. But I know he's a fantastic coach because he runs a coaching system that perfectly suits a school that doesn't have any of Texas' major advantages.

We don't want a coach who can deliver decent Quality from low Talent. Instead we want someone whose high Coaching skills can credibly be applied to high Talent and thus produce elite, championship-contending team Quality.

So ultimately we want to give each coach in the database a score that is based on our measured Coaching impact but we need to weight that impact by team Talent. Coaches with good talent who succeed should get extra credit for proving they can do the job, and coaches with great talent who suddenly find themselves getting mudholed by average teams should get penalized more.

The simplest way to accomplish this weighting is (C times T), so that will be our raw score (hence "CxT"). To make the score easy to understand we'll convert it into an index (hence "CxT Index") where 100 is the top ranked CxT performance (Saban, Alabama 2011) and 1 is the lowest ranked CxT performance (Embree, Colorado 2012).

The Data

Before starting, I needed to make two major adjustments to RL's dataset.

First, after running a few tests on the data I noticed that RL's proxy for Talent - the four-year rolling average of Rivals/Scout stars - has actually been inflating over the last few years. The reason is simple: Scout and Rivals have been scouting more kids every year. That means they're handing out more 3, 4, and 5 stars rankings every year, which means there's a lot of kids out there getting three stars today who would've only gotten two in 2006.

This is problematic because in recent years talent-poor teams have seen almost uniformly rising talent scores, peaking circa 2008. Without adjusting the data, these leaps and bounds would soil the analysis, scattering average coaches at crappy schools across the top half of the results, all because scouting services finally got around to grading their crappy recruits. Here are the mean star averages for BCS conference teams from 2006-2012:

via i882.photobucket.com

Fortunately most of the grade inflation is on the bottom end of the Talent scores and the effect appears to be very linear. So I used a formula to subtract that from the raw Talent scores. Hard to explain but it's a simple fix, and without this adjustment you get Jeff Jagodzinski and Steve Kragthorpe in your overall Top 20, so it's worthwhile.

Second, I didn't want to include S&P+ in my Quality Composite because S&P+ is already part of F/+. Plus, the S&P+ index is skewed and pear-shaped, while both Sagarin Predictor and F/+ are much more normally distributed. The resulting Quality composite is a damn fine dataset, as these things go.

Here's the descriptive statistics:

via i882.photobucket.com

The Coaching results also come in very normal-ish. All of the indexes are somewhat fat-tailed. The Talent scale is also somewhat fat-bottomed and therefore so is the CxT Index, which is just C times T and indexed to 100.

The Results

Some tips when looking at the following numbers:

For both Quality and Coaching, stats the middle third of the range (33-67) will hold about two-third of the scores. 70 is a Top 10 score in a typical year, 80+ is a Top 5 score, and 90 is championship-level performance.

For the Talent rank, a 3.00 will typically put you on the cusp of the top 25 in Talent score in an average year, and a 3.5 puts you around the top 10. The highest Talent score in a year tends to be very close to 4.

For the CxT Index, scores over 50 are respectable. Scores over 60 typically produce Top Ten results, 70+ is a Top 5 score, and 85 is championship-level performance.

You'll notice that some coach names have an exclamation point in front of the name (e.g., "!Saban"). This is my method of marking seasons in which the coach on his "honeymoon" - i.e., in his first or second year, and thus the majority players on the team were not recruited by him.

This will allow me to distinguish "system coaches" - that is, coaches who need certain kinds of players to succeed and/or time to implement their system - and "fundamentals coaches" who can quickly coach up a sloppy roster someone else cobbled together. "System coaches" who recruit downward to fill particular needs and "fundamentals coaches" with no system are not well-positioned to take advantage of Texas' strengths.

Incidentally, the arrows signify the quartile in which a result resides. From top to bottom: green, yellow, grey, then red.

FanPost

The Source

The Problem

The Data

The Results

Most Talented

Least Talented

Highest Quality

Lowest Quality

Best Coaching

Worst Coaching

Best CxT Scores

Worst CxT Scores

Cumulative CxT Averages

Best "Honeymoon" Performers

Best Established Performers

Team Analysis

What's Next

More from Barking Carnival

Recent FanPosts

In This FanPost

Teams

Trending Discussions