The (model dependent) value of a turnover

The value of a turnover is a topic addressed in The Hidden Game of Football, noting that the turnover value consists of the loss of value by the team that lost the ball and the gain of value by the team that recovered the ball. To think in these terms, a scoring model is necessary, one that gives a value to field position. With such a model then, the value is

Turnover = Value gained by team with the ball + Value lost by team without the ball

In the case of the classic models of THGF, that value is 4 points, and it is 4 points no matter what part of the field the ball is recovered.

That invariance is a product of the invariant slope of the scoring model. The model in THGF is linear, the derivative of a line is a constant, and the slopes, because this model doesn’t take into account any differences between teams, cancel. That’s not true in models such as the Markov chain model of Keith Goldner, the cubic fit to a “nearly linear” model of Aaron Schatz in 2003, and the college expected points model (he calls his model equivalent points, but it’s clearly the same thing as an expected points model) of Bill Connelly on the site Football Study Hall. Interestingly, Bill’s model and Keith’s model have a quadratic appearance, which guarantees better than constant slope throughout their curves. Aaron’s cubic fit has a clear “better than constant” slope beyond the 50 yard line or so.

Formula with slopes exceeding a constant result in turnover values that maximize at the end zones and minimize in the middle of the field, giving plots that Aaron calls the “Happy Turnover Smile Time Hour”. As an example, this is the value of a turnover on first and ten (ball lost at the LOS) for Keith Goldner’s model

First and ten turnover value from Keith Goldner’s Markov chain model

And this is the piece of code you can use to calculate this curve yourself.

Note also, the models of Bill Connelly and Keith have no negative expected points values. This is unlike the David Romer model and also unlike Brian Burke’s expected points model. I suspect this is a consequence of how drives are scored. Keith is pretty explicit about his extinction “events” for drives in his model, none of which inherit any subsequent scoring by the opposition. In contrast, Brian suggests that a drive for a team that stalls inherits some “responsibility” for points subsequently scored.

A 1st down on an opponent’s 20 is worth 3.7 EP. But a 1st down on an offense’s own 5 yd line (95 yards to the end zone) is worth -0.5 EP. The team on defense is actually more likely to eventually score next.

This is interesting because this “inherited responsibility” tends to linearize the data set except inside the 10 yard line on either end. A pretty good approximation to the first and ten data of the Brian Burke link above can be had with a line that is valued 5 points at one end, -1 points at the other. The value of the slope becomes 0.06 points, and the value of the turnover becomes 4 points in this linearization of the Advanced Football Stats model. The value of the touchdown is 7.0 points minus subsequent field position, which is often assumed to be 27 yards. That yields

27*0.06 – 1.0 = 1.62 – 1.0 = 0.62 points, or approximately 6.4 points for a TD.

This would yield, for a “Brianized” new passer rating formula, a surplus yardage value for the touchdown of 1.4 points / 0.06 = 23.3 yards.

The plot is below:

Eyeball linearization of BB’s EP plots yield this simplified linear scoring model. The surplus value of a TD = 23.3 yards, and a turnover is valued 66.7 yards.

Update 9/29/2011: No matter how much I want to turn the turnover equation into a difference, it’s better represented as a sum. You add the value lost to the value gained.

Issues with Pro Football Focus’s new passer rating formula « Code and Football Says:

September 26, 2011 at 9:15 am

[…] better yet, since Brian Burke’s expected points formulas linearize to a surplus value for TDs of 23.3 yards, and the value of a turnover in yards is about 67 yards, use […]

From adjusted yards per attempt to a NFL style passer rating « Code and Football Says:

September 28, 2011 at 9:13 am

[…] of argument, we’re going to set alpha to 25, pretty close to the 23.3 that we get from a linearized Brian Burke model, and beta we’ll set to 60, 6.7 yards less than the 66.7 yards we calculated from the […]

From the Live Blog: Drive Outcomes » Skeptical Sports Analysis Says:

October 4, 2011 at 9:55 pm

[…] Keith Goldner and Bill Connelly and Romer/Burke are different. I’ve speculated on the difference here. Your plot suggests that different drive scoring has to be at the root of those differences, as […]

Nascent 2012 analytics questions « Code and Football Says:

August 21, 2012 at 1:02 pm

[…] data set, EP models are a class of related models which can be quite different in value (discussed here, here, here). If you need independent verification, please note that Keith Goldner now has […]

Dave Says:

July 10, 2013 at 10:50 am

But this doesn’t answer the most important question. Do “adjusted” turnovers predict future turnovers better than unadjusted turnovers?

foodnearsnellville Says:

July 11, 2013 at 8:54 am
Well, the article makes no attempt to predict turnovers at all, makes no attempt to define any concept like adjusted turnover or unadjusted turnovers, nor was there any analysis of any kind that I can see that would have led to any ability to predict turnovers. So I’m not sure what you’re trying to do here, other than inject a concept that has nothing to do with the article in the first place.

In this article, I’m largely taking a concept from THGF, noting Aaron Schatz’s groundbreaking 2003 paper, and then pointing out that those results aren’t fixed constants but instead inherently model dependent. As the scoring models have no sense of time, they have no sense of rate, and time-free models are, really, the worst kinds of things to attempt things gamblers would like to do, such as predict turnovers.

We’re after the value of a turnover, not turnover rates.

Code and Football