Friday, 19 June 2015

Is 400 the new 300 in ODIs?

The current ODI series between England and NZ has been quite extraordinary. England has scored more than 300 in every game, but has lost twice. Yesterday morning, New Zealand’s total of 349 was not only chased down by England, it was chased down with ease, with England losing only three wickets and having 6 overs to spare. 

My sense from my Twitter feed is that the conventional wisdom is as follows:
  1. The rule changes dating from October 2012 (that saw two new balls in each innings and a reduction in the number of fielders allowed outside the circle in non-powerplays) allowed for larger scores, as the outfield gaps and still-hard balls allow batsmen to score at will in the final overs.
  2. These rule changes coincided with new batting skills honed in 20-20 competitions like the IPL.
  3.  New Zealand has been leading the way in showcasing an aggressive approach to cricket; England  prior to now has continued to play with an outdated conservative style, but has now belatedly accepted the new approach, in which “400 is the new 300”.
There is probably much right with this version of events, but I’m not fully convinced. Here are some raw numbers. Since the October 2012 rule changes prior to the current series between England and New Zealand, there were 177 ODI games played involving two teams from the top 8 (defined as the test-playing nations excluding Zimbabwe and Bangladesh), excluding games with a Duckworth-Lewis-Stern reduction in overs. Of those, 56 (or 31%) saw the team batting first score 300 or more. This is certainly a higher rate than we would have seen in past eras, but not as high as conventional wisdom seems to be suggesting. Moreover, getting to 300 still made the team batting first the overwhelming favourite. Of those 56 games with a first-innings score in excess of 300, the team batting first won 48 (86%). More pertinently, of the 18 games where the first innings score only just got to 300 (defined as a score between 300 and 310), the team batting first won 14, which is still a %77 success rate. If 400 is the new 300, it really should be easier to chase down 300 than these data suggest.

I wrote last year about how, once you adjust for mismatches between teams and where the game has been played, there wasn’t much evidence in the data for a general trend towards increasing first-innings scores. Taking all games from the start of the English 2002 season through to the end of the World Cup, controlling for team ability, home-field advantage, and the ground being used, first innings scores since Oct 2012 are only 12 runs higher on average than in the 10 years before Oct 2012. (For data geeks, I describe the exact model at the bottom.)

The following graph illustrates the lack of a trend. The small red dots are the difference between the first-innings score and a prediction based on the team batting, the team bowling, which team (if any) was playing at home, the ground at which the game was played, and allowing for a 12-run premium for the current rules. The solid red dots are a 25-game smoothed moving average, to take out some of the random variation and make any trends clearer. Although there appears to be a bit of an upward trend over the period since October 2012, scores by the end of this period were still only 28 runs higher than in the 2002-2012 period, suggesting that 328 is the new 300!


But now look at the four blue dots. These are the out-of-sample prediction errors for the first four ODIs between England and NZ in the current series. These predictions take into account England’s and New Zealand’s recent (since Oct 2012) batting and bowling form, England’s home-field advantage, and how high scores typically are at those four grounds. The prediction, actual score, and prediction error are as follows:


Ground
Predicted Score
Actual Score
Prediction Error
Edgbaston
235
408
173
The Oval
256
398
142
The Rose Bowl
269
302
33
Trent Bridge
232
349
117
Chester-le-Street
225 (Eng) 228 (NZ)
?
?

The point here is that rather than there having been a world-wide trend in the past few years that England have only now come to grips with; the current series has been extraordinary in every respect even in comparison to recent history. So what is going on? I can think of four hypotheses:
  1. I have made a massive coding error in my database.
  2. There has been a structural break in conditions: The four English groundsmen have produced very different pitches than in the past, ones much more favourable to high scores.
  3. There has been a structural break in team quality: Both NZ and England have better batting and/or worse bowling in this series than they had in the recent past.
  4. These four games have been black-swan events; and things will return to normal soon.
  5. There has been a strategic mindset shift in both New Zealand and England.

When things look extraordinary, coding errors are always a good bet, and I wouldn’t rule this out, but the raw predictions don’t look too far out from my own intuition, so I  don’t think this is the problem here.

I can’t comment on whether conditions were very different from usual in the four games so far, but I haven’t seen any commentary from England suggesting that the groundsmen have been producing untypical pitches, so I suspect hypothesis 2 is not the right one.

There is probably some truth to hypothesis 3. I don’t think the batting is too much different, but the bowling is quite possibly weaker. New Zealand have lost Vettori, Anderson and Milne from their World Cup bowling line-up, and have had Southee and Boult together for only one of the four matches. England have rested Anderson and Broad. Even so, I would put my money on the final two hypotheses explaining most of the data. 

The idea that teams are not aggressive enough when batting is something Scott Brooker and I have been saying for a long time. Back when he was writing his thesis, Scott experimented with what an average team would be able to achieve if they applied the optimal level of aggression. Based on the strike rates and dismissal rates that we can observe batsmen having in different game situations (e.g. conservative batting in the middle overs versus aggression in the death overs), he constructed a set of frontiers describing the trade-off between risk and return for typical batsmen in positions #1-#11, and then simulated optimal behaviour. He found that scores could be roughly 30 runs higher if batting teams were more aggressive, but that there would be more variance in scores and a higher probability of not batting out the overs. This was based on data from before 2007. It is likely that under the new rules, the value of extra aggression is even higher. 

What I think we are seeing in the current series is two teams who keep pushing the boundaries of this approach, forcing the other team to react in kind, and so all kinds of previously unrealised potential that has existed for a while is now being revealed. Contrary to the conventional wisdom, I don’t think this has been New Zealand’s approach before now. Rather, I think they have emphasised retaining wickets during the middle overs in preparation for an all-out assault in the final 10. New Zealand’s famous aggression in the World Cup was mostly seen in its approach to bowling, putting an emphasis on wicket taking rather than containment.

If I am right, that there has been a mind-set change for both teams in the current series, I am mindful that Scott’s conclusion was that the additional 30 runs on average would come alongside a big increase in variance. This brings me back to the black-swan hypothesis. We have seen scores more than 100 runs in excess of what recent form would have predicted on average. It is likely that in each game, there was a degree of luck. Batsmen got away with taking risks on these occasions, but we could just as easily have seen scores that were quite low. Even allowing for the fact that dropped catches are more likely when batsmen are hitting the ball hard, the catching in the current series does seem to have been below par. Realistically, 370 might be the new 300, but equally 180 the new 200.

Method:

My prediction model was based on a database of all non-rain-affected ODIs involving the top-8 countries since May 2002, using only games played on grounds where there were at least 10 matches played (but also including Chester le Street, as that is the venue for the final ODI between England and NZ).

The model was an OLS regression of first innings score on a dummy variables for the batting-team country, dummy variables for the bowling-team country, a dummy variable for each of the 53 grounds, a dummy variable for when the batting team was playing at home, and for when the bowling team was playing at home. Finally, I added a dummy variable for matches played since Oct 2012, and more dummy variables for this recent-era interacted with the batting and bowling team dummies. 

These interaction teams mean that only post-Oct-2012 data is used to determine the effect of team ability on scores; the only reason for pooling the data with the pre-Oct-2012 era is to provide enough data to estimate ground effects. Essentially, the model is assuming that the relative impact on scores of being at a particular ground and the relative impact of home-field advantage has not changed from before the Oct 2012 to after.


In the 25-game  moving average shown in the chart above, the data is broken around the structural break of Oct 2012, so that the smoothed line before the break is not influenced by games played after the break and vice versa. 

6 comments:

  1. Very interesting analysis.

    I am no statistician, but I would like to point out that the statistic of 300+ not being chased down is not a reliable falsifier of the hypothesis, because optimal amounts of aggression are more probable in pressure-less conditions (i.e. batting first). While chasing, scoreboard pressure acts act as a natural dampener of aggression levels and thus we see that 300+ scores batting second are less frequent than the same batting first.



    It would be interesting to model the probability of 300+ scores while chasing with a mathematical model of scoreboard pressure, perhaps linearly proportional to the target, with the factor of proportionality dependent on the success rate as a function of target.

    ReplyDelete
  2. Himanish. The impact of pressure is an interesting question, but I don't think it affects the interpretation here. First, you would expect to see far fewer 300+ 2nd innings scores than 1st innings, because of the nature of the game. When a strong team bats first against a weaker team, they could well get over 300. When they bat second, 300 is much less likely as they are almost certainly chasing a score less than that and the game ends as soon as they have reached target.

    What we do know from the data is that, at least before the 2012 rule changes, the probability of chasing down a total above 300 was more than the probability that such a total would be scored in the first place. This is because high first innings scores are correlated with how easy the conditions are for batting. If there is an effect of pressure, it is being masked by this effect.

    But as to the interpretation of the data, if 400 is the new 300, and (as we also often hear), par is now 300, then there should be know pressure chasing 300-310; you are only chasing par. But, prior to ENGvNZ, we were not seeing teams able to consistently chase down this supposed par.

    ReplyDelete
  3. The UK commentators made a couple of points regarding high scores. 1) the cricket ball, Ian Botham doesn't think much of the Duke balls they are using in this series, as it doesn't help the bowlers as much as other brands. 2) The boundaries have been brought in from the fence, and implying that they had been brought in more than normal.

    ReplyDelete
  4. I heard that comment from Botham. Certainly, the lack of swing in this series has been a marked contrast to what we saw in NZ last summer, but I thought he was saying that they had changed the ball a couple of years back. I didn't hear the comment about boundaries. That might be relevant.

    ReplyDelete
  5. If Dube's analysis cannot be extended beyond the US, then those here citing him in favour of raising minimum wages, when our minimum wage are already around 65% of the median, are on pretty shaky ground.


    Sorry for the delayed reply; just caught these.

    ReplyDelete
  6. That is a very fair point, it does cut both ways. Like I said, I am a bit out of my depth in the minimum wage debate, but I would be cautious about relying on US data. With a fragmented state and Federal minimum wage system (and income tax system too), they are very different from us. Not to mention the probable impact on wages from undocumented migrants and other factors.

    ReplyDelete