Wednesday 4 March 2015

Batting out your overs


The mantra that "the biggest sin a team batting first in an ODI can commit is to not bat our its overs" has long been a bugbear of mine. As Dan Liebke noted in a rant about net-run-rate the other day, 
We've had Duckworth Lewis for decades now and, even if the mathematics of it is beyond most casual fans, the basic concept that wickets remaining are a resource that need to be considered along with overs remaining is pretty well established. 
Yes, a team has two resources. If it is a sin to not use one of those two resources to the max, why is not also a sin to bat out 50 overs leaving capable batsmen in the pavilion with their pads on? A batting team has to manage both declining resources with no certainty as to the effect that its actions will have on either the rate of scoring or the loss of wickets. 

So I was very happy to see Chris Smith take on this mantra in his Declaration Game blog, and also to see him quote a former player, Geoff Lawson, who was prepared to take a contrarian view. 
`Why?' asked Geoff Lawson, who went on to rationalise that if all the batting side attempted was to survie the 50 overs, they were very unlikely to set a winning total. `Wouldn't it be better', Lawson argued, `to hit out wiht the aim setting a challenging targe, accepting the risk that they could be bowled out, than to crawl to an unsatisfactory total?'
Lawson is right, although maybe not quite. In this quote, he seems to be suggesting that a team that is heading towards a very low score might as well start taking more risks to get to a competitive total. This is a manifestation of a mathematical theorem known as Jensen's inequality, when optimising over a relationship that is not linear, but actually, the relationship between the total score and the probability of winning is pretty much linear over the range of possibilities that can occur on any particular ball. That means, that a batting team should always ignore the current score, accept bygones as bygones, and base their level of aggression on how many balls and wickets they have remaining.

As it happens, we can quantify this decision reasonably precisely. The graph below gives a measure of what I like to call "deathness" for the first innings. The particular metric I use is the payoff to a risky single. Imagine that the batsmen have to choose between trying for a run or not. If they choose not to run, they will score 0 runs but not lose a wicket. If they try for the run, there is some probability that attempt will fail and one batsman will be run out, or they might succeed. What probability of being run out would be too high to make the risk not worth the cost. The graph shows that cross-over probability as a function of the number of overs bowled, for each possible number of wickets lost. The higher is the probability, the greater is the risk that it is worth taking and so the greater is the level of deathness (so called, because the final overs in an innings where batsmen start to take higher levels of risk is often termed "the death"). The actual numbers aren't particularly interesting (most decisions on aggression are about striking the ball, not about whether to attempt a run), but the comparison across different lines in the graph is revealing. So, for example, the graph reveals that if a particular level of aggression is warranted after 40 overs when a team is 5 wickets down, then the same level can be justified at 23 overs if no wickets have been lost.  



Before getting to batting out your overs, a few things to note about this graph:
  1. It is based on WASP data that predates the rule change to two new balls and only four outside the circle. That said, the basic story would not change using more recent data or some other estimate of the cost of a wicket such as the Duckworth-Lewis tables. 
  2. This table indicates what the expected payoffs are to different levels of risk and return in different game situations; it does not show what different risk-return combinations are possible. So, for 0-7 wickets down, the graphs indicate that the cost of risk is high at the start of the innings (the probability of a run-out has to be very low to justify attempting a run). With the fielding restrictions in the first 10 overs, however, it can be that the return to batsmen from a particular level of risk is much higher than in the middle overs, so that a high-risk strategy is still worthwhile, despite the costs. 
  3. The graphs all hit 100% for the final ball of the innings. That makes sense. It is simply saying that as long as there is any probability whatsoever of not being run out, you might as well keep running until you lose your wicket on the final ball.  
  4. Interestingly, though, for 1, 3 and 6 wickets lost, the graphs hit 100% before getting to the final ball of the innings. Remember that this is based on average-team versus average-team data. What is going on here is that on average the batters deeper in the batting order, are better at power slugging than those further up the order. So, for example, it is common for a batting order to have two aggressive openers followed by an accumulating #3 to take the team through the middle overs. If a team gets to 43 overs with only one wicket down, it might be better to go for a suicidal run (with the #3 coming to the danger end) and bring in a power hitter than to play out a dot ball. 
  5. The graph for 9 wickets down slopes down for most of the graph. This is mostly reflects out-of-sample extrapolation (there is no actual data for games where a team is 9 wickets down after 2 overs), and also the fact that when a team is 9 wickets down very early, there is almost no chance they will bat out their overs, and are likely to lose their last wicket any time so its worthwhile the batters taking risky singles while they are still there to do so. The longer the innings progresses, the less reason there is to think that the next wicket is imminent and so more need for caution. 
  6. While there is a general tendency for the graph to be lower the more wickets that have been lost, this tendency is not absolute. This is because, while losing a wicket will reduce the expected number of runs the team will score, the cost of the next wicket is not necessarily greater. For example, after about 46 overs, the incremental cost to a team of losing its 7th wicket is less than losing its 5th or 6th at that stage, so a team being 6 wickets down should be more aggressive than one that has lost only 4 or 5 wickets. 
So let's now think about batting out your overs. In the World Cup game between New Zealand and England, England batting first lost their 6th wicket at 28.1 overs, their 7th later in the same over, and their 8th at 30.4 overs. Looking at the purple, yellow and pink lines, the deathness measures at 28-30 overs, are all pretty much the same. Yes, a lot more caution was called for than if they had only been 2 wickets down at that point (and so Broad's approach at that point was probably not beyond reproach), but also the optimal strategy was not for the team to go into its shell. Rather the situation called for playing in much the same style as any team should do in the middle overs (10-30) but delaying all-out aggression for a bit longer than if they had more wickets in hand. This pretty much describes any situation where the "make sure you bat out your overs" comment is likely to arise. A team should probably delay its all-out assault for a bit if it loses too many wickets, but at no point should it bat more conservatively than in a normal middle-orders situation.  

7 comments:

  1. Seamus, seminal stuff. You've even coined a new term.
    Contrary to your (and my) position, I do see some justification for 'batting out the overs' in the graph. Between overs 42 and 47 there is a significant diverging of the level of acceptable risk for teams 4 or fewer wickets down and those 7 or more. Am I reading it correctly?
    Chris

    ReplyDelete
  2. ballsintherightareasThu Mar 05, 10:44:00 am GMT+13

    Regarding the sin of leaving unused batters in the pavilion, this should be extended to the sin of not letting your fastest scoring batters have as much time as possible at the crease.

    ReplyDelete
  3. You are right, and I see this as a rebuttal of Geoff Lawson's position but not of yours and mine. This is saying that you should delay the all-out assault (for a given quality of bowling, etc.) but there is no reason to behave more conservatively than one would in the middle overs. And given the typical regularity of wickets in the middle overs, playing normally if you are 6-8 wickets down at 30 overs (as in the case of the England versus NZ game) will almost certainly see a team not survive till 50. To keep using that game as an example, Broad's approach was probably too aggressive, but having got to 30 overs 7 wickets down, batting ultra-conservatively to try to survive to 50 would also have been a mistake.

    ReplyDelete
  4. It is certainly a case for a flexible batting order (such as promoting Maxwell in Aus v Afg, assuming winning by a huge margin is more important than letting your captain have much-needed batting practice). You want your big hitters when the cost of a wicket gets low and they can hit out with impunity. If the 40 over mark is the start of the death when 4 wickets down, then you should send in the big hitters if you lose your first wicket at around 26 overs. Of course, the data above comes from games with one new ball per match. With the new rules, it might be the case that you save your big hitters till a bit more shine is off the balls, knowing that they will still be hard through to the 50th over.

    ReplyDelete
  5. Of course bowling sides well versed in Game Theory may be able to figure out strategies to counter this approach. Although it was easier when underarm bowling was still possible.

    ReplyDelete
  6. ballsintherightareasThu Mar 05, 11:50:00 am GMT+13

    I would think there is a good case for sending in the faster scoring players right from the start, to make sure they face as many balls as possible, and save the slower scoring, low risk style batsmen for later, to ensure all overs are used if the top order get out quickly.

    ReplyDelete