We thank Teh and co-authors for a stimulating paper. The comparison of the different performance characteristics of Rt estimators is to be applauded. The lack of any more general systematic comparison has been a crucial gap in the literature. At the same time, the chosen approach highlights some difficulties in addressing this gap.

  • 1.

    Reproduction numbers are generally not observable and the ability of a model to reconstruct their trajectories can only be assessed in simulation studies. These may not yield sufficient information to accurately assess the ability of a model to estimate Rt in the real world. While simulation studies can be useful to assess the self-consistency of a model and its sensitivity to potential misspecification, their use for comparing estimates between models is more limited unless the data generating process is not one of the models used for estimating.

  • 2.

    It is challenging to make a fair comparison between tools that each have much flexibility and can be used and parameterised in a variety of ways that affect the resulting reconstructed R trajectories; identifying whether any resulting performance differences are due to differences in the methods themselves or specific user choices can be difficult.

  • 3.

    By definition of the instantaneous reproduction number Rt is the average number of new infections caused by infectious individuals at time t (weighted by their infectiousness). However, this leaves much room for interpretation. It can be defined over any time frame such as daily or weekly, and this choice affects the estimates. Moreover, even within the same model there may be several quantities which could be interpreted as Rt. For example, consider the following discrete-time branching process model, initialised with some starting number of infectious individuals I0, a mean reproduction number R and over-dispersion k:

    It ∼ Poisson (λt).

    λt = Xt − 1It − 1.

    Xt ∼ gamma (mean = R, var = R/k).

    There are at least three ways one could define an effective reproduction number in this model:

    (a) Rt = It/It − 1.

    (b) Rt = Xt − 1.

    (c) Rt = R.

Any assessment of the quality of Rt estimates would thus involve a decision on the timescale at which the reproduction number is considered and what exactly Rt represents and whether models agree that this is being estimated.

We believe that evaluating the performance of published Rt estimates or methods for generating them is valuable, and reiterate our appreciation of the authors’ approach to doing so. Further comparisons should be encouraged by both those developing methods and those consuming them. Ideally, this could be led by an independent arbiter who defines the parameters of the exercise and, in collaboration with the developers, applies the methods to be assessed.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The moral rights of the named author(s) have been asserted.