_{0}?

_{0}?

^{*}

Wrote the paper: SB RB RV. Other: Performed computer simulations: RB RV.

The authors have declared that no competing interests exist.

To predict the potential severity of outbreaks of infectious diseases such as SARS, HIV, TB and smallpox, a summary parameter, the basic reproduction number R_{0}, is generally calculated from a population-level model. R_{0} specifies the average number of secondary infections caused by one infected individual during his/her entire infectious period at the start of an outbreak. R_{0} is used to assess the severity of the outbreak, as well as the strength of the medical and/or behavioral interventions necessary for control. Conventionally, it is assumed that if R_{0}>1 the outbreak generates an epidemic, and if R_{0}<1 the outbreak becomes extinct. Here, we use computational and analytical methods to calculate the average number of secondary infections and to show that it does not necessarily represent an epidemic threshold parameter (as it has been generally assumed). Previously we have constructed a new type of individual-level model (ILM) and linked it with a population-level model. Our ILM generates the same temporal incidence and prevalence patterns as the population-level model; we use our ILM to directly calculate the average number of secondary infections (i.e., R_{0}). Surprisingly, we find that this value of R_{0} calculated from the ILM is very different from the epidemic threshold calculated from the population-level model. This occurs because many different individual-level processes can generate the same incidence and prevalence patterns. We show that obtaining R_{0} from empirical contact tracing data collected by epidemiologists and using this R_{0} as a threshold parameter for a population-level model could produce extremely misleading estimates of the infectiousness of the pathogen, the severity of an outbreak, and the strength of the medical and/or behavioral interventions necessary for control.

In Epidemiology, it is essential to quantify the severity of actual (or potential) outbreaks of infectious diseases such as SARS _{0}) that characterizes the potential of an outbreak to cause an epidemic. R_{0} has been extensively used to assess transmissibility of pathogens, severity of outbreaks, and epidemiological control _{0}, as phrased by Anderson and May _{0} is greater than one then the outbreak will lead to an epidemic, and if R_{0} is less than one then the outbreak will become extinct_{0} is a threshold parameter that establishes whether an outbreak yields an epidemic or not. Here we establish that the average number of secondary infections (i.e., R_{0}) is not always an epidemic threshold parameter.

Epidemiologists calculate R_{0} using individual-level contact tracing data obtained at the onset of the epidemic. Once an individual is diagnosed, his/her contacts are traced and tested. R_{0} is then computed by averaging over the number of secondary cases of many diagnosed individuals. This approach is based upon the definition of R_{0}, but it does not ensure that the calculated R_{0} is also an epidemic threshold parameter.

Another approach (which is more commonly used) is to obtain R_{0} from population-level data, namely cumulative incidence data. Making certain individual-level modeling assumptions (e.g., the mass-action principle of infectious spread, time independent infection rates, etc.), theorists construct models (typically) based on Ordinary Differential Equations (ODEs) which describe the dynamics of the expected population size in different disease stages without tracking individuals. It is very important to note that the individual-level modeling assumptions cannot be verified using population-level data (i.e., they remain hypothetical). ODE models are formulated in terms of disease transmissibility and progression rates at the population level. These parameters are obtained by fitting the model to population-level data; their relation to the individual-level processes may be quite complex and is generally unknown. Bifurcation analysis of the ODE model yields a threshold parameter _{0} that has been calculated from contact tracing data.

The individual-level and the population-level approaches may produce very different numbers as the first calculates the value of R_{0}, whilst the second calculates the value of a threshold parameter. The question of whether the R_{0} obtained by calculating the average number of secondary infections matches the threshold parameter obtained from fitting the epidemiological model to population-level data has been previously studied _{0} values obtained from different individual-level models (ILMs) do not necessarily agree with those obtained from mean-field ODE models. However, in order to make this point, the modelers consider that the individual-level transmission dynamics occurs on a _{0} mismatch can be attributed to the model mismatch. In contrast, in our ILMs, we preserve the assumption that the contact network is all-to-all. However, our research focuses on the _{0}. We thus discuss two distinct ILMs whose prevalence and incidence can be described by an ODE model with an established threshold parameter. We calculate their R_{0} values through the definition and then compare these values with the epidemic threshold parameter. Our results address the question of whether or not an R_{0} (i.e., an average number of secondary infections) can be assigned to an ODE model (which only provides a population-level description of disease propagation) without having any knowledge of the underlying disease transmission network.

A simple ODE model is the Susceptible-Infected (_{0} of this ILM. In this case, the average number of secondary infections R_{0} =

Two generations of infections are shown. Every horizontal bar segment represents the time interval that a specified individual remains infectious; these time intervals follow a negative exponential distribution with average ^{−1}^{−1}

However, the branching process is not the only possible ILM that is compatible with the ODE model. Recently, we have shown that other plausible ILMs can be constructed _{0}. Starting from one infected individual, our simulations integrated the ILM and kept track of the number of secondary infections caused by each individual in the infectious and in the recovered pools. The dynamics were integrated to a certain final time and the collected data were stratified over the date of infection. R_{0} was calculated using the average number of secondary cases generated by infectious individuals, according to the standard definition of Anderson and May _{0} is no longer infectious and that there is no right censoring (See _{0} value by real-world contact tracing data.

The model tracks individuals through growing a transmission network by using infection and removal rules

A The average prevalence versus time. The open circles represent results for the branching process while the dots represent results for our ILM. We used _{0} of the branching process is 1.5, while the R_{0} of our ILM is approximately 1.4.

The results (black dots) of the simulation are presented in _{0} yields the expected value that agrees with the threshold parameter of the _{0} versus the date of infection plateaus at a lower value than that for the branching model. It is thus evident, as supported by our numerics, that two individual-level models having exactly the same expectations of the corresponding population-level variables (i.e., incidence and prevalence) may yield different R_{0} values (as given by the definition). In the case of our second ILM (see _{0} is not the threshold parameter of the

Our results have significant consequences for understanding the concept of R_{0}. We explicitly show that certain population-level dynamics, theoretically specified by an ODE model, can be the result of many distinct ILMs. We further demonstrate that the R_{0} obtained from the ILM, by applying the definition of Anderson and May _{0} value found by contact tracing as a threshold parameter may be inaccurate.

Our novel results have significant implications for understanding the dynamics of outbreaks of infectious diseases, particularly for the biological understanding of the transmission dynamics of the pathogen, estimating the severity of outbreaks, making health policy decisions, and designing epidemic control strategies. We have shown that the value of R_{0} may not be an accurate measure of the severity of an outbreak since R_{0} may fail to represent an epidemic threshold parameter. Thus, measuring R_{0} through contact tracing (as generally occurs during an outbreak investigation), may not help in predicting the severity of the outbreak and may not be a useful measure for determining the strength of the necessary control interventions. Only an epidemic threshold parameter can be used to design control strategies. This parameter can be obtained through fitting an ODE model to population-level data as mentioned above and will signal epidemic growth whether or not it is equal to the average number of secondary infections. However, obtaining an R_{0} value via contact tracing can be very useful in conjunction with population-level epidemic data to understand the possible transmission mechanisms of the epidemic at the individual level. We thus suggest that the role of R_{0} should be more carefully considered, and that a reevaluation of the role of R_{0} may lead to the development of more effective control strategies.

Here we give more details and references about the individual-level models presented in the main text. We also briefly discuss how the concept of right censoring manifests in our simulations.

(0.05 MB PDF)

We thank Virginie Supervie and Justin Okano for stimulating and helpful discussions during the course of this research. We also thank Tiffany Head for assistance with the figures.

_{0}.