Models of epidemics: when contact repetition and clustering should be included
- Timo Smieszek^{1}Email author,
- Lena Fiebig^{2} and
- Roland W Scholz^{1}
https://doi.org/10.1186/1742-4682-6-11
© Smieszek et al; licensee BioMed Central Ltd. 2009
Received: 05 March 2009
Accepted: 29 June 2009
Published: 29 June 2009
Abstract
Background
The spread of infectious disease is determined by biological factors, e.g. the duration of the infectious period, and social factors, e.g. the arrangement of potentially contagious contacts. Repetitiveness and clustering of contacts are known to be relevant factors influencing the transmission of droplet or contact transmitted diseases. However, we do not yet completely know under what conditions repetitiveness and clustering should be included for realistically modelling disease spread.
Methods
We compare two different types of individual-based models: One assumes random mixing without repetition of contacts, whereas the other assumes that the same contacts repeat day-by-day. The latter exists in two variants, with and without clustering. We systematically test and compare how the total size of an outbreak differs between these model types depending on the key parameters transmission probability, number of contacts per day, duration of the infectious period, different levels of clustering and varying proportions of repetitive contacts.
Results
The simulation runs under different parameter constellations provide the following results: The difference between both model types is highest for low numbers of contacts per day and low transmission probabilities. The number of contacts and the transmission probability have a higher influence on this difference than the duration of the infectious period. Even when only minor parts of the daily contacts are repetitive and clustered can there be relevant differences compared to a purely random mixing model.
Conclusion
We show that random mixing models provide acceptable estimates of the total outbreak size if the number of contacts per day is high or if the per-contact transmission probability is high, as seen in typical childhood diseases such as measles. In the case of very short infectious periods, for instance, as in Norovirus, models assuming repeating contacts will also behave similarly as random mixing models. If the number of daily contacts or the transmission probability is low, as assumed for MRSA or Ebola, particular consideration should be given to the actual structure of potentially contagious contacts when designing the model.
Keywords
Background
The spread of infectious disease is determined by an interplay of biological and social factors [1]. Biological factors are, among others, the virulence of an infectious agent, pre-existing immunity and the pathways of transmission. A major social factor influencing disease spread is the arrangement of potentially contagious contacts between hosts. For instance, the distribution of contacts among the members of a population (degree distribution) strongly impacts population spread patterns: Highly connected individuals become infected very early in the course of an epidemic, while those that are nearly isolated become infected very late, if at all [2, 3]. For a high dispersion of the degree distribution, the transmission probability above which diseases spread is lower than for a low dispersion [2–4]. If the degree distribution follows a power law, the transmission probability necessary to sustain a disease even tends to zero [5–7].
Another important structural property influencing the spread of diseases is the clustering of contacts. Clustering deals with how many of an individual's contacts also have contact among each other. High clustering of contacts means more local spread (within cliques) and thus a rapid local depletion of susceptible individuals. In extreme cases, infections get trapped within highly cohesive clusters. Random mixing is known to overestimate the size of an outbreak [8], whereas the local depletion caused by clustering remarkably lowers the rates of disease spread [9, 10]: Clustering results in polynomial instead of exponential growth, which can be expected for unclustered contact structures [11].
For most of the diseases transmitted by droplet particles or through close physical contact, the number of contacts that can be realistically made within the infectious period has a clear upper limit. The mean value of potentially contagious contacts can be interpreted in a meaningful way, since the distribution of daily contacts is unimodal with a clear "typical" number of contacts [12–15]. Potentially dominant properties of the underlying contact structure are the clustering of such contacts and their repetitiveness, i.e. whether contacts repeat within the infectious period or not.
A recent study combining a survey and modelling showed that the repetition of contacts plays a relevant role in the spread of diseases transmitted via close physical contact. Contrarily, the impact of repetitiveness seems to be negligible in case of conversational contacts [16]. However, the generality of these findings is limited, as they are based on a small, unrepresentative sample and as the specific patterns of such contacts vary depending on the national and cultural context [12]. A more theoretical work showed that the dampening effect of contact repetition is further increased by contact clustering and is more pronounced if the number of contacts per day is low [10].
The aim of this paper is to better understand the conditions under which the inclusion of contact repetition and clustering is relevant in models of disease spread compared to a reference case assuming random mixing. This is pertinent, as many researchers still use the random mixing assumption without thoroughly discussing its adequacy for the respective case study [17–21]. In particular, we test and discuss the influence of transmission probability, number of contacts per day, duration of the infectious period, clustering and proportion of repetitive contacts on the total outbreak size of a disease. This helps modellers and epidemiologists make informed decisions on whether the simplifying random mixing assumption provides adequate results for a particular public health problem.
Methods
Stochastic SIR models
Under the random mixing assumption (in mathematical terms denoted by index ran), n contacts are randomly chosen out of the whole population (including susceptible, infectious and recovered individuals) for every individual and every day. There is neither contact repetition nor clustering, as our algorithm ensures, that no contact partner is picked twice by the same individual.
In fact, clustering is neither properly defined nor is it a reasonable concept under the random mixing assumption for theoretical and practical reasons: In this paper we refer to the common definition that the clustering coefficient CC is the ratio of closed triplets to possible triplets [23], where a closed triplet is defined as three individuals with mutual contact. This definition is based on static networks. As in random mixing models contacts change daily, different clustering coefficients could be calculated for every single simulation time step. However, no epidemiologically relevant effect of such clusters could be observed, because any new infection comes into effect only in the following time step when contacts are already rearranged. As a consequence, there is no local depletion of susceptible individuals observable under this definition, even for high clustering coefficients. If clustering would be defined for an extended time interval (e.g., the infectious period), an enormous amount of closed triplets would be necessary to attain only slight clustering coefficients as the total number of contacts over such a long time is very high. For such huge cliques, there is no meaningful interpretation and no analogy in the real world.
Repetitive contacts (in mathematical terms denoted by index rep) are implemented by generating a static network with n links for every individual. The links of this network represent stable, mutual, daily contacts between individuals. As mentioned, the model type assuming repetitive contacts exists in two variants. For the variant without clustering, individuals are linked completely at random. Nonetheless, for repetitive contacts, clustering is a meaningful concept as contacts are static and as clusters correspond to observable entities in the real world: Family or work contacts, for instance, are usually clustered and tend to be highly repetitive. In this paper, predefined average clustering coefficients are achieved by alternately generating random links and triplet closures, as suggested by Eames [10], until the clustering aim is achieved in average for the whole population. When the target value of closed triplets is reached, the network is filled up with random contacts until all individuals have n contacts.
This paper compares most parameter settings for a model assuming either full random mixing or perfect repetitiveness of contacts. This comparison allows for estimating the maximal possible difference between both antipodal simplifications of reality. However, real world dynamics of networks are far more complicated; therein some contacts are repeated daily, others on certain days of the week and others only once in a while. In order to investigate the effect of different proportions of repetitive contacts, we vary the fractions of repetitive contacts.
Parameter space to be tested
In the following section, we describe some important factors in the spread of infectious diseases that will be systematically tested for their influence on the difference between the random mixing model and the model assuming repetitiveness (with and without clustering).
Important biological factors influencing the spread of infectious diseases are the duration of the infectious period τ and the per-contact transmission probability β.
Key transmission parameters of selected diseases
Disease | R _{0} | τ[d] | Transmission pathways[32] |
---|---|---|---|
Chickenpox (Varicella) | 7–12[3] | 10–11[3] | Direct contact, airborne, droplet, contact with infectious material |
Ebola | 1.34[42]^{a} 1.79[43] 1.83[42]^{b} 2.13[43]^{c, a} 3.07[43]^{c, b} | 14[43] | Direct contact, contact with infectious material, monkey-to-person |
Influenza | 1.3; 1.8; 3.1[17]^{d} 1.39[51] 1.58; 2.52; 3.41[52]^{e} 1.7–2.0[53] 2–3[54]^{f} 3.77[55] | 2–3[3] 2.27[55] 3–7[56] | Direct contact, airborne, droplet [57] |
Measles | 5–18[3] 7.17–45.41[33]^{g, h} 7.7[34] 15–17[32] 16.32[33] g | 6–7[3] | Direct contact, airborne, droplet, contact with infectious secretions |
MRSA^{i} | 1.2[41]^{j} | as long as purulent lesions continue to drain[40] | Direct contact, contact with infectious material[40] |
Mumps | 7–14[3] 4.4[35]^{h} 10–12[32] | 4–8[3] | Direct contact, airborne, droplet, contact with infectious secretions |
Norovirus | 3.74[37]^{j} | 1.8[37]^{j} | Direct contact, droplet (vomiting), contaminated food[38, 39]^{k} |
SARS^{k} | 1.43[43]^{l} 1.5[43]^{m} 1.6[47] 2.2–3.7[48] >2.37[49] | 4[49] 5[43] | Close direct contact |
Whooping cough (Pertussis) | 10–18[3] 15–17[32] | 7–10 [3] | Direct contact, airborne, droplet, contact with infectious secretion |
The transmission probability β is defined as the probability that an infectious-susceptible pair results in disease transmission within one single time step of the simulation. β is equal for every infectious-susceptible pair. The effect of β on the impact of repetitive contacts compared to the reference case (without repetitive contacts) is analyzed via systematic variation.
In the results section, we show all results for β·n·τ values instead of pure β values to assure comparability of the outcomes: β·n·τ equals the basic reproduction number R_{0} for the random mixing model and thus models with the same β·n·τ result in a similar total outbreak size. Referring to β·n·τ values assures that model comparisons are always made for a relevant range of β. The effect of repetitive contacts is tested for β·n·τ values between 1.2 and 4.0 in increments of 0.2. The epidemic threshold of random mixing models is β·n·τ = 1.0. As we are only interested in diseases that can cause an epidemic, we set the lower boundary to 1.2. The upper boundary is chosen arbitrarily.
Social factors considered in this paper are the number of contacts per day n, the proportion of repetitive contacts and the clustering coefficient.
For every single simulation run, the number of contacts per day n is constant and equal for all individuals. n counts every contact an individual has within one simulation step, regardless of the alter's infection status (susceptible, infectious or recovered) and regardless of whether the contact is repetitive. The effect of repetitive contacts on the simulation outcome is tested for n values between 4 and 20 with a step width of 2 (mean values for conversational contacts lie in this range [12]).
In order to investigate the effect of varying fractions of repetitive contacts, we simulate the total outbreak size for 0%, 25%, 50%, 75% and 100% repetitive contacts. Thereby, 25% repetitive contacts means that one fourth of all contacts on a given day repeat daily but that three fourth of the contacts on a given day are unique.
In the case of repetitive contacts, clustering coefficients between CC = 0.0 and 0.6 with a step width of 0.2 are accounted for. This span covers a wide range of existing transmission systems from highly infectious diseases with a high number of contacts per day and with clustering coefficients close to zero to highly structured settings with a considerable proportion of clustered contacts like in hospitals [24].
For all runs of the simulation model, the total population N was fixed to 20000 individuals. As initial seed 15 randomly chosen individuals are set to infectious every simulation run. For each combination of model parameters 350 runs were performed to achieve stable mean values of the outcome variables. A simulation run was terminated when no infectious individual was left.
Overview on performed analyses
Parameter settings of the analyses
n | τ[d] | β·n·τ | CC | Proportion repetitive contacts | |
---|---|---|---|---|---|
Analysis 1 | |||||
a | 4 – 20; 2 | 2 – 14; 1 | 1.6 | .0 | .0 vs. 1.0 |
b | 4 – 20; 2 | 14 | 1.2 – 4.0; .2 | .0 | .0 vs. 1.0 |
c | 4 | 2 – 14; 1 | 1.2 – 4.0; .2 | .0 | .0 vs. 1.0 |
Analysis 2 | 4 – 20; 2 | 14 | 1.2 – 4.0; .2 | .0 – .6; .2 | .0 vs. 1.0 |
Analysis 3 | 8 – 20; 4 | 14 | 1.2 – 3.0; .6 | .0 – .6; .2 | .0 – 1.0; .25 |
In addition to the total outbreak size, we present further epidemiologically relevant indicators in the additional files. Epidemic curves can be found in additional file 2, findings on the model differences regarding the average peak size of the outbreaks and the average time to peak are given in additional file 3.
Results and discussion
Analysis 1: The effect of contact repetition depending on τ, n and β
Effect of contact number
The increasing difference between and with decreasing n can be explained by two lines of reasoning.
First, in the case of contact repetition, there is always at least one out of the n contacts per day that is already infected (and thus not available for new infection): As contacts are stable over time, the infector of a susceptible individual is included in the subsequent contact list of that individual even when said individual has changed to the infectious state. Thus, at the least, the contact that originally transmitted the infection is not susceptible. In contrast, contacts change in every time step under the random mixing assumption: Hence, the infector is not more likely to appear in the contact set than any other individual. This difference between and is more pronounced for small n because one non-susceptible individual out of a small set of contacts means a relatively higher decrease in local resources than does one out of a large set of contacts.
Secondly, any new infection means that the infector will have one susceptible contact less for all subsequent time steps. This local depletion of resources is more pronounced for small n for the same reason as in the first argument. Further, stochasticity acts stronger in small local environments than in large ones [25].
In this equation the number of susceptible individuals in the local environment is reduced by 1 compared to the random mixing case, as we assume that every contact except the one that originally transmitted the infection is susceptible. This number of susceptible individuals (n - 1) is multiplied by the probability that such an individual becomes infected during the infectious period τ. As (n - 1) is smaller than n and [1 - (1 - β)^{ τ }] is smaller (or equal for τ = 1) than β·τ, the expected number of secondary cases caused by an infectious individual in a population with a huge number of susceptible and few infected ones is always smaller in the repetitive case.
Effect of the per-contact transmission probability
The difference between and decreases rapidly with increasing β. The reason is that practically every individual will be reached and infected in case of large transmission probabilities, regardless of the underlying contact structure. Differences between both models may appear in the shape of the outbreak curve (cf. to additional files 2 and 3), but in terms of I_{ tot }both models are equivalent. In case of small transmission probabilities, differences in the effective number of secondary cases generated by an infectious individual can become visible, as only a fraction of the whole population will be infected under both assumptions.
Effect of the infectious period
As expected, the difference between and increases with increasing τ. However, the change in difference is largest for Δτ in a range of low τ values, but is almost irrelevant for high values of τ. This observation is explained by the τ-dependence of R_{0,rep}(equation 1, see also figure 3b): The longer the infectious period, the smaller the chances for a specific contact to remain uninfected. However, this increase in individual infection probability is partly compensated by a lower per-day transmission probability, which is needed to achieve constant R_{0,ran}. The interaction of these antagonistic effects results in a stabilization of R_{0,rep}/R_{0,ran}for a large τ.
Analysis 2: The effect of contact repetition combined with clustering depending on n and β
The further dampening of disease spread by clustering can be explained by increased locality of resources: While repetition limits the number of available susceptible individuals by keeping previously infected ones in the set of contacts, clustering reduces the number of susceptible contacts because there is a higher likelihood that contacts of an infector have already become infected by others during the infectious period, as infections spread rapidly within cliques. The reason why this effect is more pronounced for small n rather than for large n is the same as in the case of unclustered, pure contact repetition: Any reduction of susceptible individuals in the set of contacts weights relatively stronger in the case of few contacts than in the case of many. The right shift of the peak of can be explained by the increased transmission probability β needed to pass the epidemic threshold under increased clustering compared to the constantly low levels of β necessary under the random mixing assumption [26].
Analysis 3: Varying proportions of contact repetition, clustering and β
One mechanism driving this non-linear relation when clustering is present is the local depletion of resources. Repetitive contacts of an infector have a much higher chance of becoming infected than do non-repetitive contacts. Moreover, if these repetitive contacts are also highly clustered, it is likely that the disease will become trapped in those cohesive social subgroups. However, if only a few non-repetitive, non-clustered contacts are added per day, the chances of spreading the disease between otherwise unrelated regions of the social network greatly increase.
Limitations
This paper systematically investigates a variety of epidemiologically relevant parameters needed to describe real-world transmission systems of diseases spread by droplet particles or direct physical contact. However, real-world social and biological processes involved in the transmission of infectious diseases are far more complex than captured by the archetypical model structures presented. Conceptual decisions and simplifications which could have potentially influenced the results are critically discussed in the following:
Model structure
We designed our two model types as SIR models, assuming that every individual is either susceptible, infectious or immune with respect to a certain disease. Transitions are only allowed from susceptible to infectious or from infectious to immune. The SIR structure is a fairly good representation for many diseases which lead to full immunity after recovery (e.g., measles). However, many diseases require other representations, as relevant intermediate states need to be covered, e.g., as with a long latency period in SEIR (Susceptible-Exposed-Infectious-Recovered) models. Another common deviation from the SIR structure arises, when recovery confers only partial or no immunity. In such cases, SIS (Susceptible-Infectious-Susceptible) representations are often chosen. In SIR or SEIR models, a total outbreak size can be defined (because the disease fades out at the end of an epidemic), whereas SIS models typically achieve an equilibrium I(t) in the long run, but the disease does not die out. Despite all the differences in model behaviour, we expect the rough picture to be the same for SIR, SEIR and SIS models, as the mechanisms behind the observed differences for SIR models that we discussed also apply to SIS and SEIR models. Thus, the general conclusions derived in this paper should also hold true for these model types.
Degree distribution
The number of daily contacts n is fixed and equal for the entire population in both modelling approaches presented. This is a reasonable simplification for the purpose of this paper, as it keeps the investigated number of interactions manageable. However, in real world systems, the number of daily contacts appears to follow a negative binomial distribution [12, 14] with some people having a relatively high number of contacts and others being almost isolated. It is known that the variance of the degree distribution impacts the spread of infectious disease, for instance, by decreasing the transmission probability needed to cause an epidemic [27]. Particularly relevant for the difference between random mixing models and models accounting for contact repetition and clustering are the correlations between the number of contacts per day and contact repetition and clustering, respectively. It is plausible to assume that individuals with many contacts tend to also have many unrepeated contacts, whereas individuals with few contacts tend to have disproportionately high levels of repetitive contacts. If the proportion of repetitive contacts and clustering is correlated with the number of contacts, individuals with few contacts are likely to be dead-end streets for infectious diseases. In contrast, highly connected individuals could be structurally more important than expected, as they bridge distinct cliques.
Occasional contact repetition
In our simulations, contacts repeat either daily or never. Intermediate states between both extremes of complete random mixing and complete contact repetition have been investigated by combining both models in defined proportions. However, in reality, specific persons can be met at any frequency between never and daily. It is plausible to assume that intermediate frequencies reduce the effect of repetitiveness depending on the duration of the infectious period τ: For short infectious periods, those with low contact frequencies might appear as unrepeated contacts whereas they unfold their full dampening potential for long infectious periods.
Contact intensity and duration
In our models all contacts between an infector and a susceptible individual are equally likely to result in the transmission of the infectious disease. This simplification is not a good representation of the real world: The transmission probability depends on the amount of infectious material ingested by a susceptible person [28, 29]. The uptake correlates with contact duration and intensity. Contact duration is long for highly repetitive contacts, while unrepeated contacts tend to have short duration (unpublished data). Accordingly, it can be expected that the interaction of clustering, contact repetitiveness and contact duration leads to a rapid infection of all closely tied clusters (primarily families, then workgroups and cliques at school and childcare institutions), leaving behind the people connected via mainly short, unclustered, occasional contacts.
Distribution of infectious period
The infectious period τ is fixed in our model, which contrasts to the design of classical mean-field models assuming exponentially distributed infectious periods [3, 22]. Keeling and Grenfell argue that R_{0} is smaller for exponential period models than for fixed period models under otherwise identical conditions, because individuals with a long τ rapidly exhaust the susceptible in their local neighbourhood and, therefore, cannot compensate for the large majority of individuals with extremely short infectious periods [25, 30]. However, the often assumed exponential distribution is highly unrealistic, as observed infectious periods tend to be closely centred around a mean period and are thus less dispersed [31]. Thus, assuming a fixed infectious period is a reasonable simplification of the reality that is not likely to have a major influence on as only very few individuals will use up their local susceptible resources during the infectious period in most cases. Moreover, if the infection probability is high enough to exploit almost the entire local environment (such that deviations of τ could affect the individual reproduction ratio), will reach the order of magnitude of the population size in either the fixed or the exponential case.
Implications for some exemplar diseases
Information on the per-contact transmission rate β and the number of potentially contagious contacts n is often not easily accessible or available and has to be measured (or fitted) if included in models of disease spread. However, rough estimates of both variables can be obtained when R_{0} estimates are available and when the possible pathways of transmission are known, because β and n are linked to the basic reproduction number by R_{0,ran}= β·n·τ and the possible pathways reveal information on the possible number and structure of contacts at risk: At one extreme there is transmission via close physical contacts, which correlate mostly with intense social relations and are typically rare, repetitive and highly clustered. The other extreme is airborne transmission via tiny droplet nuclei that remain suspended indoors for a long time. In this case, vast numbers of persons can potentially be exposed, and such casual contacts are neither highly repetitive nor strongly clustered.
Table 1 provides information about the infectious period τ, R_{0} estimates and the possible pathways of transmission for a variety of infectious diseases. The implications of clustering and contact repetition for models of the diseases listed in this table are discussed below.
Typical childhood diseases like mumps, measles, pertussis (whopping cough) or chickenpox have comparatively high R_{0} estimates [3, 32–35], which means that one infector generates many secondary cases if a sufficient number of susceptible contact partners are available. These diseases are highly communicable – in fact, measles is one of the most highly communicable diseases in the world [36] – and thus, very short and non-intense contacts have the potential to confer infection. Accordingly, both the number of contacts per day n and the per-contact transmission probability β are very high. We further assume that a high proportion of the contacts are casual contacts, because the threshold for a contact to be potentially contagious is very low with respect to duration and intensity. Consequently, the levels of repetitiveness and clustering are low, which means that the contact patterns for such childhood diseases are structurally similar to random mixing. Considering that high numbers of daily contacts n make both types of models that we discussed behave similarly and considering that under high transmission probabilities β almost every individual will be reached, random mixing models achieve almost the same results as more elaborate models including a certain amount of contact repetition and clustering. Also in case of Norovirus, the difference is probably small, as the infectious period of this infectious agent is very short [37] and as at the same time the basic reproduction number is comparatively high [37] (because the disease is easily communicable [38, 39]).
On the other side, there are diseases with comparatively low R_{0} estimates and typically low numbers of contacts that still qualify for potential transmission. Methicillin-resistant Staphylococcus aureus (MRSA), for instance, is an infectious agent mostly transmitted in health care and nursing institutions. It needs close physical contact for transmission [40] and R_{0} estimates given in the literature are close to the epidemic threshold [41]. Accordingly, both β and n are low. At the same time, health care settings tend to be highly structured regarding who cares for whom and who shares a room with whom. Hence, high levels of contact repetitiveness and clustering can be assumed [24]. Modelling MRSA under the random mixing assumption is likely to overestimate the total number of cases for given n, β and τ. If, in contrast, a random mixing model is fitted to measured data from an outbreak, either the infectivity or the number of potentially infectious contacts will be underestimated to meet the measured outbreak size. A similar argumentation applies to Ebola, which is transmitted via direct contact with infected blood, secretions, organs or semen (thus, n is rather low) and seems to be only moderately infectious [42–45]. As a consequence, random mixing models of Ebola [46] are of limited validity.
Finally, there are some diseases not easily attributable to one or the other class. Severe Acute Respiratory Syndrome (SARS) and Influenza, for instance, have a range of R_{0} estimates between 1.43 and 3.7 [43, 47–50] and between 1.3 and 3.77 [17, 51–56], respectively. No definite consensus has been reached on whether Influenza is transmitted predominantly by large droplets and close contact or by very small droplets that disseminate quickly and stay suspended in indoor air for a long time [57]. In the latter case, a large amount of people would be at risk of infection, so random mixing would be a reasonable approximation of the real contact patterns. In the case of transmission by close contact and large droplets (that fall out quickly), the mean number of potentially contagious contacts per day lies between 8 and 18, depending on the national and cultural context [12]. Considering that not all contacts are equally likely to transmit influenza, but that long and intense contacts (such as household contacts [58]) are more prone to do so and that such contacts also tend to be more repetitive and clustered, it is likely that random mixing models also overestimate the outbreak size for given n, β and τ. However, problems will definitely arise when the impact of social distancing measures (decrease of n) or of antiviral treatment (decrease of β) are estimated under the random mixing assumption: Both interventions will be much more effective in a more elaborate model than in a random mixing model when n, β and τ are the same for both model types. This argumentation is consistent with recent findings on the impact of other network properties on influenza spread: Heterogeneity in degree distribution does not influence the outbreak size in case of highly contagious influenza strains, but does so for moderately contagious strains; however, it does influence the total outbreak size when interventions are simulated – even in case of highly contagious strains [4].
Conclusion
Real-world contact patterns are complex. They typically show all kinds of intermediate states ranging from contacts repeating on a daily basis to and never again. There are various clearly defined, cohesive groups with typically high intra-group clustering coefficients (e.g. households, workgroups, peer groups at school) and, at the same time, random contacts, e.g., in a leisure setting. Moreover, contacts differ in intensity and duration, which further complicates the dynamics of disease spread in such settings. This paper simplifies these complex patterns to a manageable model and parameter space that can be investigated systematically. Our research applies to diseases transmitted via conversational or direct contact, for which a typical number of contacts per day can be defined. For such diseases, our findings can help modellers judge whether a specific transmission system consisting of a specific infectious agent and a specific human system at risk can be represented by a simple random mixing model or if more elaborate models are necessary.
Random mixing models result in acceptable estimates of the total outbreak size even if the real world contacts are highly repetitive and clustered
• if the number of potentially infectious contacts per day is high and
• if the transmission probability for a single infectious-susceptible pair is high and
• particularly, if the infectious period is just one to three days.
If the number of contacts per day or the transmission probability is low, particular consideration should be given to the actual structure of potentially contagious contacts in designing the model.
Declarations
Acknowledgements
TS is funded by the Swiss National Science Foundation (project 320000-114122), LF is funded by the Swiss Federal Veterinary Office (project 1.07.05). We thank Jan Hattendorf, Esther Schelling and four anonymous reviewers, who made valuable comments that helped to improve the quality of this paper. Further we thank Devon D. Brewer, Istvan Z. Kiss, Peter de Haan, Fadri Gottschalk and Philippe Peter, whose support and comments in earlier stages of this research is greatly acknowledged. Sandro Bösch created the final layout of the figures. Stephanie Keller revised the language of this paper.
Authors’ Affiliations
References
- Koopman JS: Infection transmission science and models. Jpn J Infect Dis. 2005, 58: S3-8.PubMedGoogle Scholar
- Hethcote HW, Yorke JA: Gonorrhea transmission dynamics and control. 1984, Berlin: SpringerView ArticleGoogle Scholar
- Anderson RM, May RM: Infectious diseases of humans: dynamics and control. 1991, Oxford, UK: Oxford University PressGoogle Scholar
- Duerr HP, Schwehm M, Leary CC, De Vlas SJ, Eichner M: The impact of contact structure on infectious disease control: influenza and antiviral agents. Epidemiol Infect. 2007, 135: 1124-1132. 10.1017/S0950268807007959.PubMed CentralView ArticlePubMedGoogle Scholar
- Pastor-Satorras R, Vespignani A: Epidemic spreading in scale-free networks. Phys Rev Lett. 2001, 86: 3200-3203. 10.1103/PhysRevLett.86.3200.View ArticlePubMedGoogle Scholar
- Kiss IZ, Green DM, Kao RR: Infectious disease control using contact tracing in random and scale-free networks. J R Soc Interface. 2006, 3: 55-62. 10.1098/rsif.2005.0079.PubMed CentralView ArticlePubMedGoogle Scholar
- Keeling MJ, Eames KTD: Networks and epidemic models. J R Soc Interface. 2005, 2: 295-307. 10.1098/rsif.2005.0051.PubMed CentralView ArticlePubMedGoogle Scholar
- Zaric GS: Random vs. nonrandom mixing in network epidemic models. Health Care Manag Sci. 2002, 5: 147-155. 10.1023/A:1014489218178.View ArticlePubMedGoogle Scholar
- Keeling MJ: The effects of local spatial structure on epidemiological invasions. Proc R Soc Lond B. 1999, 266: 859-869. 10.1098/rspb.1999.0716.View ArticleGoogle Scholar
- Eames KTD: Modelling disease spread through random and regular contacts in clustered populations. Theor Popul Biol. 2008, 73: 104-111. 10.1016/j.tpb.2007.09.007.View ArticlePubMedGoogle Scholar
- Szendrói B, Csányi G: Polynomial epidemics and clustering in contact networks. Proc Biol Sci. 2004, 271 (Suppl 5): S364-S366. 10.1098/rsbl.2004.0188.PubMed CentralView ArticlePubMedGoogle Scholar
- Mossong J, Hens N, Jit M, Beutels P, Auranen K, Mikolajczyk R, Massari M, Salmaso S, Tomba GS, Wallinga J, Heijne J, Sadkowska-Todys M, Rosinska M, Edmunds WJ: Social contacts and mixing patterns relevant to the spread of infectious diseases. PLoS Med. 2008, 5 (3): e74-10.1371/journal.pmed.0050074.PubMed CentralView ArticlePubMedGoogle Scholar
- Beutels P, Shkedy Z, Aerts M, Van Damme P: Social mixing patterns for transmission models of close contact infections: exploring self-evaluation and diary-based data collection through a web-based interface. Epidemiol Infect. 2006, 134: 1158-1166. 10.1017/S0950268806006418.PubMed CentralView ArticlePubMedGoogle Scholar
- Mikolajczyk RT, Akmatov MK, Rastin S, Kretzschmar M: Social contacts of school children and the transmission of respiratory-spread pathogens. Epidemiol Infect. 2008, 136: 813-822. 10.1017/S0950268807009181.PubMed CentralView ArticlePubMedGoogle Scholar
- Edmunds WJ, O'Callaghan CJ, Nokes DJ: Who mixes with whom? A method to determine the contact patterns of adults that may lead to the spread of airborne infections. Proc Biol Sci. 1997, 264 (1384): 949-957. 10.1098/rspb.1997.0131.PubMed CentralView ArticlePubMedGoogle Scholar
- Read JM, Eames KTD, Edmunds WJ: Dynamic social networks and the implications for the spread of infectious disease. J R Soc Interface. 2008, 5: 1001-1007. 10.1098/rsif.2008.0013.PubMed CentralView ArticlePubMedGoogle Scholar
- Sertsou G, Wilson N, Baker M, Nelson P, Roberts MG: Key transmission parameters of an institutional outbreak during the 1918 influenza pandemic estimated by mathematical modelling. Theor Biol Med Model. 2006, 3: 38-10.1186/1742-4682-3-38.PubMed CentralView ArticlePubMedGoogle Scholar
- Nishiura H, Brockmann SO, Eichner M: Extracting key information from historical data to quantify the transmission dynamics of smallpox. Theor Biol Med Model. 2008, 5: 20-10.1186/1742-4682-5-20.PubMed CentralView ArticlePubMedGoogle Scholar
- Ray KJ, Porco TC, Hong KC, Lee DC, Alemayehu W, Melese M, Lakew T, Yi E, House J, Chidambaram JD, Whitcher JP, Gaynor BD, Lietman TM: A rationale for continuing mass antibiotic distributions for trachoma. BMC Infect Dis. 2007, 7: 91-10.1186/1471-2334-7-91.PubMed CentralView ArticlePubMedGoogle Scholar
- Gani R, Leach S: Transmission potential of smallpox in contemporary populations. Nature. 2001, 414: 748-751. 10.1038/414748a.View ArticlePubMedGoogle Scholar
- Nagelkerke NJD, Moses S, de Vlas SJ, Bailey RC: Modelling the public health impact of male circumcision for HIV prevention in high prevalence areas in Africa. BMC Infect Dis. 2007, 7: 16-10.1186/1471-2334-7-16.PubMed CentralView ArticlePubMedGoogle Scholar
- Kermack WO, McKendrick AG: A contribution to the mathematical theory of epidemics. Proc R Soc Lond A. 1927, 115: 700-721. 10.1098/rspa.1927.0118.View ArticleGoogle Scholar
- Watts DJ, Strogatz SH: Collective dynamics of 'small-world' networks. Nature. 1998, 393: 440-442. 10.1038/30918.View ArticlePubMedGoogle Scholar
- Liljeros F, Giesecke J, Holme P: The contact network of inpatients in a regional healthcare system. A longitudinal case study. Math Popul Stud. 2007, 14: 269-284. 10.1080/08898480701612899.View ArticleGoogle Scholar
- Keeling MJ, Grenfell BT: Individual-based perspectives on R-0. J Theor Biol. 2000, 203: 51-61. 10.1006/jtbi.1999.1064.View ArticlePubMedGoogle Scholar
- Aparicio JP, Pascual M: Building epidemiological models from R-0: an implicit treatment of transmission in networks. Proc R Soc Lond B. 2007, 274: 505-512. 10.1098/rspb.2006.0057.View ArticleGoogle Scholar
- Bansal S, Grenfell BT, Meyers LA: When individual behaviour matters: homogeneous and network models in epidemiology. J R Soc Interface. 2007, 4: 879-891. 10.1098/rsif.2007.1100.PubMed CentralView ArticlePubMedGoogle Scholar
- Wells WF: Airborne contagion and air hygiene: an ecological study of droplet infections. 1955, Cambridge, MA: Harvard University PressGoogle Scholar
- Haas CN, Rose JB, Gerba CP: Quantitative microbial risk assessment. 1999, New York: John Wiley & SonsGoogle Scholar
- Keeling MJ, Grenfell BT: Disease extinction and community size: Modeling the persistence of measles. Science. 1997, 275: 65-67. 10.1126/science.275.5296.65.View ArticlePubMedGoogle Scholar
- Lloyd AL: Realistic Distributions of Infectious Periods in Epidemic Models: Changing Patterns of Persistence and Dynamics. Theor Popul Biol. 2001, 60: 59-71. 10.1006/tpbi.2001.1525.View ArticlePubMedGoogle Scholar
- Heymann DL: Control of communicable disease manual. 2004, Washington, D.C.: American Public Health Association, 18Google Scholar
- Wallinga J, Levy-Bruhl D, Gay NJ, Wachmann CH: Estimation of measles reproduction ratios and prospects for elimination of measles by vaccination in some Western European countries. Epidemiol Infect. 2001, 127: 281-295. 10.1017/S095026880100601X.PubMed CentralView ArticlePubMedGoogle Scholar
- Mossong J, Muller CP: Estimation of the basic reproduction number of measles during an outbreak in a partially vaccinated population. Epidemiol Infect. 2000, 124: 273-278. 10.1017/S0950268899003672.PubMed CentralView ArticlePubMedGoogle Scholar
- Edmunds WJ, Gay NJ, Kretzschmar M, Pebody RG, Wachmann H: The pre-vaccination epidemiology of measles, mumps and rubella in Europe: implications for modelling studies. Epidemiol Infect. 2000, 125: 635-650. 10.1017/S0950268800004672.PubMed CentralView ArticlePubMedGoogle Scholar
- Moss WJ, Griffin DE: Global measles elimination. Nat Rev Microbiol. 2006, 4: 900-908. 10.1038/nrmicro1550.View ArticlePubMedGoogle Scholar
- Vanderpasa J, Louisa J, Reynders M, Mascarta G, Vandenberg O: Mathematical model for the control of nosocomial norovirus. J Hosp Inf. 2009, 71: 214-222. 10.1016/j.jhin.2008.11.024.View ArticleGoogle Scholar
- Duizer E, Koopmans M: Tracking foodborne viruses: lessons from noroviruses. Emerging foodborne pathogens. Edited by: Motarjemi Y, Adams M. 2006, Boca Raton (FL): CRC Press, 77-110.View ArticleGoogle Scholar
- Evans MR, Meldrum R, Lane W, Gardner D, Ribeiro CD, Gallimore CI, Westmoreland D: An outbreak of viral gastroenteritis following environmental contamination at a concert hall. Epidemiol Infect. 2002, 129: 355-360. 10.1017/S0950268802007446.PubMed CentralView ArticlePubMedGoogle Scholar
- Material safety data sheet: Staphylococcus aureus.http://www.phac-aspc.gc.ca/msds-ftss/msds143e-eng.phphttp://www.phac-aspc.gc.ca/msds-ftss/msds143e-eng.php
- Bootsma MC, Diekmann O, Bonten MJ: Controlling methicillin-resistant Staphylococcus aureus: quantifying the effects of interventions and rapid diagnostic testing. Proc Natl Acad Sci USA. 2006, 103: 5620-5625. 10.1073/pnas.0510077103.PubMed CentralView ArticlePubMedGoogle Scholar
- Chowell G, Hengartner NW, Castillo-Chavez C, Fenimore PW, Hyman JM: The basic reproductive number of Ebola and the effects of public health measures: the cases of Congo and Uganda. J Theor Biol. 2004, 229: 119-126. 10.1016/j.jtbi.2004.03.006.View ArticlePubMedGoogle Scholar
- Ferrari MJ, Bjornstad ON, Dobson AP: Estimation and inference of R0 of an infectious pathogen by a removal method. Math Biosci. 2005, 198: 14-26. 10.1016/j.mbs.2005.08.002.View ArticlePubMedGoogle Scholar
- Oyok T, Odonga C, Mulwani E, Abur J, Kaducu F, Akech M, Olango J, Onek P, Turyanika J, Mutyaba I, Luwaga HRS, Bisoborwa G, Kaguna A, Omaswa FG, Zaramba S, Okware S, Opio A, Amandua J, Kamugisha J, Mukoyo E, Wanyana J, Mugero C, Lamunu M, Mugaga M, Kiyonga C: Outbreak of ebola Hemorrhagic Fever – Uganda, Augsut 2000–January 2001. MMWR. 2001, 50: 73-77.Google Scholar
- Khan AS, Tshioko FK, Heymann DL, Le Guenno B, Nabeth P, Kerstiens B, Fleerackers Y, Kilmarx PH, Rodier GR, Nkuku O, Rollin PE, Sanchez A, Zaki SR, Swanepoel R, Tomori O, Nichol ST, Peters CJ, Muyembe-Tamfum JJ, Ksiazek TG: The reemergence of Ebola hemorrhagic fever, Democratic Republic of the Congo, 1995. Commission de Lutte contre les Epidemies a Kikwit. J Infect Dis. 1999, 179 (Suppl 1): S76-86. 10.1086/514306.View ArticlePubMedGoogle Scholar
- Legrand J, Grais RF, Boelle PY, Valleron AJ, Flahault A: Understanding the dynamics of Ebola epidemics. Epidemiol Infect. 2007, 135: 610-621. 10.1017/S0950268806007217.PubMed CentralView ArticlePubMedGoogle Scholar
- Meyers LA: Contact network epidemiology: Bond percolation applied to infectious disease prediction and control. Bull Am Math Soc. 2007, 44: 6-Google Scholar
- Riley S, Fraser C, Donnelly CA, Ghani AC, Abu-Raddad LJ, Hedley AJ, Leung GM, Ho LM, Lam TH, Thach TQ, Chau P, Chan KP, Leung PY, Tsang T, Ho W, Lee KH, Lau EMC, Ferguson NM, Anderson RM: Transmission dynamics of the etiological agent of SARS in Hong Kong: Impact of public health interventions. Science. 2003, 300: 1961-1966. 10.1126/science.1086478.View ArticlePubMedGoogle Scholar
- Wang J, McMichael AJ, Meng B, Becker NG, Han W, Glass K, Wu J, Liu X, Liu J, Li X, Zheng X: Spatial dynamics of an epidemic of severe acute respiratory syndrome in an urban area. Bull World Health Organ. 2006, 84: 965-968. 10.2471/BLT.06.030247.PubMed CentralView ArticlePubMedGoogle Scholar
- Disease Outbreak News.http://www.who.int/csr/don/archive/disease/severe_acute_respiratory_syndrome/en/http://www.who.int/csr/don/archive/disease/severe_acute_respiratory_syndrome/en/
- Gani R, Hughes H, Fleming D, Griffin T, Medlock J, Leach S: Potential impact of antiviral drug use during influenza pandemic. Emerg Infect Dis. 2005, 11: 1355-1362.PubMed CentralView ArticlePubMedGoogle Scholar
- Nishiura H: Time variations in the transmissibility of pandemic influenza in Prussia, Germany, from 1918–19. Theor Biol Med Model. 2007, 4: 20-10.1186/1742-4682-4-20.PubMed CentralView ArticlePubMedGoogle Scholar
- Ferguson NM, Cummings DA, Fraser C, Cajka JC, Cooley PC, Burke DS: Strategies for mitigating an influenza pandemic. Nature. 2006, 442: 448-452. 10.1038/nature04795.View ArticlePubMedGoogle Scholar
- Mills CE, Robins JM, Lipsitch M: Transmissibility of 1918 pandemic influenza. Nature. 2004, 432: 904-906. 10.1038/nature03063.View ArticlePubMedGoogle Scholar
- Wearing HJ, Rohani P, Keeling MJ: Appropriate models for the management of infectious diseases. PLoS Med. 2005, 2: e174-10.1371/journal.pmed.0020174.PubMed CentralView ArticlePubMedGoogle Scholar
- Davis BD, Dulbecco R, Eisen HN, Ginsberg HS: Microbiology. 1980, New York: Harper & RowGoogle Scholar
- Brankston G, Gitterman L, Hirji Z, Lemieux C, Gardam M: Transmission of influenza A in human beings. Lancet Infect Dis. 2007, 7: 257-265. 10.1016/S1473-3099(07)70029-4.View ArticlePubMedGoogle Scholar
- Ferguson NM, Cummings DAT, Cauchemez S, Fraser C, Riley S, Meeyai A, Iamsirithaworn S, Burke DS: Strategies for containing an emerging influenza pandemic in Southeast Asia. Nature. 2005, 437: 209-214. 10.1038/nature04017.View ArticlePubMedGoogle Scholar
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.