1. Main points
Instances of reinfection with coronavirus (COVID-19) were rare in the survey.
There is evidence that COVID-19 reinfections are milder than initial infections.
Viral load was lower in reinfections than initial infections.
Self-reported symptoms were less common in episodes of reinfection than in initial infections.
2. Overview
This article presents analysis on the number of COVID-19 reinfections identified among participants of the Coronavirus (COVID-19) Infection Survey and the characteristics of reinfections in relation to viral load (as inferred from a cycle threshold (Ct) value) and presence of self-reported symptoms.
To identify reinfections, we have included all data from across the UK collected since the survey commenced on 26 April 2020 up to 5 June 2021.
The risk of reinfection varies from person to person, depending on when they were first infected. People who were first infected in the early part of the survey have had more opportunity to become reinfected compared with someone who has experienced their first infection more recently. Therefore, this analysis uses "participant days at risk" to determine the number of reinfections.
Overall, there were an estimated 15.2 reinfections for every 100,000 participant days at risk (95% confidence interval: 12.7 to 18.0), averaged for the entire at-risk period. This means that once participants had accumulated 100,000 days at risk of reinfection between them, we would expect there to have been around 15 reinfections in that time.
The estimated rate for reinfections with a strong positive test (with Ct less than 30) was 2.7 per 100,000 participant days at risk (95% confidence interval: 1.8 to 4.1) up to 5 June 2021. This means that once participants had accumulated 100,000 days at risk of reinfection between them, we would expect there to have been around three reinfections with a strong positive test (indicating a high viral load).
The risk of reinfection peaked around the first month after becoming at risk for reinfection and declined afterwards. The estimated rate for all reinfections was 5.0 per 100,000 participant days at risk (95% confidence interval: 3.1 to 8.1) 90 days after becoming at risk for a reinfection.
Reinfection episodes, on average, had a lower viral load (median: 32.8, interquartile range: 30.6 to 34.0) than initial infection episodes (median: 24.5, interquartile range: 19.7 to 31.2).
More individuals reported symptoms in their initial infection episode within 35 days after the first observed positive test (57.1%, 95% confidence interval: 48.6% to 65.2%) than in their reinfection episode (21.8%, 95% confidence interval: 15.6% to 29.6%).
Back to table of contents3. Number of reinfections identified
Key definitions relating to reinfection, at-risk period and initial episodes
The sample for this analysis is individuals in the survey who have had at least one positive test recorded in the survey and meet our criteria for being "at risk" of reinfection where:
90 days has elapsed since an individual's first positive test in the survey and their most recent test result was negative
if 90 days has not passed since their first positive test in the survey, the individual's last positive test has been followed by four consecutive negative tests
For the purposes of this analysis, we refer to these individuals as "at risk" of reinfection. An individual being classified as "at risk" reflects that it is possible for a positive test of theirs to be considered a reinfection. The "at-risk period" refers to the period following the first time we could have defined a reinfection.
A reinfection is therefore defined as when an individual who meets these criteria has a positive test. The 90-day threshold is arbitrary but follows recommendations from Public Health England and the Centre for Disease Control and Prevention. Our definition of reinfection accounts for intermittent reverse transcriptase polymerase chain reaction (RT-PCR) positivity and reduces the likelihood that we are misinterpreting a longer initial infection as a reinfection, whilst also attempting to capture earlier reinfections.
Standard definitions of reinfection rely on whole genome sequencing of both initial and subsequent positive swab tests, or on testing positive for antibodies at a previous blood test, neither of which are currently available for most participants in the survey. It is important to note that no definition of reinfection using positive swab tests alone can differentiate reinfections as distinct from ongoing detection of virus, including after intermittent negative tests. SARS-CoV-2 RT-PCR positivity may persist for prolonged periods (up to 90 days in a small minority of individuals) following initial infection without indicating a viable virus. RT-PCR tests may also be intermittently negative despite continued detection of the virus.
Participants will have had their first positive swab in the survey at varying times and will subsequently have become at risk of reinfection at varying times.
For the purposes of this analysis, "initial episode" refers to an individual's first episode of COVID-19 recorded in the survey and "reinfection episode" refers to a subsequent infection episode meeting our reinfection criteria. Further definitions can be found in the glossary.
Reinfection rates
Reinfections, as identified in the survey, have been rare up to 5 June 2021. The rate of reinfections is provided per 100,000 participant days of risk. This is calculated by dividing the number of reinfections identified by the cumulative number of participants days from the first time a participant became at risk for reinfection to their last test in the survey to date. The estimated rate for all reinfections was 15.2 per 100,000 participant days at risk (95% confidence interval: 12.7 to 18.0) over the entire at-risk period. The estimated rate for reinfections with a strong positive test (with Ct less than 30) was 2.7 per 100,000 participant days at risk (95% confidence interval: 1.8 to 4.1) over the entire at-risk period (accompanying dataset 1a).
Definition | Number of participants at risk | Number of identified reinfections | Estimated rate of reinfections (per 100,000 participant days at risk) | Lower 95% confidence interval | Upper 95% confidence interval |
---|---|---|---|---|---|
All reinfections definition | 17,434 | 133 | 15.2 | 12.7 | 18.0 |
Reinfections with Ct less than 30 | 17,434 | 24 | 2.7 | 1.8 | 4.1 |
Download this table Table 1: Rate of reinfections per 100,000 participant days at risk
.xls .csvNotes:
For the purposes of this analysis, we define reinfection as a new positive test 90 days or more after an initial first positive test which was preceded by at least one negative test, or where an individual has had a subsequent positive test following four consecutive negative tests regardless of the time since the first positive.
A confidence interval gives an indication of the degree of uncertainty of an estimate, showing the precision of a sample estimate. The 95% confidence intervals are calculated so that if we repeated the study many times, 95% of the time the true unknown value would lie between the lower and upper confidence limits. A wider interval indicates more uncertainty in the estimate.
Flexible parametric models were used to investigate how the overall rate of reinfection varied over time after an individual first became at risk. The rate of reinfections was highest in the month after individuals first became at risk for reinfection (accompanying dataset 1b). Increased risk at the start of the at-risk period may reflect the fact that some early reinfections could still be the same initial infection in individuals who had intermittent positive and negative RT-PCR tests over a long period of time. After the first month, the reinfection rate declined. After 90 days at risk, the estimated rate for all reinfections was 5.0 per 100,000 participant days at risk (95% confidence interval: 3.1 to 8.1). There were not enough strong positive tests (Ct less than 30) after 90 days of risk to provide an estimated rate of infections for this sub-group.
Figure 1: Risk of reinfection peaked around the first month after becoming "at risk" for reinfection and declined afterwards
Estimated rate of COVID-19 reinfection according to time at risk per 100,000 participant days, 26 April 2020 to 5 June 2021, UK
Embed code
Notes:
For the purposes of this analysis, we define reinfection as a new positive test 90 days or more after an initial first positive test which was preceded by at least one negative test, or where an individual has had a subsequent positive test following four consecutive negative tests regardless of the time since the first positive test.
A confidence interval gives an indication of the degree of uncertainty of an estimate, showing the precision of a sample estimate. The 95% confidence intervals are calculated so that if we repeated the study many times, 95% of the time the true unknown value would lie between the lower and upper confidence limits. A wider interval indicates more uncertainty in the estimate.
Time between infection episodes
In those with reinfections, the median time between the first positive test in the initial episode and first positive test in the reinfection episode was 115 days (interquartile range: 108 to 139) (accompanying dataset 1c).
Back to table of contents5. Symptoms
This analysis considers whether any symptoms were reported at visits within 35 days following the first observed positive test corresponding to the episode of initial infection or reinfection. At each visit we ask about symptoms in the last seven days, including any specific self-reported symptoms (cough, fever, shortness of breath, loss of taste, loss of smell, myalgia, fatigue, sore throat, headache, abdominal pain, diarrhoea, nausea or vomiting) and any general symptoms participants have that they think may be related to COVID-19.
In unadjusted analyses, individuals were less likely to self-report symptoms within 35 days following the first observed positive test in their reinfection episode than in their initial episode. Figure 3 shows the proportion of individuals who reported symptoms within 35 days following the first observed positive test associated with the initial infection episode and the reinfection episode. People were more likely to report symptoms within 35 days following the first observed positive test in their initial episode (57.1%, 95% confidence interval: 48.6% to 65.2%) than in their reinfection episode (21.8%, 95% confidence interval: 15.6% to 29.6%), suggesting that reinfections may be more likely to be asymptomatic.
Figure 3: Fewer people reported symptoms within 35 days following the first observed positive test in their reinfection episode than in their initial infection episode, suggesting reinfections are more likely to be asymptomatic
Percentage of people reporting symptoms within 35 days following their first observed positive test by episode, 26 April 2020 to 5 June 2021, UK
Embed code
Notes:
A confidence interval gives an indication of the degree of uncertainty of an estimate, showing the precision of a sample estimate. The 95% confidence intervals are calculated so that if we repeated the study many times, 95% of the time the true unknown value would lie between the lower and upper confidence limits. A wider interval indicates more uncertainty in the estimate. Overlapping confidence intervals indicate that there may not be a true difference between two estimates.
Symptoms are self-reported and were not clinically diagnosed.
This data shows unweighted percentages of individuals who met our definition of COVID-19 reinfection, split by whether the individual reported symptoms at any visit within 35 days following the first observed positive test of their initial or reinfection episode respectively.
Differences between those reporting any symptoms within 35 days of initial infection and reinfection were statistically significant (McNemar test, p < 0.001).
Further symptom analysis can be found in the latest edition of Coronavirus (COVID-19) Infection Survey: characteristics of people testing positive for COVID-19 in countries of the UK.
Back to table of contents6. Methodology update
This section outlines the updated methodology for reinfections analysis that will be published in our characteristics of people testing positive for COVID-19 bulletin from 6 October 2021.
We have updated our definition of a possible reinfection event.
Since our publication of reinfections in our characteristics bulletin on 25 August 2021, we define a possible reinfection event as:
a new positive test that is 120 days or more after an initial first positive test, with the most recent test result (before the new positive test) being negative
if the new positive test is fewer than 120 days after an initial first positive test, the new positive test has to be subsequent to four consecutive negative tests after the initial first positive test
This can be used to define the time a participant is "at risk". We refer to individuals as being "at risk" of reinfection when it is possible that a positive test of theirs can be considered a reinfection. The duration a participant is at risk (time at risk) is the time in days from the date they become susceptible and the date of the first positive test in the second episode or, (for those not testing positive) to the date of their last negative test.
We have made this change to ensure that our definition of a reinfection, which is necessarily based on swab tests alone, produces the most accurate possible estimates.
As we have collected more data and expanded our reinfections analysis, we have improved our understanding of the duration when people test positive. There is evidence that a 120-day cut-off, rather than a 90-day cut-off, is necessary to reduce the risk of long-term intermittent positives being captured as reinfections. By moving to a 120-day cut-off, we minimise the risk that the reinfections we identify are part of a single ongoing infection episode and not reinfections.
Improvements to our modelling approach apply to reinfections analysis based on the Coronavirus COVID-19 Infection Survey, published from the 6 October 2021 onwards. Previously, when the number of reinfections was low, we estimated the number of reinfections over time and were unable to consider factors, such as vaccination status, that can influence an individual’s risk of experiencing a possible reinfection event.
Since we first began monitoring cases, the number of reinfections has increased slightly allowing us to estimate reinfection rates in the context of vaccination status, patient-facing occupations, and different variant-dominant periods of time.
A flexible parametric model was used to investigate the overall rate of reinfection after an initial positive test result. In the model, vaccination status and the observed "Delta" and "Alpha" periods were fitted as time-dependent covariates. A time-dependent covariate is a variable that changes over the follow-up period.
In the model, the Alpha period is classified as prior to 17 May 2021 and the Delta period is classified from 17 May 2021.
The addition of vaccination status as a time-dependent covariate allows us to compare the risk of reinfection between individuals being unvaccinated or vaccinated at each time period, and to re-evaluate which risk group each person would belong in based on their vaccination status.
The new model was also fitted with time-independent covariates such as sex, ethnicity (White or Other), household size, and age. This allows us to compare the risk of reinfection between individuals while taking into account static factors that can influence an individual's risk of experiencing a possible reinfection event. Time-independent covariates do not change over the study period.
Notes for Methodology update
Factors adjusted for in flexible parametric model include: age (linear); sex; ethnicity (White or Other); household size (linear); ever patient-facing health care worker; symptoms within 35 days of the first positive in the first episode; Index of Multiple Deprivation; ever worked in person-facing health care; ever reported long-term health conditions; vaccination status; "Delta" or "Alpha" period (Alpha period is prior to 17th May 2021, Delta is after); first episode minimum Ct value (the first Ct value is taken as the minimum Ct value across the entire first episode).
From 6 October 2021, reinfections analysis, published in our characteristics of people testing positive for COVID-19 bulletins will include data that start from 26 April 2020, with the date from which any participant is at risk of reinfection with COVID-19 being 2 July 2020.
8. Collaboration
This analysis was produced by Owen Gethings - Office for National Statistics (ONS) Senior Statistical Officer, in collaboration with our research partners at the University of Oxford, the University of Manchester, Public Health England (PHE) and Wellcome Trust. Of particular note are:
- Sarah Walker - University of Oxford, Nuffield Department for Medicine: Professor of Medical Statistics and Epidemiology and Study Chief Investigator
- Koen Pouwels - University of Oxford, Health Economics Research Centre, Nuffield Department of Population Health: Senior Researcher in Biostatistics and Health Economics
- Thomas House - University of Manchester, Department of Mathematics: Reader in mathematical statistics
9. Glossary
At-risk period
The "at-risk" period consists of any individual in the sample that meets either of the following criteria: 90 days has elapsed since an individual's first positive test in the survey and their most recent test result was negative or, if 90 days has not passed since their first positive test in the survey, the individual's last positive has been followed by four consecutive negative tests.
COVID-19 infection
We define a coronavirus (COVID-19) infection as an individual testing positive for SARS-CoV-2, with or without symptoms, on a swab taken from the nose and throat using reverse transcriptase polymerase chain reaction (RT-PCR).
Confidence interval
A confidence interval gives an indication of the degree of uncertainty of an estimate, showing the precision of a sample estimate. The 95% confidence intervals are calculated so that if we repeated the study multiple times, 95% of the time the true unknown value would lie between the lower and upper confidence limits. A wider interval indicates more uncertainty in the estimate. Overlapping confidence intervals indicate that there may not be a true difference between two estimates. For more information, see our methodology page on statistical uncertainty.
Cycle threshold (Ct) value
A cycle threshold (Ct) value is the cycle number at which the PCR product crosses a threshold of detection and can give an indication of the viral load, or strength of a positive SARS-CoV-2 test. The lower the Ct value, the higher the viral load and stronger the positive test.
Participant days at risk
The rate per 100,000 participant days at risk allows us to understand the rate of reinfection depending on how long participants have been "at risk" of reinfection and accounts for participants having initial infection episodes and entering the at-risk period at different times. It also accounts for individuals having different amounts of time until their last test in the survey to date. The rate is calculated by dividing the number of reinfections identified by the cumulative number of participants days from the first time a participant became at risk to their last test in the survey to date. For example, if there were 1,000 participants who had each spent 100 days in the at-risk period, and we identified one reinfection in that time, the rate would be one reinfection per every 100,000 at-risk days.
Viral load
Viral load refers to the quantity of virus in a bodily fluid such as saliva or blood. High viral load is associated with lower Ct values (indicating a strong positive test), and low viral load is associated with higher Ct values.
Embed code
10. Data sources and quality
In this article, we refer to the number of coronavirus (COVID-19) reinfections within the Coronavirus (COVID-19) Infection Survey. The survey includes individuals from the community population; community in this instance refers to private residential households, and it excludes those in hospitals, care homes and/or other institutional settings in the UK. For the purposes of this analysis, all estimates of COVID-19 reinfections are unweighted.
More information on measuring the data and its strengths and limitations is available in the Coronavirus (COVID-19) Infection Survey statistical bulletin.
Our methodology article provides further information around the survey design, how we process data and how data are analysed.
Methods and technical information
All estimates of COVID-19 reinfections presented in this article are unweighted, as the sample for the analysis includes only those who meet our criteria for reinfection and as such there is not a nationally representative population to weight this to. As we currently have low counts of participants reinfected with COVID-19, confidence intervals associated with this analysis may be wide, and where applicable measures of uncertainty are presented.
The strength of a positive test is determined by how quickly the virus is detected by reverse transcriptase polymerase chain reaction (RT-PCR) and is measured by a cycle threshold (Ct) value. Positive tests with a Ct value of less than 30 are considered a strong positive test and indicate a higher viral load, with higher Ct values indicating a weaker positive test. High Ct value can be seen in the early stages of infection when virus levels are rising, or late in the infection when the risk of transmission is low.
This analysis of reinfections considers individuals with a new positive test and within this, a subgroup is considered where any positive in the reinfection episode has a Ct value less than 30. We do not restrict to this group alone as one question addressed by the analysis was whether reinfections tended to have higher Ct values indicating a lower viral load. You can find more information on Ct values in a paper written by our academic partners at the University of Oxford.
Time-to-event data is commonly analysed using the Cox proportional hazards model. The Cox model makes no assumptions about the shape of the underlying hazard function and does not produce smoothly varying estimates of rates over time. In contrast, a flexible parametric model uses restricted natural cubic spline functions to model the baseline cumulative hazard (used here), baseline cumulative odds of survival, or some more general baseline distribution in survival analysis models. This provides a smoothly varying estimate of the rates of events over time, as shown in Figure 1.
For this article, we present the percentage of people reporting symptoms at any visit within 35 days following a first observed positive test corresponding to an episode of COVID-19. Previous analysis of symptoms has considered only those individuals with a strong positive test (Ct value less than 30) to exclude the possibility that symptoms are not identified because we pick up individuals very early or late in their infection. However, this analysis of symptoms considers symptoms reported by individuals regardless of the Ct value as this allowed us to examine whether reinfections were milder in terms of both symptom burden and viral load.
Back to table of contentsContact details for this Article
infection.survey.analysis@ons.gov.uk
Telephone: +44 (0)1329 447683