# Estimating the Prevalence of Asymptomatic COVID-19 Cases and Their Contribution in Transmission - Using Henan Province, China, as an Example

^{1}Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, China^{2}Department of Applied Mathematics, Hong Kong Polytechnic University, Hong Kong, China^{3}Department of Biostatistics and Data Science, School of Public Health, University of Texas Health Science Center at Houston, Houston, TX, United States

**Background:** Novel coronavirus disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus 2 (SARS-COV-2), is now sweeping across the world. A substantial proportion of infections only lead to mild symptoms or are asymptomatic, but the proportion and infectivity of asymptomatic infections remains unknown. In this paper, we proposed a model to estimate the proportion and infectivity of asymptomatic cases, using COVID-19 in Henan Province, China, as an example.

**Methods:** We extended the conventional susceptible-exposed-infectious-recovered model by including asymptomatic, unconfirmed symptomatic, and quarantined cases. Based on this model, we used daily reported COVID-19 cases from January 21 to February 26, 2020, in Henan Province to estimate the proportion and infectivity of asymptomatic cases, as well as the change of effective reproductive number, *R*_{t}.

**Results:** The proportion of asymptomatic cases among COVID-19 infected individuals was 42% and the infectivity was 10% that of symptomatic ones. The basic reproductive number *R*_{0} = 2.73, and *R*_{t} dropped below 1 on January 31 under a series of measures.

**Conclusion:** The spread of the COVID-19 epidemic was rapid in the early stage, with a large number of asymptomatic infected individuals having relatively low infectivity. However, it was quickly brought under control with national measures.

## Introduction

In December 2019, cases of pneumonia with an unknown cause were reported. The disease was later named as novel coronavirus disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus-2 (SARS-COV-2) (1, 2). The rapid increase in confirmed cases and subsequent secondary outbreaks in many countries caused concern on an international scale. As a result, the World Health Organization declared the COVID-19 outbreak a Public Health Emergency of International Concern on January 31, 2020 and eventually classified it as a pandemic on March 11, 2020 (3). As of July 19, 2020, 14 million COVID-19 cases and 597,583 deaths have been confirmed globally, including 85,937 confirmed cases in China (4). Although the number of confirmed cases was staggering, only the sicker part of those infected were being reported. Li et al. used a metapopulation model to estimate that 86% of the infections (presumably of mild symptoms or asymptomatic) before January 23, 2020 were undetected in Wuhan, China (5); Chinazzi et al. used a GLEAM model to estimate that only one out of four cases were confirmed in Mainland China by February 1, 2020 (6, 7). Hao et al. used a SAPHIRE model to estimate that 87% of the infections before March 8, 2020 were unascertained in Wuhan, China (8). And some even suggested that most infections were caused by undetected cases (5, 9). A significant proportion of these undetected infected individuals were asymptomatic (8). In one documented case, a patient who disclaimed all symptoms and showed a normal chest radiography had multiple PCR cycle counts consistent with that of symptomatic patients (10), suggesting such patients are somewhat infectious (11).

The proportion of asymptomatic cases is a critical epidemiological characteristic that modulates the pandemic potential of the emergent respiratory virus, and is an important parameter in estimating the disease burden (5, 12–14). Estimating the proportion of asymptomatic cases will improve the understanding of COVID-19 transmission and spectrum of presentation, thereby providing insight into the spread of epidemics (14). But the estimated proportion of asymptomatic infected individuals varied widely from place to place. A recent analysis of 21 retrieved reports by the Centre for Evidence-Based Medicine in Oxford found that estimates of asymptomatic COVID-19 cases ranged from 5 to 80% (15). Meanwhile, most studies only showed that asymptomatic infected individuals are less contagious than symptomatic ones (16, 17). Only one previous study clearly showed that the asymptomatic cases could be one quarter as infectious as symptomatic cases in Ningbo, China (18). Therefore, it is important to estimate the proportion and infectivity of asymptomatic cases in various regions. Taking Henan Province as an example, we used a model-inference framework to explore the proportion and infectivity of asymptomatic cases, so as to estimate the prevalence of COVID-19.

## Methods

### Study Area

The study area is located in east-central China (31°23′ to 36°22′ north latitude, 110°21′ to 116°39′ east longitude, Figure 1), with a population of more than 96 million and an area of 167,000 km^{2}. Most of Henan is located in the warm temperature zone and has the characteristics of climate transition from plains to hills and mountains from east to west.

### Source of Data

All data were obtained from the official websites of Provincial and Municipal Health Commissions (Supplementary Table 1), which published COVID-19 case data and information. The case data included the number of newly confirmed cases, cured cases, and deaths per day. The case information included age, gender, exposure history, date of symptom onset, and activity trajectory of confirmed cases. Identifiable personal information was removed for privacy protection.

### Case Definition

Although the definition of COVID-19 cases has been changed several times, which has greatly affected the observed epidemic curve in Wuhan (19), the change of cases in Henan Province has been relatively stable, and the diagnosis of all cases in this study were based on the sixth edition of Diagnosis and Treatment Scheme for COVID-19 released by the National Health Commission of China (20). A laboratory-confirmed case was defined if the patient had a positive test of SARS-CoV-2 virus by real-time reverse-transcription-polymerase-chain-reaction (RT-PCR) assay or high-throughput sequencing of nasal and pharyngeal swab specimens. Only laboratory-confirmed cases were included in this study.

### Modeling the Epidemic of COVID-19 in Henan Province

To consider asymptomatic infected individuals, we constructed the susceptible-exposed-asymptomatic-confirmed-unconfirmed symptomatic-hospitalized-removed (SEAIUHR) model by extending the classic susceptible-exposed-infectious-removed (SEIR) model to include asymptomatic cases, unconfirmed symptomatic cases who did not seek medical attention or get tested for mild symptoms, and quarantined confirmed cases. In this model, we divided the population into seven compartments: S (susceptible), E (latent), A (asymptomatic infectious), I (confirmed symptomatic infectious), U (unconfirmed symptomatic individuals), H (hospitalized), and R (removed). Susceptible individuals could acquire the virus after contact with infected cases (both symptomatic and asymptomatic) and became latent when they were infected but non-infectious. After a period of time, some of the latent individuals developed into symptomatic infections; some of these were confirmed and treated until they progressed into the removed stage and some went unconfirmed because they did not present themselves to healthcare facilities or get tested for mild symptoms. Others developed into asymptomatic infections and remained infectious until they progressed into the removed stage. Removed stage included individuals who were recovered or had died (Figure 2).

**Figure 2**. Schematic diagram of SEAIUHR model propagation process. S, susceptible; E, latent; A, asymptomatic infectious; I, confirmed symptomatic infectious; U, unconfirmed symptomatic infectious; H, hospitalized; R, removed.

Dynamics of these seven parts over time could be expressed by the following ordinary differential equation:

where β_{t} was the transmission rate due to symptomatic infected individuals at time *t*, defined as the proportion of cases from susceptible individuals to infected individuals, both asymptomatic and symptomatic, caused by symptomatic infected cases; θ was the ratio of the transmission rate due to asymptomatic over symptomatic cases; μ_{1} and μ_{2} were the proportion of the asymptomatic and unconfirmed symptomatic cases among infected individuals, respectively; *z* was the latent period; *r*_{1}, *r*_{2}, and *r*_{3} were infectious periods of confirmed symptomatic, asymptomatic, and unconfirmed symptomatic cases, respectively; and *r* was the duration from hospitalization to recovery or death. Assume that *N* = *S* + *E* + *I* + *A* + *U* + *H* + *R*.

The differential equations in the model were numerically solved using a 4th order Runge-Kutta (RK4) method. Specifically, for each step of the algorithm, each term on the right side of the equation was determined using a random sample of the Poisson distribution (5).

On January 25, 2020, Henan Province implemented a first-level public health emergency response to the epidemic and took a series of prevention and control measures, such as traffic restriction, quarantine, contact tracing, isolated treatment of confirmed cases, and so on (21, 22). We assumed that these major government measures caused the transmission rate to change from a constant rate to a time dependent exponentially decreasing rate (23).

Then, the formula of β_{t} could be expressed by the following step function:

where β_{0} was the transmission rate due to symptomatic infected individuals before implementing measures; *a* was the decreasing rate of transmission rate; and *t*_{1} was the date to start implementing measures.

The effective reproductive number, *R*_{t}, could be computed as:

In the initial state, namely, *t* = 0, *R*_{t} = *R*_{0} is the basic reproductive number.

### Estimation of Parameters in the Model

Initial states and parameter's setting in the model were presented in Table 1. We assumed that the initial latent population, asymptomatic infected population, and unconfirmed symptomatic cases were drawn from uniform distribution [0,10], the initial confirmed symptomatic infected population was 0, and the rest of Henan Province were susceptible. For parameters, we estimated β_{0}, μ_{1}, μ_{2}, θ, and α by assuming that the values of parameters *z*, *r*_{1}, *r*_{2}, *r*_{3}, and *r* were fixed throughout the process. We assumed that the initial values of each parameter to be estimated were drawn using Latin hypercube sampling in uniform distribution. The initial ranges of μ_{1}, μ_{2}, and θ were chosen to cover most possible values, i.e. [0,1]; the initial range of α was selected to more broadly cover what the previous research covered (23). The initial range of β was selected from the widest possible range of basic reproductive number (*R*_{0}).

We used the Ensemble Adjustment Kalman Filter (EAKF) to infer epidemiological parameters of the model based on the number of cases presenting symptoms per day in Henan Province (31–33). The EAKF is a data assimilation algorithm that only needs hundreds of ensembles to obtain good results, especially suitable for the estimation of high-dimensional parameters of the model (34, 35), and has been successfully applied to epidemics such as cholera and influenza (32, 35). In this study, we used 1,000 ensembles and 1,000 independent realizations to infer parameters and their corresponding 95% confidence intervals (Cls).

### Sensitivity Analysis

#### Synthetic Testing

Before applying the model-inference framework to the number of daily incidence data, we tested the effect of model-inference framework with model-generated outbreak data. Specifically, we fixed the parameters of the model to specified values and used the model to generate synthetic outbreak data. We then applied the EAKF algorithm to the synthetic daily outbreak data and assessed the model-inference framework by analyzing whether the model could fit the synthetic outbreak data and estimate parameters.

#### Sensitivity of Parameters Estimation to the Range of Initial States and Values of Fixed Parameters

In initial states, the quantities of *E*_{0}, *A*_{0}, and *U*_{0} were unknown, and our assumptions may affect the estimation of other parameters. Therefore, this study simultaneously investigated the results of parameter's estimation when shortening and expanding their ranges. At the same time, we changed values of fixed parameters, respectively, to test the robustness of our results.

## Results

As shown in Figure 3, our model could fit reported daily incidence data well and accurately capture the peak and tendency of the epidemic. The numbers of reported daily cases were within the confidence interval estimated by the model, except for a few days in the later stages of the outbreak.

**Figure 3**. Comparison of the number of cases estimated and observed. The asterisk represents the number of cases with symptoms observed on a daily basis; the curve shows the change in the average number of confirmed symptomatic cases per day estimated by the model. The light blue shade was the 95% confidence interval of the estimation.

The mean estimation of transmission rate due to symptomatic infected individuals was 1.14 (95% CI:1.07-1.23) at the beginning of the epidemic and the decreasing rate of transmission rate after implementing prevention and control measures was 0.16 (95% CI: 0.12-0.19). Our model estimated that the asymptomatic rate among COVID-19 infected individuals was 42% (95% CI: 41-47%), and the mean ratio of the transmission rate of asymptomatic over symptomatic cases was 0.1 (95% CI: 0.02-0.11). At the same time, our model estimated that 11% (95% CI: 9-22%) of infected individuals were unconfirmed symptomatic cases who did not seek medical attention or get tested for mild symptoms (Table 2). Then, the fraction of undocumented infections in Henan Province was 53% (95% CI: 50-68%). Based on above parameters, we estimated the average effective reproduction number, *R*_{t}, to be 2.73(95% CI: 2.64-3.31) at the beginning of the epidemic, which was equal to the basic reproduction number (*R*_{0}). With the implementation of measures, *R*_{t} fell below 1 on January 31.

The results of the synthetic test were shown in Figure 4 and Table 3. All generated values were within the confidence interval estimated by the model and values of all parameters were within the estimated 95% confidence interval, which demonstrated the ability of the model-inference-framework to fit the synthetic outbreak data and estimate all five target model parameters accurately.

**Figure 4**. Comparison of the number of cases estimated and generated. The asterisk represents the number of cases with symptoms observed on a daily basis; the curve shows the change in the average number of confirmed symptomatic cases per day estimated by the model. The light blue shade was the 95% confidence interval of the estimation.

Results of parameter estimation when changing the range of initial states and values of fixed parameters were shown in Supplementary Table 2. It could be seen that values of the resampled epidemiological parameters fall near the values estimated from the original data, with small fluctuations, indicating that the estimated results of our model are robust.

## Discussion

Taking Henan Province as an example, we constructed a SEAIUHR model to estimate the prevalence of asymptomatic COVID-19 cases and their contribution in transmission with EAKF algorithm. This model-inference framework is also applicable to studies of asymptomatic infected individuals in other regions.

Asymptomatic proportion, which is broadly defined as the proportion of asymptomatic infections among all infections of the disease, is important for estimating the true burden of disease and its transmission potential. At present, results of different studies on the asymptomatic proportion vary greatly (15). We estimated that the proportion of asymptomatic infections among infected individuals during the entire epidemic was 42% in Henan Province, within the confidence interval of the estimated asymptomatic rate of 13 cases imported from Wuhan to Japan (14). But it was higher than that of the Diamond Princess cruise ship, which showed that only 17.9% of those infected were asymptomatic (36). It could be that passengers and crew on the Diamond Princess were not drawn from a random sample of the general population, most of whom were older than 60 years and tended to have more severe symptoms after infection. Our model estimated that the mean ratio of transmission rate due to asymptomatic over symptomatic cases was 0.1, corresponding to a study showing that prolonged exposure to infected persons and short exposure to symptomatic persons (such as coughing) is associated with a higher risk of transmission, while short exposure to asymptomatic contacts is associated with a lower risk of transmission (24). The less contagious nature of asymptomatic individuals may be the result of a convolution of the shedding fraction of viable virus, the titer of viable virus in the primary/upstream case, and possibly behavioral factors.

The fraction of undocumented infections, including asymptomatic cases and unconfirmed symptomatic cases who did not seek medical attention or get tested for mild symptoms, was lower than that of Wuhan in the early stage of the epidemic (5, 6, 8), which may be caused by following reasons. Firstly, in the early stage, the medical configuration was not perfect and public awareness was still insufficient, while the undocumented rate gradually decreased with the development of the epidemic (5, 10, 37). Secondly, contact tracing measures implemented in China may become unfeasible when the number of cases in Wuhan rose sharply in the early stage (3). Finally, we need to point out that the differences in the estimated proportions of asymptomatic cases and unconfirmed symptomatic cases may be due to unidentifiability of parameters in epidemiological models. The theoretical analysis of identifiability of parameters in epidemiological models needs to be done in the future.

Basic reproductive number (*R*_{0}) is an important parameter to determine whether an infectious disease is prevalent or not. If *R*_{0} < 1, infectious disease would gradually decline and die out without an epidemic; if *R*_{0} > 1, an epidemic would break out. In this study, our estimation of *R*_{t} = 2.73 at the beginning of the epidemic measured the basic reproductive number *R*_{0}, that is, without intervention, each infected individual could infect an average of 2.73 susceptible individuals. This result was similar to some studies in other regions of China (28, 38, 39), although it was smaller than results from some other research (38). Combined with the latent period, the number of cases without intervention would increase exponentially (25, 29). However, Henan Province implemented a first level response on January 25, 2020, and adopted a series of prevention and control measures. The isolation treatment of confirmed cases and the testing of suspected cases aimed at removing infected individuals from the process of transmission. The closing of public places and the change of crowd behavior were to protect susceptible groups. Contact tracing, which identified possible chains of transmission between known infected persons and their close contacts, affected both susceptible and asymptomatic individuals and can effectively interrupt transmission. With the help of these measures, *R*_{t} dropped below 1 on January 31, 2020.

This study also has some limitations. Firstly, our estimation of the asymptomatic proportion and infectivity was obtained by a model, which could not be generalized because it has not been confirmed by serological investigation. Secondly, we only used data from Henan Province, which might limit the interpretation of our results, although our model-inference framework is also applicable to studies of asymptomatic infected individuals in other regions. Therefore, large-scale relevant studies are needed in the future. Thirdly, this study estimated the average asymptomatic infection rate in Henan Province over time, but the asymptomatic rate may vary in different periods of the epidemic.

## Conclusion

The epidemic situation developed rapidly in Henan Province, and there were a large number of asymptomatic infected individuals with relatively low infectivity. Our study further explored the prevalence of asymptomatic COVID-19 cases and their contribution to transmission so as to deepen people's understanding of asymptomatic cases and provide a reference for the prevention and control of COVID-19.

## Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

## Author Contributions

CL and XL conceived of and designed the research. CL, YZ, CQ, LL, DZ, XW, KS, YJ, and TL did the analyses. CL wrote and revised the paper. DH, MX, and XL contributed to the writing and revisions. All the authors have read and approved the submitted version. All the authors have agreed both to be personally accountable for the author's own contributions and to ensure that questions related to the accuracy or integrity of any part of the work are answered.

## Funding

This research was supported by the National Natural Science Foundation of China (Grant No. 81673238), COVID-19 Emergency Research Project of Shandong University (Grant No. 2020XGC01), National Key Research and Development Program of China (Grant Nos. 2019YFC1200500, 2019YFC1200502), and the General Research Fund (Grant No. 15205119) of the Research Grants Council (RGC) of Hong Kong, China and Alibaba (China) Co., Ltd. Collaborative Research project (P0031768). Funders have no role in the design of the study and collection, analysis, interpretation of data, or in writing the manuscript.

## Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

## Acknowledgments

We appreciate the Health Commission of Henan Province and its subsidiaries for providing data for our research.

## Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2021.591372/full#supplementary-material

## Abbreviations

COVID-19, coronavirus disease 2019; *R*_{0}, the reproductive number; *R*_{t}, the effective reproductive number; SEIAUHR model, susceptible-exposed-asymptomatic-confirmed-unconfirmed symptomatic-hospitalized-removed model.

## References

1. Chen N, Zhou M, Dong X, Qu J, Gong F, Han Y, et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. *Lancet.* (2020) 395:507–13. doi: 10.1016/S0140-6736(20)30211-7

2. Guan W, Ni Z, Hu Y, Liang W, Ou C, He J, et al. Clinical characteristics of coronavirus disease 2019 in China. *N Engl J Med.* (2020) 382:1708–20. doi: 10.1056/NEJMoa2002032

3. Maier BF, Brockmann D. Effective containment explains subexponential growth in recent confirmed COVID-19 cases in China. *Science.* (2020) 368:742–6. doi: 10.1126/science.abb4557

4. WHO. *Coronavirus Disease (COVID-19) Situation Report-181* (2020). Available online at: https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200719-covid-19-sitrep-181.pdf?sfvrsn=82352496_2 (accessed July 20, 2020).

5. Li R, Pei S, Chen B, Song Y, Zhang T, Yang W, et al. Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2). *Science.* (2020) 368:489–93. doi: 10.1126/science.abb3221

6. Chinazzi M, Davis JT, Ajelli M, Gioannini C, Litvinova M, Merler S, et al. The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak. *Science.* (2020) 368:395–400. doi: 10.1126/science.aba9757

7. GLEAMviz.org. Available online at: http://www.gleamviz.org/ (accessed April 7, 2021).

8. Hao X, Cheng S, Wu D, Wu T, Lin X, Wang C. Reconstruction of the full transmission dynamics of COVID-19 in Wuhan. *Nature.* (2020) 584:420–4. doi: 10.1038/s41586-020-2554-8

9. Qiu J. Covert coronavirus infections could be seeding new outbreaks. *Nature.* (2020). doi: 10.1038/d41586-020-00822-x. [Epub ahead of print].

10. Zou L, Ruan F, Huang M, Liang L, Huang H, Hong Z, et al. SARS-CoV-2 viral load in upper respiratory specimens of infected patients. *N Engl J Med.* (2020) 382:1177–9. doi: 10.1056/NEJMc2001737

11. Bai Y, Yao L, Wei T, Tian F, Jin DY, Chen L, et al. Presumed asymptomatic carrier transmission of COVID-19. *JAMA.* (2020) 323:1406–7. doi: 10.1001/jama.2020.2565

12. Chan JFW, Yuan S, Kok KH, To KKW, Chu H, Yang J, et al. A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: a study of a family cluster. *Lancet.* (2020) 395:514–23. doi: 10.1016/S0140-6736(20)30154-9

13. Munster VJ, Koopmans M, van Doremalen N, van Riel D, de Wit E. A novel coronavirus emerging in China - key questions for impact assessment. *N Engl J Med.* (2020) 382:692–4. doi: 10.1056/NEJMp2000929

14. Nishiura H, Kobayashi T, Miyama T, Suzuki A, Jung S-M, Hayashi K, et al. Estimation of the asymptomatic ratio of novel coronavirus infections (COVID-19). *Int J Infect Dis.* (2020) 94:154–5. doi: 10.1016/j.ijid.2020.03.020

15. Heneghan C, Brassey J, Jefferson T. *COVID-19: What Proportion are Asymptomatic?* Available online at: https://www.cebm.net/covid-19/covid-19-what-proportion-are-asymptomatic/ (accessed July 23, 2020).

16. Park SY, Kim YM, Yi S, Lee S, Na BJ, Kim CB, et al. Coronavirus disease outbreak in call center, South Korea. *Emerg Infect Dis.* (2020) 26:1666–70. doi: 10.3201/eid2608.201274

17. Cheng HY, Jian SW, Liu DP, Ng TC, Huang WT, Lin HH. Contact tracing assessment of COVID-19 transmission dynamics in Taiwan and risk at different exposure periods before and after symptom onset. *JAMA Intern Med.* (2020) 180:1–8. doi: 10.1001/jamainternmed.2020.2020

18. He D, Zhao S, Lin Q, Zhuang Z, Cao P, Wang MH, et al. The relative transmissibility of asymptomatic COVID-19 infections among close contacts. *Int J Infect Dis.* (2020) 94:145–7. doi: 10.1016/j.ijid.2020.04.034

19. Tsang TK, Wu P, Lin Y, Lau EHY, Leung GM, Cowling BJ. Effect of changing case definitions for COVID-19 on the epidemic curve and transmission parameters in mainland China: a modelling study. *Lancet Public Heal.* (2020) 5:e289–96. doi: 10.1016/S2468-2667(20)30089-X

20. National Health Commission of the Peoples's Republic of China. *Clinical Diagnosis and Treatment Guidance of 2019 Novel Coronavirus (COVID-19) Caused Pneumonia* (2020). Available online at: http://www.nhc.gov.cn/yzygj/s7652m/202002/54e1ad5c2aac45c19eb541799bf637e9.shtml (accessed July 21, 2020).

21. Henan.gov. *Announcement on Prevention and Control of Novel Coronavirus Pneumonia*. Available online at: https://www.henan.gov.cn/2020/01-26/1285120.html (accessed April 7, 2021).

22. Zhang J, Litvinova M, Wang W, Wang Y, Deng X, Chen X, et al. Evolving epidemiology and transmission dynamics of coronavirus disease 2019 outside Hubei province, China: a descriptive and modelling study. *Lancet Infect Dis.* (2020) 20:793–802. doi: 10.1016/S1473-3099(20)30230-9

23. Liu Z, Magal P, Seydi O, Webb G. Predicting the cumulative number of cases for the COVID-19 epidemic in China from early data. *Math Biosci Eng.* (2020) 17:3040–1. doi: 10.1101/2020.03.11.20034314

24. Wiersinga WJ, Rhodes A, Cheng AC, Peacock SJ, Prescott HC. Pathophysiology, transmission, diagnosis, and treatment of coronavirus disease 2019 (COVID-19): a review. *JAMA.* (2020) 324:782–93. doi: 10.1001/jama.2020.12839

25. Lauer SA, Grantz KH, Bi Q, Jones FK, Zheng Q, Meredith HR, et al. The incubation period of coronavirus disease 2019 (CoVID-19) from publicly reported confirmed cases: estimation and application. *Ann Intern Med.* (2020) 172:577–82. doi: 10.7326/M20-0504

26. He X, Lau EHY, Wu P, Deng X, Wang J, Hao X, et al. Temporal dynamics in viral shedding and transmissibility of COVID-19. *Nat Med.* (2020) 26:672–5. doi: 10.1038/s41591-020-0869-5

27. Wei WE, Li Z, Chiew CJ, Yong SE, Toh MP, Lee VJ. Presymptomatic transmission of SARS-CoV-2-Singapore. *Morb Mortal Wkly Rep.* (2020) 69:411–5. doi: 10.15585/mmwr.mm6914e1

28. Xu XW, Wu XX, Jiang XG, Xu KJ, Ying LJ, Ma CL, et al. Clinical findings in a group of patients infected with the 2019 novel coronavirus (SARS-Cov-2) outside of Wuhan, China: retrospective case series. *BMJ.* (2020) 368:m606. doi: 10.1136/bmj.m606

29. Linton N, Kobayashi T, Yang Y, Hayashi K, Akhmetzhanov A, Jung S-M, et al. Incubation period and other epidemiological characteristics of 2019 novel coronavirus infections with right truncation: a statistical analysis of publicly available case data. *J Clin Med.* (2020) 9:538. doi: 10.3390/jcm9020538

30. Wang D, Hu B, Hu C, Zhu F, Liu X, Zhang J, et al. Clinical characteristics of 138 hospitalized patients with 2019 novel coronavirus-infected pneumonia in Wuhan, China. *JAMA.* (2020) 323:1061–9. doi: 10.1001/jama.2020.1585

31. King AA, Ionides EL, Pascual M, Bouma MJ. In apparent infections and cholera dynamics. *Nature.* (2008) 454:877–80. doi: 10.1038/nature07084

32. Pei S, Morone F, Liljeros F, Makse H, Shaman JL. Inference and control of the nosocomial transmission of methicillin-resistant *Staphylococcus aureus*. *Elife*. (2018) 7:e40977. doi: 10.7554/eLife.40977

33. Anderson JL. An ensemble adjustment Kalman Filter for data assimilation. *Mon Weather Rev.* (2001) 129:2884–903. doi: 10.1175/1520-0493(2001)129

34. Pei S, Kandula S, Yang W, Shaman J. Forecasting the spatial transmission of influenza in the United States. *Proc Natl Acad Sci USA.* (2018) 115:2752–7. doi: 10.1073/pnas.1708856115

35. Portet S. A primer on model selection using the Akaike information criterion. *Infect Dis Model.* (2020) 5:111–28. doi: 10.1016/j.idm.2019.12.010

36. Mizumoto K, Kagaya K, Zarebski A, Chowell G. Estimating the asymptomatic proportion of coronavirus disease 2019 (COVID-19) cases on board the Diamond Princess cruise ship, Yokohama, Japan, 2020. *Eurosurveillance.* (2020) 25:2000180. doi: 10.2807/1560-7917.ES.2020.25.10.2000180

37. Zhao S, Musa SS, Lin Q, Ran J, Yang G, Wang W, et al. Estimating the unreported number of novel coronavirus (2019-nCoV) cases in China in the first half of January 2020: a data-driven modelling analysis of the early outbreak. *J Clin Med.* (2020) 9:388. doi: 10.3390/jcm9020388

38. Liu Y, Gayle AA, Wilder-Smith A, Rocklöv J. The reproductive number of COVID-19 is higher compared to SARS coronavirus. *J Travel Med.* (2020) 27:1–4. doi: 10.1093/jtm/taaa021

Keywords: COVID-19, asymptomatic cases, infectious dynamic model, the effective reproductive number, prevention and control measures

Citation: Li C, Zhu Y, Qi C, Liu L, Zhang D, Wang X, She K, Jia Y, Liu T, He D, Xiong M and Li X (2021) Estimating the Prevalence of Asymptomatic COVID-19 Cases and Their Contribution in Transmission - Using Henan Province, China, as an Example. *Front. Med.* 8:591372. doi: 10.3389/fmed.2021.591372

Received: 04 August 2020; Accepted: 04 May 2021;

Published: 23 June 2021.

Edited by:

Roger Hewson, Public Health England, United KingdomReviewed by:

Senzhong HUANG, Nankai University, ChinaPaul Fenimore, Los Alamos National Laboratory (DOE), United States

Copyright © 2021 Li, Zhu, Qi, Liu, Zhang, Wang, She, Jia, Liu, He, Xiong and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xiujun Li, xjli@sdu.edu.cn