# Estimation of Some Epidemiological Parameters With the COVID-19 Data of Mayotte

^{1}Institut Montpelliérain Alexander Grothendieck, Univ. Montpellier, CNRS, Montpellier, France^{2}Centre Universitaire de Formation et de Recherche, Dembeni, France^{3}Laboratoire de Mathématiques et Applications, UMR CNRS 7031, Poitiers, France^{4}Agence Régionale de Santé de Mayotte, Centre Kinga, Mamoudzou, France

We study in this article some statistical methods to fit some epidemiological parameters. We first consider a fit of the probability distribution which underlines the serial interval distribution of the COVID-19 on a given set of data collected on the viral shedding in patients with laboratory-confirmed. The best-fit model of the non negative serial interval distribution is given by a mixture of two Gamma distributions with different shapes and rates. Thus, we propose a modified version of the generation time function of the package *R0*. Second, we estimate the time-varying reproduction number in Mayotte. Using a justified mathematical learning model, we estimate the transmission parameters range values during the outbreak together with a sensitivity analysis. Finally, using some regression and forecasting methods, we give some learning models of the hospitalized, intensive care, and death cases over a given period. We end with a discussion and the limit of this study together with some forthcoming theoretical developments.

## 1. Introduction

Following the emergence of the COVID-19 and its spread outside of China, all countries are experiencing the pandemic. The COVID-19 virus has caused great disruption to human health, social life, development, and economics. In response, several countries have implemented several interventions including nonpharmaceutical, vaccination, case isolation of symptomatic individuals and their contacts, restriction and social distancing in schools, at work, and universities, banning of mass gatherings and some public events, and local and national lockdowns of populations. Also in response to the rising numbers of cases and deaths, and to maintain the capacity of health systems to treat as many severe cases as possible, France like European countries and other continents has implemented some process measures to control the epidemic, (refer to [1–4]). In the French overseas department, especially in Mayotte, there is a high population density and certain local issues such as access to water, housing, hindering distance measures, and barrier gestures. The epidemic data in Mayotte started on 13 March 2020 when the first reported case traveled from metropolitan France (3 days ago with two other family members, which in turn were confirmed positive a few days later). Some other imported cases were confirmed later and led to the actual state of the spread of the virus in Mayotte. Many mathematical models of the COVID-19 coronavirus epidemic have been developed, and some of these are listed in the following articles [5–12, 42]. Based on the development of epidemiological characteristics of COVID-19 infection, a model of type SEIARS can be appropriate to study the dynamics of this disease. To understand trends in the development of the epidemic curve, it is important to know the serial interval distribution of the COVID-19 virus estimate and hence the generation time to be able to give some estimation of the epidemiological parameters like the time- varying reproduction number and observed transmission rates, among others. This article aims to study first the statistical methods to fit the probability distribution function which underlines the serial interval distribution of the COVID-19 virus on a given set of data collected on the viral shedding in patients with laboratory-confirmed COVID-19 (refer to [13, 14]) to obtain a best-fit model of the serial interval distribution. Based on this estimation, we estimate the time- varying reproduction number and the transmission rates observed on the island of Mayotte from March 2020 to January 2022. The paper is organized as follows. We present the material and methods sections, the data, and the statistical tools together with the mathematical learning model and tools. In the result section, we give the estimation of the serial interval distribution by several methods and the best-fit model with a mixture model. We derive an estimation of the effective reproduction number along the time. We fit the transmission rates parameters range values obtain through the mathematical learning model together with a sensitivity analysis of some parameters like the asymptotic ratio and infectious period ratio. The results are presented together with along with a discussion. We also present some regression and forecasting models for the hospitalized, intensive care, and death cases over a given period. We end with a conclusion and the limit of this study and some theoretical perspectives as forthcoming work.

## 2. Material

This study is concerned with a statistical analysis of some important parameters of the COVID-19 epidemic curve in Mayotte.

### 2.1. Observed COVID-19 Infected Cases in Mayotte

Many efforts have been made by the French Regional Health Agencies in Mayotte to collect the data. The present study is based on a reported dataset of confirmed cases of COVID-19 from 13 March 2020 to 11 January 2022. The reported temporal daily cases and total confirmed cases are illustrated in Figure 1. We also consider a subset of the database containing the daily observed cases of confirmed hospitalization, intensive care, and death cases from 13 March 2020 to 26 February 2021 in Figure 1.

**Figure 1**. Reported daily infected cases from 13 March 2020 to 11 January 2022 and reported daily cases of hospitalization, intensive care, and death cases from 13 March 2021 to 26 February 2021 in Mayotte.

### 2.2. COVID-19 Infectiousness Profiles From Transmission Pairs

Some major studies have been made on the determination of generation times through serial interval distributions and incubation time from real data, refer to for instance [13, 14]. In this section, we will focus on the data collected by He et al. [13] where the authors report temporal patterns of viral shedding in 94 patients with laboratory-confirmed COVID-19 and modeled COVID-19 infectiousness profiles from a separate sample of 77 infector-infectee transmission pairs. The variation between individuals (infector/infectee) and the transmission process is summarized, respectively by the distribution of incubation period and serial intervals. Figure 2 (picture above) contains the number of infected cases (in red), the lower limit of infectors (in black), and the upper limit of infectors (in green).

The variation between individuals and chains of transmission is summarized, respectively by the distribution of incubation periods and the distribution of serial intervals (refer to definition below). For simplicity, we consider non-negative serial interval values which represent 93% of the data. Figure 2 shows the histogram (picture below) of the serial interval from the underline database.

## 3. Methods

First let us recall some definitions concerning the infectivity process. Consider an infector *i* and infectee *j*. The generation time interval ${{G}}_{i,j}$ is the time interval from the infection of *i* to infection of *j*. It is the time lag between infection in a primary case and a secondary case; and should be obtained from the time lag between all infectee/infector pairs [15]. In this way, it describes infectiousness with respect to the point of infection. As it cannot be observed directly, it is often substituted with the serial interval distribution ${{S}}_{i,j}$ which is the time interval between symptom onset of the infector *i* and symptom onset of the infectee *j*. They can be captured by the following equations:

or

where ${E}$ is the time interval between symptom onset and the infection (incubation period) and ${{P}}_{i,j}$ is the time interval from the symptom onset of *i* to infection of *j*. We refer the reader to Lehtinen et al. [16] and Svensson [15] for further details. The serial interval that can be computed as follow (refer to [13]):

with

• *x*.*lb*: Lower limit dates of onset of symptoms of infectors.

• *x*.*ub*: Upper limit dates of onset of symptoms of infectors.

• *y*: Dates of onset of symptoms of the infectee.

There are many methods for generating estimates of parameters for standard probability distributions such as the method of moments and the maximum likelihood estimation. The use of other methods was generally confined to analytical work on mixture densities. Rapid analysis of the above histogram seems to show that the serial interval can be fit by a set of exponential- family distributions (e.g., «Gamma », «Lognormal », «Weibull », and «Exponential »distributions) or a mixture of them. The method of moments consists in estimating the parameters of the distributions from the theoretical finite moments with their empirical estimates. As concerned the Weibull distribution, we estimate the scale and shape parameters from the analytical expression of the mean, median, and the variance of the above serial interval data.

### 3.1. Estimation by the Maximum Likelihood Method

The maximum likelihood estimation (refer to [17] is a statistical method used to estimate the parameters of the probability law of a given observation *x* = (*x*_{1}, .., *x*_{n}) with probability densities *f*(*x*_{i}, θ) by determining the values of the parameters maximizing the log-likelihood function given by:

where θ is the (vector) parameter to be estimated. The likelihood function then represents the joint density of the individual observations *x*_{i} for any given level of the distribution parameters. The maximum likelihood estimate is the distribution parameter values, which maximize the likelihood function since these same values of the estimators also maximize the log-likelihood function.

### 3.2. Estimation by a Mixture Method

We present a mixture estimation which is a flexible tool (see [18, 19]) to model a known smooth probability density function as a weighted sum of parametric density functions *f*_{j}(*x*, θ_{j}) in a possibly multivariate independent observations *x* = (*x*_{1}, .., *x*_{n}) drawn from *k* several densities:

where *k* represents the number of classes or groups of the mixing, *p*_{j}>0 is the relative proportion of observations from each class *j* such that $\sum _{j=1}^{n}{p}_{j}=1.$ The mixture density will allow finding the parameter Θ = (*p*_{j}, θ_{j}) of the model where θ_{j} is the (vector) parameter from class *j*. The log-likelihood of the mixing density is defined by:

The maximum likelihood estimates are:

subject to

For *k*>1, the sum of the term appearing inside a logarithm makes this optimization quite difficult. Solution of this non-linear optimization problem has long been a difficult task for applied researchers. There are, of course, many general iterative procedures that are suitable for finding an approximate solution to the likelihood equations.

### 3.3. Non Parametric Estimation of the Time Varying Reproduction Number

Understanding the development of an epidemic is important as well as the basic reproduction number ℝ_{0} at the start of an epidemic and the time varying reproduction numbers. It is historically defined as the average number of new cases of infection generated by an individual during a period of infectivity (refer to [20]). There are several methods for computing this parameter, e.g., the next-generation matrix method. Let us mention here a non-parametric approach following Wallinga and Lipsitch [21]. Let η be the probability that an individual will remain infectious over a unit of time after being infected (i.e., age of infection) and β the transmission rate at age of infection *a*. Then τ(*a*) = η(*a*)β(*a*) is the transmissibility of an infectious individual at the age of infection *a*, assuming that the entire population is susceptible. Hence,

We can normalize τ(*a*) to be a probability density function:

so that

Let Γ(*t*) be the number of new infections during the time interval ]*t*; *t*+*dt*[ and *s*(*t*) the proportion of susceptibles in the population. Note that the new infections at time *t* are the sum of all infections caused by infectious individuals infected at a unit of time (i.e., at time *t* = *a*) if they remain infectious at time *t* (with an infectious age *a*). In other words

For *t*≠0, we have a non parametric formula of the time varying reproduction number

For practical calculation, this formula offers a discrete version as follows

implemented in the package *R0*, Obadia et al. [22] and Boelle et al. [23] and package *EpiEstim*, [24, 25]. In this study, we will compare our results on the reproduction numbers with the *R*_{0} package. For more on the calculation of the *R*_{0}, we refer to Batista [26]. Other improved estimation methods also exist in the literature, refer to Demongeot et al. [27] and Waku et al. [28], and references therein.

### 3.4. Regression and Forecasting Models

A regression method (refer to [29]) is a predictive statistical approach for modeling the relationship between a dependent variable say *Y* with a given set of observed variables say *X* = (*X*_{1}, *X*_{2}, …, *X*_{n}) under the form *Y* = *f*(*X*, η)+ω, where η is an unknown (vector) parameter and ω representing an additive error that may stand for white noise. The prediction models provide the insights that result in tangible outcomes.

The predictive statistical approach for disease research work found in the literature deals with ARIMA models [30, 31], polynomial regression and hierarchical polynomial regression models [32–34]. Recall that a process *Y*_{t} is said to be *ARIMA*(*p, d, q*) if

where *Y*_{t−1}, *Y*_{t−2}, ..., *Y*_{t−p} are the lags (past values); ϕ_{1}, ϕ_{2}, ..., ϕ_{p} are lag coefficients that are estimated by the model; ω_{t−1}, ω_{t−2}, ..., ω_{t−q} are error terms of the model for the respective lags; θ_{1}, θ_{2}, ..., θ_{q} their coefficients, and

with μ the mean of the process. When analyzing a model, it is important to measure its performance in order to draw the best conclusion and interpretation of the data. We will use the coefficient of determination or *R*^{2} as a metric given by

where for *n* values marked *y*_{1}, ..., *y*_{n}, each associated with a fitted value ŷ_{1}, ..., ŷ_{n}, the sum of squares of residuals $S{S}_{res}=\sum _{i}^{n}({y}_{i}-{\u0177}_{i})$, and the total sum of squares $S{S}_{tot}=\sum _{i=1}^{n}({y}_{i}-\u0233)$.

### 3.5. The Mathematical Learning Model

We propose a simple model that generalizes the SEIR model commonly used for virus disease modeling. The total population is partitioned into the following compartments: Susceptible **S**, Exposed **E**, Infected but not yet infectious, in a latent homogeneous period λ^{−1}; symptomatic Infected **I**_{s} with infectious period ${\gamma}_{1}^{-1}$ and asymptomatic Infected **I**_{a} in an infectious period ${\gamma}_{2}^{-1}$; Recovered **R** cases, and disease Dead cases **D**. We assume that every Exposed subject that enters **I**_{s} cohort later develops symptoms with probability 1−*q* and those who do not enter the **I**_{a} cohort with probability *q*. This assumption comes from real information in Mayotte and can be seen also in some studies, refer to Mizumoto et al. [35, 36]. We assume that a protected fraction ω of recovered individuals move to **S** (loss of immunity and re-infection). Given initial conditions *S*(0)>0, *E*(0)>0, *I*_{s}(0)>0, *I*_{a}(0)>0, *R*(0)>0, and *D*(0)≥0, the differential equations that govern the trajectories of our model are formulated as follow

where *N* = *S*+*E*+*I*_{s}+*I*_{a}+*R*+*D* is the fixed total population. With the initial conditions, the above system is well posed, with all variables remaining non-negative and *N* positive constant.

The epidemic equilibrium *X*_{0} of the system is obtained by setting all the derivatives to zero with *I*_{s} = *I*_{a} = 0, which yields to *X*_{0} = (*N*, 0, 0, 0, 0). To obtain the basic reproduction number, we use the next generation matrix (Dieckmann) by writing the Jacobian matrix of the flows of individuals between the different compartments at equilibrium:

The Jacobian matrix *FV*^{−1} has two zero and one positive eigenvalues so that

Considering the negligible death rate, we have

Thus

where ratio $r:=\frac{{\gamma}_{1}}{{\gamma}_{2}}.$ We assume in this article that, due to the isolation the symptomatic individual reduces their infectiousness so we shall consider values of *r* less than one.

## 4. Results and Discussion

The purpose of this section is to present the results not only on the parameter estimation of the serial interval distribution, and the reproduction numbers but also estimates of the transmission rates based on the above simple model.

### 4.1. Estimation of the Serial Interval

The basic method of moments (Figure 3A) and maximum likelihood estimation (Figure 3B) are summarized in Figure 3 for a set of classical distributions.

**Figure 3**. Summary of the method of moments, **(A)**, and by the maximum likelihood estimation (MLE), **(B)**, of fitted distributions.

As we can see in the method of moments, the «Gamma »distribution seems to fit better than others. Note that the method of moments does not give a good fit since it does not use any optimization techniques such as the Maximum Likelihood Estimation (MLE) method. The fit with the exponential distribution is not good and in the sequel, we will focus on «Gamma », «Lognormal », and «Weibull »distributions. The Kolmogorov-Smirnov test result for them is given also in Table 1 for the method of moments.

The maximum likelihood estimate is done using the R software function *nlm*, refer to Obadia et al. [22]. In Figure 3B, we represent several fitted densities estimated by this method.

We can see that, compared to the method of moments, the maximum likelihood improves in some way the estimation, refer to Table 2 for the Kolmogorov test. Akaike's Information Criterion (AIC) and the Bayesian Information Criterion (BIC) are written as follows:

where *k* is the number of parameters to estimate, *L* is the maximum of the likelihood function of the model, and *N* is the number of observations in the sample. The log-likelihood values, widely applicable information AIC and BIC criteria, and estimate parameters are summarize in Table 2. We note that the log-normal distribution is the best to fit the data in this context.

In the case of a mixture, a two-component mixture model is a reasonable model based on the bi-modality. Using the R software function **optim**, it is possible to minimize the log-likelihood functions of the different laws taken as arguments in addition to the initialization of the parameters to be optimized for the serial interval distributions. In Figures 3, 4, we summarize the estimation results of the mixture models.

The Log-likelihood, AIC, and BIC values are given in Table 3, and we conclude that the best-fit mixture is given by the mentioned mixture of «Gamma »distributions. This best-fit mixture is better than the MLE.

### 4.2. The Reproduction Number

In this section, we propose to modify the generation time function provided in the R0 package (refer to [22, 23]) to take into account a mixture modeling framework. Indeed, the choices of series interval estimation models in the R0 package are fixed distributions («Gamma », «Lognormal », and «Weibull ») and do not allow mixture distributions of the serial interval (SI). Furthermore, note that there is no reason for a mixture of Gamma distributions with different non-integer parameters to be Gamma distributed. Thus, using the epidemic incidence curve in Mayotte, we derive a generation time distribution in order to estimate the time varying reproduction number *R*_{0}(*t*) using the non parametric formula in Equation (1) and the best-fit mixture of the SI. The picture in Figure 5 represents the evolution of the time varying reproduction number *R*_{0}(*t*); we make the curve smooth using estimated values.

**Figure 5**. Evolution of the time varying reproductive number in Mayotte from 13 March 2020 to 11 January 2022 with the best-fit mixture of the SI.

The picture in Figure 6 shows not only the evolution of the time-varying reproduction number *R*(*t*) with package *R*0 but also a comparison with our method of estimation in a given period. We can see that at the start of the epidemic the basic reproductive number is around 3. This is consistent with the literature [1, 6, 37] and show that, at the start of the epidemic, an infected individual could contaminate an average of 3 individuals. The public policy measures taken by the authorities and health agency of Mayotte in response to the COVID-19 with a total lockdown on March 17 made it possible to observe a value of this indicator below 1 around the middle of April 2020, showing a temporary decline in the epidemic curve during this period. After then, unfortunately, the dynamic changed over time since the end of the lockdown at the end of May 2020. Subsequently, the Mayotte authorities implemented several measures including non-pharmaceutical intervention measures such as the establishment of curfews, the wearing of mandatory masks in public spaces, and now vaccination. As we can see two other maximum values (explained by the appearance of new variants being more contagious over time) were observed at the end of 2020 and 2021 which justified restriction decisions by the local authorities. Figure 7 shows the relative variation of the reproduction numbers under other mixtures of the SI distributions.

**Figure 6**. Evolution of the time varying reproductive number in Mayotte from 13 March 2020 to 11 January 2022 with a mixture of Log-normal distribution fit of the serial interval **(A)** and a mixture of Weibull distribution fit of the serial interval **(B)**.

**Figure 7**. Comparison of our time varying (figure below) reproductive number in Mayotte from 11 January 2021 to 11 January 2022 with the time-varying reproductive number from package R0 (figure above) with the best-fit of the SI by the MLE method.

### 4.3. Estimates of the Transmission Rates

We give some estimated results of the transmission rates for some given periods such as March 2020, December 2020 and 2021. The following pictures in Figure 8 show the range values of the time-varying transmission rates β(*t*) using the formula (2) according to the time-varying reproduction numbers *R*_{0}(*t*) and infectious rate γ_{1}. We make a sensitivity analysis for some values of the asymptotic ratio *q*. The information we have about the data allow to choose *q* = 0.2 or *q* = 0.3. For an infectious ratio *r*~0.7, one observed transmission rate going around up to 1.6 in March 2020 for a range from 20 % to 30 % of asymptomatic ratio. In the same conditions, the range of transmission rate goes up to 0.9 in December 2020 and up to 2.6 in December 2021. This show strong contact rates in December 2021, which can be explained by the traditional weddings in December in Mayotte. In December 2020, the contact rates were not strong and this can be explained by perhaps less contagiousness of the first variant at this time or by the fact that there were fewer traditional weddings in this same period of 2020 because of the fears of the population.

**Figure 8**. Estimates of the transmission rate with respect to time varying reproduction number *R*_{0}(*t*) and a range of infectious period of symptomatic ${\gamma}_{1}^{-1}\in (3,10)$.

### 4.4. Regression and Forecasting Results

The decomposition of the time series of daily hospitalized, intensive care unit, and death cases, allowed us to choose the best models summarized in Table 4. The prediction of the above models is given in Figure 9.

**Figure 9**. Predicted daily hospitalized, intensive care, and death cases from March 2020 to February 2021.

We analyze in Figure 10, the performance of a polynomial regression model on the cumulative cases with a common *R*^{2} = 0.998.. The model equations read as follows:

**Figure 10**. Predicted cumulative intensive care, hospitalized, and death cases from March 2020 to February 2021.

## 5. Conclusion and Further Studies

In this study, the outbreak dynamics of COVID-19 in Mayotte are discussed through the estimation of the time-varying reproduction number using a non parametric method. To this end, we first make use of statistical methods to fit the probability distribution function which underlines the serial interval distribution of the COVID-19 virus on a given set of data collected on the viral shedding in patients with laboratory-confirmed COVID-19 in He et al. [13]. We propose a modified version in package R0 of the time-varying reproduction number with a mixture estimation model of the serial interval distribution. After that, we considered a deterministic mathematical dynamic epidemic model to estimate the transmission rate parameters by using the Dykman method. In the model, we did not integrate the impact of vaccination because it is well known that vaccination does not prevent being infected, and instead, we introduce a re-infection parameter even though we do not deal in this article with the simulation of the dynamic model. The fit of the transmission rates is done in some given period of observation together with a sensitivity in the asymptotic ratio and infectious ratio between symptomatic and asymptomatic. We discuss some non-pharmaceutical interventions in the evolution of the time-varying reproduction number and the transmission rate coefficients. Any conclusion that can be drawn subject to our assumptions is that mitigation and non-pharmaceutical interventions of the COVID-19 have an impact on the transmission rate parameters. Note also that some conditions can be different in reality, e.g., the infectious rate, the virus mortality rate, and the generation time distribution. This is the limit of our study.

In terms of forthcoming study, we are currently interested in theoretical developments in a context of a cross-mixture in order to improve the estimates considering negative serial interval and α-stables distributions with stochastic Expectation-Maximization algorithm and Bayesian methods, refer to Castillo-Barnes et al. [38] and El Haj et al. [39]. As concerned the regression methods, we are currently working on some other non-parametric forms, refer to Slaoui [40] and Bouzebda and Slaoui [41].

## Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the author on reasonable request.

## Author Contributions

SM-A performed the conception, preparation, experiment, the statistical analysis of results, and the writing of the present article. YS contributed to the conception. JB contributed to some epidemiological information and interpretation. All authors contributed to the article and approved the submitted version.

## Funding

This work was partially funded by the Regional Health Agency (ARS) of Mayotte.

## Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

## Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

## Acknowledgments

The authors are indebted to the referees for their valuable comments, criticism and suggestions on a first version of this paper. We also thank the third author involved in the collection and processing of the data.

## References

1. Di Domenico L, Pullano G, Sabbatini CE, Boëlle PY, Colizza V. Impact of lockdown on COVID-19 epidemic in Île-de-France and possible exit strategies. *BMC Med*. (2020) 18:1–13. doi: 10.1186/s12916-020-01698-4

2. Salje H, Kiem CT, Lefrancq N, Courtejoie N, Bosetti P, Paireau J, et al. Estimating the burden of SARS-CoV-2 in France. *Science*. (2020) 369:208–11. doi: 10.1126/science.abc3517

3. Dudine P, Hellwig KP, Jahan S, Coady D. *A framework for estimating health spending in response to COVID-19*. IMF Working Papers (2020).

4. Di Domenico L, Pullano G, Sabbatini CE, Boëlle PY, Colizza V. Expected impact of reopening schools after lockdown on COVID-19 epidemic in Île-de-France. *MedRxiv*. (2020). doi: 10.1101/2020.05.08.20095521

5. Kucharski AJ, Russell TW, Diamond C, Liu Y, Edmunds J, Funk S, et al. Early dynamics of transmission and control of COVID-19: a mathematical modelling study. *Lancet Infect Dis*. (2020) 20:553–8. doi: 10.1016/S1473-3099(20)30144-4

6. Liu Z, Magal P, Seydi O, Webb G. A COVID-19 epidemic model with latency period. *Infect Dis Modell*. (2020) 5:323–37. doi: 10.1016/j.idm.2020.03.003

7. Liu Z, Magal P, Seydi O, Webb G. Predicting the cumulative number of cases for the COVID-19 epidemic in China from early data. *arXiv preprint arXiv:200212298*. (2020). doi: 10.1101/2020.03.11.20034314

8. Forien R, Pang G, Pardoux É. Epidemic models with varying infectivity. *SIAM J Appl Math*. (2021) 81:1893–930. doi: 10.1137/20M1353976

9. Forien R, Pang G, Pardoux É. Estimating the state of the COVID-19 epidemic in France using a model with memory. *R Soc Open Sci*. (2021) 8:202327. doi: 10.1098/rsos.202327

10. Peter OJ, Qureshi S, Yusuf A, Al-Shomrani M, Idowu AA. A new mathematical model of COVID-19 using real data from Pakistan. *Results Phys*. (2021) 24:104098. doi: 10.1016/j.rinp.2021.104098

11. Abioye AI, Peter OJ, Ogunseye HA, Oguntolu FA, Oshinubi K, Ibrahim AA, et al. Mathematical model of COVID-19 in Nigeria with optimal control. *Results Phys*. (2021) 28:104598. doi: 10.1016/j.rinp.2021.104598

12. Peter OJ, Shaikh AS, Ibrahim MO, Nisar KS, Baleanu D, Khan I, et al. Analysis and dynamics of fractional order mathematical model of COVID-19 in Nigeria using atangana-baleanu operator. *Comput Mater Continua*. (2021) 66:1823–48. doi: 10.32604/cmc.2020.012314

13. He X, Lau EH, Wu P, Deng X, Wang J, Hao X, et al. Temporal dynamics in viral shedding and transmissibility of COVID-19. *Nat Med*. (2020) 26:672–5. doi: 10.1038/s41591-020-0869-5

14. Nishiura H, Linton NM, Akhmetzhanov AR. Serial interval of novel coronavirus (COVID-19) infections. *Int J Infect Dis*. (2020) 93:284–6. doi: 10.1016/j.ijid.2020.02.060

15. Svensson Å. A note on generation times in epidemic models. *Math Biosci*. (2007) 208:300–11. doi: 10.1016/j.mbs.2006.10.010

16. Lehtinen S, Ashcroft P, Bonhoeffer S. On the relationship between serial interval, infectiousness profile and generation time. *J R Soc Interface*. (2021) 18:20200756. doi: 10.1098/rsif.2020.0756

17. Le Cam L. Maximum likelihood: an introduction. *Int Stat Rev*. (1990) 58:153–71. doi: 10.2307/1403464

18. McLachlan GJ, Lee SX, Rathnayake SI. Finite mixture models. *Ann Rev Stat Appl*. (2019) 6:355–78. doi: 10.1146/annurev-statistics-031017-100325

19. Benaglia T, Chauveau D, Hunter DR, Young DS. mixtools: an R package for analyzing mixture models. *J Stat Software*. (2010) 32:1–29. doi: 10.18637/jss.v032.i06

20. Diekmann O, Heesterbeek JAP, Metz JA. On the definition and the computation of the basic reproduction ratio R 0 in models for infectious diseases in heterogeneous populations. *J Math Biol*. (1990) 28:365–82. doi: 10.1007/BF00178324

21. Wallinga J, Lipsitch M. How generation intervals shape the relationship between growth rates and reproductive numbers. *Proc R Soc B Biol Sci*. (2007) 274:599–604. doi: 10.1098/rspb.2006.3754

22. Obadia T, Haneef R, Boëlle PY. The R0 package: a toolbox to estimate reproduction numbers for epidemic outbreaks. *BMC Med Inform Decis Making*. (2012) 12:1–9. doi: 10.1186/1472-6947-12-147

24. Cori A, Ferguson NM, Fraser C, Cauchemez S. A new framework and software to estimate time-varying reproduction numbers during epidemics. *Am J Epidemiol.* (2013) 178:1505–12. doi: 10.1093/aje/kwt133

25. Cori A, Cauchemez S, Ferguson NM, Fraser C, Dahlqwist E, Demarsh PA, et al. Package ‘EpiEstim' (2021)

26. Batista M. On the reproduction number in epidemics. *J Biol Dyn*. (2021) 15:623–34. doi: 10.1080/17513758.2021.2001584

27. Demongeot J, Oshinubi K, Rachdi M, Seligmann H, Thuderoz F, Waku J. Estimation of daily reproduction numbers during the COVID-19 outbreak. *Computation*. (2021) 9:109. doi: 10.3390/computation9100109

28. Waku J, Oshinubi K, Demongeot J. Maximal reproduction number estimation and identification of transmission rate from the first inflection point of new infectious cases waves: COVID-19 outbreak example. *Math Comput Simulat*. (2022) 198:47–64. doi: 10.1016/j.matcom.2022.02.023

29. Marill KA. Advanced statistics: linear regression, part I: simple linear regression. *Acad Emerg Med*. (2004) 11:87–93. doi: 10.1111/j.1553-2712.2004.tb01378.x

30. Kumar N, Susan S. COVID-19 pandemic prediction using time series forecasting models. In: *2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT)*. Kharagpur: IEEE (2020). p. 1–7.

31. Ala'raj M, Majdalawieh M, Nizamuddin N. Modeling and forecasting of COVID-19 using a hybrid dynamic model based on SEIRD with ARIMA corrections. *Infect Dis Model*. (2021) 6:98–111. doi: 10.1016/j.idm.2020.11.007

32. Patibandla RL, Rao BT, Narayana VL. Prediction of COVID-19 using machine learning techniques. In: *Deep Learning for Medical Applications with Unique Data*. Academic Press (2022). p. 219–31.

33. Chauhan P, Kumar A, Jamdagni P. Regression analysis of covid-19 spread in india and its different states. *medRxiv*. (2020). doi: 10.1101/2020.05.29.20117069

34. Ekum M, Ogunsanya A. Application of hierarchical polynomial regression models to predict transmission of COVID-19 at global level. *Int J Clin Biostat Biom*. (2020) 6:27. doi: 10.23937/2469-5831/1510027

35. Mizumoto K, Kagaya K, Zarebski A, Chowell G. Estimating the asymptomatic proportion of coronavirus disease 2019 (COVID-19) cases on board the Diamond Princess cruise ship, Yokohama, Japan, 2020. *Eurosurveillance*. (2020) 25:2000180. doi: 10.2807/1560-7917.ES.2020.25.10.2000180

36. McDonald SA, Miura F, Vos ER, van Boven M, de Melker HE, van der Klis FR, et al. Estimating the asymptomatic proportion of SARS-CoV-2 infection in the general population: analysis of nationwide serosurvey data in the Netherlands. *Eur J Epidemiol*. (2021) 36:735–9. doi: 10.1007/s10654-021-00768-y

37. Lekone PE, Finkenstädt BF. Statistical inference in a stochastic epidemic SEIR model with control intervention: Ebola as a case study. *Biometrics*. (2006) 62:1170–7. doi: 10.1111/j.1541-0420.2006.00609.x

38. Castillo-Barnes D, Martínez-Murcia FJ, Ramírez J, Górriz J, Salas-Gonzalez D. Expectation-Maximization algorithm for finite mixture of α-stable distributions. *Neurocomputing*. (2020) 413:210–6. doi: 10.1016/j.neucom.2020.06.114

39. El Haj A, Slaoui Y, Solier C, Perret C. Bayesian Estimation of The Ex-Gaussian Distribution. *Stat Optim Inf Comput*. (2021) 9:809–19. doi: 10.19139/soic-2310-5070-1251

40. Slaoui Y. Recursive nonparametric regression estimation for independent functional data. *Stat Sin*. (2020) 30:417–37. doi: 10.5705/ss.202018.0069

41. Bouzebda S, Slaoui Y. Nonparametric recursive method for kernel-type function estimators for spatial data. *StatProbabil Lett*. (2018) 139:103–14. doi: 10.1016/j.spl.2018.03.017

Keywords: health statistics, parameter estimation, reproduction number, epidemiology, model, transmission rates

Citation: Manou-Abi SM, Slaoui Y and Balicchi J (2022) Estimation of Some Epidemiological Parameters With the COVID-19 Data of Mayotte. *Front. Appl. Math. Stat.* 8:870080. doi: 10.3389/fams.2022.870080

Received: 05 February 2022; Accepted: 17 June 2022;

Published: 05 August 2022.

Edited by:

Jacques Demongeot, Université Grenoble Alpes, FranceReviewed by:

Khalid Hattaf, Centre Régional des Métiers de l'Education et de la Formation, MoroccoOlumuyiwa James Peter, University of Medical Sciences, Nigeria

Copyright © 2022 Manou-Abi, Slaoui and Balicchi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Solym M. Manou-Abi, solym-mawaki.manou-abi@umontpellier.fr