Prediction of Mental Health in Medical Workers During COVID-19 Based on Machine Learning

Wang, Xiaofeng; Li, Hu; Sun, Chuanyong; Zhang, Xiumin; Wang, Tan; Dong, Chenyu; Guo, Dongyang

doi:10.3389/fpubh.2021.697850

ORIGINAL RESEARCH article

Front. Public Health, 07 September 2021

Sec. Digital Public Health

Volume 9 - 2021 | https://doi.org/10.3389/fpubh.2021.697850

This article is part of the Research TopicExtracting Insights from Digital Public Health Data using Artificial IntelligenceView all 15 articles

Prediction of Mental Health in Medical Workers During COVID-19 Based on Machine Learning

Xiaofeng Wang¹

Hu Li¹

Chuanyong Sun^1,2^*

¹Northeast Asian Research Center, Jilin University, Changchun, China
²Kuancheng Health Commission, Changchun, China
³Department of Social Medicine and Health Management, School of Public Health, Jilin University, Changchun, China

Mental health prediction is one of the most essential parts of reducing the probability of serious mental illness. Meanwhile, mental health prediction can provide a theoretical basis for public health department to work out psychological intervention plans for medical workers. The purpose of this paper is to predict mental health of medical workers based on machine learning by 32 factors. We collected the 32 factors of 5,108 Chinese medical workers through questionnaire survey, and the results of Self-reporting Inventory was applied to characterize mental health. In this study, we propose a novel prediction model based on optimization algorithm and neural network, which can select and rank the most important factors that affect mental health of medical workers. Besides, we use stepwise logistic regression, binary bat algorithm, hybrid improved dragonfly algorithm and the proposed prediction model to predict mental health of medical workers. The results show that the prediction accuracy of the proposed model is 92.55%, which is better than the existing algorithms. This method can be used to predict mental health of global medical worker. In addition, the method proposed in this paper can also play a role in the appropriate work plan for medical worker.

Introduction

Although the definition of mental health is not uniform in academic circles, the research significance of mental health is self-evident. Mental health has been widely used in psychology (1), sociology (2), psychiatry (3), pedagogy (4, 5), genetics (6), and other fields.

Currently, some representative scales are usually used to measure mental health, such as Self-reporting Inventory (SCL-90) (7), Minnesota Multiphasic Personality Inventory (MMPI) (8), Self-Rating Anxiety Scale (SAS) (9), Self-Rating Depression Scale (SDS) (10), Eysenck Personality Questionnaire (EPQ) (11), the Sixteen Personality Factor Questionnaire (16PF) (12). Above scales are widely used internationally because they are guided by various psychological theories and can transform abstract mental health concepts into observable specific indicators. However, some shortcomings are not considered in the scales mentioned above. First, the different emphasis of scale measurement leads to the differences in the evaluation criteria because many factors need to be considered in the measurement of mental status. Second, the existing way of answering the scales is self-evaluation, which inevitably makes the respondent hold something back. Third, a lot of time is spent in obtaining the results of the scale for judging mental status in emergency situations. Although the diagnosis and intervention of mental symptoms are significant, prevention is even more important. Therefore, using existing information to predict mental health is of great significance.

Mental health prediction is conductive to detecting mental disorders in advance, reducing the incidence of serious mental illnesses, and facilitating the health system to provide people with targeted health care services (13). In particular, the mental health of medical workers is seriously threatened by the global spread of COVID-19. These workers are prone to anxiety and depression (14). United Nations Secretary-General António Guterres indicated in “Message on COVID-19 and the demand for action of mental health” (15) that various mental health services must be shifted to the community and must be included in the all-people medical plan. Based on a survey conducted by WHO, COVID-19 pandemic has caused the disruption of major mental health services in 93% of countries worldwide (16). However, there are urgent demand for mental health services in many countries. In addition, the delta variants have appeared in at least 98 countries and regions, and continue to mutate and evolve. Almost all new cases of CIVID-19 are the delta variants (17), and the delta variants are becoming the main epidemic strain in many countries. The delta variant pandemic is likely to further exacerbate the fears of the public and medical workers. Therefore, predicting the potential psychological symptoms of medical workers contributes to the mental health of medical personnel, and helps maintain the high efficiency of global medical institutions.

The existing mental health prediction methods are divided into statistical model methods and artificial intelligence algorithms.

Among the statistical model methods that are used for mental health prediction, structural equation models are widely used (18–20). Moving average methods are also commonly used in health prediction. Autoregressive Integrated Moving Average model (ARIMA) (21–23) and Exponential Smoothing (ES) (24, 25) are the representative methods of the moving average model. Negative binomial model (NBM) (26, 27) and fractional polynomials (28) also provide new mentalities for predicting health. However, the statistical analysis of the data can be achieved by the above methods, but the inherent relationship between the characteristic variables and the prediction results cannot be identified by the above methods. Therefore, the accuracy of mental health prediction based on statistical models is low.

For the purpose of improving the accuracy of mental health prediction, machine learning technology has been used in the mental health prediction research since the 1980s. Basavappa et al. (29) proposed a depth-first search method according to reverse search strategy in 1996, which is used to diagnose depression or dementia. Basavappa et al. developed an expert system based on the subjects' behavior, cognition, symptoms of emotion, and neuropsychological assessment results. Gil. and Manuel (30) come up with a system according to Artificial Neural Network (ANN) and Support Vector Machine (SVM) in 2009, which is used to diagnose Parkinson's disease. The system improves the accuracy of diagnosis and reduces the cost of diagnosis. Seixas et al. (31) come up with a model called Bayesian Network (BN) in 2014, which is used to diagnose dementia and Alzheimer disease. The experimental results show that compared with most other well-known classifiers, the BN decision model has better performance. Dabek and Caban (32) proposed a neural network model in 2015, which is used to assess the possibility of suffering from psychological illnesses.

However, three problems are not solved in the algorithms mentioned above. First, statistical models are difficult to tackle the impact of random interference factors on mental health because of their limitations. Therefore, statistical models cannot reflect the high uncertainty of mental health and the non-linear relationship between feature variables and prediction results, which leads to their low prediction accuracy. Second, existing machine learning methods that are used for mental health prediction only focus on prediction accuracy without considering the impact of feature variables on the importance of mental health. Therefore, the influence weight cannot be determined according to the degree to which feature variables have effects on the prediction results. As a result, the above algorithms cannot provide a theoretical basis for the health department to work out psychological intervention plans for medical workers. Third, a large number of irrelevant or redundant features are usually included in the datasets that are used for mental health prediction. Statistical model methods merely choose significant features rather than important features, which cannot eliminate the irrelevant and redundant features in dataset that influence prediction results. Consequently, the mental health prediction methods based on statistical model not only have low prediction accuracy, but also waste computing time.

To deal with the above problems, the article proposes a novel mental health prediction algorithm called the Improved Global Chaos Bat Back Propagation Neural Network (IGCBA-BPNN). The purpose of this article is to monitor the mental health of medical workers in time to reduce the incidence of mental illness of medical workers, and to rationalize the distribution of global public health resources. Therefore, IGCBA-BPNN is applied to the mental health prediction of Chinese medical workers. The experimental results show that, compared with the existing mental health prediction methods, IGCBA-BPNN not only improves the accuracy of mental health prediction, but also selects the fewest feature variables.

The contribution of this paper is the proposal of a new mental health prediction algorithm. The proposed algorithm can predict more effectively the mental health of medical workers during COVID-19, and at the same time provides a theoretical basis for global public health departments to work out psychological intervention plans.

The remaining content of this article is arranged as followed: in section Materials and Methods, we introduced the data and methods of this research. In section Results, the effectiveness of proposed algorithm is evaluated. At last, the discussion and conclusion are illustrated in detail.

Materials and Methods

Data Preparation

Using dataset from the “Mental Health Status of Medical Workers During COVID-19” survey conducted in Changchun, Jilin Province, China from June 1, 2020 to June 7, 2020, this paper predicts the mental health of Chinese medical workers during COVID-19. The subjects of above survey are medical workers who participated in epidemic prevention and control. According to the population status and the characteristics of geographical distribution, we selected 150 grass-roots medical units from 220 grass-roots medical units in the Changchun city and then randomly selects 35 medical workers in each grass-roots medical unit. The questionnaire is conducted online and 5,260 questionnaires were obtained in this survey. Based on research need, 152 unqualified samples were eliminated and the final sample size is 5,108. There are 32 variables in the questionnaire. In the process of designing the questionnaire, we collected as much as possible the basic information of the subjects and the variables information that may affect mental status of medical workers during COVID-19. Studies have shown that the measurable factors affecting mental status mainly include the five respects of demography (33, 34), family (35, 36), employment (37, 38), lifestyle (39, 40), and work/living environment related to COVID-19 (41, 42). Based on the results of the existing literature and the actual situation of medical workers during COVID-19, 32 factors were decided.

The description of variables is presented in Table 1. The data and its description are published on GitHub (https://github.com/Hu-Li/mental-health-dataset).

TABLE 1

Table 1. The description of variables.

This study had been reviewed and approved by the Ethics Committee of the School of Public Health, Jilin University. This study does not involve questions about the identity of the respondents. An informed consent page was provided on the first page of the questionnaire for confirmation. All participants voluntarily joined this study with informed consent.

Feature Selection

Bat Algorithm

The Bat Algorithm (BA) (43) proposed by Yang is widely used in many fields because of its simplicity, fast convergence speed and few parameters. The bat algorithm has been used by many scholars for feature selection (44, 45). The excellent performance of the bat algorithm has also been verified in comparison with other most well-known algorithms such as genetic algorithm (GA) and particle swarm optimization (PSO) (46). Bat algorithm uses echolocation principles to simulate the predation process of bats. Bat algorithm is also an effective search method, and it is used to search for the global optimal solution. Original bat algorithm has three ideal hypotheses so as to simulate the predation behavior of bats:

First, bats use echolocation to perceive the distance between themselves and the target, and they can effectively distinguish targets and obstacles. Second, the ith bat flies randomly at a speed v_i in the space position x_i, and searches for targets with frequency f_i, wavelength λ and loudness A_i. Bats adjust the rate of emission of pulse r(r ∈ [0,1]) according to the distance between themselves and prey. Third, loudness changes from maximum A_max to minimum A_min.

Based on the above three ideal hypotheses, in the search space, the calculating equations of the frequency, velocity and position of bats as follows:

\begin{array}{l} f_{i} = f_{m i n} + (f_{m a x} - f_{m i n}) \times β & (1) \end{array}

\begin{array}{l} v_{i}^{t + 1} = v_{i}^{t} + (x_{i}^{t} - x_{*}) \times f & (2) \end{array}

\begin{array}{l} x_{i}^{t + 1} = x_{i}^{t} + v_{i}^{t + 1} & (3) \end{array}

where f_i is the pulse frequency of the ith bat, and f_min and f_max are the minimum and maximum value of the pulse frequency, respectively, β is a random number within [0,1], $v_{i}^{t + 1}$ is the flight speed of the ith bat at the t + 1th iteration, $v_{i}^{t}$ is the flight speed of the ith bat at the tth iteration, $x_{i}^{t}$ is the position where the ith bat stays at the tth iteration, $x_{i}^{t + 1}$ is the position where the ith bat stays at the t + 1th iteration, x_* is the optimal position of the bat in the current population.

In the process of searching for prey, the initial ultrasonic loudness of bats is large, but the emission rate is low. This helps bats search for prey in the entire space. When a bat finds prey, the loudness of volume that the bat emits is gradually reduced, and the rate of emission of pulse is gradually increased. Through the above adjustments, bats can more accurately determine the location of prey. The rate of emission of pulse and the loudness of volume that the bat emits are calculated as follows:

\begin{array}{l} r_{i}^{t + 1} = r_{i}^{0} [1 - exp (- γ \times t)] & (4) \end{array}

\begin{array}{l} A_{i}^{t + 1} = α \times A_{i}^{t} & (5) \end{array}

where $r_{i}^{t + 1}$ is the pulse emission rate of the ith bat at the t + 1th iteration, $r_{i}^{0}$ is the maximum of pulse emission rate of the ith bat, γ(γ > 0) is the enhancement coefficient of the pulse frequency, $A_{i}^{t + 1}$ and $A_{i}^{t}$ are the loudness of volume that the ith bat emits at the t + 1th iteration and at the tth iteration, respectively, α(α ∈ [0,1]) is the attenuation coefficient of the pulse loudness.

However, the bat algorithm is easy to fall into the local optimum, and the prediction accuracy of bat algorithm is low. The population initialization of the bat algorithm is randomly generated and does not have the ability to cover the entire solution space, which greatly affects the performance of the bat algorithm.

Improved Global Chaos Bat Algorithm

In order to overcome the shortcomings of the bat algorithm, Global Chaos Bat Algorithm (GCBA) (47) is introduced to eliminate redundant features and irrelevant features in the dataset. As a heuristic optimization algorithm, GCBA is used for feature selection. At first, in the initial stage, the chaotic map method is introduced to ensure the bat population traverse the entire solution space as much as possible. The chaotic map method also conducive to enriching the population diversity. Then, a fitness function based on accuracy and feature subset length is proposed to calculate the score of the feature subset after each update. Finally, GCBA selects the feature subset with the highest score from all feature subsets through the score calculated, which eliminate irrelevant features and redundant features from all feature variables.

To further improve the performance of GCBA, Improved Global Chaos Bat Algorithm (IGCBA) with higher accuracy and better performance is proposed, in which a nonlinear function based on the number of iterations is designed to balance IGCBA's exploitation and exploration capabilities. In the early stage of IGCBA, the algorithm is inclined toward the exploration capability. Global information is fully utilized to enable IGCBA to traverse the entire solution space as much as possible. In the later stage of IGCBA, the algorithm is inclined toward exploitation capability. Partial information is fully utilized to enable IGCBA to obtain the better solution through further exploitation.

Currently, the logistic method is widely used as a chaotic map method. The initial population generated by this method is diverse and can traverse the entire solution space. Therefore, in this paper, the initialization of the population is finished by using an improved logistic mapping method, and its mathematical model (48) is:

\begin{array}{l} y_{i}^{d + 1} = | 1 - 2 \times {(y_{i}^{d})}^{2} | & (6) \end{array}

where $y_{i}^{d} (i = 1, 2, \dots N, d = 1, 2, \dots D) (y_{i}^{d} \in [0, 1])$ is the chaotic variable, N is the amount of bat population, and D is the dimension of initial population. Then, the position $x_{i}^{d}$ of the bat individual in the solution space is obtained by inverse mapping of $y_{i}^{d}$ . The calculating equation of $x_{i}^{d}$ is:

\begin{array}{l} x_{i}^{d} = l_{i} + (u_{i} - l_{i}) y_{i}^{d} & (7) \end{array}

where l_i and u_i are the minimum and maximum value of the variable range, respectively.

The local optimum position of the bat and the global optimum position of the population are recorded when the position of each bat is updated. The position of the ith bat at the t + 1th iteration can be calculated as follows:

\begin{array}{l} x_{i}^{t + 1} = x_{i}^{t} + v_{i}^{t + 1} C_{1} r_{1} (P_{i} - x_{i}^{t}) + C_{2} r_{2} (P_{g} - x_{i}^{t}) & (8) \end{array}

where P_i is the local optimal position of the ith bat, P_g is the global optimal position of the bat population, r₁ and r₂ are two random numbers within [0,1].

C₁ is the control coefficient that balances the global exploration capability of IGCBA, represents the degree to which the historical optimal position of a bat individual has effect on the current state of the bat. The larger the C₁ is, the more the algorithm focuses on exploitation capability. C₂ is the control coefficient that balances the local exploitation capability of IGCBA, represents the degree to which the historical optimal position of the bat population has effect on the current state of the bat. The larger the C₂ is, the more the algorithm focuses on exploration capability.

In the preliminary stage of algorithm, it is necessary to traverse the entire solution space as much as possible to ensure that the algorithm does not converge prematurely. Therefore, in the early stage of the algorithm, C₂ should be as large as possible and C₁ as small as possible; in the later stage of the algorithm, C₁ should be as large as possible and C₂ as small as possible. In this way, the algorithm can get better performance. According to the above analysis, the calculating equation of C₁ and C₂ as follows:

\begin{array}{l} C_{1} = {\begin{matrix} e^{- (\frac{T}{2} - t) / 10} + 0.1, 0 \leq t < 40 \\ 0.0095 \times t - 0.0980, 40 \leq t < 70 \\ 4.8 + \frac{20}{loge (t + 70)}, 70 \leq t < 100 \end{matrix} & (9) \end{array}

\begin{array}{l} C_{2} = {\begin{matrix} 0.9 - e^{- (\frac{T}{2} - t) / 10}, 0 \leq t < 40 \\ 0.0095 \times t + 0.9120, 40 \leq t < 70 \\ \frac{- 20}{loge (t + 70)} - 3.8, 70 \leq t < 100 \end{matrix} & (10) \end{array}

where t represents the current iteration times, T represents the maximum iteration times.

When initializing the bat population, we use a matrix of size N × D. N is the number of bat population, D is the number of features. In this paper, a transfer equation is used to perform discrete binary operations on the bat's position. The transfer equation is:

\begin{array}{l} S (x_{i}^{d} (t)) = \frac{1}{1 + e^{- x_{i}^{d} (t)}} & (11) \end{array}

where $x_{i}^{d} (t)$ is the position of the ith bat individual in the dth dimension at the tth iteration.

The updating equation of position of the bat individual is:

\begin{array}{l} x_{i}^{d} (t) = {\begin{matrix} 0, r a n d < S (x_{i}^{d} (t)) \\ 1, r a n d \geq S (x_{i}^{d} (t)) \end{matrix} & (12) \end{array}

where rand is a random number within [0,1].

When the ith bat's position in the dth dimension at the tth iteration is 0, this bat will not be selected. When the ith bat's position in the dth dimension at the tth iteration is 1, this bat will be selected.

Back Propagation Neural Network

Back Propagation Neural Network (BPNN) is particularly suited for solving the non-linear problems (49), so it is widely used in the field of health prediction (50). In the process of back propagation of prediction errors, the connection weights and bias are constantly adjusted. Finally, the output predicted by BPNN is constantly close to the expected output.

Before using BPNN for prediction, the network needs to be trained. Through training, the network will have associative memory and predictive capabilities. The main steps of the BPNN training process are:

Step 1: Initialize the network. Based on the input and output sequence (X, Y), the number of the input layer nodes s and the output layer nodes m can be determined. The number of hidden layers and the number of the hidden layer nodes l are given by experience. The connection weight w_hj(h = 1, 2, ⋯s; j = 1, 2, ⋯l) between the input and the hidden layer, the connection weight w_jk(j = 1, 2, ⋯l; k = 1, 2, ⋯ , m) between hidden and the output layer, the hidden layer bias value a_j and the output layer bias value b_k are initialized. Given the learning rate η, the activation function g(x). In order to solve non-linear problems, the activation function usually uses the Sigmoid function, which is defined as follows:

\begin{array}{l} g (x) = \frac{1}{1 + e^{- x}} & (13) \end{array}

Step 2: The output of the hidden layer. The output H_j of the hidden layer is calculated based on the input vector X, ω_hj and a_j.

\begin{array}{l} H_{j} = g (\sum_{h = 1}^{n} ω_{h j} x_{h} + a_{j}) & (14) \end{array}

Step 3: The output of the output layer. The prediction output O_k of BPNN is calculated based on H_j, ω_jk and b_k.

\begin{array}{l} O_{k} = \sum_{j = 1}^{l} H_{j} ω_{j k} + b_{k} & (15) \end{array}

Step 4: Calculate prediction error. The prediction error of pth simple E_p is calculated based on prediction output of pth simple O_pk and expected output of pth simple Y_pk.

\begin{array}{l} E_{p} = \frac{1}{2} \sum_{k = 1}^{m} {(Y_{p k} - O_{p k})}^{2} & (16) \end{array}

Step 5: Calculate the reverse transmission value. The reverse transmission value of output layer δ_k, and the reverse transmission value of hidden layer δ_j are calculated as follows:

\begin{array}{l} δ_{k} = O_{p k} (1 - O_{p k}) (Y_{p k} - O_{p k}) & (17) \end{array}

\begin{array}{l} δ_{j} = H_{j} (1 - H_{j}) \sum_{k = 1}^{m} δ_{k} ω_{j k} & (18) \end{array}

Step 6: Update the weight. η is the learning rate, and the weight ω_hj and ω_jk are updated as follows:

\begin{array}{l} ω_{h j} = ω_{h j} + η δ_{j} x_{h} & (19) \end{array}

\begin{array}{l} ω_{j k} = ω_{j k} + η δ_{k} H_{j} & (20) \end{array}

Step 7: Update the bias value. The bias value a_j and b_k are updated based on δ_j and δ_k.

\begin{array}{l} a_{j} = a_{j} + η δ_{j} & (21) \end{array}

\begin{array}{l} b_{k} = b_{k} + η δ_{k} & (22) \end{array}

Step 7: Determine whether the algorithm iteration is over, if not, return to step 2.

Improved Global Chaos Bat Back Propagation Neural Network

Figure 1 illustrates the process of IGCBA-BPNN. First, initialize all variables. Second, IGCBA is used for feature selection to select a feature subset that can represent as much information as possible of the original features and as few numbers as possible. Existing research has proved that compared with other classifiers, SVM has higher classification accuracy (51) and better stability (52). Therefore, SVM is used to judge the quality of the feature subset selected by IGCBA. Third, the features selected by IGCBA are used as the input of the BPNN to reduce the model complexity of BPNN.

FIGURE 1

Figure 1. The flowchart of IGCBA-BPNN.

Results

Parameter Settings

Table 2 shows the parameter settings of feature selection algorithms. In binary bat algorithm (BBA) (46), GCBA, IGCBA, A is the loudness of volume that the bat emits and is set to 1.5, r is the rate of emission of pulse and is set to 0.5, f_max is the maximum value of the pulse frequency and is set to 1, and f_min is the minimum value of the pulse frequency and is set to 0. In GCBA, C₁ is the control coefficient, represents the degree to which the historical optimal position of a bat individual has effect on the current state of the bat. C₁ is set to 1.49618. C₂ is the control coefficient, represents the degree to which the historical optimal position of the bat population has effect on the current state of the bat. C₂ is set to 1.49618. In hybrid improved dragonfly algorithm (HIDA) (53), s and a are the separation weight and the alignment weight, respectively, and they are both set to 0.1. c is the cohesion weight and is set to 0.7. f and e are the food factor and the enemy factor, respectively, and they are both set to 1. w is the inertia weight and is set to 0.9. In information gain binary butterfly optimization algorithm (IG-bBOA) (54), N represents the number of butterflies and is set to 10, p is the transition probability and is set to 0.8, a is the power exponent and is set to 0.1, C is the sensory modality and is set to 0.01-0.25. α, β, and δ are set to 0.99, 0.001, and 0.009, respectively. In hyper learning binary dragonfly algorithm (HLBDA) (55), the parameters of s, a, c, f, e, and w are consistent with HIDA. The pl is the personal learning rate and is set to 0.4, and gl is the global learning rate and is set to 0.7.

TABLE 2

Table 2. The parameter settings of feature selection algorithms.

After combining BPNN with SR, BBA, HIDA, GCBA, IGCBA, IG-bBOA, and HLBDA, the relevant parameters are set in Tables 3, 4. q is the number of the hidden layer and is set to 1. p is the training goal and is set to 1,000. g is the training goal and is set to 1e-4. η is the learning rate and is set to 0.08. l is the number of the hidden layer nodes, and as a matter of experience, it is often set to half of the number of input layer nodes. The number of hidden layer nodes of SR-BPNN-4, BBA-BPNN-4, BBA-BPNN-8, HIDA-BPNN-4, HIDA-BPNN-16, GCBA-BPNN-4, GCBA-BPNN-16 and IGCBA-BPNN-4, IG-bBOA-BPNN-4, IG-bBOA-BPNN-10, HLBDA-BPNN-4, and HLBDA-BPNN-14 is set to 4, 4, 8, 4, 16, 4, 9, 4, 4, 10, 4, and 14, respectively.

TABLE 3

Table 3. Common parameter settings in BPNN.

TABLE 4

Table 4. The number of hidden layer nodes in BPNN.

Experiment Results

We make experiments to compare the IGCBA algorithm with stepwise regression (SR) (56), BBA, HIDA, GCBA, IG-bBOA, and HLBDA methods on the survey dataset in this section. At the same time, we also perform experiments to compare the IGCBA-BPNN algorithm with SR-BPNN, BBA-BPNN, HIDA-BPNN, GCBA-BPNN, IG-bBOA-BPNN and HLBDA methods on the survey dataset. Given that BPNN, K-Nearest Neigbour (KNN) (57) and decision tree (DT) (58) are important methods for classification, we also add the comparison results of BPNN with KNN and DT. Table 5 shows the experimental results.

TABLE 5

Table 5. Comparison of prediction accuracy of different algorithms.

Compared with SR, BBA, HIDA and GCBA, HIDA and HLBDA have the highest prediction accuracy followed by IGCBA. However, the number of features finally found by IGCBA is 23 and 20 fewer than HIDA and HLBDA, respectively. Besides, the number of features selected by IGCBA is also less than other methods. By comparing the performance of the feature selection algorithms, it can be proved that IGCBA can reduce the irrelevant and redundant features in the original features as much as possible without reducing the prediction accuracy of the classifier.

The prediction accuracy of SR-BPNN-4 is 0.98% higher than that of SR. The prediction accuracy of BBA-BPNN-4 and BBA-BPNN-8 is 0.19 and 0.59% higher than that of BBA, respectively. The prediction accuracy of HIDA-BPNN-4 and HIDA-BPNN-16 is 2.15 and 2.55% higher than that of HIDA, respectively. The prediction accuracy of GCBA-BPNN-4 and GCBA-BPNN-9 is 1.71 and 2.49% higher than that of GCBA, respectively. The prediction accuracy of IG-bBOA-BPNN-4 and IG-bBOA-BPNN-10 is 1.51 and 1.90% higher than that of IG-bBOA, respectively. The prediction accuracy of HLBDA-BPNN-4 and HLBDA-BPNN-14 is 1.73 and 2.32% higher than that of HLBDA. The prediction accuracy of IGCBA-BPNN-4 is 4.04% higher than IGCBA. The above experimental results prove that compared with the feature selection algorithms, the feature selection algorithms combined with BPNN can improve the prediction accuracy.

The prediction accuracy of SR-BPNN-4, BBA-BPNN-4, BBA-BPNN-8, HIDA-BPNN-4, HIDA-BPNN-16, GCBA-BPNN-4, GCBA-BPNN-9, IG-bBOA-BPNN-4, IG-bBOA-BPNN-10, HLBDA-BPNN-4, and HLBDA-BPNN-14 is 88.43, 88.62, 89.02, 90.78, 91.18, 90.20, 90.98, 90.00, 90.39, 90.39, and 90.98%, respectively. The prediction accuracy of IGCBA-BPNN-4 is 92.55%, which is 4.12, 3.93, 3.53, 1.77, 1.37, 2.35, 1.57, 2.55, 2.16, 2.16, and 1.57% higher than that of SR-BPNN-4, BBA-BPNN-4, BBA-BPNN-8, HIDA-BPNN-4, HIDA-BPNN-16, GCBA-BPNN-4, GCBA-BPNN-9, IG-bBOA-BPNN-4, IG-bBOA-BPNN-10, HLBDA-BPNN-4, and HLBDA-BPNN-14. The experimental results of combining each feature selection algorithm with BPNN prove that IGCBA-BPNN-4's performance is better than other algorithms. At the same time, the prediction accuracy of IGCBA-KNN and IGCBA-DT is 87.52 and 79.56%, respectively. The prediction accuracy of IGCBA-BPNN-4 is 5.03 and 12.99% higher than that of IGCBA-KNN and IGCBA-DT, respectively. It can be proved that BPNN is better than KNN and DT for classification on survey dataset. Therefore, IGCBA-BPNN-4 model has good applicability in predicting the mental health of medical workers in public health events.

For the purpose of better verifying the superior convergence performance of the IGCBA algorithm on the test dataset, Figure 2 shows the convergence performance of the six algorithms. By directly plotting the classification accuracy curve with the iteration times, we can see that the classification accuracy increases monotonously at each iteration until level off. Figure 2 shows that GCBA converges faster than BBA. From Figure 2, it can be analyzed that GCBA does not solve the shortcoming that BBA falls into the local optimal solution easily. IGCBA falls into the local optimal solution at the 46th iteration, and it jumped out of the local optimal solution at the 66th iteration. Although the ability of IGCBA to jump out of the local optimal solution is not as good as HIDA and HLBDA, it is significantly better than IG-bBOA, BBA and GCBA.

FIGURE 2

Figure 2. Convergence curves of the six algorithms on the survey dataset.

Figure 3 shows that the alteration trend of the number of features selected by the six algorithms in the survey dataset with the number of iterations. Since the non-linear equation balances the exploitation and exploration capabilities of IGCBA, IGCBA has strong exploitation capabilities in the later stage. Therefore, IGCBA finds fewer features at the 65th iterations. Particularly, although the prediction accuracy of HIDA and HLBDA in Figure 2 is 0.12 and 0.12% higher than that of IGCBA, the number of features finally found by IGCBA is 23 and 20 fewer than HIDA and HLBDA. Combining Figures 2, 3, the experimental results show that IGCBA has strong exploitation ability and superior performance in the later optimization stage.

FIGURE 3

Figure 3. The feature numbers curves of the six algorithms on the survey dataset.

Analysis of the Degree of Feature Variables on Mental Health

Mean Impact Value (MIV) is currently considered to be one of the best algorithms for evaluating the correlation between input variables and output variables. Sorting the variables according to MIV's absolute value can determine the degree of influence of input variables on network output variables. The symbol of the MIV value represents the relative direction, and the relative importance of the impact is represented by MIV's absolute value.

The IGCBA-BPNN-4 prediction model eliminates irrelevant and redundant features in the original dataset, decreases model running time, and improves the prediction accuracy of the classifier. At the same time, the feature variables that affect mental health are sorted according to the degree of their importance. Table 6 shows that IGCBA-BPBB-4 selects a total of nine feature variables that affect mental health. The nine feature variables are “Have patients with COVID-19 or not in the living place,” “age,” “employment type,” “Have patients with COVID-19 or not in the workplace,” “the work unit is a designated treatment point or not,” “changes in work intensity,” “usual sleep time,” “place of residence,” and “marital status.” We analyze the factors affecting mental health according to their degree of importance.

TABLE 6

Table 6. Feature variables that affect mental health: sorted by importance.

Variables in statistics are divided into numerical variables and categorical variables. When considering the impact of input variables on output variables, the direction of the symbol is only meaningful for numerical variables, and has no meaning for categorical variables. Since the variables in this article are mostly categorical variables, the positive or negative influence of the symbol is not considered in this analysis.

In the community transmission stage of the epidemic, according to a study that cluster transmission occurs in multiple communities and families. On average, each patient transmits the infection to 2.2 people (59). When relatives, friends, and nearby people in the living place are determined to be suspected or confirmed cases, people will have psychological problems such as fear and anxiety due to fear of infection.

Patients with COVID-19 are mostly elderly people. Under normal circumstances, the deterioration of body function with age decreases the health levels of the elderly. The elderly are more vulnerable to the threat of diseases because their immune system is relatively weak. In the “Questions and Answers About COVID-19 and the Elderly” on the WHO official website, a clear answer is also given to the question “Who is at risk of severe illness,” that is, the elderly and all ages of people who are diagnosed with diseases such as hypertension, heart disease, lung disease, diabetes or cancer are more likely to suffer from severe illness than others (60).

Differences in employment type lead to differences in the psychological status of medical workers. In contrast with formal medical personnel, temporarily hired medical personnel may show a stronger sense of anxiety and fear during COVID-19. On the one hand, due to the absence of both manpower capital and social capital, temporary medical workers are more likely to engage in low-tech and labor-intensive jobs. The work pressure caused by high labor intensity make easily temporary medical workers prone to anxiety and hostility. On the other hand, most temporary medical workers are exposed to such a severe epidemic for the first time. They lack the work experience and sufficient mental preparation to deal with severe infectious diseases. At the same time, due to the lack of objective cognition of COVID-19, they are in a highly alert state at work, and their anxiety and fear are more prominent.

There are patients with COVID-19 in the workplace, especially the workplace is a designated treatment point for COVID-19, which will have a greater impact on the mental status of medical workers. In face of high-intensity work pressure and the risk of being infected, medical workers are more likely to become a high-risk group with psychological symptoms. Less sleep and poor sleep during COVID-19 can cause sleep disorders, and sleep disorders are often accompanied by symptoms such as depression, tension, anxiety, hostility and irritability (61). For people who do not have a spouse, they cannot get timely help when they encounter difficulties and need a good listener. They are prone to anxiety and depression (62). The farther the place of residence is from the city center, the lower the population density. It is difficult for COVID-19 to spread rapidly in rural areas (63), and people living in rural areas have less fear of COVID-19 than people living in cities.

Discussion

According to the above observations, we can make a conclusion that the performance of IGCBA-BPNN-4 is better than other algorithms. First, BPNN learns the non-linear relationship between feature variables and prediction results, which improves the accuracy of mental health prediction. The results in Table 5 indicate that the accuracy of the feature selection algorithms combined with BPNN is higher than that of the feature selection algorithms without BPNN, with an average increase of 2.46%. Particularly, the accuracy of IGCBA-BPNN-4 is 4.04% higher than that of IGCBA. Second, the value calculated by MIV is used as the influence weight, which assesses the extent to which feature variables contribute to mental health. It can be seen from Table 6 that through the calculation of MIV, the nine feature variables that affect mental health are sorted by their importance. The top three important factors affecting mental health are “whether there are patients with COVID-19 in the workplace,” “age” and “employment type.” The result corresponds with our expectations. Third, GCBA eliminates irrelevant and redundant features in the original features, which reduces BPNN's complexity. The results in Table 5 indicate that GCBA reduces the number of features in the survey dataset from 32 to 18. Although GCBA selects more features than SR and BBA, it has higher prediction accuracy. Fourth, the non-linear equation in IGCBA balances the exploitation and exploration capabilities of IGCBA, which accelerates the convergence speed of IGCBA and prevents IGCBA from falling into a local optimal solution. It can be seen from Figures 2, 3 that IGCBA does not fall into the local optimal solution due to its certain exploration capabilities in the later stage. As a result, IGCBA obtains a feature subset that can represent as much information as possible of the original features and as few features as possible. The number of features selected by IGCBA is only half of the number of features selected by GCBA. Besides, the prediction accuracy rate of IGCBA is higher than that of GCBA.

It should be pointed out that although many people have been vaccinated against COVID-19, the COVID-19 epidemic is far from over due to the spread of mutant strains. COVID-19 directly endangers people's lives, and it is extremely important to diagnose COVID-19 quickly and accurately. The latest method proposed by Wang et al. (64, 65) may help diagnose COVID-19 more quickly and effectively. In the fight against COVID-19, when the psychological symptoms of medical workers are discovered and intervened in time, the work efficiency of the entire health system will be improved. The algorithm proposed in this article can more effectively predict the mental health of medical staff, and the research results can also be directly used by global public health departments. However, several limitations also exist in our research. First, the data in this article was obtained through an online survey, and this research is an observational study. As a result, self-report problems and recall biases are inevitable to some extent. Secondly, mental health is affected by personal, family, economic, social environment and other factors. The factors affecting mental health in this article are incomprehensive. Finally, some parameters that are set manually are used in our algorithm. The parameters of the neural network are given by experience rather than obtained from adaptive changes or learning. We will solve this problem in future work.

Conclusions

The accuracy of existing mental health prediction methods is low because the relationship between the feature variables and the prediction results is non-linear and the prediction dataset contains a lot of irrelevant and redundant features. At the same time, current mental health prediction methods cannot estimate the extent to which the feature variables are important to the prediction results. Therefore, this paper proposes IGCBA-BPNN. First, BPNN is introduced to deal with the non-linear problem between prediction results and feature variables, which improves the accuracy of mental health prediction. Second, MIV is introduced to calculate the influence weight, which assesses the extent to which feature variables contribute to mental health. Third, GCBA is introduced to eliminate redundant and irrelevant features in the original features, which reduces the model complexity of BPNN and improves the performance of BPNN. Fourth, a non-linear equation is designed in IGCBA to speeds up the convergence speed of IGCBA and prevents IGCBA from falling into a local optimal solution. Experiment results show that the performance of IGCBA-BPNN is better than existing algorithms. The IGCBA-BPNN prediction model can obtain good results in mental health prediction.

However, IGCBA only reduces BPNN's input dimension. The BPNN's structure is not improved, and the parameters in the BP network is not optimized. Therefore, how to ascertain the number of neural network nodes is an important challenge in the future.

In a word, with the development of swarm intelligence algorithms and neural network technology, the methods based on swarm intelligence algorithms combined with neural networks are playing an increasingly significant role in the field of prediction. In the future health prediction research, the prediction method based on swarm intelligence algorithm combined with neural network will have a wider application prospect.

Data Availability Statement

The datasets presented in this study can be found in an online repository: https://github.com/Hu-Li/mental-health-dataset.

Ethics Statement

The studies involving human participants were reviewed and approved by the Ethics Committee of the School of Public Health, Jilin University. The ethics committee waived the requirement of written informed consent for participation.

Author Contributions

XW and HL came up with the original idea. HL and TW designed this study and provided research methods. CS and XZ completed the data collection and performed the statistical analysis. TW conducted the experiments. XW supervised the research. HL drafted the manuscript. XW, HL, DG, and CD improved the manuscript. All authors contributed to the article and approved the final version.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We sincerely thank all participants for their support in this research.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2021.697850/full#supplementary-material

References

1. Angehrn A, Sapach MJNT, Ricciardelli R, MacPhee RS, Anderson GS, Carleton RN. Sleep quality and mental disorder symptoms among Canadian public safety personnel. Int J Environ Res Public Health. (2020) 17:2708. doi: 10.3390/ijerph17082708

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Lee MH, Seo MK. Community integration of persons with mental disorders compared with the general population. Int J Environ Res Public Health. (2020) 17:1596. doi: 10.3390/ijerph17051596

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Tang J, Wu L, Huang HL, Feng J, Yuan YF, Zhou YP, et al. Back propagation artificial neural network for community Alzheimer's disease screening in China. Neural Regen Res. (2013) 8:270-6. doi: 10.3969/j.issn.1673-5374.2013.03.010

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Francis L, Barling J. Organizational injustice and psychological strain. Can J Behav Sci. (2005) 37:250-61. doi: 10.1037/h0087260

CrossRef Full Text | Google Scholar

5. Ko HC. A sustainable approach to mental health education: an empirical study using Zhuangzi's self-adaptation. Sustainability. (2019) 11:3677. doi: 10.3390/su11133677

CrossRef Full Text | Google Scholar

6. Pirooznia M, Seifuddin F, Judy J, Mahon PB, Potash JB, Zandi PP, et al. Data mining approaches for genome-wide association of mood disorders. Psychiat Genet. (2012) 22:55-61. doi: 10.1097/YPG.0b013e32834dc40d

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Derogatis LR, Melisaratos N. The brief symptom inventory-an introductory report. Psychol Med. (1983) 13:595-605. doi: 10.1017/S0033291700048017

CrossRef Full Text | Google Scholar

8. Hathaway SR, McKinley JC. A multiphasic personality schedule (Minnesota): I. construction of the schedule. J Psychol. (1940) 10:249-54. doi: 10.1080/00223980.1940.9917000

CrossRef Full Text | Google Scholar

9. Zung WWK. A rating instrument for anxiety disorders. Psychosomatics. (1971) 12:371-9. doi: 10.1016/S0033-3182(71)71479-0

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Zung WWK. A self-rating depression scale. Arch Gen Psychiat. (1965) 12:63-70. doi: 10.1001/archpsyc.1965.01720310065008

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Eysenck SBG, Eysenck HJ, Barrett P. A revised version of the psychoticism scale. Pers Indiv Differ. (1985) 6:21-9. doi: 10.1016/0191-8869(85)90026-1

CrossRef Full Text | Google Scholar

12. Cattell RB. Scree test for number of factors. Multivar Behav Res. (1966) 1:245-76. doi: 10.1207/s15327906mbr0102_10

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Center for Global Development. Mapping and Realigning Incentives in the Global Health Supply Chain. Available online at: http://www.cgdev.org/doc/DemandForecasting/Principles.pdf (accessed September 24, 2020).

14. Cosic K, Popovic S, Sarlija M, Kesedzic I, Jovanovic T. Artificial intelligence in prediction of mental health disorders induced by the COVID-19 pandemic among health care workers. Croat Med J. (2020) 61:279-88. doi: 10.3325/cmj.2020.61.279

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Inter-Agency Standing Committee. António Guterres (UN Secretary-General) on COVID-19 and the Need for Action on Mental Health. Available online at: https://interagencystandingcommittee.org/iasc-reference-group-mental-health-and-psychosocial-support-emergency-settings/antonio-guterres-un (accessed June 12, 2020).

16. World Health Organization. The Impact of COVID-19 on Mental, Neurological and Substance Use Services: Results of a Rapid Assessment. Available online at: https://www.who.int/docs/default-source/mental-health/ppt-who-covid19-mental-health-rapid-assessment-v10.pdf?sfvrsn=2f45b88a_2 (accessed August 20, 2020).

Google Scholar

17. O'Dowd A. Covid-19: cases of delta variant rise by 79%, but rate of growth slows. BMJ. (2021) 373:n1596. doi: 10.1136/bmj.n1596

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Yang X, Jin M, Zheng L. The prediction model of college students' life stressor on mental health. China J Health Psychol. (2018) 26:775-8. doi: 10.13342/j.cnki.cjhp.2018.05.001

CrossRef Full Text | Google Scholar

19. Margraf J, Zhang XC, Lavallee KL, Schneider S. Longitudinal prediction of positive and negative mental health in Germany, Russia, and China. PLoS ONE. (2020) 15:e0234997. doi: 10.1371/journal.pone.0234997

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Wilson MG, DeJoy DM, Vandenberg RJ, Richardson HA, McGrath AL. Work characteristics and employee health and well-being: test of a model of healthy work organization. J Occup Organ Psych. (2004) 77:565-88. doi: 10.1348/0963179042596522

CrossRef Full Text | Google Scholar

21. Boyle J, Jessup M, Crilly J, Green D, Lind J, Wallis M, et al. Predicting emergency department admissions. Emerg Med J. (2012) 29:358-65. doi: 10.1136/emj.2010.103531

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Champion R, Kinsman LD, Lee GA, Masman KA, May EA, Mills TM, et al. Forecasting emergency department presentations. Aust Health Rev. (2007) 31:83-90. doi: 10.1071/AH070083

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Reis BY, Mandl KD. Time series modeling for syndromic surveillance. BMC Med Inform Decis Mak. (2003) 3:2. doi: 10.1186/1472-6947-3-2

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Medina DC, Findley SE, Guindo B, Doumbia S. Forecasting non-stationary diarrhea, acute respiratory infection, and malaria time-series in Niono, Mali. PLoS ONE. (2007) 2:e1181. doi: 10.1371/journal.pone.0001181

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Hyndman RJ, Koehler AB, Snyder RD, Grose S. A state space framework for automatic forecasting using exponential smoothing methods. Int J Forecasting. (2002) 18:439-54. doi: 10.1016/S0169-2070(01)00110-8

CrossRef Full Text | Google Scholar

26. Soyiri I, Reidpath D, Sarran C. Determinants of asthma length of stay in London hospitals: individual versus area effects. Emerg Health Threats J. (2011) 4:143. doi: 10.3402/ehtj.v4i0.11179

CrossRef Full Text | Google Scholar

27. Soyiri I, Reidpath D, Sarran C. Asthma length of stay in hospitals in London 2001-2006: demographic, diagnostic and temporal factors. PLoS ONE. (2011) 6:e27184. doi: 10.1371/journal.pone.0027184

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Williams JS. Assessing the suitability of fractional polynomial methods in health services research: a perspective on the categorization epidemic. J Health Serv Res Po. (2011) 16:147-52. doi: 10.1258/jhsrp.2010.010063

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Basavappa SR, Rao SL, Harish B. Expert system for dementia/depression diagnosis. Nimhans J. (1996) 14:99-106.

Google Scholar

30. Gil D, Manuel DJ. Diagnosing Parkinson by using artificial neural networks and support vector machines. J Comput Sci Technol. (2009) 9:63-71. https://lup.lub.lu.se/record/1776690

Google Scholar

31. Seixas FL, Zadrozny B, Laks J, Conci A, Saade DCM. A Bayesian network decision model for supporting the diagnosis of dementia, Alzheimer's disease and mild cognitive impairment. Comput Biol Med. (2014) 51:140-58. doi: 10.1016/j.compbiomed.2014.04.010

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Dabek F, Caban JJ. A neural network based model for predicting psychological conditions. In: Proceedings of the 2015 8th International Conference on Brain Informatics and Health (BIH). London (2015). p. 252-61.

Google Scholar

33. Liu CY, Yang YZ, Zhang XM, Xu XY, Dou QL, Zhang WW. The prevalence and influencing factors in anxiety in medical workers fighting COVID-19 in China: a cross-sectional survey. Epidemiol Infect. (2020) 148:e98. doi: 10.1017/S0950268820001107

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Lei L, Huang XM, Zhang S, Yang JR, Yang L, Xu M. Comparison of prevalence and associated factors of anxiety and depression among people affected by versus people unaffected by quarantine during the COVID-9 epidemic in Southwestern China. Med Sci Monitor. (2020) 26:e924609. doi: 10.12659/MSM.924609

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Covinsky KE, Newcomer R, Fox P, Wood J, Sands L, Dane K, et al. Patient and caregiver characteristics associated with depression in caregivers of patients with dementia. J Gen Intern Med. (2003) 18:1006-14. doi: 10.1111/j.1525-1497.2003.30103.x

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Lu JJ, Kong JX, Song JS, Li L, Wang HM. The health-related quality of life of nursing workers: a cross-sectional study in medical institutions. Int J Nues Pract. (2019) 25:e12754. doi: 10.1111/ijn.12754

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Gomez MAL, Sabbath E, Boden L, Williams JAR, Hopcia K, Hashimoto D, et al. Organizational and psychosocial working conditions and their relationship with mental health outcomes in patient-care workers. J Occup Environ Med. (2019) 61:e480-5. doi: 10.1097/JOM.0000000000001736

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Maunder RG, Lancee WJ, Balderson KE, Bennett JP, Borgundvaag B, Evans S. Long-term psychological and occupational effects of providing hospital healthcare during SARS outbreak. Emerg Infect Dis. (2006) 12:1924-32. doi: 10.3201/eid1212.060584

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Lopresti AL, Hood SD, Drummond PD. A review of lifestyle factors that contribute to important pathways associated with major depression: diet, sleep and exercise. J Affect Disord. (2013) 148:12-27. doi: 10.1016/j.jad.2013.01.014

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Hoare E, Milton K, Foster C, Allender S. The associations between sedentary behaviour and mental health among adolescents: a systematic review. Int J Behav Nutr Phy. (2016) 13:108. doi: 10.1186/s12966-016-0432-4

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Lai JB, Ma SM, Wang Y, Cai ZX, Hu JB, Wei N, et al. Factors associated with mental health outcomes among health care workers exposed to coronavirus disease 2019. JAMA Netw Open. (2020) 3:e203976. doi: 10.1001/jamanetworkopen.2020.3976

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Lu W, Wang H, Lin YX, Li L. Psychological status of medical workforce during the COVID-19 pandemic: a cross-sectional study. Psychiat Res. (2020) 288:112936. doi: 10.1016/j.psychres.2020.112936

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Yang XS. (2010) A New Metaheuristic Bat-Inspired Algorithm. In: González J.R., Pelta D.A., Cruz C., Terrazas G., Krasnogor N. (eds) Nature Inspired Cooperative Strategies for Optimization (NICSO 2010). Studies in Computational Intelligence, vol 284. Berlin, Heidelberg:Springer, p. 65–74. https://doi.org/10.1007/978-3-642-12538-6_6

Google Scholar

44. Nakamura RYM, Pereira LAM, Costa KA, Rodrigues D, Papa JP, Yang XS. BBA: a binary bat algorithm for feature selection. In: 2012 25th SIBGRAPI Conference on Graphics, Patterns and Images. Ouro Preto: IEEE (2012). p. 291-7. https://ieeexplore.ieee.org/abstract/document/6382769.

Google Scholar

45. Rodrigues D, Pereira LAM, Nakamura RYM, Costa KAP, Yang XS, Souza AN. A wrapper approach for feature selection and optimum-path forest based on bat algorithm. Expert Syst Appl. (2014) 41:2250-8. doi: 10.1016/j.eswa.2013.09.023

CrossRef Full Text | Google Scholar

46. Mirjalili S, Mirjalili SM, Yang X. Binary bat algorithm. Neural Comput Appl. (2014) 25:663-81. doi: 10.1007/s00521-013-1525-5

CrossRef Full Text | Google Scholar

47. Cui X, Li Y, Fan J. Global chaotic bat optimization algorithm. J Northeast Univ (Nat Sci). (2020) 41:488-91. doi: 10.12068/j.issn.1005-3026.2020.04.006

CrossRef Full Text | Google Scholar

48. Kazimipour B, Li XD, Qin AK. A review of population initialization techniques for evolutionary algorithms. In: Proceedings of the 2014 IEEE Congress on Evolutionary Computation (CEC). Beijing (2014). p. 2585-92.

Google Scholar

49. Grossi E, Buscema M. Introduction to artificial neural networks. Eur J Gastroen Hepat. (2007) 19:1046-54. doi: 10.1097/MEG.0b013e3282f198a0

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Ramesh AN, Kambhampati C, Monson JRT, Drew PJ. Artificial intelligence in medicine. Ann Roy Coll Surg. (2004) 86:334-8. doi: 10.1308/147870804290

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Huang CL, Chen MC, Wang CJ. Credit scoring with a data mining approach based on support vector machines. Expert Syst Appl. (2007) 33:847-56. doi: 10.1016/j.eswa.2006.07.007

CrossRef Full Text | Google Scholar

52. Subasi A. Classification of EMG signals using PSO optimized SVM for diagnosis of neuromuscular disorders. Comput Biol Med. (2013) 43:576-86. doi: 10.1016/j.compbiomed.2013.01.020

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Cui X, Li Y, Fan J, Wang T, Zheng Y. A hybrid improved dragonfly algorithm for feature selection. IEEE Access. (2020) 8:155619-29. doi: 10.1109/ACCESS.2020.3012838

CrossRef Full Text | Google Scholar

54. Sadeghian Z, Akbari E, Nematzadeh H. A hybrid feature selection method based on information theory and binary butterfly optimization algorithm. Eng Appl Artif Intel. (2021) 97:104079. doi: 10.1016/j.engappai.2020.104079

CrossRef Full Text | Google Scholar

55. Too J, Mirjalili S. A hyper learning binary dragonfly algorithm for feature selection: a COVID-19 case study. Knowl Based Syst. (2021) 212:106553. doi: 10.1016/j.knosys.2020.106553

CrossRef Full Text | Google Scholar

56. Lucero RJ, Lindberg DS, Fehlberg EA, Bjarnadottir RI, Li Y, Cimiotti JP, et al. A data-driven and practice-based approach to identify risk factors associated with hospital-acquired falls: applying manual and semi- and fully-automated methods. Int J Med Inform. (2019) 122:63-9. doi: 10.1016/j.ijmedinf.2018.11.006

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Peng NB, Zhang YX, Zhao YH. A SVM-kNN method for quasar-star classification. Sci China Phys Mech. (2013) 56:1227-34. doi: 10.1007/s11433-013-5083-8

CrossRef Full Text | Google Scholar

58. Bui DT, Pradhan B, Lofman O, Revhaug I. Landslide susceptibility assessment in Vietnam using support vector machines, decision tree, and naive bayes models. Math Probl Eng. (2012) 2012:974638. doi: 10.1155/2012/974638

CrossRef Full Text | Google Scholar

59. Chan JFW, Yuan SF, Kok KH, To KKW, Chu H, Yang J, et al. A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: a study of a family cluster. Lancet. (2020) 395:514-23. doi: 10.1016/S0140-6736(20)30154-9

PubMed Abstract | CrossRef Full Text | Google Scholar

60. World Health Organization. Coronavirus Disease (COVID-19): Risks and Safety for Older People. Available online at: https://www.who.int/emergencies/diseases/novel-coronavirus-2019/question-and-answers-hub/q-a-detail/coronavirus-disease-covid-19-risks-and-safety-for-older-people (accessed September 22, 2020).

Google Scholar

61. Metse AP, Fehily C, Clinton-McHarg T, Wynne O, Lawn S, Wiggers J, et al. Self-reported suboptimal sleep and receipt of sleep assessment and treatment among persons with and without a mental health condition in Australia: a cross sectional study. BMC Public Health. (2021) 21:463. doi: 10.1186/s12889-021-10504-6

PubMed Abstract | CrossRef Full Text | Google Scholar

62. Skapinakis P, Bellos S, Koupidis S, Grammatikopoulos I, Theodorakis PN, Mavreas V. Prevalence and sociodemographic associations of common mental disorders in a nationally representative sample of the general population of Greece. BMC Psychiatry. (2013) 13:163. doi: 10.1186/1471-244X-13-163

PubMed Abstract | CrossRef Full Text | Google Scholar

63. Eilersen A, Sneppen K. SARS-CoV-2 superspreading in cities vs the countryside. APMIS. (2021) 129:401-7. doi: 10.1111/apm.13120

PubMed Abstract | CrossRef Full Text | Google Scholar

64. Wang SH, Satapathy SC, Anderson D, Chen SX, Zhang YD. Deep fractional max pooling neural network for COVID-19 recognition. Front Public Health. (2021) 9:1117. doi: 10.3389/fpubh.2021.726144

PubMed Abstract | CrossRef Full Text | Google Scholar

65. Wang SH, Zhang Y, Cheng X, Zhang X, Zhang YD. PSSPNN: PatchShuffle Stochastic Pooling Neural Network for an explainable diagnosis of COVID-19 with multiple-way data augmentation. Comput Math Methods Med. (2021) 2021:6633755. doi: 10.1155/2021/6633755

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: COVID-19, mental health, prediction, machine learning, artificial intelligence, neural network, public health

Citation: Wang X, Li H, Sun C, Zhang X, Wang T, Dong C and Guo D (2021) Prediction of Mental Health in Medical Workers During COVID-19 Based on Machine Learning. Front. Public Health 9:697850. doi: 10.3389/fpubh.2021.697850

Received: 20 April 2021; Accepted: 16 August 2021;
Published: 07 September 2021.

Edited by:

Yu-Dong Zhang, University of Leicester, United Kingdom

Reviewed by:

Siyuan Lu, University of Leicester, United Kingdom
Dimas Lima, Federal University of Santa Catarina, Brazil

Copyright © 2021 Wang, Li, Sun, Zhang, Wang, Dong and Guo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Chuanyong Sun, Y2h1YW55b25nc3VuQGhvdG1haWwuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.