Can machine learning models predict maternal and newborn healthcare providers’ perception of safety during the COVID-19 pandemic? A cross-sectional study of a global online survey

Hammoud, Bassel; Semaan, Aline; Elhajj, Imad; Benova, Lenka

doi:10.1186/s12960-022-00758-5

Research
Open access
Published: 19 August 2022

Can machine learning models predict maternal and newborn healthcare providers’ perception of safety during the COVID-19 pandemic? A cross-sectional study of a global online survey

Bassel Hammoud ORCID: orcid.org/0000-0002-3464-749X¹,
Aline Semaan²,
Imad Elhajj³ &
…
Lenka Benova²

Human Resources for Health volume 20, Article number: 63 (2022) Cite this article

3519 Accesses
2 Citations
9 Altmetric
Metrics details

Abstract

Background

Maternal and newborn healthcare providers are essential professional groups vulnerable to physical and psychological risks associated with the COVID-19 pandemic. This study uses machine learning algorithms to create a predictive tool for maternal and newborn healthcare providers’ perception of being safe in the workplace globally during the pandemic.

Methods

We used data collected between 24 March and 5 July 2020 through a global online survey of maternal and newborn healthcare providers. The questionnaire was available in 12 languages. To predict healthcare providers’ perception of safety in the workplace, we used features collected in the questionnaire, in addition to publicly available national economic and COVID-19-related factors. We built, trained and tested five machine learning models: Support Vector Machine (SVM), Random Forest (RF), XGBoost, CatBoost and Artificial Neural Network (ANN) for classification and regression. We extracted from RF models the relative contribution of features in output prediction.

Results

Models included data from 941 maternal and newborn healthcare providers from 89 countries. ML models performed well in classification and regression tasks, whereby RF had 82% cross-validated accuracy for classification, and CatBoost with 0.46 cross-validated root mean square error for regression. In both classification and regression, the most important features contributing to output prediction were classified as three themes: (1) information accessibility, clarity and quality; (2) availability of support and means of protection; and (3) COVID-19 epidemiology.

Conclusion

This study identified salient features contributing to maternal and newborn healthcare providers perception of safety in the workplace. The developed tool can be used by health systems globally to allow real-time learning from data collected during a health system shock. By responding in real-time to the needs of healthcare providers, health systems could prevent potential negative consequences on the quality of care offered to women and newborns.

Peer Review reports

Introduction

In the last 20 years, coronaviruses have caused several outbreaks, such as severe acute respiratory syndrome (SARS) in 2002, and middle east respiratory syndrome (MERS) in 2012 [1]. During December 2019, several cases of respiratory distress were reported in Wuhan City in China, due to a novel coronavirus (SARS-CoV-2). Following an exponential increase in the number of cases, the World Health Organization (WHO) declared COVID-19 a “global health emergency” on January 2020, and in March 2020 a “pandemic” [2]. Globally, as of 18 May 2022, there have been more than 520 million confirmed cases of COVID-19, including more than 6.2 million deaths [3].

The impact of the COVID-19 pandemic is not limited to physical health; it has repercussions on the psychological, social, and economic level, including on healthcare infrastructure [4]. Healthcare providers are particularly vulnerable to the risks associated with COVID-19, with several studies reporting an increased prevalence of depression, anxiety, insomnia and psychological distress [5]. According to a study among healthcare workers in China, this is associated with many factors, including the rapidly increasing number of cases and deaths, the quick spread of the virus, the overwhelming workload, lack of access to personal protective equipment (PPE), absence of clear guidelines especially at the beginning of the pandemic, and fear of spreading the virus to family members [6]. A longitudinal study among healthcare providers in Argentina shows a worsening of mental health among providers who expressed concern about infection with COVID-19 [7].

Maternal and newborn healthcare providers faced personal and professional challenges during the COVID-19 pandemic as they continued to provide essential health services to women, babies and families. In the United Kingdom (UK) and Italy, many deaths were reported among midwives due to COVID-19 [8]. Few studies were conducted to document those challenges and experiences, with the majority being from high- and middle-income countries [9,10,11,12,13,14,15,16]; including one global survey of providers caring specifically for small and sick newborns [17]. With the inadequate supply of PPE, which in many settings was prioritised for healthcare providers working in COVID-19 treatment wards, maternal and newborn healthcare providers worried about their own health and were concerned over occupational exposure to COVID-19 in the workplace, and transmitting the infection to patients, family and friends [9, 12, 14, 17]. A survey conducted by the Royal College of Midwives in the UK revealed that more than half of the midwives did not feel safe to conduct home visits in April 2020 [18]. Loss of social support and increased levels of stress and anxiety were common among maternal and newborn healthcare providers during this period [13, 15, 17]. In Nigeria, the majority of maternal and newborn healthcare providers worried about stigmatisation or discrimination as a result of their potential exposure to COVID-19, and 87% experienced work-related burnout [9]. Maternal and newborn healthcare providers were additionally overwhelmed by the amount of new information and guidelines that were frequently changing in the early phase of the pandemic [17]. This was not universal, however, and in some settings, healthcare providers reported being adequately informed [15, 16].

Based on the above summary of the literature, we hypothesise that factors at various levels could influence maternal and newborn healthcare providers’ wellbeing and their perception of safety during the COVID-19 pandemic. A first factor is the perceived ability to protect themselves against infection (e.g. through the availability of PPE) [6, 9, 12, 14, 17]. Second, healthcare providers’ perceived risk of infection with COVID-19 can influence their wellbeing, and this risk can be reflected by the number of confirmed COVID-19 cases and deaths in the country and the numbers of cases and deaths among healthcare providers themselves [6, 8, 18]. Third, the wellbeing of healthcare providers depends on the perceived adequacy of information guidelines [6, 17].

During the COVID-19 pandemic, hundreds of machine learning (ML) models were built and applied to address various issues related to the pandemic, including automated diagnosis by extracting COVID-19 specific patterns from chest X-rays and CT-scans, predicting epidemiologic outbreaks, discovering therapeutics and designing novel vaccines [19,20,21], and to predict the effect of non-pharmaceutical interventions on the COVID-19 epidemiology globally [22]. Some studies applied these techniques to tackle mental health issues and psychological stressors for the general population, including for healthcare providers during the pandemic [23, 24]. However, the majority of studies assessing the mental health of healthcare providers (using ML or not) included individuals from single countries (e.g. China [25], USA [26], Turkey [27]) or from high-income countries [28]. In India, a study is planned to predict burnout among healthcare providers due to the COVID-19 pandemic using ML [29]. No study has used data from a diversity of country income groups.

The objective of this study is to use ML algorithms to create a predictive tool based on main drivers contributing to maternal and newborn healthcare providers’ perception of being safe in the workplace globally and compare its performance to standard statistical models. Specifically, we aim to identify the most salient factors contributing to perception of safety among maternal and newborn healthcare providers.

Methods

Study design and data collection

This cross-sectional study uses data collected between 24 March 2020 and 5 July 2020, during the first round of a global online survey of maternal and newborn healthcare providers during the COVID-19 pandemic. The survey targeted various cadres of maternal and newborn healthcare providers, including midwives, nurse-midwives, nurses, obstetricians/gynaecologists, neonatologists and paediatricians, among others. Participants were invited to complete the survey through personal and professional networks, and social media channels (e.g. Twitter, Facebook, WhatsApp groups, etc.). Additional details about the study design and sampling are available elsewhere [30]. The questionnaire was available in 12 languages (Arabic, Chinese, Dutch, English, French, German, Italian, Japanese, Kiswahili, Portuguese, Russian and Spanish), and it was published online using KoboToolbox’s online data collection feature [31].

Questionnaire and definitions

The questionnaire was developed by an international multidisciplinary team including health professionals, experts in health systems, maternal and newborn health epidemiologists and public health researchers. The questionnaire consisted of four main modules including questions about (1) respondents’ background information and characteristics of the facilities where they worked; (2) preparedness for the COVID-19 pandemic, including access to information and training; (3) facility-level response to the COVID-19 pandemic including setting-up screening areas and PPE availability; and (4) healthcare providers’ work-related experiences since the start of the COVID-19 pandemic, including stress levels and concerns. The full questionnaire is available on the study website [32].

In the disciplines of computer engineering/science and public health, different terminologies are used to describe similar concepts. Throughout this manuscript, we use terminologies adopted in computer engineering/science. The term “output” is used in computer engineering/science disciplines and is equivalent to the term “dependent variable” used in public health. It refers to the predicted factor which is a respondent’s perception of feeling protected from infection with COVID-19 in the workplace at the time of the survey. This was collected on a 5-point Likert scale: (1) not at all protected, (2) minimally protected, (3) some protection, (4) well protected, and (5) completely protected. The term “features” or “model inputs” used in computer engineering/science is equivalent to the term “explanatory variables” or “independent variables” in public health and refers to the factors that are fixed and used to predict/explain the output (listed in Additional file 1).

A few features were added to the dataset after data collection was completed. These capture characteristics of the countries where respondents worked. The country income level variable (high-income, middle-income, low-income countries) was defined using the World Bank classification of the worlds’ economies (according to 2019 gross national income) [33]. Another economic indicator from the World Bank database was the gross domestic product per capita expressed in current international dollars for the year 2019 [34]. The national estimates of the maternal mortality ratio (MMR) were added based on the WHO’s 2017 estimates [35]. National level events relevant to the COVID-19 pandemic, including cumulative number of cases and deaths, lockdowns, curfews, domestic and foreign travel bans, mask mandates were sourced from the Oxford COVID-19 Government Response Tracker, Blavatnik School of Government, University of Oxford [36]. These data were merged with the survey data based on the country and the date of data collection. The complete list of features, including their data sources (n = 71 features; 41 from survey, 23 from the Government Response Tracker, 4 from the WHO COVID-19 dashboard, 2 from the World Bank database; 1 from the WHO estimates on maternal mortality) is provided in Additional file 1.

Data management

Missing answers

Features collected through the survey to which more than 30% of the respondents did not provide answers were removed from the dataset. Furthermore, respondents who had at least one missing feature were excluded from analysis. The remaining dataset contained a total of 941 respondents out of 1641 submissions originally made.

Data pre-processing

To deal with class imbalance, the data were augmented using Synthetic Minority Oversample Technique (SMOTE) from the Imbalanced-Learn Library [37]. The technique consists of oversampling examples in the minority class (the class of the output with fewest individuals), by randomly selecting an instance from this class, choosing a certain number of nearest neighbours to that instance and interpolating new datapoints between the selected neighbours in the feature space. This leads to an augmented dataset with balanced classes of the output. The features were then appropriately encoded based on their type: one-hot encoding [38] for categorical features and ordinal encoding for ordinal features. Numerical features were standardised using Scikit-learn Standard Scaler (features become centred around their mean with a unit standard deviation), to allow faster convergence of the models. The dataset was then randomly split into a training set (80% of sample) used to train models and a testing set (20%) used to test the performance of different models for new respondents.

Data analysis

Machine learning models

Two different approaches were used to predict the output: classification and regression. In classification models, the output was employed as a categorical variable, and the goal was to train the model to predict a discrete class of the output to which the respondent belongs. In regression models, the output feature was employed as a continuous variable, and the goal was to predict a decimal score from 1 to 5 reflecting the output.

We built, trained and tested five machine learning models, and compared them to the conventional statistical methods (Linear Regression and Logistic Regression). The 5 ML algorithms (sequence of steps that lead to the model when implemented on the data) used are: Support Vector Machine (SVM), Random Forest (RF), XGBoost, CatBoost and Artificial Neural Network (ANN). These models were chosen due to their predictive abilities in healthcare settings in general and in public health and mental health applications in particular [39,40,41,42,43]. The models are described individually in detail in Additional file 2.

Hyperparameters tuning

To build a robust ML model, the optimal set of “hyperparameters” should be identified. Hyperparameters are a group of tuneable variables related to the architecture of the model and not learned from the dataset (unlike “parameters” which refer to the group of variables that are learned from the dataset during the training process). In our study, in order to determine the best hyperparameters, a grid search was performed for each model based on its performance, reflected by the accuracy. In addition, to validate the results and assure better generalisability, a tenfold cross-validation was performed. Finally, the best model was extracted to be tested on the testing set.

Training and testing

To predict to which extent maternal and newborn healthcare providers felt protected in the workplace, two sets of experiments were conducted. In each set, several ML models were trained and tested for a particular task.

Experiments set 1: classification models

Experiment 1A—classification with all features. Several classification models (Logistic Regression, SVM, RF, XGBoost, CatBoost, ANN) were trained and tested to predict a discrete class of the output describing the feeling of protection of the healthcare providers during the pandemic. After training and testing the models, features’ importance was extracted from the RF model, to determine the features that contribute the most to the prediction of the feeling of protection among maternal and newborn healthcare providers in the pandemic.

Experiment 1B—classification with selected features. Experiment 1A was repeated using only the top 10 selected features from the RF model for training. The RF was used for two reasons: (1) its tree-based strategy naturally ranks the features by how well they maximise the gain of information (or minimise the error) and contribute to the prediction, and (2) because it’s widely used in the literature for feature selection, especially for medical applications [44, 45]. This experiment was conducted to compare the performance of the models when trained using only the 10 most important features, to that of the models trained using all 71 features.

Experiments set 2: regression models

Experiment 2A—regression. Several regression models (Linear Regression, SVM, RF, XGBoost, CatBoost and ANN) were trained and tested to predict a decimal score from 1 to 5 reflecting the feeling of protection among healthcare providers during the pandemic. Features’ importance is once again extracted from the RF Regression model after training and compared to the results obtained in Experiment 1A.

Experiment 2B—regression with selected features. Experiment 2A was repeated using only the top 10 selected features from the RF model for training.

Performance metrics

Accuracy was used to evaluate and compare the performance of the classification models (Experiments 1A and 1B). The accuracy was chosen instead of the F1 score since the data became balanced after oversampling. On the other hand, root mean square error (RMSE) was used as a performance metric for the regression models (Experiments 2A and 2B). Equations (1) and (2) show the mathematical equations of accuracy and RMSE, respectively.

Equation 1 . Mathematical formula for accuracy

$$Accuracy= \frac{Number \, of \, correct \, predictions}{Total \, number \, of \ predictions}.$$

Equation 2 . Mathematical formula for RMSE

$$RMSE=\sqrt{\frac{1}{N} \sum_{i=1}^{N}{({y}_{i}-\widehat{{y}_{i}})}^{2},}$$

where $N$ is the number of elements in the sample, ${y}_{i}$ is the true value of the output of the ith element and $\widehat{{y}_{i}}$ is the predicted value of the output of the ith element.

Results

Sample description

The 941 respondents included in this analysis were from high-income countries (73%), middle-income (22%) and low-income countries (5%). The complete distribution of respondents across the 89 unique countries is available in Additional file 3. Table 1 displays the characteristics of respondents by country income group. Overall, half of respondents were midwives/nurse-midwives/nurses (50%), followed by obstetricians/gynaecologists (27%), and neonatologists/paediatricians (13%). About a third of respondents provided both inpatient and outpatient care services, and 22% provided at least two inpatient care services. More than half the respondents were team members (54%), followed by head of team (12%), head of department or ward (10%) and head of facility (6%). Overall, the majority of respondents were female (80%).

Table 1 Characteristics of the respondents (n = 941)

Full size table

Table 2 shows the distribution of respondents according to the characteristics of the facility where they primarily work. Most of the respondents worked in referral hospitals and district/regional hospitals (39% and 30%, respectively). About 70% of respondents worked in public facilities, and 15% worked in private facilities.

Table 2 Characteristics of the facilities where respondents’ mainly work (n = 941)

Full size table

Figure 1 shows the distribution of respondents by perception of being protected in the workplace, on a 5-point Likert scale, by country income group. The majority of respondents in low-income countries (74%) reported feeling minimally or not at all protected, whereas this was reported by 36% and 22% of respondents from middle- and high-income countries, respectively. None of the respondents in low-income countries reported feeling completely protected, whereas this was reported by 3% and 5% of respondents in middle- and high-income countries.

Hyperparameters

Table 3 summarises the set of hyperparameters for different models. Note that for the XGBoost and CatBoost models, the default hyperparameters were used. The ANN used is a multilayer perceptron composed of four layers: an input layer with one neuron for each feature (total of 71 neurons), two hidden layers with 120 neurons each and Relu activation function, and an output layer. Categorical cross-entropy loss with Adamax optimizer are set, and a batch size of 32 with 150 epochs are used for training.

Table 3 Set of hyperparameters used for support vector machine and random forest models in experiments 1A, 1B and 2

Full size table

Experiment 1: classification

Experiment 1A: classification with all features

Figure 2A illustrates the accuracies of different ML models obtained by tenfold cross-validation, using a subset of the training set as a validation set. The best performing model was the RF (82%) followed by the ANN (80%), XGBoost (79%), CatBoost (79%) and SVM (78%). All models demonstrated better performance when compared to the conventional statistical technique, i.e. Logistic Regression, with an accuracy of 68%. This also applies to the performance on the testing set (the set of data that was obtained initially from the training–testing split and was never used during the training process), shown in Table 4.

Table 4 Accuracies of different models from Experiment 1A (classification with all features) on the testing set

Full size table

If we examine the confusion matrix of the RF model on the testing set (Fig. 2B), we notice that the number of missed classifications is low for classes 1 (“Not at all protected”) and 5 (“Completely protected”): 1 out of 38 (2.6%) and 0, respectively. This percentage progressively increased when we move centrally (to the middle classes) to achieve 45% of erroneous predictions for individuals belonging to class 3 (“Some Protection”). When compared to the Logistic Regression, the same pattern of distribution of misclassifications was present with overall higher percentages of error: 24% and 0% for classes 1 and 5, respectively, and the percentage increased centrally with up to 61% of missed classifications for class 3 (Fig. 2C). In addition, more predictions in the LR model are missing the class by more than 1 class (e.g. predicts 1 or 5 instead of 3), than with the RF: 12 versus 3, respectively. For instance, in class 3, we had 4 predictions that missed by 2 classes in the LR models, versus none in the RF model.

Feature extraction

The RF model was used to extract features that contributed the most to the classification. Figure 2D shows a list of features sorted based on their relative contribution to the classification of the output. The most salient feature was the knowledge of what to do in case of receiving a maternity patient confirmed with COVID-19. Other important features were: respondent reporting that their facility addressed concerns of healthcare providers, the perception that that the information provided by their health facilities has value in making respondents feel safe, is helpful in their daily work, and is clear, availability of sufficient PPE (masks, aprons), the cumulative and daily number of COVID-19 cases at the national level at the time of the survey, and the cumulative number of deaths due to COVID-19 at the national level. In other words, those features were found to be the primary predictors for classification.

Experiment 1B: classification with selected features

Figure 3 and Table 5 show the tenfold cross-validated and the testing set accuracies of different machine learning models, respectively, after training the model using only the 10 most important features extracted from the RF model in experiment 1A (classification with all features).

Table 5 Accuracies of different models from Experiment 1B (classification with selected features) on the testing set

Full size table

Based on these results, the RF, XGBoost and CatBoost were the top performing models with tenfold cross-validated accuracies ranging from 74 to 77% and testing accuracies ranging from 76 to 81%; whereas, the ANN and SVM models scored lower with 65% and 62%, respectively. However, all the models performed better than LR that had a cross-validated accuracy of 57%.

When compared to the results of experiment 1A (classification with all features), the models were less performant when they were trained using only top 10 features rather than the entire dataset with 71 features. However, not all the models were affected the same way. In fact, the LR model was affected with a drop of 11% in accuracy (from 68% in experiment 1A to 57% in experiment 1B), the SVM with a drop of 16% and the ANN with a drop of 15%. On the other hand, the drop was generally much smaller in the RF, XGBoost and CatBoost models: 6%, 5% and 3%, respectively, which was expected since we the feature selection was based on the RF and since XGBoost and CatBoost also use tree-based strategy for classification.

Experiment 2

Experiment 2A: regression with all features

Figure 4.A shows the tenfold cross-validated RMSE (using each time a subset of the training set as a validation set) and the testing set RMSE (testing set obtained by train-test split and never used during training) of Linear Regression and machine learning models. The tenfold cross-validated RMSE of the Logistic Regression was 0.65. However, it was lower (reflecting a better performance) for all machine learning models, ranging between 0.46 for Catboost and 0.61 for SVM.

Feature extraction

Just like in Experiment 1A (classification with all features), top features were extracted from the RF Regression model, and are shown in Fig. 4B, sorted based on their contribution to the predictions. The two most important features for regression were facilities addressing the concerns of healthcare providers and knowledge of what to do in case of receiving a maternity patient confirmed with COVID-19. Other features included availability of sufficient PPE, facilities providing information that are considered helpful by respondents, the perception that healthcare providers are valued by their community, the national MMR, the level of healthcare of the institution, and the cumulative number of COVID-19 cases and deaths of the country at the time of the survey. The least contributing features include restrictions applied at the country-level, country income group, whether the facility received referrals or has an intensive care unit, and the respondents’ gender.

The classification and regression models yielded almost the same list of salient features, with slight changes in ranking.

Experiment 2B: regression with selected features

Figure 4C shows the tenfold cross-validated and the testing set RMSE of different machine learning models, after training the model using only the 10 most important features extracted from the RF model in experiment 2A (regression with all features). Even when trained on a subset of the features, machine learning models, with cross-validated RMSE ranging between 0.5 for CatBoost and 0.6 for ANN, outperformed Logistic Regression, with cross-validated RMSE of 0.72.

When compared to the results of Experiment 2A (regression with all features), all the models are less performant when trained on 10 features only, except the SVM which improves (RMSE of 0.53 vs 0.61 in Experiment 1A (classification with all features)).

Discussion

This study was conducted by a multidisciplinary team representing computer engineering/science, clinical sciences, and public health. Therefore, our interpretations cover two distinct areas: lessons about the application of the ML method and implications for maternal health service provision. We discuss both in turn.

This study explored the potential for using ML models to predict the perception of being safe in the workplace among maternal and newborn healthcare providers during the COVID-19 pandemic. Our analysis shows that ML models perform better than conventional statistical methods in terms of accuracy and margins of error. This was the case for all the models across different experiments, with the RF, XGBoost and CatBoost being the most robust models. By analysing the confusion matrices of the Logistic Regression model and the RF model from experiment 1A (classification with all features), we notice that (1) ML models (and particularly RF in this case) have overall a better performance when compared to the conventional techniques and are less likely to make large class prediction deviation, and (2) the likelihood of misclassification errors in the prediction process increases as we move to the middle class (i.e. class 3). This has an important significance on the interpretation of the Likert scale output, as the feeling of being protected is a subjective perception that results, just like any other human perception, from the complex interaction between environmental, genetic, biologic, and psychosocial factors, and this complexity is difficult to capture accurately in surveys. Despite that, ML models, due to their architecture and algorithms, are capable of more accurately capturing these interactions and this explains why the number of erroneous predictions is lowest for individuals belonging to class 0 (Not at all protected) and 5 (Completely Protected), and highest for individuals belonging to class 3 (Some Protection), because the former are certain about their feeling while the latter have already a certain level of uncertainty.

Experiment 1B (classification with selected features) also shows that some ML models (RF, XGBoost and CatBoost) are capable of making accurate predictions when trained on a small number of features without losing much accuracy, which is not the case for conventional statistical models. This is particularly important because it allows the use of such a tool to screen for the perception of feeling protected among healthcare providers without needing to collect a large number of features (fewer questions).

Experiments 2A (regression with all features) and 2B (regression with selected features), on the other hand, attempt to solve the same problem using regression. These experiments are implemented for several reasons. First, by considering the output as a continuous variable, we are capable of representing the perception of being protected as a spectrum which is more realistic than the discrete categories. Second, this allows to quantify the exact amount of error at the individual level to avoid under or overestimation of the model’s performance. For instance, if the classification model predicts 2 instead of 3, we cannot detect how far the model was from making the correct prediction, whereas in the regression model, we are able to quantify the error. Third, re-iterating the problem using a different ML model, contributes to confirming the validity of the models when similar results are obtained from the various models; which was the case in this study. The results of the experiments show that even when the problem is solved using regression, ML models are more robust at making the predictions than conventional techniques, with a mean error of 0.5 class.

By applying the RF algorithm, we are able to extract and rank features by the extent to which they contribute to the prediction of healthcare providers’ feeling of protection in the workplace. The findings from both experiments were cross-validated by comparing the features’ rankings between both experiments. The top ten features in both experiments 1A (classification with selected features) and 2A (regression with selected features) were classified in three main themes: (1) information accessibility, clarity and quality; (2) availability of support and means of protection; and (3) COVID-19 epidemiology at the national level. The three themes are discussed below in detail.

1—Information accessibility, clarity and quality

Features belonging to this theme include healthcare providers’ knowledge on what to do in case of having a COVID-19 maternity case (ranked 1 and 2, respectively, in both experiments), and healthcare providers’ perception of the information that they received from the facility regarding COVID-19 and maternity care (in terms of value in feeling safe, helpfulness in daily work, and clarity). This suggests that access to information and knowledge, particularly clear information and feasible recommendations, plays a key role in the morale of maternal and newborn healthcare providers. Our results also highlight that the quality of the information received relative to each healthcare providers’ needs and perceptions, has an important contribution to healthcare providers’ attitudes and wellbeing. Previous studies, at global and national levels, show that healthcare providers struggled with the lack of knowledge, guidance and prevailing uncertainty during the early days of the pandemic [15, 17, 30]. Particularly in the case of maternity care, global guidelines and recommendations took some time to be established, and evidence regarding the risk of COVID-19 for women and newborn continues to emerge to this day [46]. This lack of clarity can be stressful for those providing care to women and newborns in these uncertain circumstances [47], and be translated as perceptions of unsafety when providing care. On the other hand, some facilities established clear guidelines on referring women with confirmed COVID-19 to other facilities or to COVID-19 treatment centres. This could have contributed to a perception of low exposure to COVID-19 risks among healthcare providers working in the referring facilities and consequently a perception of protection in the workplace. Future studies exploring whether differences in perception of protection exist between healthcare providers who work in facilities that refer COVID-19 obstetric cases and those who treat them on site.

2—Availability of support and means of protection

Two main features were grouped to represent the support received from the health facility where healthcare providers work: whether the facility addressed their concerns (ranked 2 and 1, respectively, in both experiments), and the availability of sufficient PPE (masks and aprons). Healthcare providers are a core building block of the healthcare system, and providing quality care can only be achieved when human resources are empowered and supported. The healthcare system must be responsive and adaptive to the needs of its workforce and therefore able to address their concerns and worries, regularly and in times of crises [48]. Globally, PPE shortage was a significant issue in the early days of the pandemic for all cadres of healthcare providers. Essential healthcare providers such as maternal and newborn care workers who were not caring directly for COVID-19 patients, may have experienced this shortage more acutely, as they might not have been prioritised to receive PPE and had to continue providing clinical care. Research showed that this was a source of concern for maternal and newborn healthcare providers as many of them worried about their own safety and becoming infected with COVID-19 in the workplace as a result of the lack of PPE [9, 12, 14, 17]. Additionally, the mere availability of PPE is not sufficient, and maternal and newborn healthcare providers must have access to appropriate support and training on PPE use. This includes training on adequate donning and doffing, as well as learning to provide empathic care while wearing them [14, 47]. In our survey, these questions were specific to support received from the health facility where respondents worked. Nonetheless, it is worth mentioning that the support that health facilities can provide is conditional upon the support and resources that facilities receive from higher structures in the healthcare system, nationally and globally. For example, facilities cannot ensure PPE availability to care providers if there is a national and global shortage. Additionally, facilities cannot communicate guidelines and information to frontline care providers unless those have been officially issued by health authorities. Therefore, the interpretation of these features as a responsibility of health facilities should be made with caution, as we consider the responsiveness of health facilities to be a mere reflection of the responsiveness of the healthcare system.

3—COVID-19 epidemiology at the national level

Features grouped under this theme represent the level of spread of the COVID-19 outbreak at the country level including the cumulative number of COVID-19 cases and deaths due to COVID-19 and the daily number of cases reported on the day of data collection. Our results show that the extent of the transmission of the virus contributes to the prediction of healthcare providers’ perception of protection in the workplace. Healthcare providers, much like the rest of the community, are sensitive to these kinds of changes at the national level, and it is reflected in their attitudes in the workplace. The higher the number of COVID-19 cases and deaths in the community, the higher the likelihood of having to provide care to women confirmed or suspected with COVID-19. This influences the level of risk perceived by healthcare providers and their perceptions of being protected in the workplace. These values are publicly available data at the national level, making the prediction of the output at the individual level easier to achieve.

Least contributing factors

Further analysis reveals that restriction measures applied at the national-level are among the least contributing factors to the prediction of the outcome. In a previous analysis using qualitative data from the same survey conducted at a time point further into the pandemic, we identified that maternal and newborn healthcare providers’ perception of being safe was linked to the extent of the COVID-19 restrictions applied at the country-level [49]. However, the results from the current quantitative analysis contradict our qualitative findings. This shows that ML analysis, although can be valuable in informing a rapid response, can be supplemented by qualitative data in order to represent a clearer, more in-depth assessment of the wellbeing of healthcare providers in emergency situations. The country-income group also had a minimal contribution in predicting healthcare providers’ safety feeling. This highlights the need to consider healthcare providers’ wellbeing in various context, particularly considering the gap in research conducted in low- and middle-income countries on this issue. Some facility-level characteristics, such as the reception of referrals or the presence of an intensive care unit were also among the least contributing factors to the outcome. Although higher level facilities have been given the responsibility to handle COVID-19 cases in many countries, healthcare providers in lower level facilities have had similar experiences of safety perception as those working in higher level facilities. The gender of the healthcare providers was also a minimally contributing factor to the perception of safety feeling. This finding may warrant further exploration in future studies designed to unpack gendered differences in the impact of the pandemic on maternal and newborn healthcare providers, the majority of whom are women.

Strengths and limitations

This is one of the first studies that uses ML to develop an algorithm that predicts maternal and newborn healthcare providers’ feeling of protection in the workplace during the early phases of the COVID-19 pandemic, using data collected through an online survey. This work is one of the few applications of ML models to subjective survey data, and despite the large number of limitations and assumptions associated with analysing “perceptions and opinions” quantitatively, the results are promising and the method has a relatively high level of accuracy (81%).

Nonetheless, with the application of ML in public health research, the results must not be taken at face value, and must be interpreted with caution [50]. To ensure the relevance of our findings beyond numbers, and to confirm the validity of the applied methods we adopted two approaches: (1) a cross-comparison of the features identified in two experiments, which shows that most features exist in the top 10 across the two experiments (convergent validity); and (2) and a thorough qualitative interpretation of the top-ranked features contributing to the prediction of the output in light of pre-existing literature and knowledge, which supported/confirmed the conceptual validity of the tool. This process highlights the importance of the multidisciplinary collaboration between computer engineering/science and public health, which leveraged the value of the work and validated the findings from different perspectives.

One possible limitation of our work is that additional features that could have contributed to the prediction of the output were absent from the analysis. This includes information that was not initially collected in the survey such as personal features (e.g. age, years of experience, experience with previous outbreaks and disruptive events) and individual risk-factors for COVID-19. Other information was collected in the survey but in an open-ended manner, and therefore were not included in this analysis, such as being re-assigned to COVID-19 treatment wards, being diagnosed/suspected to have COVID-19, colleagues diagnosed with COVID-19 or the number of deaths due to COVID-19 among healthcare providers at the country level, etc. Future applications of this tool should consider expanding the list of features, including an additional feature on the availability of COVID-19 vaccines to healthcare providers.

The study’s sampling technique and online data collection meant that the data are not representative of the healthcare provider population, and we acknowledge the potential of a selection bias given that there was no sampling frame for the global study participants. Additionally, many respondents to the original sample were excluded from the final analysis because they had incomplete fields or missing information, which could have affected the sampling bias. Information bias could also exist in the data, particularly related to the quality of reporting national estimates of the number of COVID-19 cases and deaths.

The scope of our work and survey and research area is limited to maternal and newborn healthcare providers. There is potential to evaluate such advanced methods in research related to other cadres of healthcare providers, including those who are at the frontline of providing care to COVID-19 patients.

This study provides factors that predict the perception of safety among a global sample of healthcare providers who work in different settings. It was not possible to assess context-specific factors that could predict the outcome differentially based on the country setting or income-group because of the small size of the sub-samples. Future developments of ML models at the country-level can unpack context-specific factors that can be addressed at the local level, particularly for low- and middle-income countries.

Finally, it is important to mention that we do not underestimate the utility and importance of conventional techniques, but rather embrace both techniques and take advantage of their strengths based on the problem to be solved. For instance, for some problems with small datasets, conventional techniques offer a fast and cost-effective solution, whereas for complicated problems with large datasets and nonlinear interaction between different variables, machine learning algorithms might offer a better alternative.

Conclusion

The COVID-19 pandemic has challenged health systems globally, not only by having to respond to an overwhelming number of COVID-19 cases, but also by having to adjust quickly to severe restriction measures and their impact on the health workforces (quarantine, isolation, sickness or death, inability to reach the workplace, etc.). Our study shows that both pandemic-related and health system-related factors contributed significantly to maternal and newborn healthcare providers’ perceptions of feeling safe during the pandemic. According to the WHO quality of care framework, “competent and motivated human resources” are essential for ensuring quality care to women and newborns [51]. It is critical to prioritise the wellbeing of maternal and newborn healthcare providers, by ensuring they have adequate access to up-to-date, clear, and practical information, and essential means of protection during the COVID-19 pandemic [47].

The tool developed in this study can have two applications: on an individual level, it can inform the development of a future screening tool for perceptions of being safe among maternal and newborn healthcare providers; and it could be used as a simulation model to assess the impact of personal, facility-based, health systems related and policy-level measures on the perception of being safe among maternal and newborn healthcare providers.

The latter application can be used in healthcare settings (either in health facilities or within professional organisations) to guide policy and planning during shocks to the healthcare systems, including the ongoing COVID-19 pandemic. This tool could have the ability to better leverage real-time insights and translate them to preventive interventions efficiently and rapidly, with a specific focus on the wellbeing of healthcare providers [50]. By responding in real-time to the needs of healthcare providers, the health system could prevent potential negative consequences on the quality of care offered to women and newborns.

Availability of data and materials

Due to ethical constraints, the data underlying this analysis cannot be made publicly available. The dataset cannot be completely de-identified without removing key variables such as country, cadre, facility level, facility sector, area type. This de-identification would limit the value of the dataset, making any replication of the analysis impossible. Data requests can be sent to the study PI Prof Lenka Benova at lbenova@itg.be and the ethics committee at the Institute of Tropical Medicine at irb@itg.be.

Abbreviations

AI:: Artificial intelligence
ANN:: Artificial Neural Network
HIC:: High-income country
LIC:: Low-income country
ML:: Machine learning
MMR:: Maternal mortality ratio
MERS:: Middle east respiratory syndrome
MIC:: Middle-income country
PPE:: Personal protective equipment
RF:: Random Forest
SARS:: Severe acute respiratory syndrome
SVM:: Support Vector Machine
SMOTE:: Synthetic minority oversample technique
UK:: United Kingdom
WHO:: World Health Organization

References

Rodriguez-Morales AJ, Bonilla-Aldana DK, Balbin-Ramon GJ, Rabaan AA, Sah R, Paniz-Mondolfi A, Pagliano P, Esposito S. History is repeating itself: probable zoonotic spillover as the cause of the 2019 novel coronavirus epidemic. Infez Med. 2020;28:3–5.
CAS PubMed Google Scholar
Goel S, Hawi S, Goel G, Thakur VK, Agrawal A, Hoskins C, Pearce O, Hussain T, Upadhyaya HM, Cross G. Resilient and agile engineering solutions to address societal challenges such as coronavirus pandemic. Mater Today Chem. 2020;17: 100300.
Article CAS PubMed PubMed Central Google Scholar
WHO coronavirus disease (COVID-19) dashboard. https://covid19.who.int/.
Şahada A, Tekindor AN, Abbadi MB, Malluhi MA, HURİ PY. Role of biomedical engineering during COVID-19 pandemic. Nat Appl Sci J. 2020;3:1–16.
Google Scholar
Pappa S, Ntella V, Giannakas T, Giannakoulis VG, Papoutsi E, Katsaounou P. Prevalence of depression, anxiety, and insomnia among healthcare workers during the COVID-19 pandemic: a systematic review and meta-analysis. Brain Behav Immun. 2020;88:901–7.
Article CAS PubMed PubMed Central Google Scholar
Lai J, Ma S, Wang Y, Cai Z, Hu J, Wei N, Wu J, Du H, Chen T, Li R. Factors associated with mental health outcomes among health care workers exposed to coronavirus disease 2019. JAMA Netw Open. 2020;3:e203976–e203976.
Article PubMed PubMed Central Google Scholar
López Steinmetz LC, Herrera CR, Fong SB, Godoy JC. A longitudinal study on the changes in mental health of healthcare workers during the COVID-19 pandemic. Psychiatry. 2022;85:56–71.
Article PubMed Google Scholar
Coxon K, Turienzo CF, Kweekel L, Goodarzi B, Brigante L, Simon A, Lanau MM. The impact of the coronavirus (COVID-19) pandemic on maternity care in Europe. Midwifery. 2020;88:102779–102779.
Article PubMed PubMed Central Google Scholar
Ameh C, Banke-Thomas A, Balogun M, Makwe CC, Afolabi BB. Reproductive maternal and newborn health providers? Assessment of facility preparedness and its determinants during the COVID-19 pandemic in Lagos, Nigeria. Am J Trop Med Hyg. 2021;104:1495–506.
Article CAS PubMed PubMed Central Google Scholar
Baumann S, Gaucher L, Bourgueil Y, Saint-Lary O, Gautier S, Rousseau A. Adaptation of independent midwives to the COVID-19 pandemic: a national descriptive survey. Midwifery. 2021;94: 102918.
Article PubMed Google Scholar
Bradfield Z, Hauck Y, Homer CS, Sweet L, Wilson AN, Szabo RA, Wynter K, Vasilevski V, Kuliukas L. Midwives’ experiences of providing maternity care during the COVID-19 pandemic in Australia. Women Birth. 2022;35:262–71.
Article PubMed Google Scholar
Bradfield Z, Wynter K, Hauck Y, Vasilevski V, Kuliukas L, Wilson AN, Szabo RA, Homer CSE, Sweet L. Experiences of receiving and providing maternity care during the COVID-19 pandemic in Australia: a five-cohort cross-sectional comparison. PLoS ONE. 2021;16: e0248488.
Article CAS PubMed PubMed Central Google Scholar
Erin R, Bayoğlu Tekin Y: Psychosocial outcomes of COVID-19 pandemic on healthcare workers in maternity services. J Psychosom Obstetr Gynecol 2021:1-7.
Huysmans E, Audet C, Delvaux T, Galle A, Semaan A, Asefa A, Benova L. How COVID-19 challenged care for women and their newborns: a qualitative case study of the experience of Belgian midwives during the first wave of the pandemic. medRxiv. 2021052121257440 2021.
Rimmer M, Al Wattar B, Members U. Provision of obstetrics and gynaecology services during the COVID-19 pandemic: a survey of junior doctors in the UK National Health Service. BJOG Int J Obstetr Gynaecol. 2020;127:1123–8.
Article CAS Google Scholar
Szabo RA, Wilson AN, Homer C, Vasilevski V, Sweet L, Wynter K, Hauck Y, Kuliukas L, Bradfield Z. Covid-19 changes to maternity care: Experiences of Australian doctors. Austr New Zeal J Obstetr Gynaecol. 2021;61:408–15.
Article Google Scholar
Rao SPN, Minckas N, Medvedev MM, Gathara D, Y N P, Seifu Estifanos A, Silitonga AC, Jadaun AS, Adejuyigbe EA, Brotherton H, et al. Small and sick newborn care during the COVID-19 pandemic: global survey and thematic analysis of healthcare providers’ voices and experiences. BMJ Global Health 2021, 6:e004347.
New RCM survey reveals more than half of midwives do not feel safe carrying out home visits. https://www.rcm.org.uk/media-releases/2020/april/new-rcm-survey-reveals-more-than-half-of-midwives-do-not-feel-safe-carrying-out-home-visits/#:~:text=out%20home%20visits-,New%20RCM%20survey%20reveals%20more%20than%20half%20of%20midwives%20do,safe%20carrying%20out%20home%20visits&text=Of%20those%20who%20did%20receive,their%20home%20to%20be%20tested.
Albahli S, Albattah W. Detection of coronavirus disease from X-ray images using deep learning and transfer learning algorithms. J X-ray Sci Technol. 2020;28:841–50.
Article CAS Google Scholar
Batra R, Chan H, Kamath G, Ramprasad R, Cherukara MJ, Sankaranarayanan SK. Screening of therapeutic agents for COVID-19 using machine learning and ensemble docking studies. J Phys Chem Lett. 2020;11:7058–65.
Article CAS PubMed PubMed Central Google Scholar
Ahamad MM, Aktar S, Rashed-Al-Mahfuz M, Uddin S, Liò P, Xu H, Summers MA, Quinn JM, Moni MA. A machine learning model to identify early stage symptoms of SARS-Cov-2 infected patients. Expert Syst Appl. 2020;160: 113661.
Article PubMed PubMed Central Google Scholar
Nader IW, Zeilinger EL, Jomar D, Zauchner C. Onset of effects of non-pharmaceutical interventions on COVID-19 infection rates in 176 countries. BMC Public Health. 2021;21:1–7.
Article Google Scholar
Li S, Wang Y, Xue J, Zhao N, Zhu T. The impact of COVID-19 epidemic declaration on psychological consequences: a study on active Weibo users. Int J Environ Res Public Health. 2020;17:2032.
Article CAS PubMed Central Google Scholar
Low DM, Rumker L, Talkar T, Torous J, Cecchi G, Ghosh SS. Natural language processing reveals vulnerable mental health support groups and heightened health anxiety on reddit during covid-19: observational study. J Med Internet Res. 2020;22: e22635.
Article PubMed PubMed Central Google Scholar
Wang X, Li H, Sun C, Zhang X, Wang T, Dong C, Guo D. Prediction of mental health in medical workers during COVID-19 based on machine learning. Front Publ Health 2021:9:697850.
Bender WR, Srinivas S, Coutifaris P, Acker A, Hirshberg A. The psychological experience of obstetric patients and health care workers after implementation of universal SARS-CoV-2 testing. Am J Perinatol. 2020;37:1271–9.
Article PubMed PubMed Central Google Scholar
Yalçın Bahat P, Aldıkaçtıoğlu Talmaç M, Bestel A, Topbas Selcuki NF, Karadeniz O, Polat I. Evaluating the effects of the COVID-19 pandemic on the physical and mental well-being of obstetricians and gynecologists in Turkey. Int J Gynecol Obstet. 2020;151:67–73.
Article Google Scholar
Motrico E, Bina R, Domínguez-Salas S, Mateus V, Contreras-García Y, Carrasco-Portiño M, Ajaz E, Apter G, Christoforou A, Dikmen-Yildiz P. Impact of the Covid-19 pandemic on perinatal mental health (Riseup-PPD-COVID-19): protocol for an international prospective cohort study. BMC Public Health. 2021;21:1–11.
Article Google Scholar
Gupta MD, Bansal A, Sarkar PG, Girish M, Jha M, Yusuf J, Kumar S, Kumar S, Jain A, Kathuria S. Design and rationale of an intelligent algorithm to detect BuRnoUt in HeaLthcare workers in COVID era using ECG and artificiaL intelligence: the BRUCEE-LI study. Indian Heart J. 2021;73:109–13.
Article PubMed Google Scholar
Semaan A, Audet C, Huysmans E, Afolabi B, Assarag B, Banke-Thomas A, Blencowe H, Caluwaerts S, Campbell OMR, Cavallaro FL, et al. Voices from the frontline: findings from a thematic analysis of a rapid online global survey of maternal and newborn health professionals facing the COVID-19 pandemic. BMJ Glob Health. 2020;5: e002967.
Article PubMed Google Scholar
KoBoToolbox. https://www.kobotoolbox.org/.
MATCO: Global Study of Maternal Health Provision during the COVID-19 Pandemic. https://www.itg.be/E/matco-global-study-of-maternal-health-provision-during-the-covid-19-pandemic.
World Bank Country and Lending Groups. https://datahelpdesk.worldbank.org/knowledgebase/articles/906519-world-bank-country-and-lending-groups.
GDP per capita, PPP (current international $). https://data.worldbank.org/indicator/NY.GDP.PCAP.PP.CD.
Maternal mortality. Levels and trends—2000 to 2017. https://www.who.int/reproductivehealth/publications/maternal-mortality-2000-2017/en/.
Hale T, Angrist N, Goldszmidt R, Kira B, Petherick A, Phillips T, Webster S, Cameron-Blake E, Hallas L, Majumdar S. A global panel database of pandemic policies (Oxford COVID-19 Government Response Tracker). Nat Hum Behav. 2021;5:529–38.
Article PubMed Google Scholar
Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. 2002;16:321–57.
Article Google Scholar
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011;12:2825–30.
Google Scholar
Husain W, Xin LK, Jothi N. Predicting generalized anxiety disorder among women using random forest approach. In 2016 3rd International Conference on Computer and Information Sciences (ICCOINS). IEEE; 2016: 37–42.
Sharma A, Verbeke WJ. Improving diagnosis of depression with XGBOOST machine learning model and a large biomarkers Dutch Dataset (n = 11,081). Front Big Data. 2020;3:15.
Article PubMed PubMed Central Google Scholar
Sau A, Bhakta I. Screening of anxiety and depression among seafarers using machine learning technology. Inform Med Unlocked. 2019;16: 100228.
Article Google Scholar
Aldarwish MM, Ahmad HF: Predicting depression levels using social media posts. In,. IEEE 13th international symposium on autonomous decentralized system (ISADS). IEEE. 2017;2017:277–80.
Google Scholar
Shafiei SB, Lone Z, Elsayed AS, Hussein AA, Guru KA. Identifying mental health status using deep neural network trained by visual metrics. Transl Psychiatry. 2020;10:1–8.
Article Google Scholar
Alam MZ, Rahman MS, Rahman MS. A Random Forest based predictor for medical data classification using feature ranking. Inform Med Unlocked. 2019;15: 100180.
Article Google Scholar
Yun T-G, Yi G-S. Application of random forest algorithm for the decision support system of medical diagnosis with the selection of significant clinical test. Trans Korean Inst Electr Eng. 2008;57:1058–62.
Google Scholar
Villar J, Ariff S, Gunier RB, Thiruvengadam R, Rauch S, Kholin A, Roggero P, Prefumo F, do Vale MS, Cardona-Perez JA, et al. Maternal and neonatal morbidity and mortality among pregnant women with and without COVID-19 infection: the INTERCOVID multinational cohort study. JAMA Pediatr. 2021;175:817–26.
Article PubMed Google Scholar
Wilson AN, Ravaldi C, Scoullar MJL, Vogel JP, Szabo RA, Fisher JRW, Homer CSE. Caring for the carers: ensuring the provision of quality maternity care during a global pandemic. Women Birth. 2021;34:206–9.
Article PubMed Google Scholar
Global strategy on human resources for health: Workforce 2030. https://apps.who.int/iris/bitstream/handle/10665/250368/?sequence=1.
Kolié D, Semaan A, Day L-T, Delvaux T, Delamou A, Benova L. Maternal and newborn healthcare providers’ work-related experiences during the COVID-19 pandemic, and their physical, psychological, and economic impacts: Findings from a global online survey. PLOS Glob Publ Health. 2022;2:e0000602.
Morgenstern JD, Rosella LC, Daley MJ, Goel V, Schünemann HJ, Piggott T. “AI’s gonna have an impact on everything in society, so it has to have an impact on public health”: a fundamental qualitative descriptive study of the implications of artificial intelligence for public health. BMC Public Health. 2021;21:40.
Article PubMed PubMed Central Google Scholar
WHO recommendations: Intrapartum care for a positive childbirth experience. https://www.who.int/reproductivehealth/publications/intrapartum-care-guidelines/en/.

Download references

Acknowledgements

We would like to thank the maternal and newborn healthcare providers who contributed their valuable time to respond to the survey during the second round, despite ongoing difficult circumstances and high workload. We thank all study collaborators and colleagues who supported in questionnaire development, translation and played a key role in distributing the invitation for this survey. We also acknowledge the Institutional Review Committee at the Institute of Tropical Medicine for providing helpful suggestions on this study protocol, and for the expedited review of this study.

Funding

This study was funded by the Institute of Tropical Medicine’s COVID-19 Pump Priming fund supported by the Flemish Government, Science & Innovation and by the Embassy of the United Kingdom in Belgium. LB is funded in part by the Research Foundation—Flanders (FWO) as part of her Senior Postdoctoral Fellowship. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information

Authors and Affiliations

Biomedical Engineering Program, Faculty of Medicine-Maroun Semaan Faculty of Engineering and Architecture, American University of Beirut, Beirut, Lebanon
Bassel Hammoud
Department of Public Health, Institute of Tropical Medicine, Antwerp, Belgium
Aline Semaan & Lenka Benova
Electrical and Computer Engineering Department, Maroun Semaan Faculty of Engineering and Architecture, American University of Beirut, Beirut, Lebanon
Imad Elhajj

Authors

Bassel Hammoud
View author publications
You can also search for this author in PubMed Google Scholar
Aline Semaan
View author publications
You can also search for this author in PubMed Google Scholar
Imad Elhajj
View author publications
You can also search for this author in PubMed Google Scholar
Lenka Benova
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the conceptualisation of the study. BH conducted data curation and formal analysis. BH and AS wrote the original draft of the manuscript. All authors critically reviewed, commented on, and edited the manuscript. LB was responsible of funding acquisition for the global survey. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Bassel Hammoud.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the Institutional Review Board at the Institute of Tropical Medicine in Antwerp Belgium under the number 1372/20. Respondents provided informed consent online by checking a box affirming that they voluntarily agreed to participate in the survey.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Patient and public involvement

Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Complete list of features used in the models.

Additional file 2.

Description of different ML algorithms.

Additional file 3.

Distribution of respondents across countries.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Hammoud, B., Semaan, A., Elhajj, I. et al. Can machine learning models predict maternal and newborn healthcare providers’ perception of safety during the COVID-19 pandemic? A cross-sectional study of a global online survey. Hum Resour Health 20, 63 (2022). https://doi.org/10.1186/s12960-022-00758-5

Download citation

Received: 19 November 2021
Accepted: 08 August 2022
Published: 19 August 2022
DOI: https://doi.org/10.1186/s12960-022-00758-5

Can machine learning models predict maternal and newborn healthcare providers’ perception of safety during the COVID-19 pandemic? A cross-sectional study of a global online survey

Abstract

Background

Methods

Results

Conclusion

Introduction

Methods

Study design and data collection

Questionnaire and definitions

Data management

Missing answers

Data pre-processing

Data analysis

Machine learning models

Hyperparameters tuning

Training and testing

Experiments set 1: classification models

Experiments set 2: regression models

Performance metrics

Results

Sample description

Hyperparameters

Experiment 1: classification

Experiment 1A: classification with all features

Feature extraction

Experiment 1B: classification with selected features

Experiment 2

Experiment 2A: regression with all features

Feature extraction

Experiment 2B: regression with selected features

Discussion

1—Information accessibility, clarity and quality

2—Availability of support and means of protection

3—COVID-19 epidemiology at the national level

Least contributing factors

Strengths and limitations

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Patient and public involvement

Additional information

Publisher's Note

Supplementary Information

Additional file 1.

Additional file 2.

Additional file 3.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Human Resources for Health

Contact us