Abstract:
Objective To construct a model for predicting the risk of postpartum depression (PPD) in parturients within 6 to 8 weeks after childbirth based on machine learning (ML) model and Shapley additive interpretation (SHAP).
Methods The Edinburgh Postnatal Depression Scale was used to evaluate the occurrence of PPD in 201 parturients within 6 to 8 weeks after delivery. The Boruta algorithm was used to screen the risk factors of PPD. Four ML models, namely support vector machine, Naive Bayes model, linear discriminant analysis and mixed discriminant analysis (MDA), were constructed to predict the risk of PPD. The prediction accuracy of four ML models was evaluated using the calibration curves, precision-recall curves, precision-recall gain curves and receiver operating characteristic curves (ROC), and the optimal ML model was interpreted and visualized through SHAP.
Results The incidence of PPD in 201 parturients was 15.92% (32/201). The Boruta algorithm screened out the Pittsburgh Sleep Quality Index (PSQI) score, feeding method, Social Support Rating Scale (SSRS) score, educational attainment and per capita monthly income of family were the risk variables of PPD. Among the four ML models, the consistency index, recall rate, recall gain and area below the ROC curve of the MDA model were all superior to those of the other three ML models. The SHAP interpretation and visualization of the MDA model could accurately predict the risk of PPD in parturients.
Conclusions The SSRS score, PSQI score, educational attainment, feeding style and per capita monthly household income are associated with the PPD risk. The MDA model based on the interpretation and visualization of SHAP values can effectively predict the risk of PPD.