基于机器学习的中国老年慢性疼痛病人抑郁风险预测研究

    A study on the prediction of depression risk in elderly Chinese patients with chronic pain based on machine learning

    • 摘要:
      目的: 构建中国老年慢性疼痛病人抑郁风险预测模型,验证不同模型的预测性能。
      方法: 采用中国健康与养老追踪调查(CHARLS)2020年的全国调查数据,根据纳入与排除标准选取研究对象,按照7∶3的比例随机分为训练集与验证集,纳入研究变量包括人口统计学信息、健康状况与功能及工作状态。基于决策树模型、随机森林模型和XGBoost模型构建老年慢性疼痛病人抑郁状态风险预测模型。比较各模型的准确率、灵敏度、特异度、F1分数和受试者工作特征(receiver operating characteristic,ROC)曲线下面积(area under the curve,AUC),评价模型的预测性能。
      结果: 1 037名老年人中抑郁症状检出率为29.89%。决策树模型筛选出疼痛干扰程度、睡眠质量、认知功能、社交活动频率及自评健康状况为重要解释变量,其中疼痛干扰程度为根节点;随机森林模型显示疼痛干扰程度(基尼系数下降值0.213)、睡眠质量(0.186)、社交活动频率(0.152)为主要影响因素;XGBoost模型中疼痛干扰程度特征增益值最高,且SHAP值分析显示疼痛与睡眠、社交因素存在交互效应。
      结论: 构建的3种老年慢性疼痛病人抑郁风险预测模型中,XGBoost模型性能最优,可有效识别抑郁高风险人群;疼痛干扰程度、睡眠质量、社交活动频率是关键影响因素,其交互作用对抑郁风险预测具有重要意义。研究结果可为老年慢性疼痛病人抑郁的早期识别及后续风险分层管理策略制定提供科学依据。

       

      Abstract:
      Objective To construct a depression risk prediction model for elderly patients with chronic pain in China, and verify the predictive performance of different models.
      Methods The national survey data of the China Health and Retirement Longitudinal Survey (CHARLS) in 2020 was adopted. The research subjects were selected according to the inclusion and exclusion criteria, and randomly divided into the training set and validation set at a ratio of 7∶3. The included research variables were demographic information, health status and function, and working status. A risk prediction model for depressive state in elderly patients with chronic pain was constructed based on the decision tree model, random forest model and XGBoost model. The accuracy rate, sensitivity, specificity, F1 score and area under the receiver operating characteristic (ROC) curve (AUC) of each model were compared to evaluate the predictive performance of the models.
      Results The detection rate of depressive symptoms among 1037 elderly people was 29.89%. The decision tree model screened out the degree of pain interference, sleep quality, cognitive function, frequency of social activities and self-rated health status as important explanatory variables, among which the degree of pain interference was the root node. The random forest model shows that the degree of pain disturbance (the decrease in Gini coefficient for 0.213), sleep quality (0.186) and frequency of social activities (0.152) are the main influencing factors. The XGBoost model had the highest characteristic gain value of pain interference degree, and the SHAP value analysis shows that there was an interaction effect between pain and sleep, as well as social factors.
      Conclusions Among the three constructed prediction models for depression risk in elderly patients with chronic pain, the XGBoost model has the best performance, and can effectively identify high-risk groups for depression. The degree of pain interference, sleep quality and frequency of social activities are the key influencing factors, and their interaction is of great significance for predicting the risk of depression. The research results can provide a scientific basis for the early identification of depression in elderly patients with chronic pain, and the formulation of subsequent risk stratification management strategies.

       

    /

    返回文章
    返回