基于多中心宫颈活检组织的P16–铁死亡风险分层模型:机器学习辅助的临床验证

    The clinical validation assisted by machine learning of P16-ferroptosis risk stratification model based on multi-center cervical biopsy tissues

    • 摘要:
      目的: 探讨P16蛋白(P16)与铁死亡标志物在宫颈病变进展中的表达特征及其风险分层价值,构建基于P16–铁死亡通路的临床预测模型,并借助机器学习方法评估其在宫颈癌精准干预中的潜力。
      方法: 选取宫颈活检病人共计438例,涵盖宫颈上皮内瘤变(CIN)1、CIN2、CIN3及宫颈癌4个病理阶段。收集病人临床资料及七项铁死亡相关评分(GPX4、SLC7A11、ACSL4、LPO、Ferritin、TFRC及P16评分),评估不同评分在病理分级及进展状态中的分布特征。通过k–means聚类分析实现进展风险分层,并构建基于P16与铁死亡评分的XGBoost预测模型和logistic回归模型进行对比,利用SHAP分析明确特征贡献度。进一步构建Nomogram列线图,评估其在风险评估中的可视化应用效果。
      结果: 不同病理分型病人的P16及各项铁死亡评分均有统计学意义(P < 0.01)。P16评分与SLC7A11评分呈明显负相关关系(r = –0.78,P < 0.01),铁死亡负荷与炎症评分呈明显正相关关系(r = 0.811,P < 0.01)。logistic回归分析显示,P16评分最高四分位为疾病进展的独立危险因素(OR = 7.006,95%CI:1.382 ~ 39.764,P = 0.02)。k–means聚类将病人分为6个亚型,其中C5和C6亚型进展比例显著较高(分别为84.1%、76.3%)。XGBoost模型与logistic模型预测效能相当(AUC分别为0.685、0.700),SHAP分析提示P16评分贡献度最高(SHAP = 2.075)。基于上述特征构建的Nomogram列线图具备良好直观性和临床实用性。
      结论: P16与铁死亡通路与宫颈病变进展具有密切关联,二者联合建模可有效实现进展风险分层。构建的XGBoost预测模型和Nomogram列线图为精准识别高风险人群提供了可靠工具。该模型在多中心样本中表现稳定,具备进一步临床转化应用价值。

       

      Abstract:
      Objective To explore the expression characteristics and risk stratification value of P16 protein (P16) and ferroptosis markers in the progression of cervical lesions, construct a clinical prediction model based on the P16-ferroptosis pathway, and evaluate its potential in precise intervention for cervical cancer with the help of machine learning methods.
      Methods A total of 438 patients with cervical biopsy were selected, covering four pathological stages of cervical intraepithelial neoplasia (CIN) 1, CIN2, CIN3 and cervical cancer. The clinical data of patients and seven ferroptos-related scores (GPX4, SLC7A11, ACSL4, LPO, Ferritin, TFRC and P16 scores) were collected to evaluate the distribution characteristics of different scores in pathological grade and progression status. The progress risk stratification was achieved through k-means clustering analysis, and an XGBoost prediction model based on P16 and ferroptosis scores, and logistic regression model were constructed for comparison. SHAP analysis was used to clarify the contribution degree of features. The nomogram was further constructed to evaluate its visualization application effects in risk assessment.
      Results The differences of P16 and various ferroptosis scores in the patients with different pathological types were statistically significant (P < 0.01). The P16 score was significantly negatively correlated with the SLC7A11 score (r = –0.78, P < 0.01), and the ferroptosis load was significantly positively correlated with the inflammation score (r = 0.811, P < 0.01). The results of logistic regression analysis showed that the highest quartile of the P16 score was an independent risk factor of disease progression OR = 7.006, 95%CI: 1.382–39.764, P = 0.02). The patients were divided into six subtypes by k-means clustering, among which the progression rates of subtypes C5 and C6 were significantly higher (84.1% and 76.3%, respectively). The predictive efficacy of the XGBoost model was comparable to that of the logistic model (AUC was 0.685 and 0.700, respectively), and the results of SHAP analysis indicated that the contribution of the P16 score was the highest (SHAP = 2.075). The nomogram constructed based on the above features had good intuitiveness and clinical practicability.
      Conclusions P16 and ferroptosis pathways are closely associated with the progression of cervical lesions. The combined modeling of the two can effectively achieve risk stratification of progression. The XGBoost prediction model and nomogram can provide the reliable tools for accurately identifying high-risk populations. This model shows stable performance in multi-center samples, and has further clinical transformation and application value.

       

    /

    返回文章
    返回