高级检索

基于机器学习的催乳素瘤术后激素早期缓解预测模型构建

Construction of a prediction model for early hormone remission after prolactinoma surgery based on machine learning

  • 摘要:
    目的 探讨催乳素瘤患者术后激素早期缓解的相关因素,并基于临床及影像学指标构建机器学习预测模型。
    方法 回顾性收集2020年1月至2024年12月于南京医科大学附属淮安第一医院接受经蝶手术治疗的107例催乳素瘤患者临床资料,包括一般临床特征、术前实验室指标及影像学特征。根据术后早期催乳素(prolactin, PRL)水平是否恢复正常,将患者分为激素缓解组(n=76)与未缓解组(n=31)。采用单因素logistic回归对候选变量进行初步评估,采用LASSO回归进行特征筛选,并构建多种机器学习模型,包括logistic回归、随机森林、支持向量机、K近邻、朴素贝叶斯、决策树、神经网络及梯度提升决策树(GBDT)。所有模型均采用十折交叉验证进行训练与评估,并通过ROC曲线的曲线下面积(AUC)、准确率、灵敏度、特异度、精确率及F1值对模型性能进行综合评价。
    结果 未缓解组肿瘤最大径大于缓解组(P=0.019),肿瘤卒中发生率和术前PRL水平显著高于缓解组(P<0.001)。单因素分析显示,性别、肿瘤最大径、肿瘤卒中及术前PRL水平是催乳素瘤患者术后激素早期缓解情况的影响因素(P<0.05)。机器学习模型比较结果显示,神经网络模型表现最佳(AUC=0.921),且该模型具有较好的临床应用价值,其次为GBDT模型(AUC=0.893)及支持向量机模型(AUC=0.884),其他模型亦表现出一定的预测能力。
    结论 性别、术前PRL水平、肿瘤最大径、椎体亨氏单位值及肿瘤卒中是影响催乳素瘤术后激素早期缓解的重要因素。基于上述变量构建的机器学习模型具有良好的预测性能,其中神经网络模型表现最佳,具有较好的临床应用价值。

     

    Abstract:
    Objective To explore factors associated with early postoperative hormonal remission in patients with prolactinoma and to develop prediction models based on clinical and radiological features.
    Methods Data from 107 patients with prolactinoma who underwent transsphenoidal surgery at The First People’s Hospital of Huai’an, Nanjing Medical University between January 2020 and December 2024 was collected, included general clinical characteristics, preoperative laboratory indicators, and imaging features. Based on whether early postoperative prolactin (PRL) levels normalized, patients were divided into a remission group (n=76) and a non-remission group (n=31). Univariate logistic regression was used for preliminary evaluation of candidate variables, followed by LASSO regression for feature selection. Multiple machine learning models were constructed, including logistic regression, random forest, support vector machine, K-nearest neighbors, naive Bayes, decision tree, neural network, and gradient boosting decision tree (GBDT). All models were trained and evaluated using ten-fold cross-validation, with comprehensive assessment of model performance based on the area under the ROC curve (AUC), accuracy, sensitivity, specificity, precision, and F1 score.
    Results The maximum diameter of tumors in the non remission group was larger than that in the remission group (P=0.019), and the incidence of tumor stroke and preoperative PRL levels were significantly higher than those in the remission group (P<0.001). Univariate analysis showed that sex, maximum tumor diameter, tumor stroke, and preoperative PRL levels were influencing factors for early postoperative hormone response in patients with prolactinoma (P<0.05). The comparison results of machine learning models show that the neural network model performs the best (AUC=0.921) and has good clinical application value, followed by the GBDT model (AUC=0.893) and the support vector machine model (AUC=0.884). Other models also show certain predictive ability.
    Conclusions Sex, preoperative PRL levels, maximum tumor diameter, Hounsfield unit value, and tumor stroke are important factors affecting early hormone response after prolactinoma surgery. The machine learning model constructed based on the above variables has good predictive performance, and performs the best and has good clinical application value.

     

/

返回文章
返回