术前脑胶质瘤分级与异柠檬酸脱氢酶突变状态机器学习预测模型构建及分析

王雨婷; 朱俊乐; 覃塽; 孙赛非; 张鑫; 吕琦

doi:10.12025/j.issn.1008-6358.2025.20251459

术前脑胶质瘤分级与异柠檬酸脱氢酶突变状态机器学习预测模型构建及分析

Construction and analysis of machine learning models for preoperative prediction of glioma grading and isocitrate dehydrogenase mutation status

摘要

摘要:
目的基于炎症指标及影像学特征构建机器学习模型，并分析这些模型在术前预测脑胶质瘤分级与异柠檬酸脱氢酶（isocitrate dehydrogenase, IDH）突变状态的价值，筛选出最优的预测模型。
方法回顾性分析2019年3月至2023年3月在同济大学附属同济医院诊治的经病理诊断为脑胶质瘤的患者数据。采用LASSO回归筛选特征变量，并基于逻辑回归（logistic regression, LR）、随机森林（random forest, RF）、支持向量机（support vector machine, SVM）、梯度提升树（gradient boosting decision tree, XGBoost）和K近邻（k-nearest neighbor, KNN）算法构建脑胶质瘤分级与IDH突变状态预测模型。通过判断力、精确率-召回率曲线下面积（area under curve, AUC）、准确率、F1分数及Brier分数等指标综合评估模型性能。采用DeLong检验比较不同模型的AUC；采用Friedman秩和检验判断模型的总体性能，采用Nemenyi检验进行多重比较校正。
结果在脑胶质瘤分级预测任务中，LR模型的综合评分最高（0.726），但与其他4个模型间差异无统计学意义；年龄与脑胶质瘤分级正相关（P＝0.003）。在IDH突变状态预测任务中，XGBoost模型的综合评分最高（0.832），优于LR（0.762，P＝0.035）和KNN模型（0.754，P＝0.025），与RF和SVM模型间差异无统计学意义。
结论基于任务导向策略构建的脑胶质瘤分级预测LR模型和IDH突变状态预测XGBoost模型，在保证性能优化的同时实现了较好的解释性，能为脑胶质瘤个体化诊疗提供可靠决策支持。

Abstract:
Objective To construct machine learning models based on preoperative inflammatory and radiological features for the prediction of glioma grading and isocitrate dehydrogenase (IDH) mutation status, and to analyze application values of these models and identify the optimal predictive models.
Methods A retrospective analysis was conducted on the data of pathologically confirmed glioma patients admitted to Tongji Hospital Affiliated to Tongji University from March 2019 to March 2023. LASSO regression was used to screen feature variables, and predictive models were constructed based on logistic regression (LR), random forest (RF), support vector machine (SVM), gradient boosting decision tree (XGBoost) and K-nearest neighbor (KNN) algorithms. The model performance was comprehensively evaluated using metrics including discrimination ability, area under the precision-recall curve (AUC), accuracy, F1 score and Brier score. The DeLong test was adopted to compare the AUC values among different models; Friedman rank-sum test was used to determine the overall performance differences of the models, with the Nemenyi test applied for multiple comparison correction.
Results In the task of glioma grading prediction, the LR model achieved the highest comprehensive score (0.726), and no significant difference was observed between the LR model and the other four models; age was positively correlated with glioma grading (P=0.003). In the task of IDH mutation status prediction, the XGBoost model obtained the highest comprehensive score (0.832), which was superior to the LR (0.762, P=0.035) and KNN models (0.754, P=0.025), while no statistical differences were found between the XGBoost model and the RF or SVM models.
Conclusions The LR model for glioma grading prediction and XGBoost model for IDH mutation prediction constructed based on a task-oriented strategy achieve a favorable interpretability while ensuring optimized performance, thereby providing reliable decision support for the individualized diagnosis and treatment of glioma.

HTML全文

参考文献(22)

施引文献

资源附件(0)