期刊导航

论文摘要

基于机器学习的苏州地区9岁儿童第一恒磨牙龋病预测模型研究

Machine learning-based prediction model for caries in the first molars of 9-year-old children in Suzhou

作者:陈灵芝, 王霞琴, 朱凯飞, 任坤, 吴桢

Author:Chen Lingzhi, Wang Xiaqin, Zhu Kaifei, Ren Kun, Wu Zhen

收稿日期:2025-04-10          年卷(期)页码:2025,43(6):871-871-880

期刊名称:华西口腔医学杂志

Journal Name:West China Journal of Stomatology

关键字:第一恒磨牙,机器学习,影响因素,预测模型,

Key words:first permanent molar,machine learning,influencing factor,prediction model,

基金项目:苏州市吴中区科技计划项目(医疗卫生领域)青年项目(WZYW2021017)

中文摘要

目的 利用机器学习算法构建苏州地区9岁儿童第一恒磨牙龋病预测模型,筛选危险因素。 方法 采用随机分层整群抽样的方法,在吴中区14个乡镇、街道的38所小学中随机抽取9岁在校学生进行口腔检查和问卷调查。采用Logistic多因素回归分析龋齿的危险因素。将数据集按8∶2随机分为训练集及验证集,使用R 4.3.1构建随机森林、决策树、极端梯度提升(XGBoost)、Logistic回归、轻量级梯度提升(LightGBM)5种机器学习算法,应用受试者工作特征曲线下面积(AUC)评估5种模型的预测效果。通过沙普利加和解释(SHAP)量化特征对龋齿预测模型的边际贡献。 结果 研究纳入符合标准的样本7 225例,其中第一恒磨牙患龋率为54.96%,多因素Logistic回归分析显示,甜饮料、甜点心和糖果、零食频率、刷牙后睡前零食等与第一恒磨牙龋齿的发生存在关联(P<0.05)。决策树、Logistic回归、轻量级梯度提升、随机森林、极端梯度提升这5种预测模型的AUC值分别为75.5%、83.9%、88.6%、88.9%、90.1%。对比独热编码后的变量,高频甜食(如甜点心糖果每天≥2次、母亲含糖饮食每天≥2次)与不良口腔卫生习惯(如刷牙后睡前常吃零食、刷牙不规律)的SHAP值为正。 结论 基于极端梯度提升算法构建苏州地区9岁儿童第一恒磨牙龋病的预测模型,具有较好的预测效果。高频甜食和不良口腔卫生习惯对第一恒磨牙患龋有强正向影响,是关键的驱动因素,可用于针对性干预措施的制定。

英文摘要

ObjectiveThis study aimed to use machine learning algorithms to build a prediction model of the first permanent molar caries of 9-year-old children in Suzhou and screen out risk factors.MethodsRandom stratified whole group sampling was applied to randomly select 9-year-old students from 38 primary schools in 14 townships and streets in Wuzhong District for oral examination and questionnaire survey. Multifactor Logistics regression was used to analyze the risk factors of tooth decay. The data set was randomly divided into training sets and verification sets according to 8∶2, and R 4.3.1 was used to build five machine learning algorithms: random forest, decision tree, extreme gradient boosting (XGBoost), Logistics regression, and lightweight gradient enhancement (LightGBM). The predictive effect of these five models was evaluated using the area under the characteristic curve (AUC). The marginal contribution of quantitative characteristics to the caries prediction model was determined through Shapley additive explanations (SHAP).ResultsThis study included 7 225 samples that met the standard. The caries rate of the first permanent molar was 54.96%. Multifactor Logistic regression analysis showed that sweet drinks, dessert and candy, snack frequency, and snacks before going to bed after brushing teeth were correlated with the occurrence of first permanent molar caries (P<0 .05). the auc values of decision tree, logistic regression, lightgbm, random forest, and xgboost were 75.5%, 83.9%, 88.6%, 88.9%, and 90.1%, respectively. compared with the variables after single heat coding, the shap value of high-frequency sweets (such as dessert candy ≥2 times a day, mother’s sugary diet ≥2 times a day) and bad oral hygiene habits (such as frequent snacks before going to bed after brushing teeth and irregular brushing teeth) exhibited the highest positive.ConclusionXGBoost algorithm has a good prediction effect for first permanent molar caries in 9-year-old children. High-frequency sweet factors and bad oral hygiene habits have a strong positive impact on the risk of first permanent molar caries and are key drivers that can be used in the formulation of targeted interventions.

上一条:基于结构—过程—结果模型的口腔门诊护理质量管理评价指标体系构建

关闭

Copyright © 2020四川大学期刊社 版权所有.

地址:成都市一环路南一段24号

邮编:610065