Search Articles

View query in Help articles search

Search Results (1 to 10 of 21 Results)

Download search results: CSV END BibTex RIS


Size-Specific Predictors for Malignancy Risk in Follicular Thyroid Neoplasms: Machine Learning Analysis

Size-Specific Predictors for Malignancy Risk in Follicular Thyroid Neoplasms: Machine Learning Analysis

We trained eight classification models including logistic regression, lasso regression, weighted k-nearest neighbor, decision tree, random forest, naive bayes, XGBoost, and support vector machine (SVM). In total, 70% of the dataset was allocated for model training and selection, while the remaining 30% was reserved for internal validation. We developed and validated the models stepwise through predictor preprocessing, model training, hyperparameter tuning, and 5-fold cross-validation.

Xin Li, Wen-yu Yang, Fan Zhang, Rui Shan, Fang Mei, Shi-Bing Song, Bang-Kai Sun, Jing Chen, Run-ze Hu, Yang Yang, Yi-hang Yang, Jing-yao Liu, Chun-Hui Yuan, Zheng Liu

JMIR Cancer 2025;11:e73069

Machine Learning Model for Predicting Coronary Heart Disease Risk: Development and Validation Using Insights From a Japanese Population–Based Study

Machine Learning Model for Predicting Coronary Heart Disease Risk: Development and Validation Using Insights From a Japanese Population–Based Study

Extreme Gradient Boosting (XGBoost) is an ML algorithm that improves model performance by using a series of decision trees, where each tree corrects the mistakes of the previous one. This sequential approach helps make predictions more accurate. Light Gradient-Boosting Machine (Light GBM) is another efficient algorithm that works similarly to XGBoost but is designed to be faster and more scalable, especially when working with large datasets and many features.

Thien Vu, Yoshihiro Kokubo, Mai Inoue, Masaki Yamamoto, Attayeb Mohsen, Agustin Martin-Morales, Research Dawadi, Takao Inoue, Jie Ting Tay, Mari Yoshizaki, Naoki Watanabe, Yuki Kuriya, Chisa Matsumoto, Ahmed Arafa, Yoko M Nakao, Yuka Kato, Masayuki Teramoto, Michihiro Araki

JMIR Cardio 2025;9:e68066

Identifying Adolescent Depression and Anxiety Through Real-World Data and Social Determinants of Health: Machine Learning Model Development and Validation

Identifying Adolescent Depression and Anxiety Through Real-World Data and Social Determinants of Health: Machine Learning Model Development and Validation

We used Extreme Gradient Boosting (XGBoost), a robust ML framework known for its efficiency, flexibility, and portability [24]. It is an ensemble learning algorithm based on the gradient boosting framework, in which models are built sequentially to boost (ie, increase) the performance of the previous models by using the gradient descent algorithm to minimize errors [24].

Mamoun T Mardini, Georges E Khalil, Chen Bai, Aparna Menon DivaKaran, Jessica M Ray

JMIR Ment Health 2025;12:e66665

Interpretable Machine Learning Model for Predicting Postpartum Depression: Retrospective Study

Interpretable Machine Learning Model for Predicting Postpartum Depression: Retrospective Study

We used the following 3 machine learning algorithms to develop the PPD prediction model: extreme gradient boosting (XGBoost), random forest (RF), and gradient boosting machine (GBM). XGBoost is a powerful and efficient machine learning algorithm known for its exceptional performance in regression, classification, and ranking problems. It is an extension of the traditional gradient boosting method that combines multiple weak classifiers to create a strong classifier that minimizes the loss function [34].

Ren Zhang, Yi Liu, Zhiwei Zhang, Rui Luo, Bin Lv

JMIR Med Inform 2025;13:e58649

Enhancing Interpretable, Transparent, and Unobtrusive Detection of Acute Marijuana Intoxication in Natural Environments: Harnessing Smart Devices and Explainable AI to Empower Just-In-Time Adaptive Interventions: Longitudinal Observational Study

Enhancing Interpretable, Transparent, and Unobtrusive Detection of Acute Marijuana Intoxication in Natural Environments: Harnessing Smart Devices and Explainable AI to Empower Just-In-Time Adaptive Interventions: Longitudinal Observational Study

This resulted in three datasets: (1) e Xtreme Gradient Boosting (XGBoost)-Mobile: mobile phone only, (2) XGBoost-Fitbit: Fitbit-only, and (3) XGBoost-Mobi Fit: combined mobile and Fitbit data. The rationale for choosing Machine Learning (ML) models is detailed in Multimedia Appendix 3 and model comparison with different classifiers can be found in Multimedia Appendix 4. Marijuana use episodes and labeling principle.

Sang Won Bae, Tammy Chung, Tongze Zhang, Anind K Dey, Rahul Islam

JMIR AI 2025;4:e52270

A Machine Learning Model for Risk Stratification of Postdiagnosis Diabetic Ketoacidosis Hospitalization in Pediatric Type 1 Diabetes: Retrospective Study

A Machine Learning Model for Risk Stratification of Postdiagnosis Diabetic Ketoacidosis Hospitalization in Pediatric Type 1 Diabetes: Retrospective Study

We did not use additional feature selection methods, relying instead on XGBoost to select relevant features in the construction of the final decision ensemble. We used a simple piecewise linear interpolation technique to fill in missing values in Hb A1c records. Hb A1c imputation was done only between 2 known values—for example, if the 3-month and 9-month Hb A1c values for a patient were known, then the 6-month value was imputed as the average. We did not perform any other imputation.

Devika Subramanian, Rona Sonabend, Ila Singh

JMIR Diabetes 2024;9:e53338

Machine Learning–Based Prediction of Suicidal Thinking in Adolescents by Derivation and Validation in 3 Independent Worldwide Cohorts: Algorithm Development and Validation Study

Machine Learning–Based Prediction of Suicidal Thinking in Adolescents by Derivation and Validation in 3 Independent Worldwide Cohorts: Algorithm Development and Validation Study

Among the 4 models tested, XGBoost consistently yielded the highest AUROC scores across all data sets, leading to its selection as the primary model. We performed hyperparameter tuning using Grid Search CV to optimize the performance of the XGBoost model, prioritizing the maximization of the AUROC score to determine the optimal hyperparameter combination.

Hyejun Kim, Yejun Son, Hojae Lee, Jiseung Kang, Ahmed Hammoodi, Yujin Choi, Hyeon Jin Kim, Hayeon Lee, Guillaume Fond, Laurent Boyer, Rosie Kwon, Selin Woo, Dong Keon Yon

J Med Internet Res 2024;26:e55913

Predicting the Risk of Sleep Disorders Using a Machine Learning–Based Simple Questionnaire: Development and Validation Study

Predicting the Risk of Sleep Disorders Using a Machine Learning–Based Simple Questionnaire: Development and Validation Study

In this work, we used XGBoost for classification. XGBoost is a tree-based machine learning algorithm using gradient boosting [36] and has been widely used in medical diagnosis because of its high accuracy and performance, straightforward interpretation, and ability to handle missing values [43-47]. Logistic regression (LR), random forest (RF), and support vector classifier were also implemented for comparison.

Seokmin Ha, Su Jung Choi, Sujin Lee, Reinatt Hansel Wijaya, Jee Hyun Kim, Eun Yeon Joo, Jae Kyoung Kim

J Med Internet Res 2023;25:e46520

Early Triage of Critically Ill Adult Patients With Mushroom Poisoning: Machine Learning Approach

Early Triage of Critically Ill Adult Patients With Mushroom Poisoning: Machine Learning Approach

Four popular machine learning classification algorithms, including extreme gradient boosting (XGBoost) [32], random forest, support vector machine, and logistic regression, were applied in this study to build the classification models. We implemented machine learning algorithms using Python (version 3.9; Python Software Foundation) and several Python modules (panda, numpy, scipy, sklearn, xgboost, shap, and matplotlib).

Yuxuan Liu, Xiaoguang Lyu, Bo Yang, Zhixiang Fang, Dejun Hu, Lei Shi, Bisheng Wu, Yong Tian, Enli Zhang, YuanChao Yang

JMIR Form Res 2023;7:e44666