• Contact us
  • E-Submission
ABOUT
BROWSE ARTICLES
JOURNAL POLICIES
FOR CONTRIBUTORS

Page Path

2
results for

"Computer neural networks"

Filter

Article category

Keywords

Publication year

Authors

"Computer neural networks"

Original article

[English]
Feature-based ensemble modeling for addressing diabetes data imbalance using the SMOTE, RUS, and random forest methods: a prediction study
Younseo Jang
Received March 31, 2025  Accepted April 10, 2025  Published online April 15, 2025  
DOI: https://doi.org/10.12771/emj.2025.00353    [Epub ahead of print]
Purpose
This study developed and evaluated a feature-based ensemble model integrating the synthetic minority oversampling technique (SMOTE) and random undersampling (RUS) methods with a random forest approach to address class imbalance in machine learning for early diabetes detection, aiming to improve predictive performance.
Methods
Using the Scikit-learn diabetes dataset (442 samples, 10 features), we binarized the target variable (diabetes progression) at the 75th percentile and split it 80:20 using stratified sampling. The training set was balanced to a 1:2 minority-to-majority ratio via SMOTE (0.6) and RUS (0.66). A feature-based ensemble model was constructed by training random forest classifiers on 10 two-feature subsets, selected based on feature importance, and combining their outputs using soft voting. Performance was compared against 13 baseline models, using accuracy and area under the curve (AUC) as metrics on the imbalanced test set.
Results
The feature-based ensemble model and balanced random forest both achieved the highest accuracy (0.8764), followed by the fully connected neural network (0.8700). The ensemble model had an excellent AUC (0.9227), while k-nearest neighbors had the lowest accuracy (0.8427). Visualizations confirmed its superior discriminative ability, especially for the minority (high-risk) class, which is a critical factor in medical contexts.
Conclusion
Integrating SMOTE, RUS, and feature-based ensemble learning improved classification performance in imbalanced diabetes datasets by delivering robust accuracy and high recall for the minority class. This approach outperforms traditional resampling techniques and deep learning models, offering a scalable and interpretable solution for early diabetes prediction and potentially other medical applications.
  • 151 View
  • 9 Download
Review

Special topic: recent clinical approach to shoulder diseases in older adults

[English]

Shoulder diseases pose a significant health challenge for older adults, often causing pain, functional decline, and decreased independence. This narrative review explores how deep learning (DL) can address diagnostic challenges by automating tasks such as image segmentation, disease detection, and motion analysis. Recent research highlights the effectiveness of DL-based convolutional neural networks and machine learning frameworks in diagnosing various shoulder pathologies. Automated image analysis facilitates the accurate assessment of rotator cuff tear size, muscle degeneration, and fatty infiltration in MRI or CT scans, frequently matching or surpassing the accuracy of human experts. Convolutional neural network-based systems are also adept at classifying fractures and joint conditions, enabling the rapid identification of common causes of shoulder pain from plain radiographs. Furthermore, advanced techniques like pose estimation provide precise measurements of the shoulder joint's range of motion and support personalized rehabilitation plans. These automated approaches have also been successful in quantifying local osteoporosis, utilizing machine learning-derived indices to classify bone density status. DL has demonstrated significant potential to improve diagnostic accuracy, efficiency, and consistency in the management of shoulder diseases in older patients. Machine learning-based assessments of imaging data and motion parameters can help clinicians optimize treatment plans and improve patient outcomes. However, to ensure their generalizability, reproducibility, and effective integration into routine clinical workflows, large-scale, prospective validation studies are necessary. As data availability and computational resources increase, the ongoing development of DL-driven applications is expected to further advance and personalize musculoskeletal care, benefiting both healthcare providers and the aging population.

  • 223 View
  • 7 Download
TOP