وثيقة

A Comparative Study for Hybrid Machine Learning Algorithms in Detecting Polycystic Ovary Syndrome

وكيل مرتبط
Zeki, Ahmed M., مشرف الرسالة العلمية
Hilal, Sawsan, مختصر
تاريخ النشر
2022
اللغة
الأنجليزية
مدى
[1], 12, 69, [5] Pages
مكان المؤسسة
Sakhir, Bahrain
نوع الرسالة الجامعية
Thesis (Master)
الجهه المانحه
"""University of Bahrain, College of Science, Department of Postgraduate Programs
الملخص الإنجليزي
Abstract : This study focuses on identifying PolyCystic Ovary Syndrome (PCOS), which is a serious medical hormonal disorder condition that affects a woman's ability in childbearing age and causes infertility, diabetes, heart problems, endometrial cancer, and other related health issues as it raises the possibility of long-term problems. Therefore, early detection is of a great importance. Taking into account Machine Learning (ML) and ensemble learning algorithms with their superior detection capabilities, especially in the medical field. The Hybrid Random Forest Logistic Regression, combined Extreme Boosting with Random Forest, Linear Support Vector Machine, Light Gradient Boosting Model, and CatBoost model are among the hybrid ML models used in this study. To support the unique techniques, the classification models have been investigated using the PCOS dataset containing physical and clinical parameters of women collected from 10 different hospitals across Kerala, India, and downloaded from the Kaggle repository. Data were resampled based on Synthetic Minority Oversampling Techniques (SMOTE) to address outliers, overfitting, and data imbalance issues to fully support this classification performance. The top 14 physical and clinical features were selected to detect PCOS by using the univariate feature selection approach. Follicle No. (R) and Follicle No. (L) were found to be the most significant among all features. The evaluation metrics used to test all the models are; Accuracy, Precision, Recall, F1-Score, ROC curve plot, Area Under the Curve Score, and K-fold Cross Validation (CV). Finally, results were discussed and compared to indicate that CatBoost outperforms other models, with a CV accuracy score of 94% applied on the top 14 features only using SMOTE and 80:20 data split ratio. Consequently, CatBoost is effective at detecting PCOS patients.
ملاحظة
Title on cover :

دراسة مقارنة لخوارزميات التعلم الآلي الهجينة في الكشف عن متلازمة تكيس المبايض
المجموعة
المعرف
https://digitalrepository.uob.edu.bh/id/0f141849-8ca5-46c7-a50c-10250b41d6b4