مهيار رضايي قوام آباد

عنوان

تحليل اثر كيفيت زندگي بر روي بيماري‌ سرطان با بهره‌گيري از الگوريتم‌هاي يادگيري ماشين

مقطع تحصيلي

كارشناسي ارشد

رشته تحصيلي

مهندسي صنايع

سال تحصيل

1399

تاريخ دفاع

1402/1/20

استاد راهنما

دكتر روزبه قوسي و دكتر محمدرضا محمدعليها

استاد مشاور

دكتر عليرضا آتشي

دانشكده

مهندسي صنايع

چكيده

سرطان امروزه يكي از شايع‌ترين انواع بيماري‌ها است كه سالانه افراد زيادي را درگير مي‌كند و به كام مرگ مي‌كشاند. اين بيماري گاه ناشي از عواملي است كه از آن‌ها غافل بوده‌ايم و پيامد اين ناآگاهي، تحمل درد و رنج طاقت‌فرساي سرطان است. پيشگيري، تشخيص زود هنگام، كم كردن هزينه‌ها، صرفه‌جويي در وقت و انرژي و همچنين پيش‌بيني وضعيت آينده بيمار با توجه به وضعيت فعلي، نكاتي است كه در اين حوزه اهميت دارند. تلفيق ابزارهاي حوزه يادگيري ماشيني با مبحث كيفيت زندگي كه تمام ابعاد مختلف زندگي روزمره را شامل مي‌شود، مسئله‌اي است كه با پياده‌سازي آن در حوزه سرطان، اهداف مذكور را مرتفع مي‌سازد. در اين تحقيق تاثير 113 ويژگي تحت عنوان كيفيت زندگي، از داده‌هاي كوهورت گلستان با 50059 نمونه، بر روي ابتلا به بيماري سرطان، توسط ابزارهاي يادگيري ماشيني به كمك زبان برنامه‌نويسي پايتون و آر، و به ترتيب در محيط‌هاي نرم‌افزاري اسپايدر، ژوپيتر، گوگل كولب و آر استوديو، بررسي شده است. پس از تحليل اكتشافي داده و استفاده از روش‌هايي براي انتخاب ويژگي‌ها و كاهش بعد همچون روش استپ‌وايز و مولفه‌هاي اصلي، 14 روش يادگيري ماشيني همچون رگرسيون‌ لجستيك، درخت تصميم، جنگل تصادفي، ماشين‌هاي بردار پشتيبان، شبكه‌هاي عصبي، به وسيله چهار شاخص دقت، صحت، حساسيت و اختصاصي، مورد ارزيابي قرار گرفتند. در انتها روش جنگل تصادفي با ميزان دقت 0.005 89±% ، صحت 0.007 94.1±% ، حساسيت 0.009 84±% و اختصاصي 0.006 ±94.4% بهترين مدل‌سازي جهت تشخيص بيمار سرطاني را ارائه كرده است. در تحليل برخي از ضرايب رگرسيون نيز انواع فعاليت بدني، افزايش سطح رفاه، آموزش و تحصيلات بيشتر، مصرف كمتر سرخ‌كردني‌ها و استفاده بيشتر از ميوه‌ها و سبزيجات تازه و محلي از عوامل مهمي هستند كه خطر ابتلا به بيماري سرطان را كاهش مي‌دهند. افزايش سن نيز عامل بسيار مهم و تاثيرگذاري در خصوص ابتلا شناخته شده است.

تاريخ ورود اطلاعات

1402/04/12

عنوان به انگليسي

Analysis of the effect of quality of life on cancer using machine learning algorithms

تاريخ بهره برداري

4/8/2024 12:00:00 AM

دانشجوي وارد كننده اطلاعات

مهيار رضايي قوام اباد

Name: مهيار رضايي قوام اباد
Author: مهيار رضايي قوام آباد

چكيده به لاتين

Cancer is currently one of the most preva‎lent diseases that affect a large number of individuals annually and leads to death. This disease is sometimes caused by factors we have been unaware of, and the consequence of this is the enduring pain and suffering of cancer. Prevention, early diagnosis, cost reduction, time and energy saving, and predicting the future status of the patient based on the current situation, are crucial aspects in this field. Integrating machine learning tools with the concept of quality of life, which encompasses various dimensions of everyday life, addresses these objectives when implemented in the cancer domain. In this research, the impact of 113 quality-of-life features from the Golestan Cohort Study with 50,059 samples on cancer incidence has been investigated using machine learning tools aided by Python and R programming languages, and analyzed in software environments such as Spyder, Jupyter, Google Colab, and RStudio. After exploratory data analysis and employing methods for feature selection and dimensionality reduction, such as stepwise method and principal components, 14 machine learning methods, including logistic regression, decision trees, random forests, support vector machines, and neural networks, were eva‎luated using four performance metrics: accuracy, precision, sensitivity, and specificity. Finally, the random forest method with an accuracy rate of 89% ± 0.005, precision of 94.1% ± 0.007, sensitivity of 84% ± 0.009, and specificity of 94.4% ± 0.006 had the best performance among the models and led us to the desired goals. In the analysis of some regression coefficients, various physical activities, increased welfare level, Education and training, reduced consumption of fried foods, and increased consumption of fresh and local fruits and vegetables were identified as important factors that reduce the risk of cancer. Aging is also recognized as a significant and influential factor in cancer incidence.

كليدواژه هاي فارسي

سرطان , يادگيري ماشين , كيفيت زندگي , داده كاوي

كليدواژه هاي لاتين

Cancer , Machine learning , Quality of Life , Data mining

Author

Mahyar Rezaei qhavam abad

SuperVisor

Dr. Mohammadreza Mohammad aliha

لينک به اين مدرک

https://dl.iust.ac.ir/dl/search/default.aspx?Term=28483&Field=0&DTC=6