پريسا خطير

عنوان

ارائه رويكرد رده بندي تركيبي براي پيش بيني ديابت براساس وزن دهي به رده بندها

مقطع تحصيلي

كارشناسي ارشد

رشته تحصيلي

نرم افزار

تاريخ دفاع

اسفند 1395

استاد راهنما

دكتر عين اله خنجري ميانه

دانشكده

كامپيوتر

چكيده

بيماري ديابت، شايعترين بيماري غدد در جهان است و در اكثر كشورها، چهارمين يا پنجمين علل مرگ و مير است. تشخيص به موقع بيماري، در كاهش عوارض آن بسيار موثر بوده و بيمار ميتواند با مصرف دارو و تغيير شيوه زندگي، عمري طبيعي داشته باشد. دراين تحقيق، از مدلهاي پيش بينانه دادهكاوي براي شناسايي افراد مبتلا به ديابت استفاده شده است. هدف اين پژوهش، افزايش دقت در پيش بيني بيماري ديابت با استفاده از رويكرد رده بندي تركيبي براساس وزن دهي به رده بندها است. روش پيشنهادي شامل مراحل استخراج ويژگي، تقسيم بندي فضاي مسئله، انتخاب ويژگي در هر زير فضا و انتساب وزن به الگوريتم هاي پيش بينانه در هر زير فضا است. ابتدا به منظور غني تر شدن فضاي ويژگي، تعداد محدودي ويژگي با استفاده از الگوريتم MVU ، استخراج كرده و به ويژگيهاي اوليه مجموعه داده بيماري ديابت ، اضافه مي شود. سپس با اعمال الگوريتم خوشه بندي kmeans فضاي اوليه مسئله به زيرفضاها تقسيم مي شود. پس ازآن با انتخاب ويژگي در هر زيرفضا، ويژگي هايي كه بيشترين وابستگي را با ويژگي برچسب كلاس دارند انتخاب مي شود كه از معيار مستقل Hilbert Schimit براي انتخاب ويژگي استفاده ميشود. آخرين مرحله، انتساب وزن مناسب به الگوريتم هاي پيش بينانه در زيرفضاها است. در اين تحقيق، از مجموعه داده PIMA Indian Diabetes از ديتابيس يادگيري ماشين UCI استفاده شده است. نتايج تجربي بر روي داده هاي آزمايش ديابت نشان ميدهد كه روش پيشنهادي در مقايسه با روش هاي موجود از دقت بيشتري برخوردار است. واژه‌هاي كليدي: استخراج ويژگي، خوشه بندي، انتخاب ويژگي، الگوريتم هاي پيش بينانه، وزن دهي به الگوريتم ها.

تاريخ ورود اطلاعات

1396/04/19

تاريخ بهره برداري

1/1/1900 12:00:00 AM

دانشجوي وارد كننده اطلاعات

پريسا خطير

Name: پريسا خطير
Author: پريسا خطير

چكيده به لاتين

ABSTRACT Diabetes is the most prevalent endocrine disease in the world and is the fourth or fifth leading cause of death in most counties. Timely diagnosis of diabetes plays a major role in reducing its complications, and the patient can live a full life by taking medications and changing lifestyle. In the present study, data mining predictive models were used to identify individuals with diabetes. The objective of this study was to increase the accuracy of identification of diabetes using a Hybrid classification approach based on assigning weights to the classifiers. The proposed method includes the following stages: feature extraction, division of the problem space, feature selection in each subspace, and assigning weights to the predictive algorithms in each subspace. In order to enrich the feature space, a limited number of features was extracted using the MVU algorithm to be added to the primary features of the diabetes dataset. Then, kmeans clustering algorithm was applied to divide the primary space of the problem into subspaces. A feature selection was then performed using the Hilbert-Schmidt Independence Criterion in each subspace to select the features with the highest association with the label feature class.As the last stage, weights were assigned to the predictive algorithms in the subspaces. In this study, the PIMA Indian Diabetes dataset from the UCI machine learning database was employed. Experimental results on the diabetes test data show that the proposed method is more accurate than existing methods. KEYWORDS Feature extraction,Clustering, Feature selection, Weighting to classifiers, predictive algorithms.

لينک به اين مدرک

https://dl.iust.ac.ir/dl/search/default.aspx?Term=17606&Field=0&DTC=6