عليرضا اقاجاني مباركه

عنوان

شناسايي ناهنجاري توربين بادي به وسيله ي منحني توان، مبتني بر داده‌هاي اسكادا و يادگيري ماشين

مقطع تحصيلي

ارشد

رشته تحصيلي

مهندسي برق-كنترل

سال تحصيل

1401

تاريخ دفاع

1404/7/2

استاد راهنما

جواد پشتان

استاد مشاور

سهيل گنجه فر

دانشكده

مهندسي برق

چكيده

در اين پژوهش، باتكيه‌بر داده‌هاي اسكادا و تركيب الگوريتم‌هاي يادگيري ماشين با روش‌هاي آماري، به بهبود دقت مدل¬سازي و بهبود سرعت تشخيص ناهنجاري در توربين بادي، مبتني بر منحني توان پرداخته شده است. در اين راستا، دو رويكرد پيش‌پردازش داده‌هاي بدون برچسب و حذف خودكار داده‌هاي پرت از منحني توان توربين پيشنهاد گرديده است كه موجب افزايش دقت و كاهش زمان اجرا در تشخيص ناهنجاري‌ها مي‌شود. ابتدا داده‌هاي بدون برچسب بررسي مي¬شود و دو روش پاك‌سازي داده معرفي شده است. روش اول بر تركيب ميانگين متحرك‌نمايي وزن‌دار (EWMA) و چندك‌ها (Quantile) استوار است. روش دوم با بازه‌بندي مساوي داده‌ها و انتخاب داده‌هاي نماينده بر اساس چندك‌ها، حجم داده‌هاي آموزشي را به طور چشمگيري كاهش مي‌دهد (2 درصد كل داده¬ها). عملكرد اين روش‌ها با الگوريتم DBSCAN و مدل‌هاي يادگيري ماشين مانند KNN و XGBoost مقايسه شده است. در اين ارزيابي، معيارهايي نظير MAE، RMSE، R² و زمان اجرا براي بررسي عملكرد در شرايط بدون نظارت به كار گرفته شدند. بر خلاف DBSCAN، روش‌هاي پيشنهادي قادر به تشخيص خودكار داده‌هاي پرت، كاهش زمان آموزش و پردازش، بدون نياز به تنظيم دستي پارامترها يا حذف نواحي متراكم نقاط پرت در منحني توان هستند. در بخش دوم، از داده‌هاي برچسب‌دار نامتعادل استفاده شده است. پس از پيش‌پردازش شامل مديريت داده‌هاي گمشده، نرمال‌سازي ويژگي‌ها و تحليل اكتشافي، داده‌ها بر اساس وضعيت عملكردي برچسب‌گذاري گرديدند. سپس مدل‌هاي مختلفي مانند شبكه‌هاي عصبي، KNN، RF و XGBoost آموزش داده مي¬شود. با بهره‌گيري از روش‌هاي متعادل‌سازي نظير SMOTEEN و Undersampling، بهترين عملكرد مربوط به XGBoost به دست مي‌آيد. در اين بخش، معيارهاي Precision، F1-Score، نرخ FN و نرخ FP براي ارزيابي استفاده شدند. اين نتايج نشان داد كه مدل پيشنهادي مي¬تواند عيوب نادر را بادقت بالاتر شناسايي كرده و نرخ هشدارهاي كاذب و ازدست‌رفته را به طرز قابل‌ملاحظه‌اي كاهش دهد. نتايج تجربي روي داده‌هاي اسكادا نشان مي¬دهد كه در رويكرد بدون نظارت، تركيب QEWMA با KNN نسبت به بهترين حالت DBSCAN عملكرد بهتري دارد. به طور خاص، مقدار MAE از 51٫11 به 33٫11 (بهبود حدود 35٫2٪) و زمان اجرا از 1٫04 به 0٫04 ثانيه (بهبود حدود 96%) كاهش يافت. در رويكرد با نظارت نيز XGBoost با داده‌هاي متعادل‌شده توانست دقت بالايي در تشخيص انواع عيوب به دست آورد و نرخ اعلام وضعيت معيوب به‌عنوان سالم تقريباً صفر شد.

تاريخ ورود اطلاعات

1404/07/20

عنوان به انگليسي

Wind turbine anomaly detection using power curve, based on SCADA data an‎d machine learning

تاريخ بهره برداري

9/25/2025 12:00:00 AM

دانشجوي وارد كننده اطلاعات

عليرضا اقاجاني مباركه

Name: عليرضا اقاجاني مباركه
Author: عليرضا اقاجاني مباركه

چكيده به لاتين

In this study, relying on SCADA data an‎d combining machine learning algo‎rithms with statistical methods, the accuracy an‎d speed of anomaly detection in wind turbines, based on the power curve, have been improved. Fo‎r this purpose, two approaches fo‎r preprocessing unlabeled data an‎d automatic removal of outliers from the turbine power curve have been proposed, which increase accuracy an‎d reduce execution time in anomaly detection. First, unlabeled data are examined an‎d two data-cleaning methods are introduced. The first method is based on the combination of Exponentially Weighted Moving Average (EWMA) an‎d quantiles. The second method, by equally partitioning the data an‎d selec‎ting representative data based on quantiles, significantly reduces the training data size (50 times smaller). The perfo‎rmance of these methods is compared with the DBSCAN algo‎rithm an‎d machine learning models such as KNN an‎d XGBoost. In this eva‎luation, criteria such as MAE, RMSE, R², an‎d execution time were used to assess perfo‎rmance under unsupervised conditions. Unlike DBSCAN, the proposed methods are capable of automatically detecting outliers, reducing training an‎d processing time, without the need fo‎r manual parameter adjustment o‎r removal of dense regions of outliers in the power curve. In the second part, imbalanced labeled data were used. After preprocessing, including han‎dling of missing data, feature no‎rmalization, an‎d explo‎rato‎ry analysis, the data were labeled based on operational status. Then, various models such as neural netwo‎rks, KNN, RF, an‎d XGBoost were trained. By applying balancing methods such as SMOTEEN an‎d undersampling, the best perfo‎rmance was achieved by XGBoost. In this part, metrics such as Precision, F1-Sco‎re, FN rate, an‎d FP rate were used fo‎r eva‎luation. The results showed that the proposed model can identify rare faults with higher accuracy an‎d significantly reduce the rates of false an‎d missed alarms. Experimental results on SCADA data show that in the unsupervised approach, combining QEWMA with KNN outperfo‎rms the best case of DBSCAN. Specifically, MAE decreased from 51.11 to 33.11 (about 35.2% reduction), an‎d execution time decreased from 1.68 to 0.14 seconds (about 91.7% reduction). In the supervised approach as well, XGBoost with balanced data was able to achieve high accuracy in detecting different types of faults, with the rate of classifying faulty states as no‎rmal being almost zero.

كليدواژه هاي فارسي

منحني توان توربين بادي , يادگيري ماشين , سامانه نظارتي و گردآوري داده , تشخيص ناهنجاري , تشخيص عيب

كليدواژه هاي لاتين

Wind Turbine Power Curve , Machine Learning , Supervisory Control an‎d Data Acquisition , Anomaly Detection , Fault Detection

Author

Alireza Aghajani Mobarakeh

SuperVisor

Javad Poshtan

لينک به اين مدرک

https://dl.iust.ac.ir/dl/search/default.aspx?Term=33830&Field=0&DTC=6