عليرضا اطلاعي

شماره ركورد
27134
پديد آورنده
عليرضا اطلاعي
عنوان
طراحي كنترل‌كننده تحمل‌پذير عيب براي يك راكتور با رويكرد يادگيري تقويتي
مقطع تحصيلي
كارشناسي ارشد
رشته تحصيلي
مهندسي برق - كنترل
سال تحصيل
1398
تاريخ دفاع
1401/04/20
استاد راهنما
دكتر سيد مجيد اسماعيل‌زاده
دانشكده
مهندسي برق
چكيده
در اين پژوهش رويكرد جديدي مبتني بر يادگيري تقويتي در طراحي ساختار كنترل تحمل‌پذير عيب موردتوجه قرار گرفته است. در اين رويكرد سعي شده با تعريف مناسب فضاي حالت عامل يادگيري تقويتي امكان مواجهه با انواع مختلفي از عيوب و اغتشاشات فراهم گردد. اين متغيرهاي حالت شامل خروجي‌هاي كنترل شونده، مانده، خطاي رديابي و نوع عيب است. علاوه بر اين طراحي سيگنال كنترل‌كننده به نحوي انجام شده است كه اين سيگنال از لحاظ دامنه و سرعت مقيد باشد. جهت امكان‌پذيري يادگيري عامل در تعامل با فضاي حالت و فضاي عمل پيوسته از الگوريتم بازيگر نقاد نرم استفاده شده كه يكي از الگوريتم‌هاي جديد در حوزه يادگيري تقويتي عميق به شمار مي‌آيد. جهت بررسي عملكرد ساختار پيشنهادي مدل يك راكتور شيميايي مورد استفاده قرار گرفته است. ديناميك اين راكتور به‌صورت يك دستگاه ديفرانسيل مرتبه چهار غيرخطي با ضرايب متغير مدل‌سازي شده است و يكي از نقاط كار عملياتي آن داراي ويژگي‌هاي چالش برانگيزي از جمله تغيير علامت بهره حالت دائم و تغيير مشخصه‌هاي پايداري ديناميك صفر است. نتايج حاصل از پياده‌سازي ساختار كنترل تحمل‌پذير پيشنهادي بيانگر عملكرد مطلوب اين كنترل‌كننده در مواجهه با انواع مختلفي از عيوب و اغتشاشاتي است كه به‌صورت ناگهاني يا تدريجي به اين سيستم دو ورودي دو خروجي اعمال مي‌گردد.
تاريخ ورود اطلاعات
1401/07/16
عنوان به انگليسي
Design of fault tolerant controller for a reactor with reinforcement learning approach
تاريخ بهره برداري
7/11/2023 12:00:00 AM
دانشجوي وارد كننده اطلاعات
عليرضا اطلاعي
چكيده به لاتين
In this research, a new approach based on reinforcement learning has been considered to design a fault tolerant control structure. In this approach, an attempt has been made to deal with different types of defects and disturbances by properly defining the state-space of the reinforcement learning agent. These state variables include controlled outputs, residue, tracking error, and fault type. In addition, the design of the controller signal has been done in such a way that this signal is limited in terms of amplitude and variation. In order to enable agent learning in interaction with continuous state space and continuous action space, the soft actor critic algorithm is used, which is one of the new algorithms in the field of deep reinforcement learning. A chemical reactor model has been used to eva‎luate the performance of the proposed structure. The dynamics of this reactor is modeled as a nonlinear four-order differential equation with variable coefficients, and one of its operating points has challenging features, including changing the steady-state gain sign and changing zero dynamic stability characteristics. The results of the implementation of the proposed fault tolerant control structure indicate the desirable performance of this controller in the face of various types of faults and disturbances that are applied to the system suddenly or gradually.
كليدواژه هاي فارسي
كنترل تحمل‌پذير عيب , يادگيري تقويتي , كنترل فرآيندهاي صنعتي
كليدواژه هاي لاتين
Fault tolerance control , Reinforcement learning , Industrial processes control
Author
Alireza Ettelaei
SuperVisor
Dr. Seyyed Majid Esmaeilzadeh
لينک به اين مدرک :
http://dl.iust.ac.ir/dL/search/default.aspx?Term=27134&Field=0&DTC=6

کلیه حقوق این اثر برای شرکت مهندسی ارتباطات پيام مشرق محفوظ می باشد