محمدحسين كرباسي

عنوان

ارايه مدلي براي مديريت انرژي در خانه هوشمند با استفاده از روش هاي يادگيري تقويتي.

مقطع تحصيلي

كارشناسي ارشد

رشته تحصيلي

رشته مهندسي صنايع گرايش سيستم هاي اطلاعاتي

سال تحصيل

1402

تاريخ دفاع

1404/6/8

استاد راهنما

دكتر محمدرضا رسولي

استاد مشاور

دكتر هادي صاحبي

دانشكده

دانشكده مهندسي صنايع

چكيده

با رشد روزافزون تقاضاي انرژي در بخش ساختمان و توسعه منابع تجديدپذير، سيستم‌هاي مديريت انرژي خانگي نقش حياتي در بهينه‌سازي مصرف و كاهش هزينه‌ها ايفا مي‌كنند. با اين حال، چالش‌هايي نظير عدم قطعيت در توليد انرژي خورشيدي، نوسانات پوياي قيمت برق و پيچيدگي كنترل همزمان تجهيزات مختلف، كارايي روش‌هاي كنترلي سنتي و مبتني بر مدل را محدود ساخته است. اين پژوهش با هدف رفع اين چالش‌ها، يك مدل جامع مبتني بر يادگيري تقويتي عميق با استفاده از الگوريتم گراديان سياست قطعي عميق ارائه مي‌دهد كه قادر است در فضاي عمل پيوسته و بدون نياز به مدل فيزيكي دقيق ساختمان، مديريت يكپارچه منابع را بر عهده گيرد.در مدل پيشنهادي، سه مؤلفه كليدي شامل سيستم تهويه مطبوع ، سيستم ذخيره‌سازي انرژي و پمپ آب هوشمند به‌صورت همزمان كنترل مي‌شوند. نوآوري اصلي اين تحقيق، گنجاندن پمپ آب به عنوان يك بار قابل‌انعطاف و طراحي يك تابع پاداش چندهدفه است كه تعادلي پويا ميان اهداف متضادِ كاهش هزينه، حفظ آسايش حرارتي و افزايش طول عمر تجهيزات برقرار مي‌كند. فرآيند آموزش عامل در يك محيط شبيه‌سازي‌شده با داده‌هاي واقعي و در حضور نويزهاي محيطي انجام پذيرفت. نتايج شبيه‌سازي‌ها نشان مي‌دهد كه مدل پيشنهادي عملكردي پايدار و همگرا داشته و توانسته است هزينه كل انرژي مصرفي را تا 4/25 درصد كاهش دهد. همزمان، اين سيستم موفق شده است در 2/99 درصد از زمان‌ها، دماي محيط داخلي را در محدوده آسايش مطلوب كاربران حفظ نمايد. علاوه بر اين، سياست‌هاي آموخته‌شده توسط عامل هوشمند، منجر به رفتار محافظه‌كارانه در مديريت باتري (جهت كاهش استهلاك) و تأمين حدود 27 درصد از انرژي مورد نياز از طريق فروش مازاد برق خورشيدي به شبكه شده است. يافته‌هاي اين پژوهش نشان‌دهنده برتري و استحكام رويكرد DDPG در مواجهه با عدم قطعيت‌ها و نويزهاي محيطي نسبت به روش‌هاي پيشين است.

تاريخ ورود اطلاعات

1404/09/30

عنوان به انگليسي

Proposing a Model for Energy Management in a Smart Home Using Reinforcement Learning Methods

تاريخ بهره برداري

8/30/2026 12:00:00 AM

دانشجوي وارد كننده اطلاعات

محمدحسين كرباسي

Name: محمدحسين كرباسي
Author: محمدحسين كرباسي

چكيده به لاتين

With the increasing growth of energy deman‎d in the building sector an‎d the expansion of renewable energy resources, home energy management systems play a vital role in optimizing energy consumption an‎d reducing costs. However, challenges such as uncertainty in solar power generation, dynamic fluctuations in electricity prices, an‎d the complexity of simultaneously controlling multiple devices have limited the effectiveness of traditional model-based control methods. To address these challenges, this study proposes a comprehensive framework based on deep reinforcement learning using the Deep Deterministic Policy Gradient (DDPG) algorithm. The proposed approach is capable of managing integrated energy resources in a continuous action space without requiring an accurate physical model of the building. In this model, three key components—namely the heating, ventilation, an‎d air conditioning (HVAC) system, the energy storage system (ESS), an‎d a smart water pump—are controlled simultaneously. The main innovation of this research lies in incorporating the water pump as a flexible load an‎d designing a multi-objective reward function that dynamically balances conflicting goals, including minimizing energy costs, maintaining thermal comfort, an‎d extending equipment lifespan. The agent training process is conducted in a simulated environment using real-world data an‎d in the presence of environmental noise. Simulation results demonstrate that the proposed model exhibits stable an‎d convergent performance, achieving up to a 25.4% reduction in total energy cost. At the same time, the system successfully maintains indoor temperature within the desired comfort range for 99.2% of the time. Furthermore, the learned policies lead to conservative battery management behavior—reducing degradation—an‎d enable approximately 27% of the required energy to be supplied through selling surplus solar power to the grid. Overall, the findings highlight the superiority an‎d robustness of the DDPG-based approach in han‎dling uncertainties an‎d environmental noise compared to previous methods.

كليدواژه هاي فارسي

مديريت انرژي خانگي , يادگيري تقويتي عميق , الگوريتم DDPG , خانه هوشمند , بهينه‌سازي چندهدفه , عدم قطعيت

كليدواژه هاي لاتين

Home Energy Management , Deep Reinforcement Learning , DDPG Algorithm , Smart Home , Multi-objective Optimization , Uncertainty

Author

Mohammadhosein Karbasi

SuperVisor

DR.Mohammadreza Rasouli

لينک به اين مدرک

https://dl.iust.ac.ir/dl/search/default.aspx?Term=34282&Field=0&DTC=6