حنانه سادات معجزي

عنوان

مديريت انرژي در خودروهاي هيبريدي با استفاده از يادگيري تقويتي

مقطع تحصيلي

كارشناسي ارشد

رشته تحصيلي

مهندسي سيستم هاي انرژي -انرژي و محيط زيست

سال تحصيل

1404

تاريخ دفاع

1404/07/01

استاد راهنما

دكتر ميثم فرج الهي

استاد مشاور

استاد مشاور نداشته ام

دانشكده

فناوري هاي نوين

چكيده

خودروهاي هيبريدي با تركيب موتور احتراقي و الكتريكي، ضمن كاهش چشمگير مصرف سوخت و انتشار آلاينده‌ها، توانايي ارتقاي كارايي ديناميكي را نيز دارا هستند. بهره‌گيري از يادگيري ماشين و به‌ويژه يادگيري تقويتي عميق اين امكان را مي‌دهد سامانه‌هاي مديريت انرژي اين خودروها، با تحليل داده‌هاي رانندگي و پيش‌بيني نياز توان، تخصيص بهينه‌ي انرژي را در زمان واقعي انجام دهند. اين رويكرد هوشمند، ضمن حفظ سلامت باتري و كاهش هزينه‌هاي عملياتي، نقشي محوري در بهبود پايداري و كاهش اثرات زيست‌محيطي حمل‌ونقل ايفا مي‌كند. در اين پژوهش، يك سامانه‌ي هوشمند مديريت انرژي براي خودروي هيبريدي تويوتا پريوس مبتني بر يادگيري تقويتي عميق توسعه يافته است. ابتدا، مدل فيزيكي دقيقي از اجزاي پيشرانه شامل موتور احتراقي ، موتور الكتريكي ، ژنراتور و باتري طراحي شد. سپس سه روش يادگيري تقويتي عميق شامل شبكه عصبي Q عميق ، كنترل نرم بازيگر-منتقد و گراديان سياست قطعي عميق به‌همراه تنظيم خودكار پارامترها روي چرخه‌ رانندگي استاندارد شبيه‌سازي شدند. تابع پاداش، تركيبي مشتمل بر كمينه‌سازي مصرف سوخت ، كاهش هزينه‌ي عملكرد موتور احتراقي و جريمه‌ي خروج وضعيت شارژ باتري از محدوده‌ي تعريف شده(6/0 -8/0)و براي سنجش پايداري، انحراف معيار مصرف سوخت و دامنه‌ي نوسان وضعيت شارژ به‌عنوان معيارهاي كمكي در نظر گرفته شدند. نتايج نشان داد كه الگوريتم كنترل نرم بازيگر-منتقد با ميانگين مصرف سوخت 135/4 ليتر در هر صد كيلومتر، بهترين عملكرد را در كاهش مصرف سوخت ارائه مي‌كند. الگوريتم گراديان سياست قطعي عميق در حفظ وضعيت شارژ باتري برتر است و ميانگين وضعيت شارژ‌ باتري در آن به حدود 615/0 مي‌رسد. الگوريتم شامل شبكه عصبي Q عميق ، به دليل ميانگين مصرف بالاتر (348/4 ليتر در صد كيلومتر) و پراكندگي گسترده در وضعيت شارژ باتري، براي كاربردهاي عملي در خودروهاي هيبريدي مناسب به نظر نمي‌رسد. تركيب دو الگوريتم كنترل نرم بازيگر-منتقد و گراديان سياست قطعي عميق براي تضمين وضعيت شارژ راهكاري مؤثر براي سامانه‌هاي مديريت انرژي نسل آينده خودروهاي هيبريدي ارائه مي‌دهد.

تاريخ ورود اطلاعات

1404/09/05

عنوان به انگليسي

Energy management of hybrid electric vehicle using reinforcement learning

تاريخ بهره برداري

9/23/2026 12:00:00 AM

دانشجوي وارد كننده اطلاعات

حنانه سادات معجزي

Name: حنانه سادات معجزي
Author: حنانه سادات معجزي

چكيده به لاتين

Hybrid vehicles, by combining an internal combustion engine an‎d an electric motor, not only significantly reduce fuel consumption an‎d pollutant emissions but also have the capability to enhance dynamic performance. The use of machine learning—an‎d particularly deep reinforcement learning—enables the energy-management systems of these vehicles to perform real-time optimal energy allocation by analyzing driving data an‎d predicting power deman‎d. This intelligent approach, while preserving battery health an‎d reducing operational costs, plays a pivotal role in improving sustainability an‎d reducing the environmental impacts of transportation. In this study, an intelligent energy-management system for the Toyota Prius hybrid vehicle was developed based on deep reinforcement learning. First, an accurate physical model of the powertrain components—including the internal combustion engine, electric motor, generator, an‎d battery—was designed. Then, three deep reinforcement learning methods—Deep Q-Network (DQN), Soft Actor-Critic (SAC), an‎d Deep Deterministic Policy Gradient (DDPG)—along with automatic parameter tuning, were simulated on a stan‎dard driving cycle. The reward function was formulated as a combination of minimizing fuel consumption, reducing the operational cost of the internal combustion engine, an‎d penalizing deviations of the battery’s state of charge (SOC) from the defined range (٪60–٪80). To assess stability, the stan‎dard deviation of fuel consumption an‎d the SOC fluctuation range were included as auxiliary metrics The results showed that the Soft Actor-Critic algorithm, with an average fuel consumption of 4.135 liters per 100 kilometers, delivered the best performance in reducing fuel usage. The DDPG algorithm excelled in maintaining battery SOC, achieving an average SOC of approximately 0.615. The DQN-based algorithm, due to its higher average fuel consumption (4.348 liters per 100 kilometers) an‎d large SOC variability, does not appear suitable for practical hybrid-vehicle applications. Combining the SAC an‎d DDPG algorithms provides an effective solution for ensuring SOC maintenance in next-generation hybrid-vehicle energy-management systems.

كليدواژه هاي فارسي

خودرو هيبريدي ،يادگيري تقويتي عميق ،مصرف سوخت، وضعيت شارژ باتري

كليدواژه هاي لاتين

Hybrid vehicle, Deep reinforcement learning, Fuel consumption, Battery state of charge (SOC)

Author

Hananeh Sadat Mojezi

SuperVisor

Dr. Meisam Farajoali

لينک به اين مدرک

https://dl.iust.ac.ir/dl/search/default.aspx?Term=34081&Field=0&DTC=6