زهرا رشيدي

شماره ركورد
30411
پديد آورنده
زهرا رشيدي
عنوان
بهينه‌سازي جانمايي محتوا در شبكه‌هاي سلولي كوچكِ مجهّز به كش با اطلاعات ناكامل
مقطع تحصيلي
دكتري
رشته تحصيلي
مهندسي كامپيوتر
سال تحصيل
1396
تاريخ دفاع
1402/8/20
استاد راهنما
دكتر وصال حكمي
دانشكده
مهندسي كامپيوتر
چكيده
با ﻇﻬﻮر و گسترش ﺑﺮﻧﺎﻣﻪﻫﺎى كاربردي و ﭼﻨﺪرﺳﺎﻧﻪاى ﺟﺪﻳﺪ، يكي از چالش‌هاي عمده شبكه‌هاي سلولي، رشد فزاينده ﺗﺮاﻓﻴﻚ تحويل محتوا به دستگاه‌هاي سيّار است كه مي‌تواند منجر به افزايش تأخير دسترسي به محتوا شود. از ﻃﺮف دﻳﮕﺮ، اﻧﺘﻘﺎل محتوا از ﺳِﺮورﻫﺎي مبدأ ﺑﻪ ﻛﺎرﺑﺮان، ﭘﻬﻨﺎى ﺑﺎﻧﺪ زيادي را از پيوندهاي پُشتي شبكه اِشغال ﻣﻰﻛﻨﺪ. ﺑﺮاى ﻣﻘﺎﺑﻠﻪ ﺑﺎ اﻳﻦ ﭼﺎﻟﺶﻫﺎ، ايده ذﺧﻴﺮهﺳﺎزى (كَش) ﻣﺤﺘﻮا در ايستگاه‌هاي پايه كوچك (SBSها) مطرح شده است. در اين رساله، مسئله جانمايي محتواها در كَش SBSها به شكل يك مسئله بهينه‌سازي با هدف كمينه كردن متوسط تأخير دسترسي كاربران سيّار به محتوا فرمول‌بندي مي‌شود. برخلاف رويكردهاي غالب در كارهاي پيشين كه بر مبناي بهينه‌سازي برون‌خط يا مبتني بر مدل هستند، در اين رساله، رويكرد مبتني بر يادگيري ماشين اتخاذ شده است. در اين رويكرد، مسئله جانمايي محتوا به هر دو صورت متمركز و توزيع‌شده، تحت اطلاعات ناكامل فرمول‌بندي شده و سياست بهينه جانمايي محاسبه مي‌شود. در فرمول‌بندي متمركز، مسئله جانمايي به صورت فرآيند تصميم ماركُف مدل‌سازي مي‌گردد كه در آن، محبوبيت لحظه‌اي محتوا، پروفايل CSI كاربران و نيز ظرفيت لحظه‌اي پيوندهاي پُشتي SBSها به عنوان بردار وضعيت لحظه‌اي سيستم، به صورت پويا و تصادفي با زمان تغيير مي‌كند. با توجه به بُعديت بزرگ فضاي وضعيت سيستم، سياست بهينه جانمايي با استفاده از تقريب تابع ارزش و بر مبناي تكنيك‌هاي يادگيري تقويتي عميق محاسبه مي‌شود كه در نهايت موجب بهبود 12 درصدي در ميزان تأخير كاربران شبكه مي‌گردد. در فرمول‌بندي توزيع‌شده، مسئله جانمايي به صورت بازي پتانسيل ميان SBSها مدل مي‌شود كه در آن، هر SBS در پي ذخيره‌سازي محتوايي است كه متوسط تأخير كاربرانِ در محدوده پوشش آن را كمينه كند. براي محاسبه تعادل نَش بازي، الگوريتمي مبتني بر روال يادگيري چندعاملي ويژه بازي‌هاي پتانسيل ارائه شده كه قادر است تحت اطلاعات ناكامل از پارامترهاي توابع هزينه SBSها، تصميمات جانمايي آنها را به سوي تعادل سوق دهد. اين الگوريتم موجب كاهش 20 درصدي تأخير كاربران مي‌شود. تحت اطلاعات ناكامل، SBSها با پيچيدگي ديگري نيز براي كَش محتوا مواجه هستند و آن، عدم قطعيت در مورد ماهيت يا تركيب جمعيتي كاربران سيّار مي‌باشد. در واقع، ممكن است كسري از كاربران تحت عنوان كاربران متخاصم با ارسال درخواست براي محتواهايي كه در SBS پوشش‌دهنده آن‌ها كَش نشده‌اند، موجب بالا رفتن نسبت عدم موفقيت كَش و ازدحام در پيوندهاي پُشتي شوند. فرمول‌بندي اين مسئله به صورت يك بازي سلسله‌مراتبي دو-سطحي (اِستَكِلبِرگ) ارائه شده كه در اين حالت، محاسبه تعادل اِستَكِلبِرگ از طريق الگوريتم پويايي بهترين پاسخ انجام مي‌شود كه باعث بهبود 15 درصدي در كاهش ازدحام پيوندهاي پُشتي مي‌گردد. در صورت عدم پيروي درخواست‌هاي كاربران متخاصم از يك الگوي راهبردي، مسئله با استفاده از روش قمار چند-بازويي تخاصمي-تركيبياتي مدل شده و يك الگوريتم يادگيري برخط با معيار پشيماني ضعيف جهت ارزيابي آن ارائه مي‌شود. اين الگوريتم موجب بهبود 13 درصدي در افزايش نسبت موفقيت كَش مي‌گردد.
تاريخ ورود اطلاعات
1402/11/10
عنوان به انگليسي
Optimizing Content Placement in Cache-Enabled Small Cell Networks with Incomplete Information
تاريخ بهره برداري
1/1/1900 12:00:00 AM
دانشجوي وارد كننده اطلاعات
زهرا رشيدي
چكيده به لاتين
With the exponential proliferation of mobile users (MUs) and the emergence of many new applications and multimedia services, the content delivery traffic over cellular networks is explosively growing. Such rapid growth increases the content downloading latency as well as the congestion in the backhaul links. To deal with these challenges, caching popular files at the small base stations (SBSs) has proved to be an effective strategy to reduce the content delivery delay and to alleviate the backhaul congestion. In this dissertation, the SBS content placement problem is formulated as an optimization problem with the objective of minimizing the average content delivery latency. Unlike mainstream research, which is mostly based on offline or model-based optimization, we adopt the machine-learning-based approach. The content placement problem with incomplete information is formulated in both centralized and distributed fashions. In the centralized formulation, the placement problem is modeled based on the formalism of Markov decision process (MDP) in which the time-varying system state captures the instantaneous content popularity, the CSI profile of the MUs, and the instantaneous capacity of the SBS backhaul links. Due to the large dimensionality of the system state space, the optimal placement policy will be determined using the approximation of the value function by deep reinforcement learning techniques. In the distributed formulation, the computation of the content placement policy is delegated to the SBSs themselves. To this end, the placement problem is modeled as a potential game among SBSs in which the objective of each SBS is to minimize the average delay of the MUs within its coverage range. In order to compute the Nash equilibrium of the game, we present two algorithms based on multi-agent learning techniques for potential games. Both algorithms can shape the placement strategies of the SBSs to induce a global equilibrium behavior under the incomplete information of the cost function parameters. The first algorithm learns the equilibrium in the joint action space of the SBSs, and the second one operates in the independent action space. Under incomplete information, there is yet another complexity imposed on caching, which concerns the behavioral model of the MUs in fetching contents. In open access cellular communications, the SBSs may be uncertain about whether all MUs whiten their coverage behave legitimately. In fact, some MUs may send requests for contents not cached in their associated SBSs, aiming at increasing the cache miss ratio, and at aggravating the congestion in backhaul links. More precisely, the probability distribution of such requests does not necessarily follow the standard content popularity distribution. When the adversary users’ objective is maximizing the delay in backhaul links, we formulate the problem as a two-level hierarchical game (Steckelberg). In fact, in addition to the game played at the top level by the SBSs to place the appropriate content into their caches, there is a second game played between the SBSs as a group and the adversary users at the bottom level. In this case, the Stackelberg equilibrium is computed through the best response dynamics algorithm. However, when the requests of the adversary users do not follow a strategic pattern, the problem is modeled using the adversarial-combinatorial multi-armed bandit problem and an online learning algorithm is presented with a weak regret criterion to eva‎luate it.
كليدواژه هاي فارسي
اطلاعات بازخوردي , بهينه‌سازي جانمايي محتوا , شبكه‌هاي بي‌سيم , كَش كردن در لبه , يادگيري تقويتي
كليدواژه هاي لاتين
Causal Information , Content Placement Optimization , Edge Caching , Reinforcement Learning , Wireless Networks
Author
Zahra Rashidi
SuperVisor
Dr. Vesal Hakami
لينک به اين مدرک :
http://dl.iust.ac.ir/dL/search/default.aspx?Term=30411&Field=0&DTC=6

کلیه حقوق این اثر برای شرکت مهندسی ارتباطات پيام مشرق محفوظ می باشد