اميرحسين دژبرو

عنوان

داده‌كاوي براي كشف روند رفتاري و پاسخ اجتماعات در مواجه با حوادث در شبكه اجتماعي با رويكرد ادغام اطلاعات چندبعدي : مورد مطالعاتي هاريكين دوريان

مقطع تحصيلي

كارشناسي ارشد

رشته تحصيلي

مهندسي صنايع - سيستم هاي كلان

سال تحصيل

1399-1400

تاريخ دفاع

1400/08/08

استاد راهنما

دكتر مهدي غضنفري

استاد مشاور

دكتر بابك اميري

دانشكده

مهندسي صنايع

چكيده

امروزه با گسترش روز افزون استفاده از شبكه‌هاي اجتماعي، هنگام وقوع هرگونه بحران قربانياني كه به‌طور مستقيم و غيرمستقيم تحت تأثير اين فاجعه قرار مي‌گيرند، اغلب حجم وسيعي از داده‌ها (مانند متن، تصاوير، گفتار، ويدئو) را با استفاده از شبكه‌هاي اجتماعي متعدد به اشتراك مي‌گذارند. اين امر به اين دليل است كه رسانه‌هاي اجتماعي اخيراً به يك كانال اصلي ارتباطي بين مردم تبديل‌شده است تا به مردم يا به مراكز مديريت بحران گزارش دهند. مراكز مديريت بحران و سازمان‌هاي مختلف واكنش اضطراري در سال‌هاي اخير معمولاً به‌منظور آگاهي از وضعيت به‌منظور پاسخگويي به بلاياي رخ‌داده استفاده از اين منبع عظيم و در دسترس را در نظر مي‌گيرند. بااين‌حال، با وقوع فاجعه، در عرض چند دقيقه، بسترهاي رسانه‌هاي اجتماعي مملو از انواع مختلف داده‌ها مي‌شود كه اين مراكز با داده‌هاي بزرگ روبرو مي‌شود. علاوه بر اين، در اين داده‌هاي ارسال‌شده، ممكن است اكثريت داده‌ها شامل محتواي اضافي و نامربوط باشد. با اين كار ، منطقي بودن و تصميم‌گيري در مورد داده‌هاي بزرگ موجود براي مديريت مؤثرتر بحران چالش‌برانگيز مي‌شود. باوجود پيشرفت‌هاي اخير درزمينه فناوري، پردازش و تجزيه و تحليل داده‌هاي بزرگ مربوط به فاجعه همچنان يك كار چالش‌برانگيز است. بنابراين، در اين پايان‌نامه، ما بر ارائه يك چهارچوب تحليل داده‌هاي مربوط به شبكه‌ اجتماعي توييتر در هنگام فاجعه و پس از آن با استفاده از روش‌هاي نوين پردازش داده و يادگيري عميق براي تجزيه و تحليل/پردازش داده‌هاي بزرگ رسانه‌هاي اجتماعي براي مديريت كارآمد بلايا تمركز مي‌كنيم. ابتدا داده‌هاي خام توييتر به دو قسمت متن و بقيه ويژگي‌ها تقسيم شده و روي هر قسمت پيش‌پردازش انجام شده. در مرحله بعد از روي داده‌هاي غير متني شبكه ارتباطات بر اساس رابطه بازنشر توييت‌ها ساخته‌شده، مكان كاربران هم با توجه به مكان اعلام شده در صفحه شخصي درج شد، ويژگي‌هاي توييتر مثل تعداد دنبال كنندگان هم در كنار متن تميز شده به‌عنوان 4 نوع منبع اطلاعاتي قرار گرفتند. با توجه به مجموعه دادگان مرجعي، مدل‌هايي مخصوص دامنه حادثه در 3 حوزه احساسات، اطلاع بخش بودن يا نبودن و 11 كلاس طبقه‌بندي شده محتواي انساني ساخته شد و با مفاهيم يادگيري انتقالي استنتاج شد كه مي‌توان از اين مدل‌ها براي طبقه‌بندي دادگان مربوط به حادثه مورد مطالعه استفاده كرد. شبكه‌هاي تشكيل شده در هر روز به‌صورت پويا تحت الگوريتم‌هاي تشخيص اجتماعات قرار گرفتند و الگوريتمي براي دنبال كردن اجتماعات توسعه داده شد و با يك مجموعه دادگان مرجع اعتبار سنجي شد. در نهايت همه خروجي‌هاي داده‌كاوي به‌صورت پايا و پويا در چارچوب ارائه شده تحليلي، تحت ادغام و تحليل قرار گرفتند.

تاريخ ورود اطلاعات

1400/11/16

عنوان به انگليسي

Data Mining to Discover the Community Response and Behavior at Disasters in Social Media with a Multi-dimensional Information Fusion Approach: A Case Study of Hurricane Dorian

تاريخ بهره برداري

10/30/2022 12:00:00 AM

دانشجوي وارد كننده اطلاعات

اميرحسين دژبرو

Name: اميرحسين دژبرو
Author: اميرحسين دژبرو

چكيده به لاتين

Nowadays, with the increasing use of social networks, in the event of any crisis, the victims who are directly and indirectly affected by the disasters, often use large amounts of data (such as text, images, speech, video) in social networks. This is because social media has recently become a major channel of communication between people to report to people or to crisis management centers. Crisis management centers and various emergency response organizations in recent years have often considered using this vast and accessible resource to raise awareness of the situation in order to respond to disasters. However, with a catastrophe, within minutes, social media platforms are filled with different types of data that these centers are faced with. In addition, in this huge data, the majority of the data may contain additional and irrelevant content. By being so, it becomes challenging to be rational and make decisions about available data for more effective crisis management. Despite recent advances in technology, processing and analyzing big data related to disaster remains a challenging task. Therefore, in this dissertation, we present a framework for analyzing Twitter social network data during and after disasters using state-of-the-art data processing and machine learning methods for analyzing social media big data for efficient management. First, the raw Twitter data is divided into two parts, the text and the rest of the features, and preprocessing is done on each part. In the next step, based on non-textual data, the social network (graph) was built based on the relationship of retweets, users' location was also discovered according to the location announced on the personal page, Twitter features such as number of followers, likes and etc. were placed next to the cleaned text as 4 types of information sources. Then the research enters the phase of data mining and knowledge discovery. According to a reference data set, models specific to the scope of the incident were constructed in 3 areas including emotion, informing or not, and 12 classes of humanitarian content. The structure of the retweet network was discovered as a power law distribution and was proved by statistical testing. The networks formed each day were dynamically subjected to community detection algorithms, and an algorithm for community tracking was developed and validated with a set of reference data. Finally, all data mining outputs were analyzed statically and dynamically.

كليدواژه هاي فارسي

مديريت بحران , پاسخ اجتماع , تحليل شبكه اجتماعي , كشف اجتماعات پويا , تحليل محتوا , تحليل توصيفي

كليدواژه هاي لاتين

Crisis Management , Community Response , Descriptive Analysis , Social Network Analysis , Dynamic Community Detection , Content Analysis

لينک به اين مدرک

https://dl.iust.ac.ir/dl/search/default.aspx?Term=26059&Field=0&DTC=6