اميرحسين همايوني

عنوان

بررسي كاربرد تكنيك‌هاي متن‌كاوي در تحليل سوانح ريلي

مقطع تحصيلي

كارشناسي ارشد

رشته تحصيلي

مهندسي ايمني در راه آهن

سال تحصيل

1398

تاريخ دفاع

1401/08/04

استاد راهنما

دكتر مسعود يقيني

دانشكده

دانشكده راه آهن

چكيده

در سال‌هاي اخير با توجه به اهميت تصميم‌گيري داده محور و تاثير آن بر صنايع، جمع‌آوري و نگهداشت داده‌ در صنعت ريلي همانند صنايع ديگر دستخوش تغييرات زيادي شده است. در حال حاضر بعد از هر سانحه گزارشي از آن در پايگاه داده سوانح تهيه مي‌شود كه شامل 83 ستون است. اين ستون‌ها شامل اطلاعات عددي نظير وزن قطار، سرعت هنگام سانحه، مدت زمان مسدودي و تعداد واگن و همچنين شامل اطلاعات طبقه‌بندي شده نظير نوع قطار، نوع سانحه و عوامل تاثيرگذار در سانحه مي‌شود. علاوه بر آن، يك ستون متني غير ساختار يافته حاوي توضيحي از شرح سانحه نيز در پايگاه داده سوانح وجود دارد كه تاكنون مورد بررسي قرار نگرفته است. مطالعه حاضر در راستاي ارتقاي ايمني حمل‌و‌نقل ريلي و كاهش سوانح راه‌آهن جمهوري اسلامي ايران با استفاده از تكنيك‌هاي متن‌كاوي انجام شده است. هدف از انجام اين تحقيق پاسخ به اين سوال است كه آيا مي‌توان با استفاده از تحليل متن، ديد وسيع‌تري نسبت به سوانح و افزايش دقت پيش‌بيني آن يافت يا خير. بدين منظور، ابتدا پيش‌پردازش و بررسي اوليه داده‌هاي سوانح راه‌آهن ايران از سال 1388 تا 1398 انجام شده است. سپس ستون‌هاي مهم داده انتخاب و آماده مدلسازي با هدف پيش‌بيني شدت سانحه شده است. در ادامه، نتايج حاكي از آن است كه استفاده از ستون متني و تحليل آن منجر به افزايش دقت پيش‌بيني شدت سوانح مي‌شود. مدلسازي و پياده‌سازي الگوريتم‌هاي پيش‌بيني شدت سوانح با استفاده از متدولوژي داده‌كاوي CRISP_DM و زبان برنامه‌نويسي پايتون انجام شده است.

تاريخ ورود اطلاعات

1401/11/02

عنوان به انگليسي

Examining the applications of text mining techniques in the analysis of rail accidents

تاريخ بهره برداري

10/26/2023 12:00:00 AM

دانشجوي وارد كننده اطلاعات

اميرحسين همايوني صادقي

Name: اميرحسين همايوني صادقي
Author: اميرحسين همايوني

چكيده به لاتين

In recent years, due to the importance of data-driven decision-making and its impact on industries, data collection and storage in the railway industry, like other industries, has undergone many changes. Nowadays, after every accident, a report is generated in the accident database, which contains 83 columns. These columns include numerical information such as the weight of the train, speed at the time of the accident, the duration of the blockage and the number of cars, and also include classified information such as the type of train, the type of accident, and the factors affecting the accident. In addition, there is an unstructured text column containing an explanation of the accident description in the accident database, which has not been investigated so far. The present study has been carried out in order to improve the safety of rail transportation and reduce railway accidents in the Islamic Republic of Iran using text mining techniques. The purpose of this research is to answer the question of whether it is possible to find a broader view of accidents and increase the accuracy of predicting their severity by using text analysis. For this purpose, first, pre-processing and preliminary analysis of the data of railway accidents in Iran from 2008 to 2018 has been done using statistical methods. Then the important columns of data have been selected and prepared for modeling with the aim of predicting the severity of the accident. Further, the results indicate that the use of the text column and its analysis leads to an increase in the accuracy of predicting the severity of accidents. Modeling and implementation of accident severity prediction algorithms has been done using CRISP_DM data mining methodology and Python programming language.

كليدواژه هاي فارسي

متن‌كاوي، سوانح ريلي، پيش‌بيني شدت سوانح، راه‌آهن جمهوري اسلامي ايران

كليدواژه هاي لاتين

Text mining, railway accidents, predicting the severity of accidents, the railways of the Islamic Republic of Iran

Author

Amirhossein Homayuni

SuperVisor

Dr. Masoud Yaghini

لينک به اين مدرک

https://dl.iust.ac.ir/dl/search/default.aspx?Term=27755&Field=0&DTC=6