نويد عباسي

عنوان

ارائه مدل پيش‌بيني خوشه مشتريان با استفاده از يادگيري اجماعي (مطالعه موردي:مسافران راه آهن ايران)

مقطع تحصيلي

كارشناسي ارشد

رشته تحصيلي

مهندسي صنايع- سيستم‌هاي كلان

سال تحصيل

1400

تاريخ دفاع

1403/7/2

استاد راهنما

مهدي غضنفري

استاد مشاور

دانشكده

دانشكده مهندسي صنايع

چكيده

مشتريان مهمترين دارايي يك سازمان هستند. بدون در نظر گرفتن مشترياني كه وفادار مي مانند و روابط خود را با سازمان توسعه مي دهند، هيچ اميدي براي بقا و توسعه يك سازمان وجود نخواهد داشت. به همين دليل است كه يك سازمان بايد استراتژي خاصي را براي حفظ مشتريان و تقويت روابط با آنها به كار گيرد. يكي از اصلي ترين فعاليت ها به جهت رسيدن به اين هدف، درك رفتار منحصر به فرد مشتريان است زيرا ارائه خدمات شخصي‌سازي‌شده‌ بسيار موثرتر و كاراتر از بازاريابي انبوه مي باشد. به منظور دستيابي به اين مهم، سازمان ها از تقسيم بندي مشتريان كه يكي از شاخه هاي اصلي مديريت ارتباط با مشتري مي باشد، استفاده مي كنند. با توجه به رشد فزاينده اطلاعات، بدون استفاده از روش هاي نوين داده كاوي و يادگيري ماشين، راهي براي تقسيم بندي مشتريان وجود نخواهد داشت. به عبارت ديگر روش ها سنتي در اين حوزه ديگر كارايي ندارد و منسوخ شده است. در اين مطالعه، يك مدل دو مرحله اي تركيبي به جهت پيش بيني خوشه مشتريان ارائه مي گردد. لازم به ذكر است مدل اين مطالعه بر اساس روش Crisp پياده سازي مي شود. در ابتدا از داده هاي اوليه مدلي بر اساس RFM توسعه يافته با 6 متغير ساخته مي شود. در مرحله اول مشتريان توسط چند الگوريتم متفاوت به خوشه هاي مختلف تقسيم مي شوند. سپس به جهت پيش بيني خوشه، خروجي مدل اول به الگوريتم يادگيري تجمعي وارد مي شود. هدف اصلي اين مطالعه يافتن بهترين تركيب اين دو مدل بر اساس شاخص هاي ارزيابي دقت است. به جهت ارزيابي دقيق‌تر از مدل ابداعي بر اساس فاصله اقليدسي استفاده مي شود. در نهايت تركيب استفاده از الگوريتم K-Means و XGBoost با دقت 78 درصد به عنوان بهترين تركيب انتخاب مي شود. 6 دسته اين خوشه بندي نيز به صورت كامل تحليل مي گردد. استفاده از يك مدل يادگيري تجمعي به جاي خوشه بندي چند باره از سرعت و دقت بالاتري برخوردار است. همچنين استفاده از اين مدل، از تحليل چند باره خوشه ها جلوگيري مي كند. اين كار باعث مي گردد تا مشتري به محض خريد از سازمان در دسته خود قرار گيرد و از همان ابتدا عمليات بازاريابي به ويژه خدمات شخصي سازي شده به آن ارائه گردد. همچنين اين فعاليت ها باعث افزايش رضايتمندي و در نتيجه وفاداري مشتري شده و در بازار رقابتي امروز مي تواند كمك به دائمي شدن آن مشتري براي سازمان بنمايد. طبيعتا استفاده از مدل هاي جديد مانند مدل ارائه شده در اين طرح باعث پويايي سازمان شده و در نتيجه سازمان را به سمت هدف اصليش كه درآمدزايي بيشتر است، سوق مي دهد.

تاريخ ورود اطلاعات

1403/08/01

عنوان به انگليسي

Presenting a customer cluster prediction model using ensemble learning: a case study of Iranian railway passengers

تاريخ بهره برداري

1/1/1900 12:00:00 AM

دانشجوي وارد كننده اطلاعات

نويد عباسي

Name: نويد عباسي
Author: نويد عباسي

چكيده به لاتين

Customers are the most important asset of an organization. Without customers who remain loyal and develop relationships with the organization, there is no hope for an organization to survive and grow. That is why an organization must adopt a specific strategy to retain customers and strengthen relationships with them. One of the main activities to achieve this goal is to understand the unique behavior of customers because providing personalized services is much more effective and efficient than mass marketing. In order to achieve this goal, organizations use customer segmentation, which is one of the main branches of customer relationship management. Due to the increasing growth of information, without the use of modern data mining and machine learning methods, there will be no way to segment customers. In other words, traditional methods in this field are no longer effective and are obsolete. In this study, a hybrid two-stage model is presented to predict the customer cluster. It should be noted that the model of this study is implemented based on the Crisp method. At first, a model based on the developed RFM with 6 variables is built from the primary data. In the first stage, customers are divided into different clusters by several different algorithms. Then, in order to predict the cluster, the output of the first model is entered into the cumulative learning algorithm. The main goal of this study is to find the best combination of these two models based on accuracy eva‎luation indices. In order to eva‎luate more accurately, the innovative model based on Euclidean distance is used. Finally, the combination of K-Means and XGBoost algorithm is selected as the best combination with 78% accuracy. 6 categories of this clustering are also fully analyzed. Using a cumulative learning model instead of multiple clustering has higher speed and accuracy. Also, the use of this model prevents multiple analysis of clusters. This allows the customer to be placed in his category as soon as he buys from the organization, and marketing operations, especially personalized services, are provided to him from the very beginning. Also, these activities increase customer satisfaction and loyalty, and in today's competitive market, it can help to make that customer permanent for the organization. Naturally, the use of new models such as the one presented in this project will make the organization more dynamic, and as a result, it will lead the organization towards its main goal, which is to generate more income.

كليدواژه هاي فارسي

داده كاوي , يادگيري ماشين , خوشه بندي , يادگيري تجمعي , تقسيم بندي مشتريان

كليدواژه هاي لاتين

data mining , Machine Learning , clustring , Ensemble learning , Customer Segmentation

Author

Navid Abbasi

SuperVisor

Dr. Mahdi Ghazanfari

لينک به اين مدرک

https://dl.iust.ac.ir/dl/search/default.aspx?Term=31365&Field=0&DTC=6