نرجس اسلوبي خواه

عنوان

ارائه مدلي جهت كشف تقلب در سازمان‌هاي بيمه با استفاده از تكنيك‌هاي داده‌كاوي ( مطالعه موردي: بيمه ما )

مقطع تحصيلي

كارشناسي ارشد

رشته تحصيلي

مهندسي صنايع- گرايش بهينه سازي سيستمها

سال تحصيل

1397

تاريخ دفاع

1399/11/29

استاد راهنما

دكتر مهدي غضنفري

دانشكده

مهندسي صنايع

چكيده

انسان به دليل آسيب‌پذيري در برابر حوادث مختلف و نياز به امنيت خاطر، همواره به دنبال راهكارهايي جهت كاهش پيامدهاي ناشي از حوادث بوده‌است. بيمه به اشخاصي كه متحمل زيان يا حادثه ناخواسته‌اي شده‌اند كمك مي‌كند پيامدهاي ناشي از اين حوادث را جبران كنند. امكان تقلب در بيمه و تعامل آن با انواع قشرهاي اجتماعي باعث شده سوءاستفاده‌هايي از آن صورت گيرد. تقلب يكي از چالش‌هايي است كه شركت‌هاي بيمه همواره با آن روبرو بوده‌اند. به همين دليل شركت‌هاي بيمه به دنبال راهي جهت پيش‌بيني و كشف تقلب به منظور پيش‌گيري از بروز آن هستند. يكي از راه‌هاي شناسايي تقلب، كشف الگوها در داده‌هاي گذشته با استفاده از تكنيك‌هاي داده‌كاوي است. استفاده از اين روش‌ها مي‌تواند در شناسايي حادثه مشكوك به تقلب در صنعت بيمه مفيد باشد. با توجه به هزينه‌هاي بسياري كه شركت‌هاي بيمه بابت خسارت‌هاي جعلي متحمل مي‌شوند، همواره اين شركت‌ها به دنبال راهي به منظور كاهش اين زيان‌ها هستند. در همين راستا اين پژوهش مي‌تواند بر شناسايي تقلب و كاهش زيان‌هاي ناشي از تقلب مؤثر باشد و مبنايي علمي جهت كشف تقلب پيش روي شركت‌هاي بيمه بگذارد. در اين پژوهش در ابتدا با بهره‌گيري از روش‌هاي داده‌كاوي بدون نظارت، داده‌هاي جمع‌آوري‌شده از پرونده خسارت‌هاي مربوط به بيمه بدنه اتومبيل شركت بيمه ما، با استفاده از سه الگوريتم خوشه‌بندي K-Means، K-Modes و AGNES خوشه‌بندي مي‌شوند. پس از تحليل نتايج حاصل از خوشه‌بندي و مقايسه نتايج با نظر خبره، الگوريتم K-Means به عنوان الگوريتم برتر خوشه‌بندي انتخاب شد و بيمه‌گزاران مشكوك به تقلب مورد شناسايي قرار گرفتند. با استفاده از نتايج حاصل از الگوريتم K-Means، بيمه‌گزاران در يك ويژگي دو مقداري جديد، به دو دسته بيمه‌گزاران مشكوك به تقلب و بيمه‌گزاران غير مشكوك به تقلب تقسيم شدند. در قسمت دوم مسئله، با استفاده از روش‌هاي داده‌كاوي با نظارت نظير شبكه عصبي مصنوعي و ماشين‌هاي بردار پشتيبان، به پيش‌بيني و كشف تقلب در مجموعه داده پرداخته مي‌شود و اين دو الگوريتم با استفاده از شاخص دقت و معيار F مورد ارزيابي و مقايسه قرار مي‌گيرند. نتايج نشان مي‌دهد كه الگوريتم شبكه عصبي در مقايسه با الگوريتم ماشين‌هاي بردار پشتيبان، در پيش‌بيني و كشف تقلب در اين پايگاه داده كارايي بهتري دارد.

تاريخ ورود اطلاعات

1400/02/05

عنوان به انگليسي

A model for insurance fraud detection using data mining techniques ( case study: Ma insurance company )

تاريخ بهره برداري

2/17/2021 12:00:00 AM

دانشجوي وارد كننده اطلاعات

نرجس اسلوبي خواه

Name: نرجس اسلوبي خواه
Author: نرجس اسلوبي خواه

چكيده به لاتين

Due to vulnerability to various accidents and the need for security of mind, human beings have always sought solutions to reduce the consequences of accidents. Insurance helps people who have suffered a damage or accident to recover the consequences of those accidents. The possibility of insurance fraud and its interaction with various social strata has led to its abuse. Fraud is one of the challenges that insurance companies have always faced. For this reason, insurance companies are looking for a way to predict and detect fraud in order to prevent its occurrence. One way to detect fraud is to detect patterns in past data using data mining techniques. Using these methods can be useful in identifying fraud in the insurance industry. Given the high costs that insurance companies incur for counterfeit damages, these companies always look for a solution to reduce these damages. So this study can be effective in identifying fraud and reducing the losses caused by fraud and provide a scientific basis for detecting fraud in front of insurance companies. In this study, first by using unsupervised data mining methods, the data collected from the car body insurance claims file of Ma insurance company, are clustered using three clustering algorithms, K-Means, K-Modes and AGNES. After analyzing the results of clustering and comparing the results with the expert opinion, K-Means algorithm was selected as the best clustering algorithm and the insurers suspected of fraud were identified. Using the results of the K-Means algorithm, insurers in a new two-value feature were divided into two categories: insurers suspected of fraud and insurers not suspected of fraud. In the second part of the problem, using supervised data mining methods such as artificial neural network (ANN) and support vector machines (SVM), the prediction and detection of fraud in the data set are discussed and the two algorithms are compared using the accuracy metric. The results show that the artificial neural network algorithm is more efficient in predicting and detecting fraud in this database than the support vector machines algorithm.

كليدواژه هاي فارسي

داده‌كاوي , كشف تقلب در بيمه , خوشه‌بندي , شبكه عصبي مصنوعي , ماشين‌هاي بردار پشتيبان

كليدواژه هاي لاتين

Data Mining , Insurance Fraud Detection , Clustering , Artificial Neural Network , Support Vector Machines

لينک به اين مدرک

https://dl.iust.ac.ir/dl/search/default.aspx?Term=23394&Field=0&DTC=6