فرزانه علوي آذر

عنوان

بررسي روند تغييرات موضوعي اسناد علمي در حوزه تجارت الكترونيكي با استفاده از مدل‌سازي موضوعي

مقطع تحصيلي

كارشناسي ارشد

رشته تحصيلي

فناوري اطلاعات - تجارت الكترونيكي

سال تحصيل

94-98

تاريخ دفاع

1398/3/20

استاد راهنما

دكتر عبدالرحمن حائري

استاد مشاور

دكتر سيد مهدي حسيني مطلق

دانشكده

صنايع

چكيده

با افزايش حجم اسناد و اطلاعات و از طرفي نمايش آن‌ها به صورت ديجيتال در قالب اخبار، وبلاگ‌ها، مقالات علمي، كتاب‌هاي الكترونيكي، عكس، صوت و تصوير و شبكه‌هاي اجتماعي پيدا كردن مطالبي كه به دنبال آن هستيم، مشكل مي‌شود. ازاين‌رو نياز به ابزار محاسباتي جديدي براي سازمان‌دهي، جستجو و درك اين حجم عظيم اطلاعات وجود دارد. الگوريتم‌هاي مدل‌سازي موضوعي روش‌هاي آماري هستند كه كلمات داخل يك متن را تحليل كرده و از اين طريق موضوعات داخل متون را استخراج مي‌كنند. هم‌چنين ارتباط اين موضوعات با يكديگر و نيز تغيير آن‌ها در طول زمان را مشخص مي‌كنند. اين الگوريتم‌ها نيازي به هيچ فرض اوليه‌اي در مورد موضوعات متون و يا برچسب‌گذاري متون ندارند. بلكه ورودي آن‌ها متن اصلي است. الگوريتم‌هاي مدل‌سازي موضوعي به ما اين امكان را مي‌دهند تا سازمان‌دهي و خلاصه‌سازي آرشيوهاي الكترونيكي‌مان را در ابعادي كه از عهده‌ي انسان برنمي‌آيد، انجام دهيم. مدل‌سازي موضوعي يك نوع مدل آماري براي كشف "موضوعات" پنهان است و در مجموعه‌اي از اسناد از طريق يادگيري ماشين صورت مي‌گيرد. در حال حاضر، رويكرد تخصيص پنهان ديريكله يك رويكرد مدل‌سازي موضوعي بسيار متداول در اين زمينه است. هدف از ارائه اين پژوهش، بررسي روند تغييرات موضوعي اسناد علمي در حوزه تجارت الكترونيكي با استفاده از روش تخصيص پنهان ديريكله مي باشد. بدين منظور از مقالات مربوط به 10 سال اخير حوزه تجارت الكترونيكي منتشر شده در ژورنال هاي معتبر استفاده شده است.

تاريخ ورود اطلاعات

1398/09/16

عنوان به انگليسي

Analyzing of Topic changes of scientific documents in the field of e-commerce using Topic Modeling

تاريخ بهره برداري

12/7/2019 12:00:00 AM

دانشجوي وارد كننده اطلاعات

فرزانه علوي اذر

Name: فرزانه علوي اذر
Author: فرزانه علوي آذر

چكيده به لاتين

By extremely increasing in the volume of documents and information, and on other hand representing them in digital format in the news, blogs, scientific articles, electronic books, photos, audio and video, and social networks, it’s difficult to find what we’re looking for in these digital archives. Therefore, there is a need for new computing tools to organize, search and understand this massive amount of information. Topic modeling algorithms are statistical methods that analyze the words within a text, thus extracting the topics within the texts. It also identifies the relationship between these topics and their changes over time. These algorithms do not require any initial assumptions about the topic matter of texts or the labeling of texts. But their input is the original text. Topic modeling algorithms allow us to organize and summarize our electronic archives in such a way that it does not come from the point of view of human beings. Topic modeling is a type of statistical model for discovering the latent ‘‘topics’’ that occur in a collection of documents through machine learning. Currently, latent Dirichlet allocation (LDA) is a popular and common modeling approach. In this research, we investigate the procedure of topic changes in scientific documents in the field of e-commerce using latent Dirichlet allocation algorithm. For this purpose, articles about the 10 years e- commerce published in authentic journals have been used.

لينک به اين مدرک

https://dl.iust.ac.ir/dl/search/default.aspx?Term=21386&Field=0&DTC=6