محمد ربيعي

عنوان

شناسايي كاستي‌هاي پژوهش با به‌كارگيري تكنيك‌هاي متن‌كاوي و تحليل رفتار جويشگري كاربران (موردمطالعه: محيط‌زيست ايران)ت

مقطع تحصيلي

دكتري

رشته تحصيلي

مهندسي فناوري اطلاعات

سال تحصيل

۱۳۹۳-۱۳۹۸

تاريخ دفاع

۱۳۹۸/۰۷/۱۵

استاد راهنما

دكتر سيدمهدي حسيني‌مطلق

استاد مشاور

دكتر بهروز مينايي بيدگلي - دكتر عبدالرحمن حاِيري

دانشكده

صنايع

چكيده

گسترش علم و فناوري و امكان تعامل و ارتباط بين پژوهشگران و دانشمندان موجب شده است تا از يك‌سو شاهد تعدد و گوناگوني پژوهش‌هاي صورت پذيرفته در حوزه‌هاي مختلف علوم باشيم و از سوي ديگر با تنوع نيازها و تقاضاهاي پژوهشي مواجه شويم. به‌منظور مديريت نظام پژوهش، داشتن شناخت كافي نسبت به تقاضاها و نيازهاي پژوهشي و توانمندي‌ها و منابع پژوهشي موجود ضروري است. ازاين‌رو، پژوهش‌هاي علم‌سنجي متعددي با هدف شناسايي روندهاي پژوهش، تعيين كاستي‌هاي پژوهش و نيازسنجي و اولويت‌بندي پژوهش‌ها انجام شده است. اين پژوهش درصدد است راهكار مناسبي را مبتني بر تحليل محتواي پژوهش‌هاي موجود و تقاضاهاي جامعه پژوهشي از طريق كاربست تكنيك‌هاي متن‌كاوي و پردازش زبان طبيعي و تحليل رفتار جويشگري كاربران ارائه نمايد كه از طريق آن كاستي‌هاي پژوهشي به‌صورت مستقل از نظر خبرگان و در فرايندي رايانشي شناسايي شود. براي اين منظور حوزه محيط‌زيست به عنوان موردمطالعه انتخاب شد و منابع پژوهشي موجود در پايگاه ايرانداك و نيز جستجوهاي صورت پذيرفته توسط كاربران اين پايگاه به عنوان داده‌هاي پژوهش مورداستفاده قرار گرفت. به‌منظور شناسايي پژوهش‌هاي مرتبط با حوزه موردنظر و نيز شناسايي و تحليل موضوعات موجود در اين حوزه، از تكنيك‌هاي مختلف پردازش زبان طبيعي، متن‌كاوي، داده‌كاوي، مدل‌سازي موضوعي و تحليل كتاب‌شناختي و محتوايي منابع استفاده شد. درنهايت موضوعات مختلف اين حوزه شناسايي و تحليل كتاب‌شناختي آن‌ها ارائه گرديد. همچنين اين موضوعات از منظر عرضه و تقاضاي پژوهش مورد رصد قرار گرفت و روشي براي ترسيم و تحليل وضعيت كنوني و آينده اين موضوعات ارائه گرديد. نتايج نشان داد كه روش وزن‌دهي NGTF2 كه در اين پژوهش ارائه شده است، موجب كارايي بالاتر مدل رده‌بند خواهد شد، همچنين از طريق رده‌بندي تك‌رده‌اي نزديك به يك ميليون و دويست هزار جستجو و بيش از يكصد هزار منبع پژوهشي مرتبط با حوزه محيط‌زيست شناسايي شد و با استفاده از مدل‌سازي موضوعي 20 موضوع در اين حوزه شناسايي و استخراج شد كه پس از ارزيابي عرضه و تقاضاي اين موضوعات، مشخص شد كه موضوعات «صنعت»، «رودخانه»، «حقوق بين‌الملل» و «زباله» در حوزه محيط‌زيست داراي كاستي پژوهش هستند و با توجه به رفتار كاربران اين پايگاه، پيش‌بيني وضعيت آتي حوزه‎هاي مختلف نيز ارائه شد. پژوهش ارائه‌شده مي‌تواند سياست‌گذاران و مديران پژوهشي را در اتخاذ تصميمات كلان پژوهشي و تعيين نيازها و اولويت‌بندي پژوهشي ياري رساند. همچنين به دليل رويكرد ميان‌رشته‌اي و روش تلفيقي مورداستفاده، پژوهشگران و علاقه‌مندان حوزه‌هاي مختلف فناوري اطلاعات، هوش مصنوعي، كتابداري و اطلاع‌رساني، سياست‌گذاري علم و فناوري و ... مي‌توانند از نتايج آن بهره‌مند شوند.

تاريخ ورود اطلاعات

1398/08/06

عنوان به انگليسي

Using Text Mining and Behavior Analysis Techniques for Research Gaps Identification (Case Study: Iranian Theses and Dissertations)

تاريخ بهره برداري

10/6/2020 12:00:00 AM

دانشجوي وارد كننده اطلاعات

محمد ربيعي

Name: محمد ربيعي
Author: محمد ربيعي

چكيده به لاتين

The expansion of science and technology and the possibility of interaction and communication among researchers have led us to conduct numerous and varied researches in different fields of science and to meet a variety of research needs and demands. In order to manage the research system, it is necessary to have sufficient knowledge of current and future research demands and needs and existing research sources. Therefore, numerous scientometric studies have been carried out with the aim of identifying research trends, determining research gaps and needs, and assessing and prioritizing the research topics. This research attempts to apply text mining techniques and natural language processing to provide an appropriate solution based on the content analysis of research supply and demands and the analysis of information seeking behavior. This solution could identify the research gaps via a machinery process and independent from experts’ knowledge. For this purpose, the “environment” field is selected as the case study and the theses and dissertations available at the IRANDOC repository and the searches made by users of this database are used as the research data. Different techniques of natural language process sing, text mining, data mining, topic modeling, and bibliographic and content analysis were used to draw the boundary for research studies and identify and analyze the topics in this field. Finally, research topics of this field were identified and their bibliographic analysis were presented. These topics were also monitored from the perspective of supply and demand of the research and provided a method for drawing and analyzing the current and future status of these topics. The results showed that the NGTF2 weighting method presented in this study will improve the classification performance. Nearly 1,2 million queries and more than 100,000 studies were identified as environmental resources through the classification approach, and 20 topics were identified and extracted using topic modeling technique. After evaluating research supply and demand, it was found that the topics of "Industry", "River", "International Law" and "Waste" in the field of environment are gap topics and considering the behavior of users of the repository, the prediction of the future status of different topics was also presented. This study could support policy makers and research managers in making decisions and identifying research needs and priorities. Moreover, due to the interdisciplinary approach and the hybrid method of this research, it is applicable for information technology, artificial intelligence, library and information science, technology and science policy making, etc. researchers.

لينک به اين مدرک

https://dl.iust.ac.ir/dl/search/default.aspx?Term=21226&Field=0&DTC=6