چكيده به لاتين
The expansion of science and technology and the possibility of interaction and communication among researchers have led us to conduct numerous and varied researches in different fields of science and to meet a variety of research needs and demands. In order to manage the research system, it is necessary to have sufficient knowledge of current and future research demands and needs and existing research sources. Therefore, numerous scientometric studies have been carried out with the aim of identifying research trends, determining research gaps and needs, and assessing and prioritizing the research topics.
This research attempts to apply text mining techniques and natural language processing to provide an appropriate solution based on the content analysis of research supply and demands and the analysis of information seeking behavior. This solution could identify the research gaps via a machinery process and independent from experts’ knowledge.
For this purpose, the “environment” field is selected as the case study and the theses and dissertations available at the IRANDOC repository and the searches made by users of this database are used as the research data.
Different techniques of natural language process sing, text mining, data mining, topic modeling, and bibliographic and content analysis were used to draw the boundary for research studies and identify and analyze the topics in this field. Finally, research topics of this field were identified and their bibliographic analysis were presented. These topics were also monitored from the perspective of supply and demand of the research and provided a method for drawing and analyzing the current and future status of these topics. The results showed that the NGTF2 weighting method presented in this study will improve the classification performance. Nearly 1,2 million queries and more than 100,000 studies were identified as environmental resources through the classification approach, and 20 topics were identified and extracted using topic modeling technique. After evaluating research supply and demand, it was found that the topics of "Industry", "River", "International Law" and "Waste" in the field of environment are gap topics and considering the behavior of users of the repository, the prediction of the future status of different topics was also presented.
This study could support policy makers and research managers in making decisions and identifying research needs and priorities. Moreover, due to the interdisciplinary approach and the hybrid method of this research, it is applicable for information technology, artificial intelligence, library and information science, technology and science policy making, etc. researchers.