محمد مهدي كامكار

عنوان

تحليل احساس خشم در مصاحبه هاي شغلي ناظران ساختماني با استفاده از پردازش زبان طبيعي

مقطع تحصيلي

كارشناسي ارشد

رشته تحصيلي

مهندسي عمران - گرايش مهندسي و مديريت ساخت

سال تحصيل

1401

تاريخ دفاع

1404/6/3

استاد راهنما

دكتر مصطفي خانزادي

استاد مشاور

دكتر زهرا كرمي

دانشكده

عمران

چكيده

اين پژوهش به طراحي، پياده‌سازي، و ارزيابي يك سامانه هوشمند مصاحبه‌كننده براي تحليل احساس خشم در مصاحبه‌هاي شغلي ناظران ساختماني با بهره‌گيري از فناوري‌هاي پردازش زبان طبيعي و تشخيص خودكار گفتار اختصاص دارد. هدف اين مطالعه، توسعه سامانه اي خودكار براي شناسايي و ارزيابي پاسخ‌هاي احساسي ناظران در موقعيت‌هاي فرضي شغلي بود تا به مديران عمراني در انتخاب افراد با توانايي مديريت خشم كمك كند. در گام نخست، با همكاري خبرگان روانشناسي و با استفاده از روش دلفي، 8 سؤال روانشناختي طراحي شد تا واكنش‌هاي احساسي، به‌ويژه خشم، را در ناظران برانگيزد. پاسخ‌هاي ممكن از طريق پرسشنامه‌اي از 75 ناظر ساختماني (54 مرد و 21 زن، رده سني 22 تا 55 سال) جمع‌آوري و به سه دسته مثبت (+1، مديريت مناسب خشم)، منفي (-1، واكنش خشمگينانه)، و خنثي (0، بي‌تفاوتي) برچسب‌گذاري شد. براي تبديل صوت به متن، يك سامانه تركيبي شامل مدل گوگل (مدل اصلي، آنلاين) ومدل ويسپر(مدل پشتيبان، آفلاين) پياده‌سازي شد. مدل ويسپر با استفاده از 1407 جمله (1125 نمونه آموزشي و 282 نمونه اعتبارسنجي، كه80درصدجملات مرتبط با صنعت ساخت بودند و 20درصدعمومي) تنظيم شد كه نرخ خطاي كلمه را از 45درصدبه 17٫63درصدكاهش داد. سپس 18 مصاحبه با استفاده از سامانه از ناظرين ساختماني و 2 مصاحبه از سامانه ي چت جي پي تي گرفته شد. در ارزيابي هاي انجام شده با استفاده از معيار هاي دقت، يادآوري وصحت ميزان دقت تبديل صوت به متن براي گوگل و ويسپر به‌ترتيب 96درصدو 82درصدو براي تشخيص برچسب پاسخ‌ها 78درصدو 73درصدبود. تحليل معنايي پاسخ‌ها با مدل مبدل چند زبانه انجام گرفت كه شباهت‌هاي معنايي را با دقت بالا به برچسب ‌هاي مناسب اختصاص داد. نمره نهايي هر مصاحبه‌شونده از جمع جبري برچسب ‌هاي 8 سؤال محاسبه شد تا توانايي مديريت خشم آن‌ها ارزيابي شود. نتايج نشان داد كه سامانه پيشنهادي در مقايسه با روش‌هاي دستي (دقت تقريبي 85-90 درصد، بر اساس مطالعات تشخيص باليني) و مدل‌هاي پيشرفته مانند GPT-4 (دقت 80-87درصددر تحليل‌هاي مشابه) عملكردي قابل‌اعتماد ارائه مي‌دهد. نوآوري اصلي اين پژوهش، بهينه‌سازي سامانه براي زبان فارسي و صنعت ساخت، و استفاده از مدل ويسپر به‌صورت آفلاين براي كاهش وابستگي به منابع محاسباتي است. اين سامانه مي‌تواند به‌عنوان ابزاري كارآمد براي مديران عمراني در فرآيند غربالگري و انتخاب ناظران با مهارت‌هاي احساسي مناسب مورد استفاده قرار گيرد، ضمن اينكه قابليت گسترش به تحليل‌هاي چندوجهي (صوتي-تصويري) را نيز دارد.

تاريخ ورود اطلاعات

1405/02/08

عنوان به انگليسي

The anger sentiment analysis in construction superintendentʹs job interviews using natural language processing (NLP)

تاريخ بهره برداري

8/25/2026 12:00:00 AM

دانشجوي وارد كننده اطلاعات

محمدمهدي كامكار

Name: محمدمهدي كامكار
Author: محمد مهدي كامكار

چكيده به لاتين

This study focuses on the design, implementation, an‎d eva‎luation of an intelligent interview system for analyzing anger in job interviews of construction supervisors using natural language processing (NLP) an‎d automatic speech recognition (ASR) technologies. The objective was to develop an automated system capable of identifying an‎d eva‎luating supervisors’ emotional responses in hypothetical job-related scenarios in order to assist construction managers in selec‎ting can‎didates with effective anger management skills. In the first phase, in collaboration with psychology experts an‎d using the Delphi method, eight psychological questions were designed to elicit emotional reactions—particularly anger—from supervisors. Possible responses were collected through a questionnaire administered to 75 construction supervisors (54 men an‎d 21 women, aged 22–55) an‎d were labeled into three categories: positive (+1, appropriate anger management), negative (–1, angry reaction), an‎d neutral (0, indifference).For speech-to-text conversion, a hybrid system was implemented consisting of Google’s model (primary, online) an‎d the Whisper model (backup, offline). The Whisper model was fine-tuned using 1,407 sentences (1,125 training samples an‎d 282 validation samples; 80% construction-industry-related an‎d 20% general-domain sentences), reducing the word error rate (WER) from 45% to 17.63%. Subsequently, 18 interviews were conducted with construction supervisors using the system, along with 2 interviews generated via ChatGPT for comparative purposes. eva‎luation results based on precision, recall, an‎d accuracy metrics showed that speech-to-text accuracy reached 96% for Google an‎d 82% for Whisper. For response label classification, accuracy was 78% an‎d 73%, respectively. Semantic analysis of responses was performed using a multilingual transformer model, which assigned appropriate labels based on high semantic similarity. The final score for each interviewee was calculated as the algebraic sum of the labels across the eight questions to assess their anger management ability. The results indicate that the proposed system provides reliable performance compared to manual assessment methods (approximately 85–90% accuracy based on clinical detection studies) an‎d advanced models such as GPT-4 (80–87% accuracy in similar analyses). The primary innovation of this research lies in optimizing the system for the Persian language an‎d the construction industry, as well as employing the Whisper model in offline mode to reduce dependence on computational resources. The proposed system can serve as an effective tool for construction managers in the screening an‎d selec‎tion process of supervisors with appropriate emotional competencies, while also offering the potential for future expansion into multimodal (audio-visual) analysis.

كليدواژه هاي فارسي

پردازش زبان طبيعي , تشخيص خودكار گفتار , تحليل احساس

كليدواژه هاي لاتين

Natural Language Processing (NLP) , Automatic Speech Recognition (ASR) , Sentiment Analysis

Author

Muhammad Mehdi Comcar

SuperVisor

Mostafa Khanzadi

لينک به اين مدرک

https://dl.iust.ac.ir/dl/search/default.aspx?Term=34739&Field=0&DTC=6