محمد ميرزائي

شماره ركورد
33076
پديد آورنده
محمد ميرزايي
عنوان
بهبود عملكرد يادگيري فعال با انتخاب داده‌هاي ابتدايي بر اساس مدل‌هاي بنيادين
مقطع تحصيلي
كارشناسي ارشد
رشته تحصيلي
مهندسي كامپيوتر گرايش هوش مصنوعي
سال تحصيل
1401
تاريخ دفاع
1403/11/8
استاد راهنما
محمدرضا محمدي
استاد مشاور
-
دانشكده
كامپيوتر
چكيده
يادگيري فعال به‌ عنوان يكي از تكنيك‌هاي مؤثر براي كاهش هزينه و زمان برچسب‌گذاري در حوزه بينايي ماشين و تشخيص اشياء شناخته مي‌شود. در تشخيص فعال اشياء، انتخاب مجموعه اوليه برچسب خورده بسيار مهم است، به ‌ويژه در مجموعه‌داده‌هاي پيچيده با هزينه بالاي برچسب‌گذاري و بودجه محدود منابع، اين مجموعه اوليه از اهميت بيشتري برخوردار هستند. اين مطالعه، چارچوبي براي يادگيري فعال ارائه مي‌دهد كه بهينه‌سازي چرخه اوليه آموزش براي وظيفه تشخيص اشياء را هدف قرار مي‌دهد. رويكرد ما از روش‌هاي سنتي كه اين مجموعه را به روش تصادفي انتخاب مي‌كنند فاصله گرفته و از خوشه‌بندي مبتني بر ويژگي‌هاي به ‌دست ‌آمده از مدل YOLO-World استفاده مي‌كند سپس تصاوير را بر اساس امتياز «شيء بودن» در مدل YOLO، پالايش مي‌كند. اين رويكرد بر يافتن نمونه‌هايي با ويژگي‌هاي متنوع تمركز دارد و بر آموزش در مناطقي تأكيد مي‌كند كه مدل بنيادين بيشترين اطمينان را دارد. اين كار به مدل كمك مي‌كند تا انواع مختلف اشياء را بهتر درك كند. براي ارزيابي كارايي اين روش، از دو مجموعه‌داده پيچيده به نام‌هاي Aquarium و NWPU VHR-10 استفاده شده است كه مدل بنيادين بدون نمونه آموزشي در آن‌ها به ‌دقت قابل‌توجهي دست پيدا نمي‌كند و به دليل اندازه‌هاي متنوع اشياء و شرايط چالش‌برانگيز، مناسب ارزيابي روش پيشنهادي ما هستند. در چرخه‌هاي بعدي، تكنيكي مبتني بر ناسازگاري براي نمونه‌برداري به كار گرفته مي‌شود تا پايداري فرايند آموزش حفظ شود. در مجموعه‌داده NWPU، روش خوشه‌بندي ما باعث افزايش ميانگين دقت mAP50 به طور متوسط 6.6% در ارزيابي مبتني بر تصوير نسبت به نمونه‌برداري تصادفي مي‌شود. علاوه بر اين، روش خوشه‌بندي همراه با پالايش تصوير، بهبود تقريبي 11% در ارزيابي مبتني بر اشياء نشان مي‌دهد. نتايج آزمايش‌ها، مزاياي استفاده از مدل‌هاي بنيادين در چرخه‌هاي اوليه يادگيري فعال را اثبات مي‌كنند. اين دستاورد مي‌تواند نقش مهمي در توسعه كاربردهاي يادگيري ماشين در شرايط محدوديت منابع و بودجه داشته باشد و بهره‌وري فرايند يادگيري را بهبود بخشد.
تاريخ ورود اطلاعات
1403/12/13
عنوان به انگليسي
Improving Active Learning Performance Through Initial Data selection Based on Foundation Models
تاريخ بهره برداري
1/1/1900 12:00:00 AM
دانشجوي وارد كننده اطلاعات
محمد ميرزائي
چكيده به لاتين
Active learning is recognized as one of the effective techniques for reducing the cost and time of labeling in the fields of computer vision and object detection. In active object detection, the selection of the initial labeled dataset is crucial, especially in complex datasets with high labeling costs and limited resources. This initial dataset holds even greater importance. This study presents a framework for active learning aimed at optimizing the initial training cycle for object detection tasks. Unlike traditional methods that select this dataset randomly, our approach leverages clustering based on features extracted from the YOLO-World model and refines the images using an "objectness" score obtained from the YOLO model. The proposed method focuses on identifying samples with diverse features and prioritizing training in areas where the foundation model exhibits the highest confidence. This helps the model better understand a variety of objects. To eva‎luate the effectiveness of the proposed method, two complex datasets, Aquarium and NWPU VHR-10, were utilized. These datasets challenge the foundation model, as it achieves low accuracy without training samples, and their diverse object sizes and challenging conditions make them suitable for testing our approach. In subsequent cycles, an inconsistency-based sampling technique is employed to maintain the stability of the training process. On the NWPU dataset, our clustering-based method improves the mean average precision (mAP50) by an average of 6.6% in image-based eva‎luations compared to random sampling. Furthermore, the clustering method combined with image refinement achieves an approximate 22% improvement in object-based eva‎luations. Experimental results demonstrate the advantages of using foundation models in the initial cycles of active learning. This achievement can play a significant role in advancing machine learning applications under resource and budget constraints and improving the efficiency of the learning process.
كليدواژه هاي فارسي
يادگيري فعال , مدل‌هاي بنيادين , تشخيص اشياء , يادگيري عميق، , برچسب‌گذاري داده‌ها
كليدواژه هاي لاتين
active learning , foundation models , object detection , deep learning , data labeling
Author
Mohammad Mirzaee
SuperVisor
Mohammad Reza Mohammadi
لينک به اين مدرک :
http://dl.iust.ac.ir/dL/search/default.aspx?Term=33076&Field=0&DTC=6

کلیه حقوق این اثر برای شرکت مهندسی ارتباطات پيام مشرق محفوظ می باشد

محمد ميرزايي

محمد ميرزائي