شماره ركورد
15261
عنوان
مروري بر روشهاي طبقهبندي تصوير با نمونههاي اندك با استفاده از مدلهاي پايه چندوجهي
سال تحصيل
1402
استاد راهنما
محمد رضا محمدي
چکيده
Multimodal foundation models such as CLIP have demonstrated remarkable zero-shot generalization by aligning images and texts in a shared semantic space. However, their performance often declines in domain-specific or low-resource scenarios due to the scarcity of annotated data. Few-shot adaptation methods have emerged as a promising solution, enabling efficient transfer with minimal supervision. This work reviews and categorizes state-of-the-art few-shot adaptation strategies, including prompt-based tuning, adapter-based methods, training-free and retrieval-augmented approaches, and knowledge-augmented inference. We analyze their core mechanisms, strengths, and limitations, alongside challenges such as domain shift, overfitting, and the balance between efficiency and generalization. Furthermore, we discuss benchmarks, evaluation protocols, and open problems to guide future research. By synthesizing these developments, the seminar highlights pathways toward more robust and adaptable multimodal models for practical applications
نام دانشجو
نوران الحسين
تاريخ ارائه
10/29/2025 12:00:00 AM
متن كامل
88031
پديد آورنده
نوران الحسين
تاريخ ورود اطلاعات
1404/08/10
عنوان به انگليسي
A Review of Few-Shot Image Classification Methods Using Multimodal Foundation Models
كليدواژه هاي لاتين
Multimodal Learning , Few-Shot Learning , CLIP Model , prompt Tuning , Adapter-Based Tuning, , Domain Adaptation