علي بحري

عنوان

طبقه بندي تصاوير هوايي با استفاده از يادگيري عميق

مقطع تحصيلي

كارشناسي ارشد

رشته تحصيلي

الكترونيك - سيستم هاي الكترونيك ديجيتال

سال تحصيل

۱۳۹۵

تاريخ دفاع

۱۳۹۷/۱۲/۱۸

استاد راهنما

دكتر كريم محمدي

دانشكده

برق

چكيده

در سال هاي اخير، از تكنيك انتقال يادگيري براي به دست آوردن دقت هاي بالا در طبقه بندي تصاوير هوايي استفاده شده است. در اين تكنيك، از قسمت استخراج كننده ويژگي براي به دست آوردن ويژگي هاي سطح بالا استفاده مي شود و نيز با جايگزين كردن يك شبكه به جاي كلاسه بند از قبل آموزش ديده مي توان آن را در كاربرد مورد نظر استفاده كرد. زماني كه شبكه از ابتدا بر روي داده مورد نظر آموزش داده مي شود به داده هاي بيشتري جهت عملكرد خوب نياز دارد. براي حل اين مشكل، در اين پژوهش، شبكه جستجوي معماري عصبي موبايل كه بر روي تصاوير Imagenet ، آموزش ديده شده است، به عنوان استخراج كننده ويژگي پيشنهاد مي گردد. هم چنين يك شبكه پرسپترون چند لايه به عنوان كلاسه بند طراحي مي گردد. ضمنا، در اين پژوهش ،يك تابع هزينه جديد جهت بهبود نتايج پيشنهاد مي شود. درواقع در اين تابع هزينه يك ترم جديد بهCross-Entropy(CE) جهت بهبود عملكرد اضافه شده است. در تابع هزينه پيشنهاد شده بر خلاف CE ساده از ميزان تفاوت احتمالي بين كلاس اشتباه تشخيص داده شده و كلاس صحيح استفاده مي شود تا تمركز شبكه را روي تصاوير پيچيده افزايش دهد. همان طور كه در نتايج نشان داده شده است، روش پيشنهادي ما به دقت ۹۹/۵۴٪، ۹۸/۱۰٪ و ۹۳/۵۶٪ به لحاظ دقت كلي در مجموعه داده هاي UCMerced land-use، AID و NWPU-RESISC45 به ترتيب رسيده است. اين روش بهبود نسبي ۰/۶۲٪، ۱/۱۱٪ و ۱/۷۵٪ را در مقايسه با بهترين نتايجات گزارش شده نشان مي دهد. اين روش به بالاترين دقت در بين روش هاي ديگر رسيده است. علاوه بر اين، اين شبكه (استخراج كننده ويژگي) بر خلاف شبكه هايي مانند VGG (۱۴ ميليون پارامتر)، ResNet50 (۲۳ ميلون پارامتر) و ... تعداد پارامترهاي بسيار كمتري دارد (حدود ۴ ميليون پارامتر) كه اين موجب مي شود پياده سازي از لحاظ حافطه حداقل ۳/۵ برابر بهبود يابد. كلمات كليدي: تصاوير هوايي، شبكه جستجوي معماري عصبي ، استراتژي انتقال يادگيري، پرسپترون چند لايه و Cross-Entropy

تاريخ ورود اطلاعات

1398/03/11

عنوان به انگليسي

Remote sensing image classification using deep learning

تاريخ بهره برداري

3/9/2019 12:00:00 AM

دانشجوي وارد كننده اطلاعات

علي بحري

Name: علي بحري
Author: علي بحري

چكيده به لاتين

The Transfer Learning Technique has been adopted in recent years to obtain high precision classification in the field of aerial imagery. In this technique, the descriptor (a pre-trained ConvNet) is employed to acquire high-level features, which can be adapted to the given application by replacing the pre-trained classifier with a network. To achieve a good performance, more data is required if the network is being trained on an arbitrary set of data from scratch. To overcome this issue, in this paper the Neural Architecture Search Network Mobile (NasNet Mobile), trained with ImageNet images, is proposed as the descriptor. Moreover, an MLP network is designed in the form of a classifier. In addition, a new loss function is proposed in this paper to further improve the results by adding a new term to the categorical-crossentropy (CE). In the proposed loss function, in contrast with the conventional CE, the probability difference between the misclassified category and the category correctly classified is utilized to enhance the network focus on complex images. As the results indicate, our proposed method achieves 99.54, 98.10, and 93.56 percent (in terms of Overall Accuracy) on UC Merced land-use, AID, and NWPU-RESISC45 dataset, respectively. This indicates a relative improvement of 0.62%, 1.11%, and 1.75% compared to the best results reported in the literature, and thus sets a new state-of-the-art result. In addition, unlike the networks such as VGG (14 million parameters), ResNet50 (23 million parameters), etc., the proposed descriptor network possess a considerably fewer number of parameters (about 4 million), which improves the implementation at least 3.5 times in terms of memory. Keywords: Aerial imagery, NASNet Mobile architecture, Transfer Learning strategy, MLP and categorical-crossentropy

لينک به اين مدرک

https://dl.iust.ac.ir/dl/search/default.aspx?Term=20586&Field=0&DTC=6