حميد كشميري

عنوان

شناسايي موضوع سه بعدي در داده‌هاي تصويربرداري ليدار با استفاده از شبكه‌هاي عصبي كانولوشن

مقطع تحصيلي

كارشناسي ارشد

رشته تحصيلي

مهندسي برق- الكترونيك ـ ديجيتال

سال تحصيل

1399

تاريخ دفاع

1402/10/25

استاد راهنما

شهريار برادران شاه حسيني

استاد مشاور

ندارم

دانشكده

مهندسي برق

چكيده

تكنولوژي، خودروهاي خودران در صنعت خودرو به تازگي رشد يافته‌اند و اساس اين تكنولوژي نحوه تصويربرداري خودروهاي خودران مي‌باشد. در تصويربرداري ليدار امكان تشخيص دقيق فاصله با شي در بردهاي طولاني مهيا شده است. در خروجي تصويربرداري ليدار تعداد نقاط زيادي است كه به آن ابرنقطه‌اي گفته مي شود. از اينرو، تشخيص نقاط كليدي از ابرهاي نقطه‌اي حاصل از تصويربرداري نقش مهمي را در فاصله سنجي دقيق ايفا مي‌كند. جهت تشخيص نقاط كليدي با استفاده از شبكه‌هاي عصبي، شبكه¬هاي عصبي رمزگذار (CAE) بررسي شده است. ورودي اين نوع شبكه¬هاي عصبي رمزگذار، ابر نقطه‌اي سه¬بعدي است. كه با پردازش ابرنقطه¬اي به نقاط كليدي و ويژگي¬هاي ابرنقطه¬اي دست يافته مي¬شود. بر اساس ويژگي‌هاي استخراج شده، فاصله اوليه ناشي از تخمين موقعيت فريم به فريم تعيين مي‌شود و از طرفي نقاط كليدي توسعه يافته (EIPS) جهت اصلاح فاصله¬سنجي در فريم‌هاي كليدي به كار مي‌رود. مجموعه داده‌ها، از داده‌هاي KITTI اخذ شده است كه رايج‌ترين مجموعه داده‌ها در صنعت تصويربرداري خودروهاي خودران است. با توجه به اين نوع داده‌ها، مقدارخطاي ناشي از جابجايي نقاط كليدي آموزش داده شده 054/0 متر و مقدار خطاي ناشي از چرخش نقاط كليدي 179/0 درجه در هر فريم مي باشد. مقدار تكرار الگوريتم نيز به 100 رسيده است كه درمقايسه با ديگر روش‌ها 30 درصد كمتر گرديده است در نتيجه به سرعت پردازش بالا ومصرف حافظه كمتر مي انجامد از طرفي شاخص‌هايي نظير كاهش مصرف فضاي حافظه و سرعت پردازش بالا، ملاك مهم ديگر است كه با عنوان معيار نرخ موفقيت عمليات بيان مي‌شود.

تاريخ ورود اطلاعات

1403/02/16

عنوان به انگليسي

3D object recognition based on Lidar imaging data using convolutional neural networks

تاريخ بهره برداري

1/14/2025 12:00:00 AM

دانشجوي وارد كننده اطلاعات

حميد كشميري

Name: حميد كشميري
Author: حميد كشميري

چكيده به لاتين

The technology of self-driving cars has recently developed in the automotive industry, and the basis of this technology is the imaging of self-driving cars. In lidar imaging, it is possible to accurately detect the distance to the object in long ranges. There are many points in the lidar imaging output, which are called hyperpoints. Therefore, the detection of key points from point clouds obtained from imaging plays an important role in accurate distance measurement. In order to detect key points using neural networks, Encoder Neural Networks (CAE) have been investigated. The input of this type of encoder neural network is a three-dimensional point cloud. which is achieved by hyperpoint processing to key points and hyperpoint features. Based on the extracted features, the initial distance resulting from frame-by-frame position estimation is determined, and on the other hand, extended key points (EIPS) are used to correct distance measurement in key frames. The dataset is derived from the KITTI dataset, which is the most common dataset in the self-driving car imaging industry. According to this type of data, the error amount due to the displacement of the trained key points is 0.054 meters and the error amount due to the rotation of the key points is 0.179 degrees in each frame. The amount of repetition of the algorithm has also reached 100, which is 30% less compared to other methods, as a result, it leads to high processing speed and less memory consumption. On the other hand, indicators such as reducing memory space consumption and high processing speed are another important criteria that It is called the measure of success rate of the operation.

كليدواژه هاي فارسي

تصويربرداري ليدار , نقاط كليدي , ابرنقطه اي , نقشه ويژگي , شبكه عصبي كانولوشن رمزگذار

كليدواژه هاي لاتين

Lidar imaging , Key points , Point cloud , Feature Map , Convolutional auto encoder (CAE)

Author

Hamid Keshmiri

SuperVisor

Dr. Shah Hosseini

لينک به اين مدرک

https://dl.iust.ac.ir/dl/search/default.aspx?Term=30835&Field=0&DTC=6