پريسا انشائي

عنوان

تشخيص و طبقه‌بندي ژست‌هاي دست با استفاده از سيگنال‌هاي الكترومايوگرافي

مقطع تحصيلي

كارشناسي ارشد

رشته تحصيلي

الكترونيك ديجيتال

سال تحصيل

1399

تاريخ دفاع

1402/10/26

استاد راهنما

دكتر علي صدر

دانشكده

مهندسي برق

چكيده

به طور كلي، انسان‌ها براي انجام حركت‌هاي كاربردي يا انتقال اطلاعات، آزادانه دست‌ها و انگشتان خود را حركت مي‌دهند. از اين رو، تشخيص و طبقه‌بندي صحيح ژست‌هاي دست، كاربردهاي متعددي مانند طراحي بازوي مصنوعي بهينه براي افراد قطع عضو، كنترل پروتز اندام فوقاني، توانبخشي غيرفعال به منظور بهبود حركت‌هاي اساسي، تشخيص زبان اشاره‌ي تاكتيكي بلادرنگ براي برقراري ارتباط در يك محيط امن و خارج از محدوده‌ي بصري، تعامل انسان-رايانه و احراز هويت بيومتريك در تأييد كاربران دارد. در سال‌هاي اخير، كاهش هزينه و افزايش دسترسي به سخت‌افزارهاي لازم، پردازش سيگنال‌هاي EMG‌ را به يك راه‌حل مناسب براي شناسايي ژست‌هاي دست تبديل كرده است. اين پايان‌نامه به ارائه‌ي راهكاري جديد براي تشخيص و طبقه‌بندي ژست‌هاي دست با استفاده از سيگنال‌هاي EMG مي‌پردازد. اين راهكار شامل مراحل پيش‌پردازش، استخراج، انتخاب و طبقه‌بندي ويژگي‌‌هاي داده‌هاي EMG است. از آن‌جايي كه سيگنال‌هاي EMG، ضعيف، غيرايستا و تأثيرپذير از منابع نويز هستند، در مرحله‌ي پيش‌پردازش از روشي مبتني بر ميانگين توان فركانسي ضرايب تبديل Stockwell در هر زمان، براي تعيين ناحيه‌ي فعال شدن عضلات استفاده مي‌شود. سپس اين پژوهش به منظور تضمين عملكرد بلادرنگ كارآمد، داده‌هاي EMG را به پنجره‌هايي به طول ms200 با هم‌پوشاني %95 تقسيم و از هر پنجره 17 ويژگي زماني و 6 ويژگي فركانسي استخراج مي‌كند. براي محاسبه‌ي ويژگي‌هاي فركانسي، تعريفي جديد با بهره‌گيري از مفهوم چگالي طيفي توان بر پايه‌ي ضرايب تبديل Stockwell به منظور تحليل زمان-فركانس سيگنال‌هاي EMG ارائه مي‌شود. همچنين مقادير آستانه‌ي بهينه در تعريف بعضي از ويژگي‌هاي زماني با استفاده از الگوريتم‌هاي AOA، GWO، SCA تعيين مي‌شوند. مرحله‌ي بعديِ راهكار پيشنهادي، نرمال‌سازي ويژگي‌ها با استفاده از روش Z-Score و تقسيم‌بندي آن‌ها به دو بخش آموزشي و آزمايشي با نسبت %70 به %30 است. براي تعيين دقيق‌ترين طبقه‌بندي‌كننده، بخش آموزشي داده‌ها با روش اعتبارسنجي متقابل 5 تايي در اختيار 4 الگوريتم‌ سنتي يادگيري ماشين kNN، ANN، Tree و SVM قرار مي‌گيرد. پس از مشخص شدن kNN به عنوان طبقه‌بندي‌كننده‌ي برتر و بهينه‌سازي پارامترهاي آن، به منظور دستيابي به مجموعه ويژگي‌هايي با بالاترين دقت، دو روش انتخاب ويژگي SFS و EFS استفاده مي‌شود كه به نتايج يكساني مي‌انجامند. از آن‌جايي كه كمتر بودن داده‌هاي عضلاني افراد قطع عضو را مي‌توان متناظر با از دست دادن اطلاعات كانال‌هاي دريافت EMG در نظر گرفت، مرحله‌ي پايانيِ راهكار پيشنهادي، روش جديدي را براي تعيين ارزش اطلاعات هر كانال در دقت تشخيص ارائه مي‌دهد. به منظور نشان دادن برتري راهكار پيشنهادي، آزمايش‌ها بر روي سيگنال‌هاي 5 پايگاه بزرگ داده از نظر تعداد آزمودني و ژست انجام و پارامترهاي ارزيابي ماتريس‌هاي سردرگمي استخراج شدند. حداقل دقت تشخيص طبقه‌بندي‌كننده‌ي بهينه‌ي kNN در بين تمام پايگاه‌هاي داده با در نظر گرفتن بهترين تركيب ويژگي‌هاي زماني و فركانسي به ترتيب %99.84 و %99.33 به دست آمد. حتي هنگامي كه تعداد كانال‌ها به 3 كاهش يافت، دقتي بيش از %95 حاصل شد. اين درصدها حاكي از بهبود چشمگير دقت روش پيشنهادي نسبت به مقادير گزارش شده در تعداد قابل‌توجهي از مقالات سال‌هاي اخير است. از مزاياي ديگر اين روش مي‌توان به كاهش بار محاسباتي و بهبود سرعت به دليل استفاده از كمترين تعداد ويژگي اشاره كرد.

تاريخ ورود اطلاعات

1402/11/17

عنوان به انگليسي

Hand Gestures Recognition and Classification Using Electromyography Signals

تاريخ بهره برداري

1/1/1900 12:00:00 AM

دانشجوي وارد كننده اطلاعات

پريسا انشائي

Name: پريسا انشائي
Author: پريسا انشائي

چكيده به لاتين

In general, humans freely move their hands and fingers to perform functional movements or convey information. Hence, the correct recognition and classification of hand gestures has many applications such as optimal prosthetic arm design for amputees, control of upper limb prosthesis, passive rehabilitation to improve basic movements, real-time tactical sign language recognition for communicating in a safe and outdoor environment. From the visual range, it has human-computer interaction and biometric authentication in user verification. In recent years, cost reduction and increased availability of necessary hardware have made EMG signal processing a suitable solution for hand gesture recognition. This thesis presents a new solution for recognizing and classifying hand gestures using EMG signals. This solution includes the stages of preprocessing, extraction, selection and classification of EMG data features. Since EMG signals are weak, unstable and affected by noise sources, in the pre-processing stage, a method based on the average frequency power of the Stockwell transform coefficients is used at any time to determine the area of muscle activation. Then, in order to ensure efficient real-time performance, this research divides the EMG data into 200 ms windows with 95% overlap and extracts 17 time features and 6 frequency features from each window. To calculate the frequency characteristics, a new definition is presented using the concept of power spectral density based on Stockwell transform coefficients for the time-frequency analysis of EMG signals. Also, the optimal threshold values in defining some time features are determined using AOA, GWO, SCA algorithms. The next step of the proposed solution is to normalize the features using the Z-Score method and divide them into two training and testing parts with a ratio of 70% to 30%. To determine the most accurate classifier, the training part of the data is submitted to 4 traditional machine learning algorithms, kNN, ANN, Tree and SVM, with the 5-fold cross-validation method. After identifying kNN as the superior classifier and optimizing its parameters, in order to obtain a set of features with the highest accuracy, two feature selection methods, SFS and EFS, are used, which lead to the same results. Since less muscle data of amputees can be considered as the loss of EMG receiving channel information, the final stage of the proposed solution provides a new method to determine the value of each channel's information in recognition accuracy. In order to show the superiority of the proposed solution, experiments were conducted on the signals of 5 large databases in terms of the number of subjects and gestures, and the eva‎luation parameters of the confusion matrices were extracted. The minimum recognition accuracy of the optimal kNN classifier among all databases considering the best combination of temporal and frequency features was obtained as 99.84% and 99.33%, respectively. Even when the number of channels was reduced to 3, an accuracy of over 95% was achieved. These percentages indicate a significant improvement in the accuracy of the proposed method compared to the values reported in a significant number of papers in recent years. Other advantages of this method include reducing the computational load and improving the speed due to the use of the least number of features.

كليدواژه هاي فارسي

ژست‌ دست , سيگنال‌ EMG , تبديل Stockwell , انتخاب ويژگي , يادگيري ماشين

كليدواژه هاي لاتين

Hand gesture , EMG signal , Stockwell transform , feature selection , machine learning

Author

Parisa Enshaee

SuperVisor

Dr. Ali Sadr

لينک به اين مدرک

https://dl.iust.ac.ir/dl/search/default.aspx?Term=30481&Field=0&DTC=6