چكيده به لاتين
In recent years, gunshot classification has been given much attention by researchers and engineers, which has practical applications in the fields of security, military, criminal, and weapons control. In this field, as in similar fields such as the classification of different ambient sounds, we face challenges such as reflection, attenuation, and noise. In addition to these challenges, in the field of gunshot classification, there is a strong correlation between system performance and spatial information such as the distance of the shooter from the sensor and the direction of the firing weapon relative to the sensor. In this study, in order to improve performance in adverse environmental conditions, an innovative pre-processing is first performed on the raw gunshot signals, which greatly reduces the effect of noise on the information required for classification. In this pre-processing, the effective and prominent parts of the sound signal are determined. The degree of prominence of the signal in this application is determined by the amount of signal energy at high frequencies. Since gunshot is an impulsive signal, it has more energy at higher frequencies compared to normal background sounds. After determining the effective and prominent areas of the signals, we extract the appropriate features from them in the form of gray images, using three different feature extraction methods. By combining these features, color images are obtained that contain various information of raw signals and provides good resolution between classes in adverse environmental noise conditions. Finally, the class of the gunshot is determined by presenting these images to the convolutional neural networks. Transfer learning technique has been used in the training of convolutional neural network to use the experience of pattern recognition gained in other areas for the purpose of this research. Considering a fixed length of one second for different gunshot, the proposed method, by providing a combined feature, increases the average accuracy of classification in different conditions of ambient noise by 19.74%, so that compared to the average accuracy of reference method (first reference) which is 46.25%, has reached an average accuracy of 65.99 in different noise conditions (noise-free to zero dB mode). Although in the noise-free condition, the proposed method with an accuracy of 92.68% has improved only 2.39% compared to the reference method, but in this study, in addition to the accuracy of classification of methods, we have also considered their resistance to noise, because in real conditions we are often challenged by the noisy input signal. Therefore, we have considered the mean and variance of the classification accuracy of methods in different noise conditions. By using the proposed method to determine the effective and prominent parts of the signal, we have further reduced the destructive effect of noise on the classification accuracy. By selecting the useful part of the signals and extracting a combination of features of them in different noise conditions, the average classification accuracy has reached 74.74%, which is 8.75% increase compared to the case we do not determine the prominent parts, and it also has increased by 28.49% compared to reference method. In order to make classification resultd independent of the shooter spatial information, we distribute the data samples recorded at different distances and directions to the sensor uniformly between the train and test data sets. In the second reference, several sensors are used to classify gunshots and have the location information of the shooter in addition to the input signal of the sensors, but we have shown that the proposed method, which divides the data properly, can achieve the accuracy of 92.68% without using additional information distance and direction of the shooter.