چكيده به لاتين
Although the decision tree is a simple model, it is still one of the most popular classifiers to solve real world problems. One of the most important features of the decision tree is to generate rules that can be easily interpreted. However, one of the biggest problems in the way of the decision tree is that most of the methods of making decision tree are greedy approach that make the tree in a top-down divide and conquer approach. In these algorithms, it is not possible to examine all the possible scenarios, and as a result, the constructed tree would not necessarily be the best possible tree. Another challenge that the decision tree learning methods are faced with is the presence of noise in the data set, leading to reduced tree accuracy. With the advent of streaming data in the information world, some learning methods were represented for learning the decision tree from streaming data that are capable of online learning. One of these models is the Hoeffding tree, a model, which is also a greedy method such as the learning methods of the decision tree of static data. In this model, the presence of noise reduces the accuracy of the Hoeffding tree as well.
In this research, providing a decision tree learning model of the streaming data that utilizes the particle swarm optimization algorithm intermittently, we tried to improve the accuracy of the Hoeffding tree. The proposed method increases the size of the Hoeffding tree. Thus, we used a feature selection algorithm in this research to reduce the size of the produced that applies a binary particle swarm optimization algorithm. The test results showed that the decision tree learning algorithm, which alternatively uses the particle swarm optimization algorithm, improves the learning accuracy of noisy and non-noisy data. In addition, the experiments' results indicated that the size of the generated tree will improve by adding the feature selection algorithm to the proposed model, and its precision is also comparable to the Hoeffding tree.
Keywords: stream data, decision tree, particle swarm optimization