چكيده به لاتين
In order to be independent of the knowledge of system dynamics in the control of linear and nonlinear systems, many approaches have been proposed as model-independent or data-based systems. In the concept of control engineering, reinforcement learning fills the gap between traditional adaptive control and optimal control algorithms to some extent. The goal is to learn the optimal policy and value function for a physical system with uncertainty. Contrary to traditional optimal control, reinforcement learning calculates the solution to the HJB problem online. Reinforcement learning in control engineering has been widely used so far, and the issue of optimal multi-model control, especially fault-tolerant control, is an area in which the need for a model-independent method is evident. However, according to studies, due to the fact that the topics of reinforcement learning in control are not old, not much research has been done on the optimal control of multimodal learning with reinforcement learning, and the proposed methods have drawbacks and difficulties that use them in application. Widespread trouble. This research is based on the existing method in combining reinforcement learning algorithms and ART clustering to investigate changes in environmental dynamics with the aim of optimal control of multimodal systems. The most prominent problem of existing methods is the high computational volume and low speed for more complex systems. The method presented in this research aims to control the tolerable error and to consider the predicted errors in order to reduce the computational volume and maintain the online performance of the control system. In the simulation section, the proposed method is compared with the classical method. The results show the efficiency of the proposed method in the optimal control of multimodal systems.