论文部分内容阅读
In this paper,the optimal control of a class of general affine nonlinear discrete-time(DT) systems is undertaken by solving the Hamilton Jacobi-Bellman(HJB) equation online and forward in time.The proposed approach,referred normally as adaptive or approximate dynamic programming(ADP),uses online approximators(OLAs) to solve the infinite horizon optimal regulation and tracking control problems for affine nonlinear DT systems in the presence of unknown internal dynamics.Both the regulation and tracking controllers are designed using OLAs to obtain the optimal feedback control signal and its associated cost function.Additionally,the tracking controller design entails a feedforward portion that is derived and approximated using an additional OLA for steady state conditions.Novel update laws for tuning the unknown parameters of the OLAs online are derived.Lyapunov techniques are used to show that all signals are uniformly ultimately bounded and that the approximated control signals approach the optimal control inputs with small bounded error.In the absence of OLA reconstruction errors,an optimal control is demonstrated.Simulation results verify that all OLA parameter estimates remain bounded,and the proposed OLA-based optimal control scheme tunes itself to reduce the cost HJB equation.
In this paper, the optimal control of a class of general affine nonlinear discrete-time (DT) systems is contained by solving the Hamiltonian Jacobi-Bellman (HJB) equation online and forward in time. The proposed approach, referring normally as adaptive or approximate dynamic programming (ADP), using online approximators (OLAs) to solve the infinite horizon optimal regulation and tracking control problems for affine nonlinear DT systems in the presence of unknown internal dynamics. But the regulation and tracking controllers are designed using OLAs to obtain the optimal feedback control signal and its associated cost function. Additionally, the tracking controller design entails a feedforward portion that is derived and approximated using an additional OLA for steady state conditions. Novel update laws for tuning the unknown parameters of the OLAs online are derived. Lyapunov techniques are used to show that all signals are distributed ultimately bounded and that the approximated control signals approach the optimal control inputs with small bounded error. In the absence of OLA reconstruction errors, an optimal control is demonstrated. Simulation results verify that all OLA parameter estimates remain bounded, and the proposed OLA-based optimal control scheme tunes itself to reduce the cost HJB equation.