About the Authors |
|
xi | |
Preface |
|
xiii | |
Acronyms |
|
xv | |
Introduction |
|
xvii | |
|
1 Nonlinear Systems Analysis |
|
|
1 | (10) |
|
|
1 | (1) |
|
1.2 Nonlinear Dynamical Systems |
|
|
2 | (1) |
|
1.2.1 Remarks on Existence, Uniqueness, and Continuation of Solutions |
|
|
2 | (1) |
|
1.3 Lyapunov Analysis of Stability |
|
|
3 | (4) |
|
1.4 Stability Analysis of Discrete Time Dynamical Systems |
|
|
7 | (3) |
|
|
10 | (1) |
|
|
10 | (1) |
|
|
11 | (22) |
|
|
11 | (1) |
|
|
12 | (6) |
|
2.2.1 Principle of Optimality |
|
|
12 | (2) |
|
2.2.2 Hamilton--Jacobi--Bellman Equation |
|
|
14 | (1) |
|
2.2.3 A Sufficient Condition for Optimality |
|
|
15 | (1) |
|
2.2.4 Infinite-Horizon Problems |
|
|
16 | (2) |
|
2.3 Linear Quadratic Regulator |
|
|
18 | (12) |
|
2.3.1 Differential Riccati Equation |
|
|
18 | (5) |
|
2.3.2 Algebraic Riccati Equation |
|
|
23 | (3) |
|
2.3.3 Convergence of Solutions to the Differential Riccati Equation |
|
|
26 | (2) |
|
2.3.4 Forward Propagation of the Differential Riccati Equation for Linear Quadratic Regulator |
|
|
28 | (2) |
|
|
30 | (3) |
|
|
30 | (3) |
|
|
33 | (18) |
|
3.1 Control-Affine Systems with Quadratic Costs |
|
|
33 | (2) |
|
3.2 Exact Policy Iteration |
|
|
35 | (6) |
|
3.2.1 Linear Quadratic Regulator |
|
|
39 | (2) |
|
3.3 Policy Iteration with Unknown Dynamics and Function Approximations |
|
|
41 | (6) |
|
3.3.1 Linear Quadratic Regulator with Unknown Dynamics |
|
|
46 | (1) |
|
|
47 | (4) |
|
|
48 | (3) |
|
4 Learning of Dynamic Models |
|
|
51 | (26) |
|
|
51 | (1) |
|
|
51 | (1) |
|
|
51 | (1) |
|
|
52 | (2) |
|
4.2.1 Gray-Box vs. Black-Box |
|
|
52 | (1) |
|
4.2.2 Parametric vs. Nonparametric |
|
|
52 | (2) |
|
|
54 | (2) |
|
4.3.1 Model in Terms of Bases |
|
|
54 | (1) |
|
|
55 | (1) |
|
4.3.3 Learning of Control Systems |
|
|
55 | (1) |
|
4.4 Parametric Learning Algorithms |
|
|
56 | (4) |
|
|
56 | (1) |
|
4.4.2 Recursive Least Squares |
|
|
57 | (2) |
|
|
59 | (1) |
|
|
60 | (1) |
|
4.5 Persistence of Excitation |
|
|
60 | (1) |
|
|
61 | (3) |
|
|
62 | (1) |
|
|
62 | (1) |
|
|
63 | (1) |
|
|
64 | (9) |
|
4.7.1 Convergence of Parameters |
|
|
65 | (2) |
|
|
67 | (2) |
|
|
69 | (4) |
|
|
73 | (4) |
|
|
75 | (2) |
|
5 Structured Online Learning-Based Control of Continuous-Time Nonlinear Systems |
|
|
77 | (26) |
|
|
77 | (1) |
|
5.2 A Structured Approximate Optimal Control Framework |
|
|
77 | (4) |
|
5.3 Local Stability and Optimality Analysis |
|
|
81 | (2) |
|
5.3.1 Linear Quadratic Regulator |
|
|
81 | (1) |
|
|
82 | (1) |
|
|
83 | (4) |
|
5.4.1 ODE Solver and Control Update |
|
|
84 | (1) |
|
5.4.2 Identified Model Update |
|
|
85 | (1) |
|
|
85 | (1) |
|
5.4.4 Limitations and Implementation Considerations |
|
|
86 | (1) |
|
5.4.5 Asymptotic Convergence with Approximate Dynamics |
|
|
87 | (1) |
|
|
87 | (12) |
|
5.5.1 Systems Identifiable in Terms of a Given Set of Bases |
|
|
88 | (3) |
|
5.5.2 Systems to Be Approximated by a Given Set of Bases |
|
|
91 | (7) |
|
|
98 | (1) |
|
|
99 | (4) |
|
|
99 | (4) |
|
6 A Structured Online Learning Approach to Nonlinear Tracking with Unknown Dynamics |
|
|
103 | (18) |
|
|
103 | (1) |
|
6.2 A Structured Online Learning for Tracking Control |
|
|
104 | (7) |
|
6.2.1 Stability and Optimality in the Linear Case |
|
|
108 | (3) |
|
6.3 Learning-based Tracking Control Using SOL |
|
|
111 | (1) |
|
|
112 | (3) |
|
6.4.1 Tracking Control of the Pendulum |
|
|
113 | (1) |
|
6.4.2 Synchronization of Chaotic Lorenz System |
|
|
114 | (1) |
|
|
115 | (6) |
|
|
118 | (3) |
|
7 Piecewise Learning and Control with Stability Guarantees |
|
|
121 | (26) |
|
|
121 | (1) |
|
|
122 | (1) |
|
7.3 The Piecewise Learning and Control Framework |
|
|
122 | (3) |
|
7.3.1 System Identification |
|
|
123 | (1) |
|
|
124 | (1) |
|
|
125 | (1) |
|
7.4 Analysis of Uncertainty Bounds |
|
|
125 | (4) |
|
7.4.1 Quadratic Programs for Bounding Errors |
|
|
126 | (3) |
|
7.5 Stability Verification for Piecewise-Affine Learning and Control |
|
|
129 | (5) |
|
7.5.1 Piecewise Affine Models |
|
|
129 | (1) |
|
7.5.2 MIQP-based Stability Verification of PWA Systems |
|
|
130 | (3) |
|
7.5.3 Convergence of ACCPM |
|
|
133 | (1) |
|
|
134 | (8) |
|
|
134 | (4) |
|
7.6.2 Dynamic Vehicle System with Skidding |
|
|
138 | (2) |
|
7.6.3 Comparison of Runtime Results |
|
|
140 | (2) |
|
|
142 | (5) |
|
|
143 | (4) |
|
8 An Application to Solar Photovoltaic Systems |
|
|
147 | (40) |
|
|
147 | (3) |
|
|
150 | (4) |
|
|
151 | (1) |
|
8.2.2 DC-DC Boost Converter |
|
|
152 | (2) |
|
8.3 Optimal Control of PV Array |
|
|
154 | (11) |
|
8.3.1 Maximum Power Point Tracking Control |
|
|
156 | (6) |
|
8.3.2 Reference Voltage Tracking Control |
|
|
162 | (2) |
|
8.3.3 Piecewise Learning Control |
|
|
164 | (1) |
|
8.4 Application Considerations |
|
|
165 | (5) |
|
8.4.1 Partial Derivative Approximation Procedure |
|
|
165 | (2) |
|
8.4.2 Partial Shading Effect |
|
|
167 | (3) |
|
|
170 | (12) |
|
8.5.1 Model and Control Verification |
|
|
173 | (1) |
|
8.5.2 Comparative Results |
|
|
174 | (2) |
|
8.5.3 Model-Free Approach Results |
|
|
176 | (2) |
|
8.5.4 Piecewise Learning Results |
|
|
178 | (1) |
|
8.5.5 Partial Shading Results |
|
|
179 | (3) |
|
|
182 | (5) |
|
|
182 | (5) |
|
9 An Application to Low-level Control of Quadrotors |
|
|
187 | (18) |
|
|
187 | (2) |
|
|
189 | (1) |
|
9.3 Structured Online Learning with RLS Identifier on Quadrotor |
|
|
190 | (7) |
|
|
191 | (4) |
|
9.3.2 Asymptotic Convergence with Uncertain Dynamics |
|
|
195 | (1) |
|
9.3.3 Computational Properties |
|
|
195 | (2) |
|
|
197 | (4) |
|
|
201 | (4) |
|
|
201 | (4) |
|
|
205 | (10) |
|
|
205 | (1) |
|
|
205 | (2) |
|
|
206 | (1) |
|
|
207 | (1) |
|
|
207 | (4) |
|
|
208 | (1) |
|
|
208 | (2) |
|
|
210 | (1) |
|
|
210 | (1) |
|
|
211 | (3) |
|
10.4.1 Graphs and Printouts |
|
|
213 | (1) |
|
|
213 | (1) |
|
|
214 | (1) |
|
|
214 | (1) |
|
|
215 | (8) |
|
A.1 Supplementary Analysis of Remark 5.4 |
|
|
215 | (7) |
|
A.2 Supplementary Analysis of Remark 5.5 |
|
|
222 | (1) |
Index |
|
223 | |