當前位置：首頁 > 编程资源 > 编程问答 >内容正文

编程问答

【Paper】2010_Distributed optimal control of multiple systems

發布時間：2025/4/5 编程问答 25 豆豆

生活随笔收集整理的這篇文章主要介紹了【Paper】2010_Distributed optimal control of multiple systems 小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

Dong W. Distributed optimal control of multiple systems[J]. International Journal of Control, 2010, 83(10): 2067-2079.

3.2 General communication

由于所有智能體 $i$ 的動力學狀態都是相同的，針對系統 $i$ 我們可以構造一個估計器來估算其它所有系統的狀態。針對 $j$ 的估算器（ $i$ 來估算 $j$ ）形式如下：
$x˙ji=Axji+Buji\dot{x}_j^i = A x_j^i + B u_j^i$

為了便于表示，我們定義
$x_i^i = x_i$

$u_i^i = u_i$

整理狀態寫成一個緊湊的形式（compact form）
$x˙i=Aˉxi+Bˉui\dot{x}^i = \bar{A} x^i + \bar{B} u^i$

定理 3.2

分布式控制法則為：
$ui=((eieiT)?Ir)uiu_i = ((e_i e_i^T) \otimes I_r) u^i$

$ui=?R?1BˉTPxi?Bˉ?1∑j∈Niwij(xi?xj)u^i = -R^{-1} \bar{B}^T P x^i - \bar{B}^{-1} \sum_{j\in N_i} w_{ij} (x^i - x^j)$

$x˙ji=Axji+B((ejejT)?Ir)ui,1≤j≠i≤m\dot{x}^i_j = A x^i_j + B((e_j e_j^T) \otimes I_r)u^i, \quad 1 \le j \ne i \le m$

這里， $P$ 是黎卡提方程的對稱正定解， $w_{ij}>0$ 常量， $e_i$ 是 $n$ 維向量第 $i$ 個元素為 1 其他為 0。

證明

有控制法則
$ui=?R?1BˉTPxi?Bˉ?1∑j∈Niwij(xi?xj)u^i = -R^{-1} \bar{B}^T P x^i - \bar{B}^{-1} \sum_{j\in N_i} w_{ij} (x^i - x^j)$

我們可以得到
$x˙i=Aˉxi+Bˉui=Aˉxi+Bˉ(?R?1BˉTPxi?Bˉ?1∑j∈Niwij(xi?xj))=Aˉxi?BˉR?1BˉTPxi?∑j∈Niwij(xi?xj)=(Aˉ?BˉR?1BˉTP)xi?∑j∈Niwij(xi?xj)\begin{aligned} \dot{x}^i &= \bar{A} x^i +\bar{B} u^i \\ &= \bar{A} x^i + \bar{B} (-R^{-1} \bar{B}^T P x^i - \bar{B}^{-1} \sum_{j\in N_i} w_{ij} (x^i - x^j)) \\ &= \bar{A} x^i - \bar{B} R^{-1} \bar{B}^T P x^i - \sum_{j\in N_i} w_{ij} (x^i - x^j) \\ &= (\bar{A} - \bar{B} R^{-1} \bar{B}^T P) x^i - \sum_{j\in N_i} w_{ij} (x^i - x^j) \\ \end{aligned}$

令
$z^i = x^i - x^*$

那么有
$z˙i=(Aˉ?BˉR?1BˉTP)zi?∑j∈Niwij(zi?zj)\dot{z}^i = (\bar{A} - \bar{B} R^{-1} \bar{B}^T P) z^i - \sum_{j\in N_i} w_{ij} (z^i - z^j)$

令
$yi=e?(Aˉ?BˉR?1BˉTP)tziy^i = e^{-(\bar{A} - \bar{B} R^{-1} \bar{B}^T P)t} z^i$

那么有
$y˙i=?∑j∈Niwij(yi?yj)\dot{y}^i = -\sum_{j \in N_i} w_{ij} (y^i - y^j)$

拉普拉斯矩陣定義為
$lij={?wij,j∈Ni∑l∈Ni?wil,i=j0,j∈Ni\begin{aligned} l_{ij} =\left\{\begin{aligned} &-w_{ij}, \quad &j \in N_i \\ &\sum_{l \in N_i}-w_{il}, \quad &i=j \\ &0, \quad &j \in N_i \\ \end{aligned}\right.\end{aligned}$

那么結合拉普拉斯矩陣的定義， $y˙i\dot{y}^i$ 可以寫成
$y˙=?Lˉy\dot{y} = -\bar{L} y$

因此，
$e^{\bar{L}t} y(0)$

結合其他引理可得
$lim?t→∞(yi?c1)=0\lim_{t\rightarrow\infty} (y^i - c1) = 0$

Simulations

%% Distributed optimal control of multiple systems % Author: Zhao-Jichao % Date: 2021-07-05 clear clc%% Define Initial States A = [0 11 1]; B = [0 00 1]; x1 = [ 5 2]'; x2 = [-3 3]'; x3 = [-4 2]'; x4 = [ 1 -1]'; x5 = [-2 7]'; Q1 = [2 11 2]; R1 = [1 00 1]; m = 5; % Number of agents Abar = kron(eye(5), A); Bbar = kron(eye(5), B); Q = kron(eye(5), Q1); R = kron(eye(5), R1); % R = eye(5); [P, l, g] = care(Abar, Bbar, Q, R);in = [x1', x2', x3', x4', x5']';u = -inv(R) * Bbar' * P * in;global dx dx = (Abar - Bbar * inv(R) * Bbar' * P);%% [t, out] = ode45(@odeFun, [0, 10], in);%% Draw Results subplot(2,3,1) plot(t, out(:,1), t,out(:,2), 'linewidth',1.5); hold on; grid on legend('x_1(1)','x_1(2)'); subplot(2,3,2) plot(t, out(:,3), t,out(:,4), 'linewidth',1.5); hold on; grid on legend('x_2(1)','x_2(2)'); subplot(2,3,3) plot(t, out(:,5), t,out(:,6), 'linewidth',1.5); hold on; grid on legend('x_3(1)','x_3(2)'); subplot(2,3,4) plot(t, out(:,7), t,out(:,8), 'linewidth',1.5); hold on; grid on legend('x_4(1)','x_4(2)'); subplot(2,3,5) plot(t, out(:,9), t,out(:,10),'linewidth',1.5); hold on; grid on legend('x_5(1)','x_5(2)');%% DDE Function function out = odeFun(~, in)global dxout = dx * in; end

先把程序留著，下一步準備探討作者設計的分布式協議

%% Distributed optimal control of multiple systems % Author: Zhao-Jichao % Date: 2021-07-05 clear clc%% Define Initial States A = [0 11 1]; B = [0 00 1]; x1 = [ 5 2]'; x2 = [-3 3]'; x3 = [-4 2]'; x4 = [ 1 -1]'; x5 = [-2 7]'; Q1 = [2 11 2]; R1 = [1 00 1]; m = 5; % Number of agents Abar = kron(eye(5), A); Bbar = kron(eye(5), B); Q = kron(eye(5), Q1); R = kron(eye(5), R1); [P, l, g] = care(Abar, Bbar, Q, R);in = [x1', x2', x3', x4', x5']';u = -inv(R) * Bbar' * P * in;global dx dx = (Abar - Bbar * inv(R) * Bbar' * P);L = [2 0 0 -1 -1-1 1 0 0 00 -1 1 0 00 0 -1 1 00 -1 0 0 1];% L1 = [0 0 0 0 0 % 0 0 0 0 0 % 0 0 0 0 0 % 0 0 0 0 0 % 0 0 0 0 0]; % L2 = [0 0 0 0 0 % 0 0 0 0 0 % 0 0 0 0 0 % 0 0 0 0 0 % 0 0 0 0 0];%% [t, out] = ode45(@odeFun, [0, 10], in);%% Draw Results subplot(2,3,1) plot(t, out(:,1), t,out(:,2), 'linewidth',1.5); hold on; grid on legend('x_1(1)','x_1(2)'); subplot(2,3,2) plot(t, out(:,3), t,out(:,4), 'linewidth',1.5); hold on; grid on legend('x_2(1)','x_2(2)'); subplot(2,3,3) plot(t, out(:,5), t,out(:,6), 'linewidth',1.5); hold on; grid on legend('x_3(1)','x_3(2)'); subplot(2,3,4) plot(t, out(:,7), t,out(:,8), 'linewidth',1.5); hold on; grid on legend('x_4(1)','x_4(2)'); subplot(2,3,5) plot(t, out(:,9), t,out(:,10),'linewidth',1.5); hold on; grid on legend('x_5(1)','x_5(2)'); %% DDE Function function out = odeFun(~, in)global dxout = dx * in; end

總結

以上是生活随笔為你收集整理的【Paper】2010_Distributed optimal control of multiple systems的全部內容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯，歡迎將生活随笔推薦給好友。

上一篇：【Matlab】dde23解时滞时延微分
下一篇：【控制】二阶含时滞多智能体系统一致性仿真