文章快速检索     高级检索
  智能系统学报  2018, Vol. 13 Issue (5): 776-782  DOI: 10.11992/tis.201706064


武利琴, 徐勇, 王金环, 等. 基于半张量积的企业创新网络演化博弈[J]. 智能系统学报, 2018, 13(5): 776-782. DOI: 10.11992/tis.201706064.
WU Liqin, XU Yong, WANG Jinhuan, et al. Evolutionary enterprise innovation networked game based on the semi-tensor product of matrices[J]. CAAI Transactions on Intelligent Systems, 2018, 13(5): 776-782. DOI: 10.11992/tis.201706064.




徐勇. E-mail:xuyong@hebut.edu.cn




武利琴1, 徐勇1, 王金环1, 李杰2    
1. 河北工业大学 理学院,天津 300401;
2. 河北工业大学 经济管理学院,天津 300401
关键词企业创新    半张量积    创新网络    演化博弈    纳什均衡    政府调控    智猪博弈    策略局势    
Evolutionary enterprise innovation networked game based on the semi-tensor product of matrices
WU Liqin1, XU Yong1, WANG Jinhuan1, LI Jie2    
1. School of Sciences, Hebei University of Technology, Tianjin 300401, China;
2. School of Economics and Management, Hebei University of Technology, Tianjin 300401, China
Abstract: In the contemporary economic environment, innovation has become the inevitable condition for the survival and development of enterprises. In this paper, all the enterprises were classified into two categories according to the scale, establishing a double-layer coupling network of enterprise innovation, and the process of the game between enterprises was studied. Firstly, taking " Boxed Pig Game” as the basic game, the semi-tensor product method was used to get the strategy of each enterprise at every moment, not the proportion of the enterprise overall innovation. Secondly, the optimal stability for the Nash equilibrium of the whole enterprise innovation network was reached according to the payoff function. Finally, the government regulation was introduced to change the basic matrix of the game and therefore reach a stable state of the optimal Nash equilibrium; that is, all enterprises realized innovation all round.
Key words: enterprise innovation    semi-tensor product    innovation networks    evolutionary game    Nash equilibrium    government regulation    Boxed Pig game    strategy profile    





1 预备知识


1) ${ {M}_{m \times n}}$ 表示 $m \times n$ 实矩阵的集合。

2) ${{\rm Col}_i}({M})$ 表示矩阵 ${M}$ 的第 $i$ 列, ${\rm Col}({M})$ 表示矩阵 ${M}$ 的列集合。

3) ${\mathcal{D}_k}: = \{ 1,2, \cdots ,k\} $

4) ${\varDelta _n}: = \{ \delta _n^i|i = 1,2, \cdots ,n\} $ , 其中 $\delta _n^i$ 为单位矩阵 ${{I}_n}$ 的第 $i$ 列。

5)矩阵 ${L} = [\delta _n^{{i_1}}\;\delta _n^{{i_2}}\; \cdots \;\delta _n^{{i_t}}]$ $n \times t$ 逻辑矩阵, 简写为 ${L} = {\delta _n}[{i_1}\;{i_2}\; \cdots \;{i_t}]$ , 通常用 ${\mathcal{L}_{n \times t}}$ 表示 $n \times t$ 逻辑矩阵的集合。

6) ${{{V}}_r}({A}) = {[{a_{1,1}}\;{a_{1,2}} \cdots {a_{1,n}}\cdots {a_{m,1}}\;{a_{m,2}} \cdots {a_{m,n}}]^{\rm{T}}}$ 表示矩阵A中行的展开。

定义1[10]  设 ${A} \in { {M}_{m \times n}},\;{B} \in { {M}_{p \times q}},\;l = {\rm {lcm}}\{ n,p\} $ $n$ $p$ 的最小公倍数,那么 ${A}$ ${B}$ 的半张量积定义为

${A} \ltimes {B} \buildrel \Delta \over = ({A} \otimes {{I}_{{l/n}}})({B} \otimes {{I}_{{l/p}}})。$

半张量积是普通矩阵乘积的一般化, 因此通常省略半张量符号“ $ \ltimes $ ”。

定义2[10]  设 ${A} \in { {M}_{p \times n}},\;{B} \in { {M}_{q \times n}}$ 。它们的Khatri-Rao积, 记 ${A} * {B}$ , 定义为

$\begin{array}{*{20}{c}}{ {A} * {B} = }\\\!{[{\rm{Co}}{{\rm{l}}_{\rm{1}}}(\! {A} \!) \otimes {\rm{Co}}{{\rm{l}}_{\rm{1}}}(\! {B} \!)\;{\rm{Co}}{{\rm{l}}_{\rm{2}}}(\! {A} \!) \otimes {\rm{Co}}{{\rm{l}}_{\rm{2}}}(\! {B} \!)\; \cdots \;{\rm{Co}}{{\rm{l}}_n}(\! {A} \!) \otimes {\rm{Co}}{{\rm{l}}_n}(\! {B}\!)] \in }\\{{{\mathcal M}_{pq \times n}}}\end{array}$

命题1[11]  1)设 ${X} \in {{{\bf{R}}}^m}$ ${Y} \in {\mathbb{{\bf{R}}}^n}$ 为两列向量,则 ${{W}_{\left[ {m,n} \right]}}{XY} = {YX},\;\,{{W}_{\left[ {n,m} \right]}}{YX} = {XY}$ ,其中 $mn \times mn$ 维矩阵 ${{W}_{\left[ {m,n} \right]}}$ 被称为换位矩阵, 且 ${{W}_{\left[ {n,n} \right]}}: = {W_{\left[ n \right]}}$

2)设 ${A} \in { {M}_{m \times n}}\;$ , 那么

${{W}_{\left[ {m,n} \right]}}{{{V}}_r}({A}) = {{{V}}_c}({A}),\quad {{W}_{\left[ {n,m} \right]}}{{{V}}_c}({A}) = {{{V}}_r}({A});$

3)设 ${X} \in {{{\bf{R}}}^t}$ ${A} \in {{{\bf{R}}}^{m \times n}}$ , 则 ${XA} = ({{I}_t} \otimes {A}){X}$ , 这称为伪交换性质。

引理1[18]  假设 ${{{x}}_i} \in {{{\varDelta}} _k},\;\,i = 1,2, \cdots ,n$ ${{x}} = \ltimes _{i = 1}^n{{{x}}_i}$ ,则有 ${{{x}}_i} = {\textbf{π}}_i^n{{x}}$ ,其中 ${\textbf{π}}_i^n = {{{l}}_{{k^{i - {{l}}}}}} \otimes {I_k} \otimes {{{l}}_{{k^{n - i}}}},\;\;i = 1,2, \cdots ,n,{{{\varPi}} ^n} =$ $ {[({{{\textbf{π}} _1^n}^{\rm{T}}})\;{({\textbf{π}} _2^n)^{\rm{T}}} \cdots {({\textbf{π}} _n^n)^{\rm{T}}}]^{\rm{T}}}{\text{。}}$

引理2[11]  设 $f:\mathcal{D}_k^n \to {{\bf{R}}}$ 是一伪逻辑函数, 则存在一个唯一的矩阵 ${{M}_f} \in {{{\bf{R}}}^{1 \times {k^n}}}$ , 称为 $f$ 的结构矩阵, 满足

$f({{{x}}_1},\;{{{x}}_2},\; \cdots ,\;{{{x}}_n}) = {{M}_f} \ltimes _{i = 1}^n{{{x}}_i}$ (1)

式中: ${{{x}}_i} \in {{{\varDelta}} _k},\;i = 1,2, \cdots ,n$ ,且 ${{M}_f}$ 列举了 ${{x}} = \ltimes _{i = 1}^n{{{x}}_i} = {{\delta}} _{{k^n}}^1$ ${{x}} = {{\delta}} _{{k^n}}^{{k^n}}$ 过程中 $f({{x}})$ 所有可能的值。引理2显示了怎样将一个逻辑函数表示成它的代数形式。

引理3[11]  考虑一个 $k$ 值逻辑动态网络:

${{x}}(t + 1) = {L}{{x}}(t)$ (2)

式中: ${{x}}(t) = \ltimes _{i = 1}^n{{{x}}_i}(t)$ ${L} \in {{{\mathcal{L}}}_{{k^n} \times {k^n}}}$ ,则

1) ${{\delta}} _k^i$ 是结构矩阵L的稳定点,当且仅当 ${L}$ 主对角线上的元素 ${\ell _{ii}}$ 等于1。可得到式(2)中均衡点的数量,用 ${N_e}$ 表示,有

${N_e} = {\rm Trace}({L})$ (3)

2)长度为 $s$ 环的数量用 ${C_s}$ 表示

$\left\{ {\begin{array}{*{20}{l}} {{C_1} = {N_e}} \\ {{C_s} = \displaystyle\frac{{{\rm Trace}({L^s}) - \displaystyle\sum\limits_{k \in \rho (s)} {k{C_k}} }}{s},\;\;2 \leqslant s \leqslant {k^n}} \end{array}} \right.$ (4)

式中: $\, \rho (s)$ 代表 $s$ 真因子的集合, $s$ 的真因子是正整数 $k < s$ , 满足 $\displaystyle\frac{{\;s\;}}{\;k\;} \in {\mathbb{Z}_ + }$

2 企业创新网络演化博弈


$\left[ {\begin{array}{*{20}{c}} {{x/y}}&{{s_1}}&{{s_2}} \\ {{s_1}}&{(5,\;1)}&{(4,\;4)} \\ {{s_2}}&{(9,\; - 1)}&{(0,\;0)} \end{array}} \right]$ (5)

式中: $x$ 表示大企业群体; $y$ 表示小企业群体; ${s_1}$ 表示创新策略; ${s_2}$ 表示不创新策略。企业拥有独立选择策略的权利,因此可建立博弈模型来分析企业创新网络随时间演化的博弈动态过程。

2.1 企业创新网络博弈模型


1)企业创新双层耦合网络:上层网络 ${G_1}\! =\! ({V_v},{E_v})$ ,其中 ${V_v} = \{ {v_1},{v_2}, \cdots ,{v_{{n_1}}}\} $ 是大企业玩家集, ${E_v} \subset {V_v} \times {V_v}$ 是大企业内部相互联系的边集。下层网络 ${G_2} = $ $({V_w},{E_w})$ ,其中 ${V_w} = \{ {w_1},{w_2}, \cdots ,{w_{{n_2}}}\} $ 是小企业玩家集, ${E_w} \subset {V_w} \times {V_w}$ 是小企业内部相互联系的边集。 ${E_{vw}} \subset {V_v} \times {V_w}$ 是网络 ${G_1}$ ${G_2}$ 间相互联系的边集。故双层耦合网络由无向图 $G = (V,E)$ 表示,满足 $ V\! =$ $\! {V_v} \cup {V_w}\! =\! \{ {v_1},{v_2}, \cdots ,{v_{{n_1}}},{w_1},{w_2}, \cdots ,{w_{{n_2}}}\} \! =\! \{ {v_1},{v_2}, \cdots ,{v_n}\} $ $ n = $ ${n_1} + {n_2}, $ $E = {E_v} \cup {E_w} \cup {E_{vw}}$

2)基本网络博弈:由两个连通玩家(不同层)形成的基本博弈 $\mathcal{G}$ ,且策略集 ${S_1} = {S_2} = $ $ \{ {s_1},{s_2}, \cdots ,{s_k}\} $ ,对应收益双矩阵:

${M} = \left[ {\begin{array}{*{20}{c}} {({c_1},{{\overline {{c_1}}}}\;)}&{({c_1},{{\overline {c_2}}}\;)}& \cdots &{({c_1},{{\overline {c_k}}}\;)} \\ {({c_2},{{\overline {c_1}}}\;)}&{({c_2},{{\overline {c_2}}}\;)}& \cdots &{({c_2},{{\overline {c_k}}}\;)} \\ \vdots & \vdots &{}& \vdots \\ {({c_k},{{\overline {c_1}}}\;)}&{({c_k},{{\overline {c_2}}}\;)}& \cdots &{({c_k},{{\overline {c_k}}}\;)} \end{array}} \right]$ (6)

${{M}_1}$ 表示 ${M}$ 中每个数组的第一个元素所组成的矩阵, ${{M}_2}$ 表示第二个元素所组成的矩阵。

3)策略更新规则:采用确定性无条件模仿策略更新规则。玩家 $i$ $t + 1$ 时刻的策略模仿它同层邻居 $j \in {\rm {S}}{\mathcal{\rm {N}}_i}$ $t$ 时刻最优收益对应的策略,设

$\begin{gathered} j_v^ * = {\operatorname{argmax} _{j \in {\rm {S}}{\mathcal{\rm {N}}_i}}}{p_j}(x(t)) \\ j_w^ * = {\operatorname{argmax} _{j \in {\rm {S}}{\mathcal{\rm {N}}_i}}}{p_j}(y(t)) \\ \end{gathered} $ (7)

${x_i}(t + 1) = {x_{j_v^ * }}(t),\;\;{y_i}(t + 1) = {y_{j_w^ * }}(t)。$ (8)


$({x_i}(t + 1),{y_i}(t + 1)) = f(x(t),y(t))$ (9)

式中: ${x_i}(t)$ ${y_i}(t)$ $t$ 时刻玩家 ${v_i}$ ${w_i}$ 的策略, $x(t) = $ $ ({x_1}(t),{x_2}(t), \cdots ,{x_{{n_1}}}(t))$ $y(t) = ({y_1}(t),{y_2}(t), \cdots ,{y_{{n_2}}}(t))$


$\begin{gathered} {\operatorname{argmax} _{j \in {\rm{S}}{\mathcal{\rm{N}}_i}}}{p_j}(x(t)) = \{ j_{v1}^ * ,j_{v2}^ * ,\cdots, j_{v{r_1}}^ * \} \\ {\operatorname{argmax} _{j \in {\rm{S}}{\mathcal{\rm{N}}_i}}}{p_j}(y(t)) = \{ j_{w1}^ * ,j_{w2}^ * ,\cdots, j_{w{r_2}}^ * \} \\ \end{gathered} $


$\left\{\begin{gathered} j_v^ * = \min \{ \left. \lambda \right|\lambda \in {\operatorname{argmax} _{j \in {\rm{S}}{\mathcal{\rm{N}}_i}}}{p_j}(x(t))\} \\ j_w^ * = \min \{ \left. \mu \right|\mu \in {\operatorname{argmax} _{j \in {\rm{S}}{\mathcal{\rm{N}}_i}}}{p_j}(y(t))\} \\ \end{gathered} \right.$ (10)

${\mathcal{N}_i} = \{ \left. {{v_j}} \right|({v_i},{v_j}) \in E\} = {\rm{S}}{\mathcal{\rm{N}}_i} \cup {\rm{D}}{\mathcal{\rm{N}}_i}$ 表示玩家 $i$ 的所有邻居。 ${\rm{S}}{\mathcal{\rm{N}}_i} = \{ \left. {{v_j}} \right|({v_i},{v_j}) \in {E_v} \cup {E_w}\} $ 表示与 $i$ 同组的邻居, ${\rm{D}}{\mathcal{\rm{N}}_i} = \{ \left. {{v_j}} \right|({v_i},{v_j}) \in {E_{vw}}\} $ 表示与 $i$ 不同组的邻居。由于大小企业之间存在合作竞争关系,本文考虑博弈发生在不同组邻居 ${\rm{D}}{\mathcal{\rm{N}}_i}$ 中,策略更新发生在同组邻居 ${\rm{S}}{\mathcal{\rm{N}}_i}$ 中。

2.2 博弈动态演化分析


${A} = \left[ {\begin{array}{*{20}{c}} {{{({{A}_v})}_{{n_1} \times {n_1}}}}&{{{({{A}_{vw}})}_{{n_1} \times {n_2}}}} \\ {{{({{A}_{wv}})}_{{n_2} \times {n_1}}}}&{{{({{A}_w})}_{{n_2} \times {n_2}}}} \end{array}} \right]$

式中: ${{A}_v} = {{A}_v^{\rm{T}}}$ ${{A}_w} = {{A}_w^{\rm{T}}}$ ${{A}_{wv}} = {{A}_{vw}^{\rm{T}}}$ 。为了研究方便,可将博弈过程进行代数公式化。不妨定义:玩家 $v{}_i$ 在时刻 $t$ 的状态为 ${x_i}(t)(1 < i < {n_1})$ ;玩家 $w{}_i$ 在时刻 $t$ 的状态为 ${y_i}(t)(1 < i < {n_2})$ ;第 $j$ 个策略用sj表示,且 ${s_j} \sim \delta _k^j$

玩家 ${v_i}(1 < i < {n_1})$ 的收益函数表达式为

$\begin{split}{p_{{v_i}}}(t) = { {V}}_r^{\rm{T}}({{M}_1}){x_i}(t)\sum\limits_{j \in {\rm{D}}{\mathcal{\rm{N}}_i}} {{y_j}} (t) = { {V}}_r^{\rm{T}}({{M}_1}){x_i}(t){\rm{Ro{w}}}_i({A_{vw}})\cdot \\{\Pi ^{{n_2}}} y(t) ={ {V}}_r^{\rm{T}}({{M}_1})({{I}_k} \otimes {\rm{Row}}_i({A_{vw}}){{\text{π}} ^{{n_2}}})\pi _i^{{n_1}}x(t)y(t) : = \quad\\ {{M}_{{v_i}}}x(t)y(t) \quad\quad\quad\quad\quad\quad\quad\quad\quad\end{split} $ (11)

玩家 ${w_i}(1 < i < {n_2})$ 的收益函数表达式为

$\begin{split}{p_{{w_i}}}(t) = {{V}}_r^{\rm{T}}({{M}_2}^{\rm{T}}){y_i}(t)\sum\limits_{j \in {\rm{D}}{\mathcal{\rm{N}}_i}} {{x_j}} (t) = {{V}}_r^{\rm{T}}({{M}_2}^{\rm{T}}){{W}_{[k,k]}}{\rm{Ro{w}}_i}({A_{wv}}) \cdot \\ {\Pi ^{{n_1}}}x(t)\pi _i^{{n_2}}y(t) = {{V}}_r^{\rm{T}}({{M}_2}^{\rm{T}}){{W}_{[k,k]}}{\rm{Ro{w}}_i}({A_{wv}})\cdot \quad\quad\quad \\ {\Pi ^{{n_1}}}({{I}_{{k^{{n_1}}}}} \otimes \pi _i^{{n_2}})x(t)y(t) := {{M}_{{w_i}}}x(t)y(t)\quad\quad\quad\quad\end{split}$ (12)

式中: $x(t) = \ltimes _{i = 1}^{{n_1}}{x_i}(t)$ $y(t) = \ltimes _{i = 1}^{{n_2}}{y_i}(t)$


定义3[18]  对于一个博弈 $\mathcal{G}$ ,一个策略局势 ${x^ * } = (x_1^ * ,x_2^ * , \cdots ,x_n^ * ) \in {S_1} \times {S_2} \times \cdots \times {S_n}$ 是一个纳什均衡,如果满足 ${p_i}(x_i^ * ,x_{ - i}^ * ) \geqslant {p_i}(x{}_i,x_{ - i}^ * )$ 对所有的 $i \in N,\;{x_i} \in {S_i}$ 均成立,其中 $N = \{ 1,2, \cdots ,n\} $ 是玩家集, ${S_i}$ 是第 $i$ 个玩家的策略集,且 $x_{ - i}^ * = (x_1^ * ,x_2^ * , \cdots ,x_{i - 1}^ * ,x_{i + 1}^ * , \cdots ,x_n^ * ){\text{。}}$

命题2[18]  对于任给的 $x,y \in {\varDelta _k},x \ne y$ ,必然存在一个整数 $1 \leqslant r \leqslant k - 1$ 满足 $x = {M}_{o,k}^ry$ ,其中 ${{M}_{o,k}} = $ $ {\delta _k}[2\;3\; \cdots \;k\;1]$ $k$ 值逻辑算子 ${\varTheta _k}$ 的结构矩阵,且满足

${\varTheta _k}\left( {\frac{i}{{k - 1}}} \right) = \left\{\!\!\!\! {\begin{array}{*{20}{l}} {\displaystyle\frac{{i - 1}}{{k - 1}},\quad i > 0} \\ {1, \quad i = 0} \end{array}} \right.$

引理4[18]  对于双层网络演化博弈G =( $ \mathcal{G},$ S1S2),支付函数满足式(11)、(12),则 $G$ 存在一个纳什均衡,当且仅当存在一个整数 $1 \! \leqslant \! j \! \leqslant \! {2^n}$ ,满足 ${\rm{Col}}({{M}_p}) \geqslant $ 0,对应 $\{ \left. {\delta _{{2^n}}^j} \right|{\rm{Co{l}}}_j({{M}_p}) \geqslant 0,1 \leqslant j \leqslant {2^n}\} $ 是所有纳什均衡的集合,其中 ${{M}_p} \!\!=\!\! {[{M}_{{v_1}}^{\rm{T}}\;{M}_{{v_2}}^{\rm{T}} \cdots {M}_{{v_{{n_1}}}}^{\rm{T}}{M}_{{w_1}}^{\rm{T}} \cdots {M}_{{w_{{n_2}}}}^{\rm{T}}]^{\rm{T}}}$

对于 $1 \leqslant r \leqslant k - 1$

${{M}_{{v_{i,}}r}} = {{M}_{{v_i}}}[{I_{{k^{i - 1}}}} \otimes ({{I}_k} - {M}_{o,k}^r)],$
${M}_{{v_i}}^{\rm{T}} = [{M}_{{v_i},1}^{\rm{T}}\;{M}_{{v_i},2}^{\rm{T}} \cdots {M}_{{v_i},k - 1}^{\rm{T}}];$

对于 $1 \leqslant \gamma \leqslant k - 1$

${{M}_{{w_{i,}}\gamma }} = {{M}_{{w_i}}}[{{I}_{{k^{{n_1} + i - 1}}}} \otimes ({{I}_k} - {M}_{o,k}^\gamma )],$
${M}_{{w_i}}^{\rm{T}} = [{M}_{{w_i},1}^{\rm{T}}\;{M}_{{w_i},2}^{\rm{T}} \cdots {M}_{{w_i},k - 1}^{\rm{T}}]$




${p_{{v_i}}}({x_i}(t),y(t)) = {M}_{{v_i}}'y(t){x_i}(t),$
${p_{{w_i}}}(x(t),{y_i}(t)) = {M}_{{w_i}}'y(t){x_i}(t)。$

2)对于任意的策略 $x{\text{、}}y$ ,大企业玩家的最优反应策略为 ${\rm {B}}{{\rm{R}}_{{v_i}}} = {L}_{{v_i}}'y,\;\;1 \leqslant i \leqslant {n_1}$

${l_{j,{v_i}}} = \min \{ \left. l \right|{\rm{Col}}{}_l({\rm{Bl{k}}}_j({M}_{{v_i}}')),\;1 \leqslant l \leqslant k\} ,$
${\rm{Co{l}}}_j({L}_{{v_i}}') = \delta _k^{{l_{j,{v_i}}}}$


${\rm{BR}}_{w_i} = {L}_{{w_i}}'y,\quad 1 \leqslant i \leqslant {n_2},$
${l_{j,{w_i}}} = \min \{ \left. l \right|{\rm{Col}}{}_l({\rm{Blk}}_j({M}_{{w_i}}')), 1 \leqslant l \leqslant k\} $
${\rm{Co{l}}}_j({L}_{{w_i}}') = \delta _k^{{l_{j,{w_i}}}}$


${x_i}(t + 1) = {{L}_{{v_i}}}y(t)$
${\overline {{M}}_{{v_i}}} = {{\rm Row}_i}({{A}_v} + {{I}_{{n_1}}}){\left[\!\!\! {\begin{array}{*{20}{c}} {{L}_{{v_1}}'}\;{{L}_{{v_2}}'}& \cdots &{{L}_{{v_{{n_1}}}}'} \end{array}}\!\!\! \right]^{\rm{T}}}$

${\alpha _{s,{v_i}}} = \min \{ \left. \alpha \right|{{\rm Row}_\alpha }{({\overline M_{{v_i}}})_{\alpha s}}\} $ ${{\rm Col}_s}({{L}_{{v_i}}}) = \delta _k^{{\alpha _s},{v_i}}$

${y_i}(t + 1) = {{L}_{{w_i}}}x(t)$
${\overline {{M}}_{{w_i}}} = {{\rm Row}_i}({{A}_w} + {{I}_{{n_2}}}){\left[ \!\!\! {\begin{array}{*{20}{c}} {{L}_{{w_1}}'}\;{{L}_{{w_2}}'}& \cdots &{{L}_{{w_{{n_2}}}}'} \end{array}} \!\!\! \right]^{\rm{T}}},$

${\beta _{s,{w_i}}} = \min \{ \left. \beta \right|{{\rm Row}_\beta }{({\overline {{M}}_{{w_i}}})_{\beta s}}\} $ ${{\rm Col}_s}({{L}_{{w_i}}}) = \delta _k^{{\beta _s},{w_i}}$

于是 $x(t + 1) = {{L}_v}y(t),\;\;y(t + 1) = {{L}_w}x(t)$ ,其中 ${{L}_v}= $ ${{L}_{{v_1}}} * $ $ {{L}_{{v_2}}} * \cdots * {{L}_{{v_{{n_1}}}}},\;\;{{L}_w} = {{L}_{{w_1}}} * {{L}_{{w_2}}} * \cdots * {{L}_{{w_{{n_2}}}}}$


$\begin{aligned} x(t + 1)y(t + 1) = {{L}_v}y(t){{L}_w}x(t) = {{L}_v}({{I}_{{k^{{n_2}}}}} \otimes {{L}_w})y(t)x(t) = \\ {{L}_v}({{I}_{{k^{{n_2}}}}} \otimes {{L}_w}){{W}_{[{k^{{n_1}}},{k^{{n_2}}}]}}x(t)y(t) : = Lx(t)y(t) \quad\quad\quad\end{aligned} $ (13)

式中 ${L} = {{L}_v}({{I}_{{k^{{n_2}}}}} \otimes {{L}_w}){{W}_{[{k^{{n_1}}},{k^{{n_2}}}]}}$ 为双层耦合网络演化的结构矩阵,可由此来分析博弈的演化动态。

通过半张量积方法,由博弈动态方程式(13)可得到博弈结构矩阵 ${L}$ 。分析结构矩阵可得到博弈的稳定点、极限环,因此有定理1。

定理1  若存在整数 $1 \leqslant j \leqslant {2^n}$ ,同时满足以下两个条件:

1) ${{\rm Col}_j}({{M}_p}) \geqslant 0$

2) $j \in {N_e} = {\rm Trace}({L}),\;{L} = {{L}_v}({{I}_{{k^{{n_2}}}}} \otimes {{L}_w}){{W}_{[{k^{{n_1}}},{k^{{n_2}}}]}};$

则称 $\delta _{{2^n}}^j$ 为全局稳定纳什均衡局势。

证明 根据引理4可知,满足定理1中条件1)的所有 $\delta _{{2^n}}^j$ 都是纳什均衡点,但纳什均衡点不一定唯一,也不一定是稳定点。根据引理3可知,如果 $j$ 满足定理1中条件2),则 $\delta _{{2^n}}^j$ 对应策略局势为稳定点。因此如果 $j$ 同时满足定理1中条件1)、2),那么结论成立, $\delta _{{2^n}}^j$ 为全局稳定纯策略纳什均衡局势。证毕。

推论1 设 $J$ 为所有满足定理1的 $j\;(1 \leqslant j \leqslant {2^n})$ 的集合,令 $h \in J$ 满足:

$\left\{\begin{aligned}& {{\rm Col}_h}({{M}_{{v_i}}}) \geqslant {{\rm Col}_j}({{M}_{{v_i}}})\;\; \\&{{\rm Col}_h}({{M}_{{w_i}}}) \geqslant {{\rm Col}_j}({{M}_{{w_i}}})\;,\;\;\end{aligned} \right.h,j \in J $ (14)

$\delta _{{2^n}}^h$ 称为为全局稳定最优纳什均衡局势。

证明  由定理1可知,所有 $J$ 中元素对应的 $\delta _{{2^n}}^j$ 都是稳定的纳什均衡点,如果 $h \in J$ 满足式(14),可知 $\delta _{{2^n}}^h$ 是使各玩家收益总和最大的稳定纳什均衡点,则称为最优稳定纳什均衡点。证毕。

2.3 政府调控下的演化博弈


政府的对企业创新的直接补贴为 $a$ ,对只想搭便车的不创新企业进行惩罚,惩罚力度设为 $b$ ,对应双收益矩阵为

$\left[ {\begin{array}{*{20}{c}} {{x/y}}&{{s_1}}&{{s_2}} \\ {{s_1}}&{(5 + a,1 + a)}&{(4 + a,4 - b)} \\ {{s_2}}&{(9 - b, - 1 + a)}&{( - b, - b)} \end{array}} \right]$ (15)

上述控制表示政府对企业的补贴与惩罚是同时进行的。设计控制的宗旨是政府对企业创新只起调控作用,尽可能少地进行投资或者获利。因此可假设补贴力度与惩罚力度相同,即 $a = b\;(a > 0,$ $b > 0)$

假设没有 $a = b$ 这一条件,则 $a = 0$ ,表示企业只对不创新企业进行惩罚;若 $b = 0$ ,表示企业只对创新企业进行补贴。

通过仿真寻找恰当的 $a{\text{、}}b$ 值,使得全局稳定点刚好为最优纯策略纳什均衡点,且大小企业全部创新,政府获利与投资之差尽可能小。当然,补贴或者惩罚力度不是盲目的,为了实现大小企业全部创新的目标,补贴力度 $a = {a_1} + {a_2}\;$ ,惩罚力度 $b = {b_1} + {b_2}$ ,需要同时满足方程组式(16):

$\left\{ \,{\begin{aligned}& {(5 + {a_1}) + (1 + {a_1}) \geqslant (4 + {a_1}) + (4 - {b_1})} \\ & {(5 + {a_2}) + (1 + {a_2}) \geqslant (9 - {b_2}) + ( - 1 + {a_2})} \\ & {{a_1} = {b_1}} \\ & {{a_2} = {b_2}} \end{aligned}} \right.$ (16)

于是,可得 $a = b = 2$ ,即补贴和惩罚力度均为2。

博弈矩阵改变后,重新计算式(13),可得博弈的转移矩阵L,通过定理1和推论1分析博弈的演化性质。若达不到所有企业全部创新的理想状态,则调整控制力度,直到大小企业玩家全部选择创新策略 ${s_1}$


3 举例分析


为了计算简便,以3个大企业和2个小企业组成的企业创新网络为例,分析演化博弈过程。网络拓扑结构图如图1所示,其中第一层为大企业, ${n_1} = \{ 1,2,3\} $ ;第二层为小企业, ${n_2} = \{ 4,5\} $

图 1 不同规模企业博弈结构图 Fig. 1 The game structure graph of different scale enterprises


${{A}_v} = {\left[ {\begin{array}{*{20}{c}}0&1&0\\1&0&1\\0&1&0\end{array}} \right]_{3 \times 3}},\;{{A}_w} = {\left[ {\begin{array}{*{20}{c}}0&1\\1&0\end{array}} \right]_{2 \times 2}},\;{{A}_{vw}} = {\left[ {\begin{array}{*{20}{c}}1&0\\0&1\\1&1\end{array}} \right]_{3 \times 2}}$

情形1 政府不进行干预( $a = b = c = d = 0$ ),企业根据市场调节不断演化。收益矩阵为

${M} = \left[ {\begin{array}{*{20}{c}}{(5,1)}&{(4,4)}\\{(9, - 1)}&{(0,0)}\end{array}} \right],{{M}_1} = \left[ {\begin{array}{*{20}{c}}5&4\\9&0\end{array}} \right],{{M}_2} = \left[ {\begin{array}{*{20}{c}}1&4\\{ - 1}&0\end{array}} \right]$


${{{M}}_p} = \left[ {\begin{array}{*{20}{c}}{ - 4}\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!4\\{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\\{ - 8}\!\!\!\!&\!\!\!\!0\!\!\!\!&\!\!\!\!0\!\!\!\!&\!\!\!\!8\!\!\!\!&\!\!\!\!8\!\!\!\!&\!\!\!\!0\!\!\!\!&\!\!\!\!0\!\!\!\!&\!\!\!\!{ - 8}\!\!\!\!&\!\!\!\!{ - 8}\!\!\!\!&\!\!\!\!0\!\!\!\!&\!\!\!\!0\!\!\!\!&\!\!\!\!8\!\!\!\!&\!\!\!\!8\!\!\!\!&\!\!\!\!0\!\!\!\!&\!\!\!\!0\!\!\!\!&\!\!\!\!{ - 8}\\{ - 6}\!\!\!\!&\!\!\!\!{ - 6}\!\!\!\!&\!\!\!\!6\!\!\!\!&\!\!\!\!6\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 6}\!\!\!\!&\!\!\!\!{ - 6}\!\!\!\!&\!\!\!\!6\!\!\!\!&\!\!\!\!6\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!4\\{ - 6}\!\!\!\!&\!\!\!\!6\!\!\!\!&\!\!\!\!{ - 6}\!\!\!\!&\!\!\!\!6\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 2}\!\!\!\!&\!\!\!\!2\!\!\!\!&\!\!\!\!{ - 2}\!\!\!\!&\!\!\!\!2\end{array}} \right.\!\!\!\!\!\!\! {\left. {\begin{array}{*{20}{c}}4\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!{ - 4}\\{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\\{ - 8}\!\!\!\!&\!\!\!\!0\!\!\!\!&\!\!\!\!0\!\!\!\!&\!\!\!\!8\!\!\!\!&\!\!\!\!8\!\!\!\!&\!\!\!\!0\!\!\!\!&\!\!\!\!0\!\!\!\!&\!\!\!\!{ - 8}\!\!\!\!&\!\!\!\!{ - 8}\!\!\!\!&\!\!\!\!0\!\!\!\!&\!\!\!\!0\!\!\!\!&\!\!\!\!8\!\!\!\!&\!\!\!\!8\!\!\!\!&\!\!\!\!0\!\!\!\!&\!\!\!\!0\!\!\!\!&\!\!\!\!{ - 8}\\{ - 4}\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 2}\!\!\!\!&\!\!\!\!{ - 2}\!\!\!\!&\!\!\!\!2\!\!\!\!&\!\!\!\!2\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 2}\!\!\!\!&\!\!\!\!{ - 2}\!\!\!\!&\!\!\!\!2\!\!\!\!&\!\!\!\!2\\{ - 6}\!\!\!\!&\!\!\!\!6\!\!\!\!&\!\!\!\!{ - 6}\!\!\!\!&\!\!\!\!6\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 4}\!\!\!\!&\!\!\!\!4\!\!\!\!&\!\!\!\!{ - 2}\!\!\!\!&\!\!\!\!2\!\!\!\!&\!\!\!\!{ - 2}\!\!\!\!&\!\!\!\!2\end{array}} \right]_{5 \times 32}}$

此时 ${{\rm Col}_4}({{M}_p}) > 0$ ,可得策略局势 $\delta _{32}^4 = \delta _2^1\delta _2^1\delta _2^1\delta _2^2\delta _2^2$ 为整个网络博弈的纳什均衡状态。此时5个企业中,大企业全部创新,小企业全部不创新。


$x(t + 1)y(t + 1) = {L}x(t)y(t)$


$\begin{aligned} {L} = {\delta _{32}}\left[ \!\! {\begin{array}{*{20}{c}} 4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4 \end{array}} {\begin{array}{*{20}{c}} 4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4\!&\!4 \!\!\end{array}} \right] \end{aligned}$

由引理3知,博弈的最终状态是稳定点,即策略局势 $\delta _{32}^4 = \delta _2^1\delta _2^1\delta _2^1\delta _2^2\delta _2^2$ ,没有出现极限环。根据定理1和推论1可知,局势 $\delta _{32}^4$ 为全局最优稳定纯策略纳什均衡点,但只有大企业创新。为了让所有的玩家都实现创新,且达到稳定状态,我们考虑以下情形。

情形2 政府对创新企业补贴为 $a$ ,对不创新企业惩罚为 $b$ 。由式(15)、(16)可得临界值 $a = b = 2$ 。通过MATLAB仿真,当 $a = b = 2$ 时,通过计算 ${{M}_p}$ 可得 $J = \{ 1,5,9,13,17,21,25,29\} $ ${L} = $ ${\delta _{32}}\left[ {1\;1\;1\;1 \cdots 1\;1\;1\;1} \right]$ 。由定理1和推论1可知,全局稳定纳什均衡点为 $\delta _{32}^1 = \delta _2^1\delta _2^1\delta _2^1\delta _2^1\delta _2^1$ ,即大小企业全部选择创新,且为最优稳定纳什均衡;若 $a = b > 2$ ,不妨设 $a = b = 2.25$ ,通过计算 ${{M}_p}$ 可得 $J = \{ 1\} $ ,且 ${L} = {\delta _{32}}\left[ {1\;1\;1\;1 \cdots 1\;1\;1\;1} \right]$ ,由推论1可得整个网络博弈的全局最优稳定纳什均衡点为 $\delta _{32}^1 = \delta _2^1\delta _2^1\delta _2^1\delta _2^1\delta _2^1$ ,满足控制大小企业全部创新的目标;当 $a = b < 2$ 时,由于政府补贴与惩罚力度不够,整个演化局势依然保持无政府干预状态。


4 结束语

本文将所有企业按规模分为大小两种企业,建立企业创新双层耦合网络。运用矩阵半张量积方法,以“智猪博弈”为基本博弈, 得到每一时刻各企业的策略,整个网络局势演化随时间改变;根据收益函数得到整个网络的最优稳定纳什均衡点;最后,通过政府调控, 改变博弈的基本矩阵,从而达到最优稳定纳什均衡状态,即所有企业全部创新。

[1] WU Qingchu, LOU Yijun, ZHU Wenfang. Epidemic outbreak for an SIS model in multiplex networks with immunization[J]. Mathematical biosciences, 2016, 277: 38-46. DOI:10.1016/j.mbs.2016.04.004 (0)
[2] WANG Danzhu, LANG M X, SUN Yan. Evolutionary game analysis of co-opetition relationship between regional logistics nodes[J]. Journal of applied research and technology, 2014, 12(2): 251-260. DOI:10.1016/S1665-6423(14)72341-7 (0)
[3] ETESAMI S R, BASAR T. Complexity of equilibrium in diffusion games on social networks[C]//Proceedings of 2014 American Control Conference. Portland, USA, 2014: 2065–2070. (0)
[4] YAN Yongyi, CHEN Zengqiang, YUE Jumei. STP approach to controlliability of finite state machines[J]. Ifac-PapersonLine, 2015, 48(28): 138-143. DOI:10.1016/j.ifacol.2015.12.114 (0)
[5] 于斌斌, 余雷. 基于演化博弈的集群企业创新模式选择研究[J]. 科研管理, 2015, 36(4): 30-38.
YU Binbin, YU Lei. A study on cluster enterprise technology innovation selection based on the evolutionary game[J]. Science research management, 2015, 36(4): 30-38. (0)
[6] ZHOU Qing, FANG Gang, WANG Dongpeng, et al. Research on the robust optimization of the enterprise’s decision on the investment to the collaborative innovation: under the risk constraints[J]. Chaos, solitons and fractals, 2016, 89: 284-289. DOI:10.1016/j.chaos.2015.11.021 (0)
[7] 盛光华, 张志远. 补贴方式对创新模式选择影响的演化博弈研究[J]. 管理科学学报, 2015, 18(9): 34-45.
SHENG Guanghua, ZHANG Zhiyuan. Allowance method’s influence on the innovation model choice in evolutionary game[J]. Journal of management sciences in China, 2015, 18(9): 34-45. DOI:10.3969/j.issn.1007-9807.2015.09.004 (0)
[8] 王健, 赵凯. " 智猪博弈”下的合作创新研究——基于非对称演化博弈的分析[J]. 科技与经济, 2016, 29(2): 21-25.
WANG Jian, ZHAO Kai. Cooperative innovation of enterprises under " Boxed pig Game”—a research based on asymmetric evolutionary game[J]. Science and technology and economy, 2016, 29(2): 21-25. (0)
[9] LIU Mengmeng, MA Yinghong, LIU Zhiyuan, et al. An IUR evolutionary game model on the patent cooperate of Shandong China[J]. Physic A: statistical mechanics and its applications, 2017, 475: 11-23. DOI:10.1016/j.physa.2017.01.086 (0)
[10] CHENG Daizhan, XU Tingting, QI Hongsheng. Evolutionary stability strategy of networked evolutionary games[J]. IEEE transactions on neural networks and learning systems, 2014, 25(7): 1335-1345. DOI:10.1109/TNNLS.2013.2293149 (0)
[11] CHENG Daizhan, HE Fenghua, QI Hongsheng, et al. Modeling, analysis and control of networked evolutionary games[J]. IEEE transactions on automatic control, 2015, 60(9): 2402-2415. DOI:10.1109/TAC.2015.2404471 (0)
[12] CHENG Daizhan, QI Hongsheng, HE F, et al. Semi-tensor product approach to networked evolutionary games[J]. Control theory and technology, 2014, 12(2): 198-214. DOI:10.1007/s11768-014-0038-9 (0)
[13] CHENG Daizhan, XU Tingting, HE Fenghua, et al. On dynamics and Nash equilibriums of networked games[J]. IEEE/CAA journal of automatica sinica, 2014, 1(1): 10-18. DOI:10.1109/JAS.2014.7004614 (0)
[14] ZHAO Yin, GHOSH B K, CHENG Daizhan. Control of large-scale Boolean networks via network aggregation[J]. IEEE transactions on neural networks and learning systems, 2016, 27(7): 1527-1536. DOI:10.1109/TNNLS.2015.2442593 (0)
[15] LI Haitao, WANG Yuzhen, XIE Lihua. Output tracking control of Boolean control networks via state feedback: constant reference signal case[J]. Automatica, 2015, 59: 54-59. DOI:10.1016/j.automatica.2015.06.004 (0)
[16] LI Haitao, XIE Lihua, WANG Yuzhen. On robust control invariance of Boolean control networks[J]. Automatica, 2016, 68: 392-396. DOI:10.1016/j.automatica.2016.01.075 (0)
[17] CHENG Daizhan, ZHAO Yin, XU Tingting. Receding horizon based feedback optimization for mix-valued logical networks[J]. IEEE transactions on automatic control, 2015, 60(12): 3362-3366. DOI:10.1109/TAC.2015.2419874 (0)
[18] GUO Peilian, WANG Yuzhen, JIANG Ping. Nash equilibrium, dynamics and control of evolutionary networked games with multi-group[C]//Proceedings of the 35th Chinese Control Conference. Chengdu, China, 2016: 585–590. (0)
[19] GUO Peilian, WANG Yuzhen, LI Haitao. Algebraic formulation and strategy optimization for a class of evolutionary networked games via semi-tensor product method[J]. Automatic, 2013, 49(11): 3384-3389. DOI:10.1016/j.automatica.2013.08.008 (0)
[20] ZHAO Guodong, WANG Yuzhen. Formulation and optimization control of a class of networked evolutionary games with switched topologies[J]. Nonlinear analysis: hybrid systems, 2016, 22: 98-107. DOI:10.1016/j.nahs.2016.03.009 (0)
[21] ZHU Bing, XIA Xiaohua, WU Zhou. Evolutionary game theoretic demand-side management and control for a class of networked smart grid[J]. Automatic, 2016, 70: 94-100. DOI:10.1016/j.automatica.2016.03.027 (0)
[22] FU Shihua, WANG Yuzhen, ZHAO Guodong. A matrix approach to the analysis and control of networked evolutionary games with bankruptcy mechanism[J]. Asian journal of control, 2017, 19(2): 717-727. DOI:10.1002/asjc.1412 (0)