License: arXiv.org perpetual non-exclusive license
arXiv:2403.10980v1 [cs.GT] 16 Mar 2024

Inverse learning of black-box aggregator for robust Nash equilibrium

Guanpu Chen, Gehui Xu, Fengxiang He, Dacheng Tao, Thomas Parisini, Karl Henrik Johansson G. Chen , G. Xu, and Karl H. Johansson are with Division of Decision and Control Systems, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, and also with Digital Futures, Stockholm 100 44, Sweden. (e-mail: [email protected], [email protected], [email protected])F. He is with Artificial Intelligence and its Applications Institute, School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, Scotland. (e-mail: [email protected])D. Tao is with the College of Computing & Data Science, Nanyang Technological University, Singapore 639798. (e-mail: [email protected])T. Parisini is with Department of Electrical and Electronic Engineering, Imperial College London, London SW7 2AZ, UK, and also with Department of Engineering and Architecture, University of Trieste, Trieste 34127, Italy. (e-mail: [email protected])
Abstract

In this note, we investigate the robustness of Nash equilibria (NE) in multi-player aggregative games with coupling constraints. There are many algorithms for computing an NE of an aggregative game given a known aggregator. When the coupling parameters are affected by uncertainty, robust NE need to be computed. We consider a scenario where players’ weight in the aggregator is unknown, making the aggregator kind of “a black box”. We pursue a suitable learning approach to estimate the unknown aggregator by proposing an inverse variational inequality-based relationship. We then utilize the counterpart to reconstruct the game and obtain first-order conditions for robust NE in the worst case. Furthermore, we characterize the generalization property of the learning methodology via an upper bound on the violation probability. Simulation experiments show the effectiveness of the proposed inverse learning approach.

1 Introduction

Multi-player game-theoretical models have gained popularity as they offer a comprehensive understanding of interactions in multi-agent systems. Aggregative games play a particularly important role [1, 2, 3] in non-cooperative games, where each player’s payoff is dependent on both its action and an aggregator of all players’ weighted actions. Indeed, more and more aggregative games enjoy widespread applications, such as demand response management [4], congestion communication control [5], and public environmental investigation [6]. In this connection, many algorithms have been developed and deployed to compute a Nash equilibrium (NE) in aggregative games within several different scenarios [7, 8, 9, 10, 11].

In actual application contexts, though, uncertainty inevitably emerges, for example, in electric vehicle charging [12], security resource allocation [13], or moving target defense [14]. Therefore, the robustness of solutions is not an option but is a necessity. Robust game theory[15], where uncertainty arises in players’ payoffs or strategies, draws inspiration from robust optimization [16, 17].

More specifically, robust equilibrium seeking in multi-player games can be divided into two main categories. One consists in enforcing satisfaction of all uncertain feasibility [18, 19, 20]. This viewpoint originates from the deterministic robust optimization [16], in order to reveal the worst-case solution subject to all possible conditions. Usually, such approaches employ robust counterparts to reconstruct the problem against uncertainty. The other category is to address uncertainty with a high probability [21, 22, 23], stemming from scenario programming [24, 17], to randomly extract finite samples from uncertain feasibility and reconstruct the problem under their intersection. This viewpoint usually concerns how to find the supported samples.

Nonetheless, consider a practical scenario where a well-developed algorithm like [7, 8, 9, 10, 11] has been already deployed to compute NE given an aggregative game. The system automatically runs after deployment, and the algorithm returns an NE corresponding to the given parameters. In this setting, players’ weight in the aggregator turns out to be unnecessarily known to the public thus making the aggregator kind of “a black box”. When parameters in the system are affected by uncertainty, the equilibrium-seeking algorithm still returns an NE given a perturbed parameter, but robustness is lost. The internal knowledge of the game model is indispensable to achieve robustness. However, the black-box aggregator prevents us from directly using the existing methods [18, 19, 20, 21, 22, 23] to address uncertainty. A suitable learning approach to “disassemble” the black-box aggregator and estimate players’ weight therein is thus needed to obtain robust NE.

In this note, we focus on approaching robust NE in a class of aggregative games with uncertainty. We first formulate the game model under uncertainty and give the concept of robust NE in the worst case. Then, we consider the situation where the players’ weight in the aggregator is unknown to the public. We propose to address the robustness by recovering the black-box aggregator from data and reconstructing the problem from a robustness perspective.

The main contribution is threefold.

  • A learning method is proposed to estimate players’ weight in the black-box aggregator. By assembling perturbed parameters and computed NE into data, we obtain an inverse variational inequality relationship (Theorem 1). We employ a slack variable as the loss, the minimization of which enables to state an inverse optimization problem.

  • Through the counterpart, the robustness of NE is addressed by transforming the recovered aggregative game with uncertainty into a deterministic worst-case model (Theorem 2). We show first-order conditions of robust NE, making gradient-based approaches usable.

  • To characterize the learning performance, a generalization guarantee of the proposed method is provided by using the violation probability. A generalization bound indicates not only the independence of the uncertainty distribution, but also the exponential convergence as the dataset size increases (Theorem 3).

The note is organized as follows. Section II gives the problem formulation whereas Section III illustrates our learning approach. Section IV addresses the robustness and Section V presents the generalization aspects. Section VI shows the effectiveness of our methodology via extensive numerical results and Section VII gives a few concluding remarks.

2 Problem Formulation

In this section, we show the game model with uncertainty, the robustness of NE, and the problem statement.

2.1 Game Model with Uncertainty

Consider a multi-player aggregative game 𝒢𝒢\mathscr{G}script_G, where players are indexed by ={1,,N}1𝑁\mathcal{I}=\{1,\dots,N\}caligraphic_I = { 1 , … , italic_N }. For player i𝑖i\in\mathcal{I}italic_i ∈ caligraphic_I, its strategy is given by the variable xinsubscript𝑥𝑖superscript𝑛x_{i}\in\mathbb{R}^{n}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and the others’ strategies are collected in 𝒙i(N1)×nsubscript𝒙𝑖superscript𝑁1𝑛\bm{x}_{-i}\in\mathbb{R}^{(N-1)\times n}bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT ( italic_N - 1 ) × italic_n end_POSTSUPERSCRIPT. Let 𝒙Nn𝒙superscript𝑁𝑛\bm{x}\in\mathbb{R}^{Nn}bold_italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_N italic_n end_POSTSUPERSCRIPT stand for all players’ strategies. Player i𝑖iitalic_i has a payoff function fi:Nn:subscript𝑓𝑖superscript𝑁𝑛f_{i}:\mathbb{R}^{Nn}\rightarrow\mathbb{R}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : blackboard_R start_POSTSUPERSCRIPT italic_N italic_n end_POSTSUPERSCRIPT → blackboard_R. The map of an aggregator σ:Nnn:𝜎superscript𝑁𝑛superscript𝑛\sigma:\mathbb{R}^{Nn}\rightarrow\mathbb{R}^{n}italic_σ : blackboard_R start_POSTSUPERSCRIPT italic_N italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is defined by

σ(𝒙)=i=1Nβixi,𝜎𝒙superscriptsubscript𝑖1𝑁subscript𝛽𝑖subscript𝑥𝑖\displaystyle\sigma(\bm{x})=\sum\limits_{i=1}^{N}\beta_{i}x_{i},italic_σ ( bold_italic_x ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , (1)

where βisubscript𝛽𝑖\beta_{i}\in\mathbb{R}italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R corresponds to player i𝑖iitalic_i’s weight in the aggregator. Take 𝜷=(β1,,βN)N𝜷subscript𝛽1subscript𝛽𝑁superscript𝑁\bm{\beta}=(\beta_{1},\dots,\beta_{N})\in\mathbb{R}^{N}bold_italic_β = ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_β start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT. For i𝑖i\in\mathcal{I}italic_i ∈ caligraphic_I, let Ji:n×n:subscript𝐽𝑖superscript𝑛superscript𝑛J_{i}:\mathbb{R}^{n}\times\mathbb{R}^{n}\rightarrow\mathbb{R}italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT × blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R be a continuously differentiable function and suppose that Ji(xi,σ(𝒙))subscript𝐽𝑖subscript𝑥𝑖𝜎𝒙J_{i}(x_{i},\sigma(\bm{x}))italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_σ ( bold_italic_x ) ) is convex in xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. In an aggregative game 𝒢𝒢\mathscr{G}script_G, player i𝑖iitalic_i’s payoff function satisfies fi(xi,𝒙i)=Ji(xi,σ(𝒙))subscript𝑓𝑖subscript𝑥𝑖subscript𝒙𝑖subscript𝐽𝑖subscript𝑥𝑖𝜎𝒙f_{i}(x_{i},\bm{x}_{-i})=J_{i}(x_{i},\sigma(\bm{x}))italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) = italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_σ ( bold_italic_x ) ).

We introduce the parameter 𝜶=(α1,,αN)𝜶subscript𝛼1subscript𝛼𝑁\bm{\alpha}=(\alpha_{1},\dots,\alpha_{N})bold_italic_α = ( italic_α start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_α start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ) with αinsubscript𝛼𝑖superscript𝑛\alpha_{i}\in\mathbb{R}^{n}italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. Given 𝒙isubscript𝒙𝑖\bm{x}_{-i}bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT, the constraint set for xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is defined by

Ωi,𝜶(𝒙i)={xi+n:αiTxibji,j=1NαjTxj},subscriptΩ𝑖𝜶subscript𝒙𝑖conditional-setsubscript𝑥𝑖subscriptsuperscript𝑛superscriptsubscript𝛼𝑖𝑇subscript𝑥𝑖𝑏superscriptsubscriptformulae-sequence𝑗𝑖𝑗1𝑁superscriptsubscript𝛼𝑗𝑇subscript𝑥𝑗\displaystyle\Omega_{i,\bm{\alpha}}(\bm{x}_{-i})=\{x_{i}\in\mathbb{R}^{n}_{+}:% ~{}\alpha_{i}^{T}x_{i}\leq b-\!\sum_{j\neq i,j=1}^{N}\!\!\!\alpha_{j}^{T}x_{j}\},roman_Ω start_POSTSUBSCRIPT italic_i , bold_italic_α end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) = { italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT + end_POSTSUBSCRIPT : italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_b - ∑ start_POSTSUBSCRIPT italic_j ≠ italic_i , italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } ,

where b𝑏bitalic_b is a scalar parameter. Take 𝐀=i=1NAiNn𝐀superscriptsubscriptproduct𝑖1𝑁subscriptA𝑖superscript𝑁𝑛\bm{\mathrm{A}}=\prod_{i=1}^{N}\mathrm{A}_{i}\subseteq\mathbb{R}^{Nn}bold_A = ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT roman_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⊆ blackboard_R start_POSTSUPERSCRIPT italic_N italic_n end_POSTSUPERSCRIPT to represent the uncertainty in parameter 𝜶𝜶\bm{\alpha}bold_italic_α. Then, given others’ strategies 𝒙isubscript𝒙𝑖\bm{x}_{-i}bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT, player i𝑖iitalic_i solves the following problem:

minxisubscriptsubscript𝑥𝑖\displaystyle\min_{x_{i}}\quadroman_min start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT Ji(xi,σ(𝒙))subscript𝐽𝑖subscript𝑥𝑖𝜎𝒙\displaystyle J_{i}(x_{i},\sigma(\bm{x}))italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_σ ( bold_italic_x ) ) (2)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}\quadroman_s . roman_t . xiΩi,𝜶(𝒙i),𝜶𝐀.formulae-sequencesubscript𝑥𝑖subscriptΩ𝑖𝜶subscript𝒙𝑖𝜶𝐀\displaystyle x_{i}\in\Omega_{i,\bm{\alpha}}(\bm{x}_{-i}),~{}{\bm{\alpha}\in% \bm{\mathrm{A}}.}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ roman_Ω start_POSTSUBSCRIPT italic_i , bold_italic_α end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) , bold_italic_α ∈ bold_A .

The overall coupling constraint can be denoted by 

Ω𝜶={𝒙+Nn:i=1NαiTxib},subscriptΩ𝜶conditional-set𝒙subscriptsuperscript𝑁𝑛superscriptsubscript𝑖1𝑁superscriptsubscript𝛼𝑖𝑇subscript𝑥𝑖𝑏\Omega_{\bm{\alpha}}=\{\bm{x}\in\mathbb{R}^{Nn}_{+}:~{}\sum_{i=1}^{N}\alpha_{i% }^{T}x_{i}\leq b\},roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT = { bold_italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_N italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT + end_POSTSUBSCRIPT : ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_b } ,

which means that players’ strategies are subject to a coupling constraint as resource allocation [8, 10]. The uncertainty in (2) indicates that the linear inequality in the coupling constraint Ω𝜶subscriptΩ𝜶\Omega_{\bm{\alpha}}roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT should be satisfied for all αiAisubscript𝛼𝑖subscriptA𝑖\alpha_{i}\in{\mathrm{A}_{i}}italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ roman_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, i𝑖i\in\mathcal{I}italic_i ∈ caligraphic_I. Concretely, we investigate a typical uncertain feasibility

Ai={αin:Diαidi},i,formulae-sequencesubscriptA𝑖conditional-setsubscript𝛼𝑖superscript𝑛subscript𝐷𝑖subscript𝛼𝑖subscript𝑑𝑖𝑖\mathrm{A}_{i}=\{\alpha_{i}\in\mathbb{R}^{n}:D_{i}\alpha_{i}\leq d_{i}\},\quad i% \in\mathcal{I},roman_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT : italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } , italic_i ∈ caligraphic_I ,

where Dimi×nsubscript𝐷𝑖superscriptsubscript𝑚𝑖𝑛D_{i}\in\mathbb{R}^{m_{i}\times n}italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT × italic_n end_POSTSUPERSCRIPT is a matrix equipped with normalized rows and dimisubscript𝑑𝑖superscriptsubscript𝑚𝑖d_{i}\in\mathbb{R}^{m_{i}}italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is a vector. In fact, AisubscriptA𝑖\mathrm{A}_{i}roman_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is a polyhedron enclosed by hyperplanes and the dimension misubscript𝑚𝑖m_{i}italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT reflects the number of hyperplanes.

Assumption 1

The constraint set Ω𝛂subscriptnormal-Ω𝛂\Omega_{\bm{\alpha}}roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT has a nonempty interior point under all the uncertainty 𝛂𝐀𝛂𝐀\bm{\alpha}\in\bm{\mathrm{A}}bold_italic_α ∈ bold_A.

Notice that if Slater’s condition holds for all uncertain feasibility 𝐀𝐀\bm{\mathrm{A}}bold_A, it is also true with a fixed parameter 𝜶𝜶\bm{\alpha}bold_italic_α.

2.2 Robustness of Nash Equilibrium

Given a fixed parameter 𝜶𝐀𝜶𝐀\bm{\alpha}\in\bm{\mathrm{A}}bold_italic_α ∈ bold_A, minimizing the payoff subject to the coupling constraint xiΩi,𝜶(𝒙i)subscript𝑥𝑖subscriptΩ𝑖𝜶subscript𝒙𝑖x_{i}\in\Omega_{i,\bm{\alpha}}(\bm{x}_{-i})italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ roman_Ω start_POSTSUBSCRIPT italic_i , bold_italic_α end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) turns out to be a deterministic problem. It is already widely studied in the past decade [18, 8, 4]. We take 𝒢𝜶subscript𝒢𝜶\mathscr{G}_{\bm{\alpha}}script_G start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT as the deterministic aggregative game under a fixed 𝜶𝜶\bm{\alpha}bold_italic_α. We first revisit the well-known definition of the generalized Nash equilibrium (GNE) in 𝒢𝜶subscript𝒢𝜶\mathscr{G}_{\bm{\alpha}}script_G start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT [25].

Definition 1

A strategy profile 𝐱𝛂*superscriptsubscript𝐱𝛂\bm{x}_{\bm{\alpha}}^{*}bold_italic_x start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is a GNE of the aggregative game 𝒢𝛂subscript𝒢𝛂\mathscr{G}_{\bm{\alpha}}script_G start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT if, for all i𝑖i\in\mathcal{I}italic_i ∈ caligraphic_I, we have

fi(x𝜶,i*,𝒙𝜶,i*)fi(xi,𝒙𝜶,i*),xiΩi,𝜶(𝒙𝜶,i*).formulae-sequencesubscript𝑓𝑖subscriptsuperscript𝑥𝜶𝑖subscriptsuperscript𝒙𝜶𝑖subscript𝑓𝑖subscript𝑥𝑖subscriptsuperscript𝒙𝜶𝑖for-allsubscript𝑥𝑖subscriptΩ𝑖𝜶superscriptsubscript𝒙𝜶𝑖\displaystyle f_{i}(x^{*}_{\bm{\alpha},i},\bm{x}^{*}_{\bm{\alpha},-i})\leq f_{% i}(x_{i},\bm{x}^{*}_{\bm{\alpha},-i}),~{}\forall x_{i}\in\Omega_{i,\bm{\alpha}% }(\bm{x}_{\bm{\alpha},-i}^{*}).italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α , italic_i end_POSTSUBSCRIPT , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α , - italic_i end_POSTSUBSCRIPT ) ≤ italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α , - italic_i end_POSTSUBSCRIPT ) , ∀ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ roman_Ω start_POSTSUBSCRIPT italic_i , bold_italic_α end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUBSCRIPT bold_italic_α , - italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) .

Definition 1 indicates that 𝒙𝜶*superscriptsubscript𝒙𝜶\bm{x}_{\bm{\alpha}}^{*}bold_italic_x start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is a GNE of 𝒢𝜶subscript𝒢𝜶\mathscr{G}_{\bm{\alpha}}script_G start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT if no player can get a better payoff by modifying its strategy unilaterally. Then, the pseudo-gradient can be given by

𝑭(𝒙)𝑭𝒙\displaystyle\bm{F}(\bm{x})bold_italic_F ( bold_italic_x ) =(F1(𝒙)FN(𝒙))=(x1f1(x1,𝒙1)xNfN(xN,𝒙N))absentmatrixsubscript𝐹1𝒙subscript𝐹𝑁𝒙matrixsubscriptsubscript𝑥1subscript𝑓1subscript𝑥1subscript𝒙1subscriptsubscript𝑥𝑁subscript𝑓𝑁subscript𝑥𝑁subscript𝒙𝑁\displaystyle=\begin{pmatrix}F_{1}(\bm{x})\\ \vdots\\ F_{N}(\bm{x})\end{pmatrix}=\begin{pmatrix}\nabla_{x_{1}}f_{1}(x_{1},\bm{x}_{-1% })\\ \vdots\\ \nabla_{x_{N}}f_{N}(x_{N},\bm{x}_{-N})\end{pmatrix}= ( start_ARG start_ROW start_CELL italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( bold_italic_x ) end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL italic_F start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ( bold_italic_x ) end_CELL end_ROW end_ARG ) = ( start_ARG start_ROW start_CELL ∇ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_x start_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL ∇ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT , bold_italic_x start_POSTSUBSCRIPT - italic_N end_POSTSUBSCRIPT ) end_CELL end_ROW end_ARG ) (9)
=(x1J1(,σ)+β1σJ1(x1,)xNJN(,σ)+βNσJN(xN,)).absentmatrixsubscriptsubscript𝑥1subscript𝐽1𝜎subscript𝛽1subscript𝜎subscript𝐽1subscript𝑥1subscriptsubscript𝑥𝑁subscript𝐽𝑁𝜎subscript𝛽𝑁subscript𝜎subscript𝐽𝑁subscript𝑥𝑁\displaystyle=\begin{pmatrix}\nabla_{x_{1}}J_{1}(\cdot,\sigma)+\beta_{1}\nabla% _{\sigma}J_{1}(x_{1},\cdot)\\ \vdots\\ \nabla_{x_{N}}J_{N}(\cdot,\sigma)+\beta_{N}\nabla_{\sigma}J_{N}(x_{N},\cdot)% \end{pmatrix}.= ( start_ARG start_ROW start_CELL ∇ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( ⋅ , italic_σ ) + italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∇ start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋅ ) end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL ∇ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ( ⋅ , italic_σ ) + italic_β start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ∇ start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT , ⋅ ) end_CELL end_ROW end_ARG ) . (13)

With the pseudo-gradient information, we can establish the connection between GNE and the first-order condition in 𝒢𝜶subscript𝒢𝜶\mathscr{G}_{\bm{\alpha}}script_G start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT. As we know, there exist various ways to seek a GNE. One of the most accepted approaches is to seek a variational GNE (vGNE), which requires a unified multiplier when deriving the Nash Karush-Kuhn-Tucker (KKT) condition [18, 25, 8]. Technically, the definition of a vGNE can be found as follows.

Definition 2

A strategy profile 𝐱𝛂*subscriptsuperscript𝐱𝛂\bm{x}^{*}_{\bm{\alpha}}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT is a vGNE of 𝒢𝛂subscript𝒢𝛂\mathscr{G}_{\bm{\alpha}}script_G start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT if and only if there exists λ*superscript𝜆\lambda^{*}\in\mathbb{R}italic_λ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∈ blackboard_R such that, for all i𝑖i\in\mathcal{I}italic_i ∈ caligraphic_I,

𝟎nsubscript0𝑛absent\displaystyle\bm{0}_{n}\inbold_0 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ xiJi(,σ(𝒙𝜶*))+βiσJi(x𝜶,i*,)+λ*αi+𝒩+n(x𝜶,i*),subscriptsubscript𝑥𝑖subscript𝐽𝑖𝜎subscriptsuperscript𝒙𝜶subscript𝛽𝑖subscript𝜎subscript𝐽𝑖subscriptsuperscript𝑥𝜶𝑖superscript𝜆subscript𝛼𝑖subscript𝒩superscriptsubscript𝑛subscriptsuperscript𝑥𝜶𝑖\displaystyle\nabla_{x_{i}}J_{i}(\cdot,\sigma(\bm{x}^{*}_{\bm{\alpha}}))\!+\!% \beta_{i}\nabla_{\sigma}J_{i}(x^{*}_{\bm{\alpha},i},\cdot)\!+\!\lambda^{*}% \alpha_{i}\!+\!\mathcal{N}_{\mathbb{R}_{+}^{n}}(x^{*}_{\bm{\alpha},i}),∇ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( ⋅ , italic_σ ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) ) + italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∇ start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α , italic_i end_POSTSUBSCRIPT , ⋅ ) + italic_λ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + caligraphic_N start_POSTSUBSCRIPT blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α , italic_i end_POSTSUBSCRIPT ) ,
00absent\displaystyle 0\geq0 ≥ i=1NαiTx𝜶,i*bλ*.perpendicular-tosuperscriptsubscript𝑖1𝑁superscriptsubscript𝛼𝑖𝑇subscriptsuperscript𝑥𝜶𝑖𝑏superscript𝜆\displaystyle\sum_{i=1}^{N}\alpha_{i}^{T}x^{*}_{\bm{\alpha},i}-b\perp\lambda^{% *}.∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α , italic_i end_POSTSUBSCRIPT - italic_b ⟂ italic_λ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT .

It follows from [25, Theorem 4.8] that with the convexity of payoffs Jisubscript𝐽𝑖J_{i}italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and Assumption 1, a vGNE 𝒙𝜶*subscriptsuperscript𝒙𝜶\bm{x}^{*}_{\bm{\alpha}}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT of 𝒢𝜶subscript𝒢𝜶\mathscr{G}_{\bm{\alpha}}script_G start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT is equivalent to a solution to a variational inequality (VI) problem VI(Ω𝜶,𝑭)subscriptΩ𝜶𝑭(\Omega_{\bm{\alpha}},\bm{F})( roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT , bold_italic_F ), i.e., to find a vector 𝒙𝜶*Ω𝜶subscriptsuperscript𝒙𝜶subscriptΩ𝜶\bm{x}^{*}_{\bm{\alpha}}\in\Omega_{\bm{\alpha}}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ∈ roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT such that

𝑭(𝒙𝜶*)T(𝒙𝒙𝜶*)0,𝒙Ω𝜶.formulae-sequence𝑭superscriptsubscriptsuperscript𝒙𝜶𝑇𝒙subscriptsuperscript𝒙𝜶0for-all𝒙subscriptΩ𝜶\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}(\bm{x}-\bm{x}^{*}_{\bm{\alpha}})\geq 0,~{% }\forall\bm{x}\in\Omega_{\bm{\alpha}}.bold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( bold_italic_x - bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) ≥ 0 , ∀ bold_italic_x ∈ roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT . (14)

In this view, many works are devoted to the algorithm design for seeking vGNE with coupling constraints [4, 8, 18, 19]. Here we do not restrict the monotonicity of 𝑭𝑭\bm{F}bold_italic_F since we mainly focus on the equivalence between GNE and VIVI\operatorname{VI}roman_VI solutions. Details for existence and uniqueness can be found in [25, 26].

Now recall the uncertainty in game 𝒢𝒢\mathscr{G}script_G. Clearly, different perturbed parameters 𝜶𝜶\bm{\alpha}bold_italic_α from uncertain feasibility 𝐀𝐀\bm{\mathrm{A}}bold_A yield different vGNE 𝒙𝜶*subscriptsuperscript𝒙𝜶\bm{x}^{*}_{\bm{\alpha}}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT of a deterministic game 𝒢𝜶subscript𝒢𝜶\mathscr{G}_{\bm{\alpha}}script_G start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT. Therefore, game 𝒢𝒢\mathscr{G}script_G in (2) should be solved under all uncertain feasibility 𝜶𝐀𝜶𝐀\bm{\alpha}\in\bm{\mathrm{A}}bold_italic_α ∈ bold_A, and we need a solution in the worst case. We introduce the following concept for the robust GNE.

Definition 3

A strategy profile 𝐱*superscript𝐱\bm{x}^{*}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is a robust GNE (rGNE) of the aggregative game 𝒢𝒢\mathscr{G}script_G if, for all i𝑖i\in\mathcal{I}italic_i ∈ caligraphic_I, we have

fi(xi*,𝒙i*)fi(xi,𝒙i*),xiΩi,𝜶(𝒙i*),𝜶𝐀.formulae-sequencesubscript𝑓𝑖subscriptsuperscript𝑥𝑖subscriptsuperscript𝒙𝑖subscript𝑓𝑖subscript𝑥𝑖subscriptsuperscript𝒙𝑖formulae-sequencefor-allsubscript𝑥𝑖subscriptΩ𝑖𝜶superscriptsubscript𝒙𝑖for-all𝜶𝐀f_{i}(x^{*}_{i},\bm{x}^{*}_{-i})\leq f_{i}(x_{i},\bm{x}^{*}_{-i}),~{}\forall x% _{i}\in\Omega_{i,\bm{\alpha}}(\bm{x}_{-i}^{*}),~{}\forall\bm{\alpha}\in\bm{% \mathrm{A}}.italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) ≤ italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) , ∀ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ roman_Ω start_POSTSUBSCRIPT italic_i , bold_italic_α end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) , ∀ bold_italic_α ∈ bold_A .

It is important to find an rGNE since it gives an acceptable solution for all players against the worst case [9]. There have been several works devoted to robust NE seeking in multi-player games. One viewpoint consists in enforcing satisfaction of all uncertainty [18, 19, 20]. It originates from deterministic robust optimization [16] to reveal the worst-case solution subject to all uncertainty. Another way is to satisfy the uncertainty with a high probability, stemming from scenario programming [24, 17]. Randomly extract finite samples from uncertain feasibility, and reconstruct the problem under their intersection [21, 22, 23]. This viewpoint usually concerns how to find the supported samples.

2.3 Problem Statement

The main goal of this paper is to compute an rGNE of the aggregative game 𝒢𝒢\mathscr{G}script_G in (2) against uncertainty. Since robustness is required, it is significant to logically reconstruct the problem with the internal knowledge of the game model. However, conditions may not be always perfect in reality and we consider the following practical scenario.

Given a fixed parameter 𝜶𝜶\bm{\alpha}bold_italic_α, suppose that a well-developed algorithm like [7, 8, 9, 10, 11] has been already deployed to compute a vGNE of 𝒢𝜶subscript𝒢𝜶\mathscr{G}_{\bm{\alpha}}script_G start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT. The system automatically runs after deployment and the algorithm returns a vGNE with the given parameter. Such an input-output process, from the given parameter 𝜶𝜶\bm{\alpha}bold_italic_α to the corresponding vGNE 𝒙𝜶*subscriptsuperscript𝒙𝜶\bm{x}^{*}_{\bm{\alpha}}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT, yields that an outsider does not need the internal knowledge of the system. As a result, some structures of the game model may not be accessible to an outsider. Here we focus on that players’ weight 𝜷𝜷\bm{\beta}bold_italic_β in the aggregator σ(𝒙)=i=1Nβixi𝜎𝒙superscriptsubscript𝑖1𝑁subscript𝛽𝑖subscript𝑥𝑖\sigma(\bm{x})=\sum_{i=1}^{N}\beta_{i}x_{i}italic_σ ( bold_italic_x ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT becomes unknown to the public, which makes the aggregator σ𝜎\sigmaitalic_σ a black box.

When the parameter 𝜶𝜶\bm{\alpha}bold_italic_α suffers uncertainty in 𝐀𝐀\bm{\mathrm{A}}bold_A, the vGNE-seeking algorithm still returns a vGNE according to the given condition but will lose robustness. What an outsider indeed needs is an rGNE, serving as a worst-case solution under all feasibility 𝐀𝐀\bm{\mathrm{A}}bold_A. As mentioned, the knowledge of the game model is indispensable to achieve robustness, since one needs the structure knowledge to reconstruct the problem against uncertainty. However, the black-box aggregator prevents us from directly using the existing methods [18, 19, 20, 21, 22, 23] to finish the job.

Hence, there should be a learning approach to disassemble the black-box aggregator σ𝜎\sigmaitalic_σ and recover players’ weight 𝜷𝜷\bm{\beta}bold_italic_β before investigating the robustness. Recall what we have: perturbed parameters 𝜶𝜶\bm{\alpha}bold_italic_α from uncertainty 𝐀𝐀\bm{\mathrm{A}}bold_A as inputs and corresponding vGNE 𝒙𝜶*subscriptsuperscript𝒙𝜶\bm{x}^{*}_{\bm{\alpha}}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT computed by the deployed solver as outputs. They constitute a data point (𝜶,𝒙𝜶*)𝜶subscriptsuperscript𝒙𝜶(\bm{\alpha},\bm{x}^{*}_{\bm{\alpha}})( bold_italic_α , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ). On this basis, a data-driven approach is required to estimate the black-box part before achieving robustness.

The problem to solve in this paper can be stated as follows.

Problem 1

Given data points (𝛂,𝐱𝛂*)𝛂subscriptsuperscript𝐱𝛂(\bm{\alpha},\bm{x}^{*}_{\bm{\alpha}})( bold_italic_α , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) composed of the perturbed parameters and the computed vGNE, develop a method to learn the black-box aggregator σ𝜎\sigmaitalic_σ and obtain an rGNE 𝐱*superscript𝐱\bm{x}^{*}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT of game 𝒢𝒢\mathscr{G}script_G in (2) under uncertainty.

In terms of Problem 1, we will address the following three concerns in the sequel: i) to propose a novel learning method based on an inverse VI-based relationship; ii) to solve the robustness with respect to the worst-case situation; iii) to measure the generalization of our learning method.

3 Inverse Learning

In this section, we provide an inverse VI-based learning approach to reveal players’ weight 𝜷𝜷\bm{\beta}bold_italic_β in the black-box aggregator σ(𝒙)𝜎𝒙\sigma(\bm{x})italic_σ ( bold_italic_x ). Recall the data (𝜶,𝒙𝜶*)𝜶superscriptsubscript𝒙𝜶(\bm{\alpha},\bm{x}_{\bm{\alpha}}^{*})( bold_italic_α , bold_italic_x start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) composed of a set of parameters 𝜶𝐀𝜶𝐀\bm{\alpha}\in\bm{\mathrm{A}}bold_italic_α ∈ bold_A and corresponding vGNE 𝒙𝜶*superscriptsubscript𝒙𝜶\bm{x}_{\bm{\alpha}}^{*}bold_italic_x start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT. In fact, the learning task is to recover the mapping from 𝜶𝜶\bm{\alpha}bold_italic_α to 𝒙𝜶*superscriptsubscript𝒙𝜶\bm{x}_{\bm{\alpha}}^{*}bold_italic_x start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT.

3.1 Inverse VI-based Relationship

Consider the VI-based relationship in (14). Given any fixed parameter 𝜶𝐀𝜶𝐀\bm{\alpha}\in\bm{\mathrm{A}}bold_italic_α ∈ bold_A, a vGNE 𝒙𝜶*superscriptsubscript𝒙𝜶\bm{x}_{\bm{\alpha}}^{*}bold_italic_x start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT of the deterministic game 𝒢𝜶subscript𝒢𝜶\mathscr{G}_{\bm{\alpha}}script_G start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT serves as a solution to VI(Ω𝜶,𝑭)subscriptΩ𝜶𝑭(\Omega_{\bm{\alpha}},\bm{F})( roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT , bold_italic_F ), that is, 𝑭(𝒙𝜶*)T(𝒙𝒙𝜶*)0,𝒙Ω𝜶.formulae-sequence𝑭superscriptsubscriptsuperscript𝒙𝜶𝑇𝒙subscriptsuperscript𝒙𝜶0for-all𝒙subscriptΩ𝜶\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}(\bm{x}-\bm{x}^{*}_{\bm{\alpha}})\geq 0,~{% }\forall\bm{x}\in\Omega_{\bm{\alpha}}.bold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( bold_italic_x - bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) ≥ 0 , ∀ bold_italic_x ∈ roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT . The following theorem indicates an inverse relation which will help construct the learning model.

Theorem 1

Consider the deterministic game 𝒢𝛂subscript𝒢𝛂\mathscr{G}_{\bm{\alpha}}script_G start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT under a fixed parameter 𝛂𝛂\bm{\alpha}bold_italic_α. Under Assumption 1, 𝐱𝛂*subscriptsuperscript𝐱𝛂\bm{x}^{*}_{\bm{\alpha}}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT is a vGNE of 𝒢𝛂subscript𝒢𝛂\mathscr{G}_{\bm{\alpha}}script_G start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT if and only if there exists a scalar γ0𝛾0\gamma\leq 0italic_γ ≤ 0 such that

𝑭(𝒙𝜶*)T𝒙𝜶*γb𝑭superscriptsubscriptsuperscript𝒙𝜶𝑇subscriptsuperscript𝒙𝜶𝛾𝑏\displaystyle\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}^{*}_{\bm{\alpha}}-\gamma bbold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT - italic_γ italic_b 0,absent0\displaystyle\leq 0,≤ 0 , (15)
Fi(𝒙𝜶*)γαisubscript𝐹𝑖subscriptsuperscript𝒙𝜶𝛾subscript𝛼𝑖\displaystyle F_{i}(\bm{x}^{*}_{\bm{\alpha}})-\gamma\alpha_{i}italic_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) - italic_γ italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT 𝟎n,i.formulae-sequenceabsentsubscript0𝑛for-all𝑖\displaystyle\geq\bm{0}_{n},~{}\forall i\in\mathcal{I}.≥ bold_0 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , ∀ italic_i ∈ caligraphic_I .

Proof. It follows from the expression in (14) that

𝑭(𝒙𝜶*)T𝒙𝜶*𝑭(𝒙𝜶*)T𝒙,𝒙Ω𝜶,formulae-sequence𝑭superscriptsubscriptsuperscript𝒙𝜶𝑇subscriptsuperscript𝒙𝜶𝑭superscriptsubscriptsuperscript𝒙𝜶𝑇𝒙for-all𝒙subscriptΩ𝜶\displaystyle\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}^{*}_{\bm{\alpha}}\leq% \bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x},~{}\forall\bm{x}\in\Omega_{\bm{% \alpha}},bold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ≤ bold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_x , ∀ bold_italic_x ∈ roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT , (16)

where we notice that 𝒙𝜶*subscriptsuperscript𝒙𝜶\bm{x}^{*}_{\bm{\alpha}}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT is a constant. (16) should be satisfied for all 𝒙Ω𝜶𝒙subscriptΩ𝜶\bm{x}\in\Omega_{\bm{\alpha}}bold_italic_x ∈ roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT, which means the inequality holds when the right-hand side takes on the minimum.

𝑭(𝒙𝜶*)T𝒙𝜶*min𝒙Ω𝜶𝑭(𝒙𝜶*)T𝒙.𝑭superscriptsubscriptsuperscript𝒙𝜶𝑇subscriptsuperscript𝒙𝜶subscript𝒙subscriptΩ𝜶𝑭superscriptsubscriptsuperscript𝒙𝜶𝑇𝒙\displaystyle\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}^{*}_{\bm{\alpha}}\leq% \min_{\bm{x}\in\Omega_{\bm{\alpha}}}\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}.bold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ≤ roman_min start_POSTSUBSCRIPT bold_italic_x ∈ roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT end_POSTSUBSCRIPT bold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_x . (17)

Hence, a new optimization problem arises:

min𝒙Ω𝜶𝑭(𝒙𝜶*)T𝒙.subscript𝒙subscriptΩ𝜶𝑭superscriptsubscriptsuperscript𝒙𝜶𝑇𝒙\min_{\bm{x}\in\Omega_{\bm{\alpha}}}\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}.roman_min start_POSTSUBSCRIPT bold_italic_x ∈ roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT end_POSTSUBSCRIPT bold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_x . (18)

Then, we investigate (18) from a duality perspective. Recalling the coupling constraint set Ω𝜶={𝒙+Nn:i=1NαiTxib}subscriptΩ𝜶conditional-set𝒙subscriptsuperscript𝑁𝑛superscriptsubscript𝑖1𝑁superscriptsubscript𝛼𝑖𝑇subscript𝑥𝑖𝑏\Omega_{\bm{\alpha}}=\{\bm{x}\in\mathbb{R}^{Nn}_{+}:~{}\sum_{i=1}^{N}\alpha_{i% }^{T}x_{i}\leq b\}roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT = { bold_italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_N italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT + end_POSTSUBSCRIPT : ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_b }, the Lagrangian function can be designed as follows:

1(𝒙,γ,𝝁)subscript1𝒙𝛾𝝁\displaystyle\mathcal{L}_{1}(\bm{x},\gamma,\bm{\mu})caligraphic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( bold_italic_x , italic_γ , bold_italic_μ ) =𝑭(𝒙𝜶*)T𝒙γ(i=1NαiTxib)+i=1NμiTxiabsent𝑭superscriptsubscriptsuperscript𝒙𝜶𝑇𝒙𝛾superscriptsubscript𝑖1𝑁superscriptsubscript𝛼𝑖𝑇subscript𝑥𝑖𝑏superscriptsubscript𝑖1𝑁superscriptsubscript𝜇𝑖𝑇subscript𝑥𝑖\displaystyle=\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}-\gamma(\sum_{i=1}^{N}% \alpha_{i}^{T}x_{i}-b)+\sum_{i=1}^{N}\mu_{i}^{T}x_{i}= bold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_x - italic_γ ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_b ) + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT
=i=1N(Fi(𝒙𝜶*)TγαiT+μiT)xi+γb,absentsuperscriptsubscript𝑖1𝑁subscript𝐹𝑖superscriptsubscriptsuperscript𝒙𝜶𝑇𝛾superscriptsubscript𝛼𝑖𝑇superscriptsubscript𝜇𝑖𝑇subscript𝑥𝑖𝛾𝑏\displaystyle=\sum_{i=1}^{N}(F_{i}(\bm{x}^{*}_{\bm{\alpha}})^{T}-\gamma\alpha_% {i}^{T}+\mu_{i}^{T})x_{i}+\gamma b,= ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ( italic_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT - italic_γ italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_γ italic_b ,

where the multipliers 0γ0𝛾0\geq\gamma\in\mathbb{R}0 ≥ italic_γ ∈ blackboard_R and 𝟎nN𝝁=col{μ1,,μN}nNsubscript0𝑛𝑁𝝁𝑐𝑜𝑙subscript𝜇1subscript𝜇𝑁superscript𝑛𝑁\bm{0}_{nN}\geq\bm{\mu}=col\{\mu_{1},\dots,\mu_{N}\}\in\mathbb{R}^{nN}bold_0 start_POSTSUBSCRIPT italic_n italic_N end_POSTSUBSCRIPT ≥ bold_italic_μ = italic_c italic_o italic_l { italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_μ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT } ∈ blackboard_R start_POSTSUPERSCRIPT italic_n italic_N end_POSTSUPERSCRIPT are employed for the inequality constraints i=1NαiTxibsuperscriptsubscript𝑖1𝑁superscriptsubscript𝛼𝑖𝑇subscript𝑥𝑖𝑏\sum_{i=1}^{N}\alpha_{i}^{T}x_{i}\leq b∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_b and the non-negative orthant 𝒙+Nn𝒙subscriptsuperscript𝑁𝑛\bm{x}\in\mathbb{R}^{Nn}_{+}bold_italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_N italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT + end_POSTSUBSCRIPT, respectively.

Under Assumption 1, given any fixed 𝜶𝜶\bm{\alpha}bold_italic_α, the dual gap of 1subscript1\mathcal{L}_{1}caligraphic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT vanishes. Followed by the duality relation, the minimum of 𝑭(𝒙𝜶*)T𝒙𝑭superscriptsubscriptsuperscript𝒙𝜶𝑇𝒙\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}bold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_x subject to 𝒙Ω𝜶𝒙subscriptΩ𝜶\bm{x}\in\Omega_{\bm{\alpha}}bold_italic_x ∈ roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT equals to

maxγ,𝝁0γbs.t.Fi(𝒙𝜶*)γαi+μi=𝟎n,i.formulae-sequencesubscript𝛾𝝁0𝛾𝑏𝑠𝑡formulae-sequencesubscript𝐹𝑖subscriptsuperscript𝒙𝜶𝛾subscript𝛼𝑖subscript𝜇𝑖subscript0𝑛for-all𝑖\displaystyle\max_{\gamma,\bm{\mu}\leq 0}~{}\gamma b\quad s.t.~{}F_{i}(\bm{x}^% {*}_{\bm{\alpha}})-\gamma\alpha_{i}+\mu_{i}=\bm{0}_{n},~{}\forall i\in\mathcal% {I}.roman_max start_POSTSUBSCRIPT italic_γ , bold_italic_μ ≤ 0 end_POSTSUBSCRIPT italic_γ italic_b italic_s . italic_t . italic_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) - italic_γ italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = bold_0 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , ∀ italic_i ∈ caligraphic_I .

In fact, we can remove the multiplier 𝝁𝝁\bm{\mu}bold_italic_μ and thus simplify the expression of the above optimization as

maxγ0γbs.t.Fi(𝒙𝜶*)γαi𝟎n,i.formulae-sequencesubscript𝛾0𝛾𝑏𝑠𝑡formulae-sequencesubscript𝐹𝑖subscriptsuperscript𝒙𝜶𝛾subscript𝛼𝑖subscript0𝑛for-all𝑖\displaystyle\max_{\gamma\leq 0}~{}\gamma b\quad s.t.~{}F_{i}(\bm{x}^{*}_{\bm{% \alpha}})-\gamma\alpha_{i}\geq\bm{0}_{n},~{}\forall i\in\mathcal{I}.roman_max start_POSTSUBSCRIPT italic_γ ≤ 0 end_POSTSUBSCRIPT italic_γ italic_b italic_s . italic_t . italic_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) - italic_γ italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ bold_0 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , ∀ italic_i ∈ caligraphic_I . (19)

So far, we derived the above dual problem (19) corresponding to the optimization min𝒙Ω𝜶𝑭(𝒙𝜶*)T𝒙subscript𝒙subscriptΩ𝜶𝑭superscriptsubscriptsuperscript𝒙𝜶𝑇𝒙\min_{\bm{x}\in\Omega_{\bm{\alpha}}}\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}roman_min start_POSTSUBSCRIPT bold_italic_x ∈ roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT end_POSTSUBSCRIPT bold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_x, and there is no duality gap due to Assumption 1. Hence, the inequality in (16), which has been equivalently transferred to the inequality in (17), can be further rewritten as

𝑭(𝒙𝜶*)T𝒙𝜶*maxγΓ𝜶γb,𝑭superscriptsubscriptsuperscript𝒙𝜶𝑇subscriptsuperscript𝒙𝜶subscript𝛾subscriptΓ𝜶𝛾𝑏\displaystyle\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}^{*}_{\bm{\alpha}}\leq% \max_{\gamma\in\Gamma_{\bm{\alpha}}}\gamma b,bold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ≤ roman_max start_POSTSUBSCRIPT italic_γ ∈ roman_Γ start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_γ italic_b , (20)

where Γ𝜶={γ0:Fi(𝒙𝜶*)γαi𝟎n,i}.subscriptΓ𝜶conditional-set𝛾0formulae-sequencesubscript𝐹𝑖subscriptsuperscript𝒙𝜶𝛾subscript𝛼𝑖subscript0𝑛for-all𝑖\Gamma_{\bm{\alpha}}=\{\gamma\leq 0:~{}F_{i}(\bm{x}^{*}_{\bm{\alpha}})-\gamma% \alpha_{i}\geq\bm{0}_{n},~{}\forall i\in\mathcal{I}\}.roman_Γ start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT = { italic_γ ≤ 0 : italic_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) - italic_γ italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ bold_0 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , ∀ italic_i ∈ caligraphic_I } . Again, if the inequality in (20) needs to be satisfied with the right-hand side taking on the maximum, then we should merely ensure that there exists at least one feasible γΓ𝜶𝛾subscriptΓ𝜶\gamma\in\Gamma_{\bm{\alpha}}italic_γ ∈ roman_Γ start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT. Hence, the inequality in (20) finally becomes

𝑭(𝒙𝜶*)T𝒙𝜶*γb.𝑭superscriptsubscriptsuperscript𝒙𝜶𝑇subscriptsuperscript𝒙𝜶𝛾𝑏\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}^{*}_{\bm{\alpha}}\leq\gamma b.bold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ≤ italic_γ italic_b . (21)

We combine the requirement (21) and Γ𝜶subscriptΓ𝜶\Gamma_{\bm{\alpha}}roman_Γ start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT, and obtain that if 𝒙𝜶*subscriptsuperscript𝒙𝜶\bm{x}^{*}_{\bm{\alpha}}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT is a vGNE of game 𝒢𝜶subscript𝒢𝜶\mathscr{G}_{\bm{\alpha}}script_G start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT satisfying the relation in (16), then there exists γ0𝛾0\gamma\leq 0italic_γ ≤ 0 leading to the consequence.

The reverse result can be proven similarly using weak duality properties since all the aforementioned conversions are equivalently conducted. \square

Most vGNE-seeking solvers compute a vGNE asymptotically since there is rarely a closed-form solution in complicated multi-player games. A numerical vGNE may not satisfy an exact solution, considering the computational errors and other biases. Thus, we relax (14) with a slack coefficient δ0𝛿0\delta\geq 0italic_δ ≥ 0 such that 𝑭(𝒙𝜶*)T(𝒙𝒙𝜶*)+δ0,𝒙Ω𝜶.formulae-sequence𝑭superscriptsubscriptsuperscript𝒙𝜶𝑇𝒙subscriptsuperscript𝒙𝜶𝛿0for-all𝒙subscriptΩ𝜶\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}(\bm{x}-\bm{x}^{*}_{\bm{\alpha}})+\delta% \geq 0,\forall\bm{x}\in\Omega_{\bm{\alpha}}.bold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( bold_italic_x - bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) + italic_δ ≥ 0 , ∀ bold_italic_x ∈ roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT . Accordingly, the relation (15) in Theorem 1 can be correspondingly revised. There exists γ0𝛾0\gamma\leq 0italic_γ ≤ 0 such that

𝑭(𝒙𝜶*)T𝒙𝜶*γb𝑭superscriptsubscriptsuperscript𝒙𝜶𝑇subscriptsuperscript𝒙𝜶𝛾𝑏\displaystyle\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}^{*}_{\bm{\alpha}}-\gamma bbold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT - italic_γ italic_b δ,absent𝛿\displaystyle\leq\delta,≤ italic_δ , (22)
Fi(𝒙𝜶*)γαisubscript𝐹𝑖subscriptsuperscript𝒙𝜶𝛾subscript𝛼𝑖\displaystyle F_{i}(\bm{x}^{*}_{\bm{\alpha}})-\gamma\alpha_{i}italic_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) - italic_γ italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT 0,i.formulae-sequenceabsent0for-all𝑖\displaystyle\geq 0,~{}\forall i\in\mathcal{I}.≥ 0 , ∀ italic_i ∈ caligraphic_I .

Similar to the proof in Theorem 1, the modified relation in (22) can also be rigorously guaranteed. Here, if the mapping from 𝜶𝜶\bm{\alpha}bold_italic_α to 𝒙𝜶*superscriptsubscript𝒙𝜶\bm{x}_{\bm{\alpha}}^{*}bold_italic_x start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is recovered well enough, then given 𝜶𝜶\bm{\alpha}bold_italic_α, the prediction should also be exactly 𝒙𝜶*superscriptsubscript𝒙𝜶\bm{x}_{\bm{\alpha}}^{*}bold_italic_x start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT. Hence, δ𝛿\deltaitalic_δ also stands for a role of loss, which means the more accurate the mapping, the smaller the loss between the prediction and the practical equilibrium point.

3.2 Data-driven Optimization

We show how to use the relation in (22) to design a learning approach for the unknown weight 𝜷𝜷\bm{\beta}bold_italic_β in the black-box aggregator. We recall what knowledge we already have: 1) a data point composed of perturbed 𝜶𝜶\bm{\alpha}bold_italic_α and the computed vGNE 𝒙𝜶*subscriptsuperscript𝒙𝜶\bm{x}^{*}_{\bm{\alpha}}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT; 2) a relaxed relation with loss in (22). By the expressions of 𝜷𝜷\bm{\beta}bold_italic_β in the pseudo-gradient 𝑭𝑭\bm{F}bold_italic_F in (9), clearly, 𝑭𝑭\bm{F}bold_italic_F belongs to a parametric family indexed by 𝜷𝜷\bm{\beta}bold_italic_β. Thus, we can rewrite 𝑭(𝒙)𝑭𝒙\bm{F}(\bm{x})bold_italic_F ( bold_italic_x ) as 𝑭(𝒙;𝜷)𝑭𝒙𝜷\bm{F}(\bm{x};\bm{\beta})bold_italic_F ( bold_italic_x ; bold_italic_β ) to describe this dependence, and naturally suppose that 𝑭(𝒙;𝜷)𝑭𝒙𝜷\bm{F}(\bm{x};\bm{\beta})bold_italic_F ( bold_italic_x ; bold_italic_β ) is continuous in 𝜷𝜷\bm{\beta}bold_italic_β.

On this basis, we design an inverse VI-based optimization. Here, the data point is (𝜶,𝒙𝜶*)𝜶subscriptsuperscript𝒙𝜶(\bm{\alpha},\bm{x}^{*}_{\bm{\alpha}})( bold_italic_α , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ), variables are the unknown weight 𝜷𝜷\bm{\beta}bold_italic_β, the auxiliary variable γ𝛾\gammaitalic_γ, and the slack variable δ𝛿\deltaitalic_δ as the loss. Hence,

min𝜷,γ,δsubscript𝜷𝛾𝛿\displaystyle\min_{\bm{\beta},\gamma,\delta}\quadroman_min start_POSTSUBSCRIPT bold_italic_β , italic_γ , italic_δ end_POSTSUBSCRIPT |δ|𝛿\displaystyle|\delta|| italic_δ | (23)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}~{}roman_s . roman_t . 𝑭(𝒙𝜶*;𝜷)T𝒙𝜶*γbδ,γ0,formulae-sequence𝑭superscriptsubscriptsuperscript𝒙𝜶𝜷𝑇subscriptsuperscript𝒙𝜶𝛾𝑏𝛿𝛾0\displaystyle\bm{F}(\bm{x}^{*}_{\bm{\alpha}};\bm{\beta})^{T}\bm{x}^{*}_{\bm{% \alpha}}-\gamma b\leq\delta,~{}\gamma\leq 0,bold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ; bold_italic_β ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT - italic_γ italic_b ≤ italic_δ , italic_γ ≤ 0 ,
Fi(𝒙𝜶*;𝜷)γαi0,i.formulae-sequencesubscript𝐹𝑖subscriptsuperscript𝒙𝜶𝜷𝛾subscript𝛼𝑖0for-all𝑖\displaystyle F_{i}(\bm{x}^{*}_{\bm{\alpha}};\bm{\beta})-\gamma\alpha_{i}\geq 0% ,~{}\forall i\in\mathcal{I}.italic_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ; bold_italic_β ) - italic_γ italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ 0 , ∀ italic_i ∈ caligraphic_I .

We turn back to robust considerations for the uncertain feasibility 𝐀𝐀\bm{\mathrm{A}}bold_A. Note that (23) is derived under a fixed 𝜶𝜶\bm{\alpha}bold_italic_α, but the problem cannot be well-solved with only one data point (𝜶,𝒙𝜶*)𝜶superscriptsubscript𝒙𝜶(\bm{\alpha},\bm{x}_{\bm{\alpha}}^{*})( bold_italic_α , bold_italic_x start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ). Fortunately, uncertainty occurs in the parameter 𝜶𝐀𝜶𝐀\bm{\alpha}\in\bm{\mathrm{A}}bold_italic_α ∈ bold_A and provides enough data. Once the system is perturbed, there emerges a new-extracted parameter 𝜶𝜶\bm{\alpha}bold_italic_α. Then, a well-deployed vGNE-seeking solver produces a corresponding numerical solution 𝒙𝜶*subscriptsuperscript𝒙𝜶\bm{x}^{*}_{\bm{\alpha}}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT, and (𝜶,𝒙𝜶*)𝜶subscriptsuperscript𝒙𝜶(\bm{\alpha},\bm{x}^{*}_{\bm{\alpha}})( bold_italic_α , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) will serve as a new data point satisfying the relation (22).

Therefore, we use index k𝑘kitalic_k to represent the k𝑘kitalic_kth uncertain condition and regard (𝜶[k],𝒙𝜶[k]*)𝜶delimited-[]𝑘subscriptsuperscript𝒙𝜶delimited-[]𝑘(\bm{\alpha}[k],\bm{x}^{*}_{\bm{\alpha}[k]})( bold_italic_α [ italic_k ] , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α [ italic_k ] end_POSTSUBSCRIPT ) as the k𝑘kitalic_kth data point. Also, take γ[k]𝛾delimited-[]𝑘\gamma[k]italic_γ [ italic_k ] and δ[k]𝛿delimited-[]𝑘\delta[k]italic_δ [ italic_k ] as the k𝑘kitalic_kth variables to be optimized together according to (23). We take 𝜹=col{δ[1],δ[M]}𝜹𝑐𝑜𝑙𝛿delimited-[]1𝛿delimited-[]𝑀\bm{\delta}=col\{\delta[1],\cdots\,\delta[M]\}bold_italic_δ = italic_c italic_o italic_l { italic_δ [ 1 ] , ⋯ italic_δ [ italic_M ] }, 𝜸=col{γ[1],,γ[M]}𝜸𝑐𝑜𝑙𝛾delimited-[]1𝛾delimited-[]𝑀\bm{\gamma}=col\{\gamma[1],\cdots,\gamma[M]\}bold_italic_γ = italic_c italic_o italic_l { italic_γ [ 1 ] , ⋯ , italic_γ [ italic_M ] }, and 𝜷={β1,,βN}𝜷subscript𝛽1subscript𝛽𝑁\bm{\beta}=\{\beta_{1},\dots,\beta_{N}\}bold_italic_β = { italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_β start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT } as all variables. With well-defined data and variables, we can finally design a data-based learning approach:

min𝜷,𝜸,𝜹subscript𝜷𝜸𝜹\displaystyle\min_{\bm{\beta},\bm{\gamma},\bm{\delta}}~{}roman_min start_POSTSUBSCRIPT bold_italic_β , bold_italic_γ , bold_italic_δ end_POSTSUBSCRIPT 𝜹norm𝜹\displaystyle\|\bm{\delta}\|∥ bold_italic_δ ∥ (24)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}~{}roman_s . roman_t . 𝑭(𝒙𝜶[k]*;𝜷)T𝒙𝜶[k]*γ[k]bδ[k],𝑭superscriptsubscriptsuperscript𝒙𝜶delimited-[]𝑘𝜷𝑇subscriptsuperscript𝒙𝜶delimited-[]𝑘𝛾delimited-[]𝑘𝑏𝛿delimited-[]𝑘\displaystyle\bm{F}(\bm{x}^{*}_{\bm{\alpha}[k]};\bm{\beta})^{T}\bm{x}^{*}_{\bm% {\alpha}[k]}-\gamma[k]b\leq\delta[k],bold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α [ italic_k ] end_POSTSUBSCRIPT ; bold_italic_β ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α [ italic_k ] end_POSTSUBSCRIPT - italic_γ [ italic_k ] italic_b ≤ italic_δ [ italic_k ] ,
Fi(𝒙𝜶[k]*;𝜷)γ[k]αi[k]0,i,formulae-sequencesubscript𝐹𝑖subscriptsuperscript𝒙𝜶delimited-[]𝑘𝜷𝛾delimited-[]𝑘subscript𝛼𝑖delimited-[]𝑘0for-all𝑖\displaystyle F_{i}(\bm{x}^{*}_{\bm{\alpha}[k]};\bm{\beta})-\gamma[k]\alpha_{i% }[k]\geq 0,\quad\forall i\in\mathcal{I},italic_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α [ italic_k ] end_POSTSUBSCRIPT ; bold_italic_β ) - italic_γ [ italic_k ] italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT [ italic_k ] ≥ 0 , ∀ italic_i ∈ caligraphic_I ,
γ[k]0,k=1,,M.formulae-sequence𝛾delimited-[]𝑘0𝑘1𝑀\displaystyle\gamma[k]\leq 0,~{}k=1,\dots,M.italic_γ [ italic_k ] ≤ 0 , italic_k = 1 , … , italic_M .
Remark 1

First, we do not request the uniqueness of vGNE 𝐱𝛂*subscriptsuperscript𝐱𝛂\bm{x}^{*}_{\bm{\alpha}}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT in 𝒢𝛂subscript𝒢𝛂\mathscr{G}_{\bm{\alpha}}script_G start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT. The learning still works as long as 𝐱𝛂*subscriptsuperscript𝐱𝛂\bm{x}^{*}_{\bm{\alpha}}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT satisfies the inequality (5), even if the payoffs might yield multiple equilibria [27]. Second, the optimal solution to (24) should exist, but is not necessarily unique. The tie can be broken by selecting the one with the minimal l2subscript𝑙2l_{2}italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT-norm among optimal solutions. Third, the norm in the objective of (24) is not restricted and can be determined by concrete conditions.

Remark 2

If the necessary convexity can be maintained, the learning approach is capable of being extended for nonlinear aggregators in 𝐱𝐱\bm{x}bold_italic_x, that is, σ(𝐱)=i=1Nβigi(xi)𝜎𝐱superscriptsubscript𝑖1𝑁subscript𝛽𝑖subscript𝑔𝑖subscript𝑥𝑖\sigma(\bm{x})=\sum_{i=1}^{N}\beta_{i}g_{i}(x_{i})italic_σ ( bold_italic_x ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ). Such a form of an aggregator actually still maintains explicit parametric properties in variable 𝛃𝛃\bm{\beta}bold_italic_β. As for more general cases, for example σ(𝐱)=i=1Ngi(xi)𝜎𝐱superscriptsubscript𝑖1𝑁subscript𝑔𝑖subscript𝑥𝑖\sigma(\bm{x})=\sum_{i=1}^{N}g_{i}(x_{i})italic_σ ( bold_italic_x ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), the learning approach (24) based on parametric estimation may fail. Some non-parametric learning approaches like kernel methods or neural networks would help in learning gisubscript𝑔𝑖g_{i}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

We provide a typical case study for interpretation.

Example 1

Consider an aggregative game with N𝑁Nitalic_N electricity users in the demand of energy consumption problem [4, 8]. User i𝑖iitalic_i adopts xiΩi,𝛂(𝐱i)subscript𝑥𝑖subscriptnormal-Ω𝑖𝛂subscript𝐱𝑖x_{i}\in\Omega_{i,\bm{\alpha}}(\bm{x}_{-i})italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ roman_Ω start_POSTSUBSCRIPT italic_i , bold_italic_α end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) as the energy consumption and aims to minimize its electricity cost Ji(xi,σ(𝐱))=li(ximi)2+P(σ(𝐱))xisubscript𝐽𝑖subscript𝑥𝑖𝜎𝐱subscript𝑙𝑖superscriptsubscript𝑥𝑖subscript𝑚𝑖2𝑃𝜎𝐱subscript𝑥𝑖J_{i}(x_{i},\sigma(\bm{x}))=l_{i}(x_{i}-m_{i})^{2}+P(\sigma(\bm{x}))x_{i}italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_σ ( bold_italic_x ) ) = italic_l start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_P ( italic_σ ( bold_italic_x ) ) italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, where lisubscript𝑙𝑖l_{i}italic_l start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and misubscript𝑚𝑖m_{i}italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are constants of energy consumption, and P=qNσ(𝐱)+p0𝑃𝑞𝑁𝜎𝐱subscript𝑝0P=qN\sigma(\bm{x})+p_{0}italic_P = italic_q italic_N italic_σ ( bold_italic_x ) + italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT with σ(𝐱)=i=1Nβixi𝜎𝐱superscriptsubscript𝑖1𝑁subscript𝛽𝑖subscript𝑥𝑖\sigma(\bm{x})=\sum_{i=1}^{N}\beta_{i}x_{i}italic_σ ( bold_italic_x ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Then learning in (24) can be expressed as follows:

min𝜷,𝜸,𝜹subscript𝜷𝜸𝜹\displaystyle\min_{\bm{\beta},\bm{\gamma},\bm{\delta}}~{}roman_min start_POSTSUBSCRIPT bold_italic_β , bold_italic_γ , bold_italic_δ end_POSTSUBSCRIPT 𝜹norm𝜹\displaystyle\|\bm{\delta}\|∥ bold_italic_δ ∥
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}~{}roman_s . roman_t . i(2li(xi,𝜶[k]*mi)+2qNβixi,𝜶[k]*\displaystyle\sum_{i\in\mathcal{I}}\big{(}2l_{i}(x^{*}_{i,\bm{\alpha}[k]}\!-\!% m_{i})\!+\!2qN\beta_{i}x^{*}_{i,\bm{\alpha}[k]}\!∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_I end_POSTSUBSCRIPT ( 2 italic_l start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i , bold_italic_α [ italic_k ] end_POSTSUBSCRIPT - italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) + 2 italic_q italic_N italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i , bold_italic_α [ italic_k ] end_POSTSUBSCRIPT
+qNj,jiβjxj,𝜶[k]*+p0)x*i,𝜶[k]γ[k]bδ[k],\displaystyle\quad+\!qN\!\!\!\!\!\sum_{j\in\mathcal{I},j\neq i}\!\!\!\beta_{j}% x^{*}_{j,\bm{\alpha}[k]}\!+\!p_{0}\big{)}x^{*}_{i,\bm{\alpha}[k]}\!-\!\gamma[k% ]b\!\leq\!\!\delta[k],+ italic_q italic_N ∑ start_POSTSUBSCRIPT italic_j ∈ caligraphic_I , italic_j ≠ italic_i end_POSTSUBSCRIPT italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_j , bold_italic_α [ italic_k ] end_POSTSUBSCRIPT + italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i , bold_italic_α [ italic_k ] end_POSTSUBSCRIPT - italic_γ [ italic_k ] italic_b ≤ italic_δ [ italic_k ] ,
2li(xi,𝜶[k]*mi)+2qNβixi,𝜶[k]*2subscript𝑙𝑖subscriptsuperscript𝑥𝑖𝜶delimited-[]𝑘subscript𝑚𝑖2𝑞𝑁subscript𝛽𝑖subscriptsuperscript𝑥𝑖𝜶delimited-[]𝑘\displaystyle 2l_{i}(x^{*}_{i,\bm{\alpha}[k]}\!-\!m_{i})\!+\!2qN\beta_{i}x^{*}% _{i,\bm{\alpha}[k]}2 italic_l start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i , bold_italic_α [ italic_k ] end_POSTSUBSCRIPT - italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) + 2 italic_q italic_N italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i , bold_italic_α [ italic_k ] end_POSTSUBSCRIPT
+qNj,jiβjxj,𝜶[k]*+p0γ[k]αi[k]0,i,formulae-sequence𝑞𝑁subscriptformulae-sequence𝑗𝑗𝑖subscript𝛽𝑗subscriptsuperscript𝑥𝑗𝜶delimited-[]𝑘subscript𝑝0𝛾delimited-[]𝑘subscript𝛼𝑖delimited-[]𝑘0for-all𝑖\displaystyle\quad+\!qN\!\!\!\!\!\sum_{j\in\mathcal{I},j\neq i}\!\!\!\!\beta_{% j}x^{*}_{j,\bm{\alpha}[k]}\!+\!p_{0}\!-\!\gamma[k]\alpha_{i}[k]\!\geq\!0,~{}% \forall i\!\in\!\mathcal{I},+ italic_q italic_N ∑ start_POSTSUBSCRIPT italic_j ∈ caligraphic_I , italic_j ≠ italic_i end_POSTSUBSCRIPT italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_j , bold_italic_α [ italic_k ] end_POSTSUBSCRIPT + italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - italic_γ [ italic_k ] italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT [ italic_k ] ≥ 0 , ∀ italic_i ∈ caligraphic_I ,
γ[k]0,k=1,,M.formulae-sequence𝛾delimited-[]𝑘0𝑘1𝑀\displaystyle\gamma[k]\leq 0,~{}k=1,\dots,M.italic_γ [ italic_k ] ≤ 0 , italic_k = 1 , … , italic_M .

It is a solvable optimization problem with linear constraints.

4 GNE Robustness

In this section, we address the robustness of GNE with the recovered knowledge by considering the worst-case solution. We introduce some new notations after learning. Take σ^(𝒙)=i=1Nβ^ixin^𝜎𝒙superscriptsubscript𝑖1𝑁subscript^𝛽𝑖subscript𝑥𝑖superscript𝑛\hat{\sigma}(\bm{x})=\sum_{i=1}^{N}\hat{\beta}_{i}x_{i}\in\mathbb{R}^{n}over^ start_ARG italic_σ end_ARG ( bold_italic_x ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT for the aggregator, where 𝜷^={β^1,,β^N}^𝜷subscript^𝛽1subscript^𝛽𝑁\hat{\bm{\beta}}=\{\hat{\beta}_{1},\dots,\hat{\beta}_{N}\}over^ start_ARG bold_italic_β end_ARG = { over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT } is revealed by (24). Accordingly, take 𝒢^^𝒢\widehat{\mathscr{G}}over^ start_ARG script_G end_ARG to represent the game model with uncertainty after learning, i.e., each player i𝑖iitalic_i has to solve the following problem:

minxiJi(xi,σ^(𝒙))s.t.xiΩi,𝜶(𝒙i),𝜶𝐀.formulae-sequencesubscriptsubscript𝑥𝑖subscript𝐽𝑖subscript𝑥𝑖^𝜎𝒙stformulae-sequencesubscript𝑥𝑖subscriptΩ𝑖𝜶subscript𝒙𝑖𝜶𝐀\displaystyle\min_{x_{i}}~{}J_{i}(x_{i},\hat{\sigma}(\bm{x}))~{}\mathrm{s.t.}~% {}x_{i}\in\Omega_{i,\bm{\alpha}}(\bm{x}_{-i}),~{}{\bm{\alpha}\in\bm{\mathrm{A}% }.}roman_min start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , over^ start_ARG italic_σ end_ARG ( bold_italic_x ) ) roman_s . roman_t . italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ roman_Ω start_POSTSUBSCRIPT italic_i , bold_italic_α end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) , bold_italic_α ∈ bold_A . (25)

Then, we seek an rGNE of game 𝒢^^𝒢\widehat{\mathscr{G}}over^ start_ARG script_G end_ARG in the worst case, that is, the robustness of Nash equilibrium in (25) satisfying all possible uncertainties in the feasibility 𝐀𝐀\bm{\mathrm{A}}bold_A. In this view, we consider transforming the uncertain problem (25) into a deterministic model. By the virtue of deterministic robust optimization, the following theorem shows how to construct a robust counterpart.

Theorem 2

Under Assumption 1, a strategy profile 𝐱*superscript𝐱\bm{x}^{*}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is an rGNE of game 𝒢^normal-^𝒢\widehat{\mathscr{G}}over^ start_ARG script_G end_ARG (25) if and only if there exists 𝐲*=col{y1*,,yN*}superscript𝐲𝑐𝑜𝑙superscriptsubscript𝑦1normal-…superscriptsubscript𝑦𝑁\bm{y}^{*}=col\{y_{1}^{*},\dots,y_{N}^{*}\}bold_italic_y start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = italic_c italic_o italic_l { italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , … , italic_y start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT } with yi*misuperscriptsubscript𝑦𝑖superscriptsubscript𝑚𝑖y_{i}^{*}\in\mathbb{R}^{m_{i}}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT such that (𝐱*,𝐲*)superscript𝐱superscript𝐲(\bm{x}^{*},\bm{y}^{*})( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , bold_italic_y start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) is a GNE of the following deterministic game:

minxi+n,yi+misubscriptformulae-sequencesubscript𝑥𝑖subscriptsuperscript𝑛subscript𝑦𝑖superscriptsubscriptsubscript𝑚𝑖\displaystyle\min_{x_{i}\in\mathbb{R}^{n}_{+},y_{i}\in\mathbb{R}_{+}^{m_{i}}}roman_min start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT + end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT Ji(xi,σ^(𝒙))subscript𝐽𝑖subscript𝑥𝑖^𝜎𝒙\displaystyle J_{i}(x_{i},\hat{\sigma}(\bm{x}))italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , over^ start_ARG italic_σ end_ARG ( bold_italic_x ) ) (26)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}\quadroman_s . roman_t . i=1NyiTdib,DiTyixi=𝟎n,i.formulae-sequencesuperscriptsubscript𝑖1𝑁superscriptsubscript𝑦𝑖𝑇subscript𝑑𝑖𝑏formulae-sequencesuperscriptsubscript𝐷𝑖𝑇subscript𝑦𝑖subscript𝑥𝑖subscript0𝑛for-all𝑖\displaystyle\sum_{i=1}^{N}y_{i}^{T}d_{i}\leq b,~{}D_{i}^{T}y_{i}\!-\!x_{i}=% \bm{0}_{n},~{}\forall i\in\mathcal{I}.∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_b , italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = bold_0 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , ∀ italic_i ∈ caligraphic_I .

Proof. Recall the expression of the coupling constraint Ω𝜶subscriptΩ𝜶\Omega_{\bm{\alpha}}roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT under all uncertain feasibility 𝜶𝐀𝜶𝐀\bm{\alpha}\in\bm{\mathrm{A}}bold_italic_α ∈ bold_A. If all possible constraints hold in (25), then it can be equivalently regarded as

max𝜶𝐀i=1NαiTxi=i=1NmaxαiAiαiTxib.subscript𝜶𝐀superscriptsubscript𝑖1𝑁superscriptsubscript𝛼𝑖𝑇subscript𝑥𝑖superscriptsubscript𝑖1𝑁subscriptsubscript𝛼𝑖subscriptA𝑖superscriptsubscript𝛼𝑖𝑇subscript𝑥𝑖𝑏\displaystyle\max_{\bm{\alpha}\in\bm{\mathrm{A}}}\sum_{i=1}^{N}\alpha_{i}^{T}x% _{i}=\sum_{i=1}^{N}\max\limits_{\alpha_{i}\in\mathrm{A}_{i}}\alpha_{i}^{T}x_{i% }\leq b.roman_max start_POSTSUBSCRIPT bold_italic_α ∈ bold_A end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT roman_max start_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ roman_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_b . (27)

We find a sub-optimization problem maxαiAiαiTxisubscriptsubscript𝛼𝑖subscriptA𝑖superscriptsubscript𝛼𝑖𝑇subscript𝑥𝑖\max\limits_{\alpha_{i}\in\mathrm{A}_{i}}\alpha_{i}^{T}x_{i}roman_max start_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ roman_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT with the feasibility structure Ai={αin:Diαidi}subscriptA𝑖conditional-setsubscript𝛼𝑖superscript𝑛subscript𝐷𝑖subscript𝛼𝑖subscript𝑑𝑖\mathrm{A}_{i}=\{\alpha_{i}\in\mathbb{R}^{n}:D_{i}\alpha_{i}\leq d_{i}\}roman_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT : italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT }.

maxαiαiTxis.t.Diαidi.formulae-sequencesubscriptsubscript𝛼𝑖superscriptsubscript𝛼𝑖𝑇subscript𝑥𝑖stsubscript𝐷𝑖subscript𝛼𝑖subscript𝑑𝑖\displaystyle\max_{\alpha_{i}}\alpha_{i}^{T}x_{i}\quad\mathrm{s.t.}~{}D_{i}% \alpha_{i}\leq d_{i}.roman_max start_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_s . roman_t . italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT . (28)

Notice that here xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT serves as a constant while αisubscript𝛼𝑖\alpha_{i}italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is variable. Accordingly, design a Lagrangian function with an auxiliary multiplier yi+misubscript𝑦𝑖subscriptsuperscriptsubscript𝑚𝑖y_{i}\in\mathbb{R}^{m_{i}}_{+}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT + end_POSTSUBSCRIPT.

2(αi,yi)subscript2subscript𝛼𝑖subscript𝑦𝑖\displaystyle\mathcal{L}_{2}(\alpha_{i},y_{i})caligraphic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) =αiTxi+yiT(Diαidi)absentsuperscriptsubscript𝛼𝑖𝑇subscript𝑥𝑖superscriptsubscript𝑦𝑖𝑇subscript𝐷𝑖subscript𝛼𝑖subscript𝑑𝑖\displaystyle=-\alpha_{i}^{T}x_{i}+y_{i}^{T}(D_{i}\alpha_{i}-d_{i})= - italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT )
=(yiTDixiT)αiyiTdi.absentsuperscriptsubscript𝑦𝑖𝑇subscript𝐷𝑖superscriptsubscript𝑥𝑖𝑇subscript𝛼𝑖superscriptsubscript𝑦𝑖𝑇subscript𝑑𝑖\displaystyle=(y_{i}^{T}D_{i}-x_{i}^{T})\alpha_{i}-y_{i}^{T}d_{i}.= ( italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT .

By Assumption 1, the polyhedron AisubscriptA𝑖\mathrm{A}_{i}roman_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is nonempty and the Slater’s condition is therefore satisfied in problem (28). Then, the duality gap vanishes in the Lagrangian function 2subscript2\mathcal{L}_{2}caligraphic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. It follows from the duality theory that the maximum in (28) can be equivalently described by the following minimum

minyiyiTdis.t.DiTyixi=𝟎n,yi𝟎mi.formulae-sequencesubscriptsubscript𝑦𝑖superscriptsubscript𝑦𝑖𝑇subscript𝑑𝑖stformulae-sequencesuperscriptsubscript𝐷𝑖𝑇subscript𝑦𝑖subscript𝑥𝑖subscript0𝑛subscript𝑦𝑖subscript0subscript𝑚𝑖\displaystyle\min_{y_{i}}~{}y_{i}^{T}d_{i}\quad\mathrm{s.t.}~{}D_{i}^{T}y_{i}-% x_{i}=\bm{0}_{n},~{}y_{i}\geq\bm{0}_{m_{i}}.roman_min start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_s . roman_t . italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = bold_0 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ bold_0 start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (29)

Hence, the inequality relation (27) in the worst case becomes

i=1NminyiYisuperscriptsubscript𝑖1𝑁subscriptsubscript𝑦𝑖subscript𝑌𝑖\displaystyle\sum_{i=1}^{N}\min_{y_{i}\in Y_{i}}~{}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT roman_min start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT yiTdib,superscriptsubscript𝑦𝑖𝑇subscript𝑑𝑖𝑏\displaystyle y_{i}^{T}d_{i}\leq b,italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_b , (30)

where the constraint Yi={yi+mi:DiTyixi=𝟎n}subscript𝑌𝑖conditional-setsubscript𝑦𝑖superscriptsubscriptsubscript𝑚𝑖superscriptsubscript𝐷𝑖𝑇subscript𝑦𝑖subscript𝑥𝑖subscript0𝑛Y_{i}=\{y_{i}\in\mathbb{R}_{+}^{m_{i}}:\;D_{i}^{T}y_{i}-x_{i}=\bm{0}_{n}\}italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT : italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = bold_0 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT }. It follows from [16] that, the minimum on the left-hand side in (30) can be removed. In fact, if there exists at least one qualified profile 𝒚=col{y1,,yN}𝒚𝑐𝑜𝑙subscript𝑦1subscript𝑦𝑁\bm{y}=col\{y_{1},\dots,y_{N}\}bold_italic_y = italic_c italic_o italic_l { italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_y start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT }, then the minimum will be naturally verified. Hence, (30) can be rewritten as

i=1NyiTdib,yiYi,i.formulae-sequencesuperscriptsubscript𝑖1𝑁superscriptsubscript𝑦𝑖𝑇subscript𝑑𝑖𝑏formulae-sequencesubscript𝑦𝑖subscript𝑌𝑖for-all𝑖\displaystyle\sum_{i=1}^{N}y_{i}^{T}d_{i}\leq b,~{}y_{i}\in Y_{i},~{}\forall i% \in\mathcal{I}.∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_b , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , ∀ italic_i ∈ caligraphic_I . (31)

Thus, together with the definition of set Yisubscript𝑌𝑖Y_{i}italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for i𝑖i\in\mathcal{I}italic_i ∈ caligraphic_I, the coupling constraint in (25) under all feasibility 𝜶𝐀𝜶𝐀\bm{\alpha}\in\bm{\mathrm{A}}bold_italic_α ∈ bold_A can be reformulated by the following deterministic constraints:

i=1NyiTdib,DiTyixi=𝟎n,i,formulae-sequencesuperscriptsubscript𝑖1𝑁superscriptsubscript𝑦𝑖𝑇subscript𝑑𝑖𝑏formulae-sequencesuperscriptsubscript𝐷𝑖𝑇subscript𝑦𝑖subscript𝑥𝑖subscript0𝑛for-all𝑖\displaystyle\sum_{i=1}^{N}y_{i}^{T}d_{i}\leq b,~{}D_{i}^{T}y_{i}-x_{i}=\bm{0}% _{n},~{}\forall i\in\mathcal{I},∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_b , italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = bold_0 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , ∀ italic_i ∈ caligraphic_I , (32)

where xi+nsubscript𝑥𝑖subscriptsuperscript𝑛x_{i}\in\mathbb{R}^{n}_{+}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT + end_POSTSUBSCRIPT and yi+misubscript𝑦𝑖superscriptsubscriptsubscript𝑚𝑖y_{i}\in\mathbb{R}_{+}^{m_{i}}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. That is exactly (26).

The reverse proof can be conducted similarly when (𝒙*,𝒚*)superscript𝒙superscript𝒚(\bm{x}^{*},\bm{y}^{*})( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , bold_italic_y start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) satisfies the deterministic formulation in (26), because all the above procedures are equivalent transformations. \square

Theorem 2 shows that, by introducing the auxiliary variable 𝒚𝒚\bm{y}bold_italic_y, the uncertain game 𝒢^^𝒢\hat{\mathscr{G}}over^ start_ARG script_G end_ARG can be transformed into a deterministic one in (26). On this basis, we can obtain the first-order condition of (26). Technically, a strategy (𝒙*,𝒚*)superscript𝒙superscript𝒚(\bm{x}^{*},\bm{y}^{*})( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , bold_italic_y start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) is a GNE of (26) (an rGNE of 𝒢^^𝒢\hat{\mathscr{G}}over^ start_ARG script_G end_ARG in (25)) if and only if there exists μ*+superscript𝜇subscript\mu^{*}\in\mathbb{R}_{+}italic_μ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT and 𝝎*=col{ωi*}i=1Nmsuperscript𝝎𝑐𝑜𝑙superscriptsubscriptsuperscriptsubscript𝜔𝑖𝑖1𝑁superscript𝑚\bm{\omega}^{*}=col\{\omega_{i}^{*}\}_{i=1}^{N}\in\mathbb{R}^{m}bold_italic_ω start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = italic_c italic_o italic_l { italic_ω start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT such that, for i𝑖i\in\mathcal{I}italic_i ∈ caligraphic_I,

𝟎nxiJi(,σ^(𝒙*))+β^iσ^Ji(xi*,)ωi*+𝒩+n(xi*),subscript0𝑛subscriptsubscript𝑥𝑖subscript𝐽𝑖^𝜎superscript𝒙subscript^𝛽𝑖subscript^𝜎subscript𝐽𝑖subscriptsuperscript𝑥𝑖superscriptsubscript𝜔𝑖subscript𝒩superscriptsubscript𝑛subscriptsuperscript𝑥𝑖\displaystyle\bm{0}_{n}\in\nabla_{x_{i}}J_{i}(\cdot,\hat{\sigma}(\bm{x}^{*}))% \!+\!\hat{\beta}_{i}\nabla_{\hat{\sigma}}J_{i}(x^{*}_{i},\cdot)\!-\!\omega_{i}% ^{*}\!+\!\mathcal{N}_{\mathbb{R}_{+}^{n}}(x^{*}_{i}),bold_0 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ ∇ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( ⋅ , over^ start_ARG italic_σ end_ARG ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ) + over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∇ start_POSTSUBSCRIPT over^ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , ⋅ ) - italic_ω start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT + caligraphic_N start_POSTSUBSCRIPT blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , (33)
𝟎miμ*di+Diωi*+𝒩+mi(yi*),subscript0subscript𝑚𝑖superscript𝜇subscript𝑑𝑖subscript𝐷𝑖superscriptsubscript𝜔𝑖subscript𝒩superscriptsubscriptsubscript𝑚𝑖subscriptsuperscript𝑦𝑖\displaystyle\bm{0}_{m_{i}}\in\mu^{*}d_{i}+D_{i}\omega_{i}^{*}+\mathcal{N}_{% \mathbb{R}_{+}^{m_{i}}}(y^{*}_{i}),bold_0 start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ italic_μ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_ω start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT + caligraphic_N start_POSTSUBSCRIPT blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_y start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ,
0i=1NdiTyi*bμ*,0superscriptsubscript𝑖1𝑁superscriptsubscript𝑑𝑖𝑇subscriptsuperscript𝑦𝑖𝑏perpendicular-tosuperscript𝜇\displaystyle 0\geq\sum_{i=1}^{N}d_{i}^{T}y^{*}_{i}-b\perp\mu^{*},0 ≥ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_b ⟂ italic_μ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ,
𝟎n=DiTyi*xi*.subscript0𝑛superscriptsubscript𝐷𝑖𝑇subscriptsuperscript𝑦𝑖superscriptsubscript𝑥𝑖\displaystyle\bm{0}_{n}=D_{i}^{T}y^{*}_{i}-x_{i}^{*}.bold_0 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT .

Now we can say that, after learning the black-box aggregator and deriving the robust counterpart, we are finally able to seek an rGNE of the original game 𝒢𝒢\mathscr{G}script_G in (2) via the above first-order condition. There is no need for a detailed derivation of the solver design since such designs have been already given in many developed works [7, 8, 9, 10, 11].

For the sake of simplicity, we put together all the procedures in Algorithm 1 to compute an rGNE within a black-box aggregative game problem (2).

Algorithm 1

Initialization: aggregator σ(𝒙)𝜎𝒙\sigma(\bm{x})italic_σ ( bold_italic_x ), payoff functions Ji(xi,σ(𝒙))subscript𝐽𝑖subscript𝑥𝑖𝜎𝒙J_{i}(x_{i},\sigma(\bm{x}))italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_σ ( bold_italic_x ) ), coupling constraint Ω𝜶subscriptΩ𝜶\Omega_{\bm{\alpha}}roman_Ω start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT, all uncertain feasibility 𝐀𝐀\bm{\mathrm{A}}bold_A, and a well-developed vGNE-seeking solver. 1) Data Construction

for k=1,2,,M𝑘12𝑀k=1,2,\dots,Mitalic_k = 1 , 2 , … , italic_M, do

Input: uncertain αi[k]Ai,iformulae-sequencesubscript𝛼𝑖delimited-[]𝑘subscriptA𝑖𝑖\alpha_{i}[k]\in\mathrm{A}_{i},~{}i\in\mathcal{I}italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT [ italic_k ] ∈ roman_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_i ∈ caligraphic_I,

Solve: deterministic game 𝒢𝜶[k]subscript𝒢𝜶delimited-[]𝑘\mathscr{G}_{\bm{\alpha}[k]}script_G start_POSTSUBSCRIPT bold_italic_α [ italic_k ] end_POSTSUBSCRIPT by the existing vGNE-

seeking solver,

Output: vGNE 𝒙𝜶[k]*subscriptsuperscript𝒙𝜶delimited-[]𝑘\bm{x}^{*}_{\bm{\alpha}[k]}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α [ italic_k ] end_POSTSUBSCRIPT of game 𝒢𝜶[k]subscript𝒢𝜶delimited-[]𝑘\mathscr{G}_{\bm{\alpha}[k]}script_G start_POSTSUBSCRIPT bold_italic_α [ italic_k ] end_POSTSUBSCRIPT.

end for

2) Inverse learning

Input: all data (𝜶[k],𝒙𝜶[k]*),k=1,2,,Mformulae-sequence𝜶delimited-[]𝑘subscriptsuperscript𝒙𝜶delimited-[]𝑘𝑘12𝑀(\bm{\alpha}[k],\bm{x}^{*}_{\bm{\alpha}[k]}),~{}k=1,2,\dots,M( bold_italic_α [ italic_k ] , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α [ italic_k ] end_POSTSUBSCRIPT ) , italic_k = 1 , 2 , … , italic_M,

Solve: inverse VI-based learning approach in (24),

Return: estimated weight 𝜷^^𝜷\hat{\bm{\beta}}over^ start_ARG bold_italic_β end_ARG in black-box aggregator

3) Robust Counterpart

Solve: the deterministic game 𝒢^^𝒢\hat{\mathscr{G}}over^ start_ARG script_G end_ARG in (26) with auxiliary

variables yi,isubscript𝑦𝑖𝑖y_{i},i\in\mathcal{I}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_i ∈ caligraphic_I and estimated weight 𝜷^^𝜷\hat{\bm{\beta}}over^ start_ARG bold_italic_β end_ARG,

Return: rGNE 𝒙*superscript𝒙\bm{x}^{*}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT of game 𝒢𝒢\mathscr{G}script_G.

5 Generalization Guarantee

In this section, we address the generalization capabilities of our learning approach (24). In fact, an important aspect of a learning method is its ability to generalize to real data, that is, whether its estimation can perform well on new data.

5.1 Violation Probability

In (24), the generalization guarantee refers to whether the optimal solution with the used data (𝜶[k],𝒙𝜶[k]*)𝜶delimited-[]𝑘subscriptsuperscript𝒙𝜶delimited-[]𝑘(\bm{\alpha}[k],\bm{x}^{*}_{\bm{\alpha}[k]})( bold_italic_α [ italic_k ] , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α [ italic_k ] end_POSTSUBSCRIPT ), k=1,2,,M𝑘12𝑀k=1,2,\dots,Mitalic_k = 1 , 2 , … , italic_M, can truly represent all cases under the uncertain feasibility 𝐀𝐀\bm{\mathrm{A}}bold_A. In fact, no matter what sampling (referring to 𝜶𝐀𝜶𝐀\bm{\alpha}\in\bm{\mathrm{A}}bold_italic_α ∈ bold_A) is taken, it is unrealistic to fully represent the entire uncertain domain. Therefore, the optimal solution we learn by (24) may merely be applicable to those given samplings (𝜶[k],𝒙𝜶[k]*)𝜶delimited-[]𝑘subscriptsuperscript𝒙𝜶delimited-[]𝑘(\bm{\alpha}[k],\bm{x}^{*}_{\bm{\alpha}[k]})( bold_italic_α [ italic_k ] , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α [ italic_k ] end_POSTSUBSCRIPT ), k=1,2,,M𝑘12𝑀k=1,2,\dots,Mitalic_k = 1 , 2 , … , italic_M, and the optimal solution 𝜷𝜷\bm{\beta}bold_italic_β may not be optimal under all uncertain feasibility. In other terms, there may be a new sampled data point (𝜶[M+1],𝒙𝜶[M+1]*)𝜶delimited-[]𝑀1subscriptsuperscript𝒙𝜶delimited-[]𝑀1(\bm{\alpha}[M+1],\bm{x}^{*}_{\bm{\alpha}[M+1]})( bold_italic_α [ italic_M + 1 ] , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α [ italic_M + 1 ] end_POSTSUBSCRIPT ) for which the previously obtained optimal solution by (24) is no longer valid.

Hence, we investigate the above fact using the following violation probability concept [24]. For convenience, we rewrite the constraints in problem (23) as follows.

𝒳(𝜶,𝒙𝜶*){δ,𝜷:γ,s.t.\displaystyle\mathcal{X}(\bm{\alpha},\bm{x}^{*}_{\bm{\alpha}})\triangleq\{% \delta,\bm{\beta}:~{}\exists\gamma,~{}\mathrm{s.t.}caligraphic_X ( bold_italic_α , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) ≜ { italic_δ , bold_italic_β : ∃ italic_γ , roman_s . roman_t . 𝑭(𝒙𝜶*;𝜷)T𝒙𝜶*γbδ,𝑭superscriptsubscriptsuperscript𝒙𝜶𝜷𝑇subscriptsuperscript𝒙𝜶𝛾𝑏𝛿\displaystyle~{}\bm{F}(\bm{x}^{*}_{\bm{\alpha}};\bm{\beta})^{T}\bm{x}^{*}_{\bm% {\alpha}}-\gamma b\leq\delta,bold_italic_F ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ; bold_italic_β ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT - italic_γ italic_b ≤ italic_δ ,
Fi(𝒙𝜶*;𝜷)γαi0,i}.\displaystyle F_{i}(\bm{x}^{*}_{\bm{\alpha}};\bm{\beta})-\gamma\alpha_{i}\geq 0% ,~{}\forall i\in\mathcal{I}\}.italic_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ; bold_italic_β ) - italic_γ italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ 0 , ∀ italic_i ∈ caligraphic_I } .
Definition 4

Given (𝛃,δ)𝛃𝛿(\bm{\beta},\delta)( bold_italic_β , italic_δ ), the violation probability is

𝕍(𝜷,δ){𝜶𝐀,(𝜷,δ)𝒳(𝜶,𝒙𝜶*)}.𝕍𝜷𝛿formulae-sequence𝜶𝐀𝜷𝛿𝒳𝜶subscriptsuperscript𝒙𝜶\displaystyle\mathbb{V}(\bm{\beta},\delta)\triangleq\mathbb{P}\{\bm{\alpha}\in% \bm{\mathrm{A}},~{}(\bm{\beta},\delta)\notin\mathcal{X}(\bm{\alpha},\bm{x}^{*}% _{\bm{\alpha}})\}.blackboard_V ( bold_italic_β , italic_δ ) ≜ blackboard_P { bold_italic_α ∈ bold_A , ( bold_italic_β , italic_δ ) ∉ caligraphic_X ( bold_italic_α , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α end_POSTSUBSCRIPT ) } . (34)

Then, the generalization guarantee takes on the form: given ϵ(0,1)italic-ϵ01\epsilon\in(0,1)italic_ϵ ∈ ( 0 , 1 ), evaluate the upper bound of M(𝕍(𝜷*,δ*)>ϵ),superscript𝑀𝕍superscript𝜷superscript𝛿italic-ϵ\mathbb{P}^{M}(\mathbb{V}(\bm{\beta}^{*},\delta^{*})>\epsilon),blackboard_P start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT ( blackboard_V ( bold_italic_β start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_δ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) > italic_ϵ ) , where (𝜷*,δ*)superscript𝜷superscript𝛿(\bm{\beta}^{*},\delta^{*})( bold_italic_β start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_δ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) is the optimal solution via the learning approach (24) with data {(𝜶[1],𝒙𝜶[1]*),,(𝜶[M],𝒙𝜶[M]*)}𝜶delimited-[]1subscriptsuperscript𝒙𝜶delimited-[]1𝜶delimited-[]𝑀subscriptsuperscript𝒙𝜶delimited-[]𝑀\{(\bm{\alpha}[1],\bm{x}^{*}_{\bm{\alpha}[1]}),\cdots,(\bm{\alpha}[M],\bm{x}^{% *}_{\bm{\alpha}[M]})\}{ ( bold_italic_α [ 1 ] , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α [ 1 ] end_POSTSUBSCRIPT ) , ⋯ , ( bold_italic_α [ italic_M ] , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α [ italic_M ] end_POSTSUBSCRIPT ) }. In qualitative forms, if ϵitalic-ϵ\epsilonitalic_ϵ is small enough and the upper bound in \mathbb{P}blackboard_P is also suitably small, then the learning method turns out to be reliable. If so, when we employ the inverse VI-based learning approach (24) to estimate the players’ weight information in the black-box aggregator, the confidence of the learning result is high. Otherwise, the learning solution may overfit and become too dependent on the samplings of each uncertain data point.

Remark 3

Generally, one would focus on the expectation of the loss δ𝛿\deltaitalic_δ for the generalization error. With the definition of violation probability, we learn that 𝕍𝕍\mathbb{V}blackboard_V monotonically approaches zero as the value of δ𝛿\deltaitalic_δ increases. Besides the consensus in the monotonicity of loss, we choose to investigate 𝕍𝕍\mathbb{V}blackboard_V because of its representation on the solution region of β𝛽\betaitalic_β and δ𝛿\deltaitalic_δ, providing a more interpretable result.

5.2 Generalization Bound

With the above statements, we consider that dataset (𝜶[k],𝒙𝜶[k]*)𝜶delimited-[]𝑘subscriptsuperscript𝒙𝜶delimited-[]𝑘(\bm{\alpha}[k],\bm{x}^{*}_{\bm{\alpha}[k]})( bold_italic_α [ italic_k ] , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α [ italic_k ] end_POSTSUBSCRIPT ) with k=1,,M𝑘1𝑀k=1,\dots,Mitalic_k = 1 , … , italic_M is considered as random samplings extracted from the uncertain feasibility 𝐀𝐀\bm{\mathrm{A}}bold_A subject to a distribution \mathbb{Q}blackboard_Q, whose exact form does not need to be known. We provide the following main result on the generalization bound of the inverse VI-based learning approach (24).

Theorem 3

Under Assumption 1, suppose that the constraint 𝒳(𝛂,𝐱*)𝒳𝛂superscript𝐱\mathcal{X}(\bm{\alpha},\bm{x}^{*})caligraphic_X ( bold_italic_α , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) is convex in 𝛃𝛃\bm{\beta}bold_italic_β and the optimal solution (𝛃*,δ*)superscript𝛃superscript𝛿(\bm{\beta}^{*},\delta^{*})( bold_italic_β start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_δ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) does exist. Then, for any ϵ(0,1)italic-ϵ01\epsilon\in(0,1)italic_ϵ ∈ ( 0 , 1 ), we have:

M(𝕍(δ*,𝜷*)>ϵ)l=0N(Ml)ϵl(1ϵ)Ml.superscript𝑀𝕍superscript𝛿superscript𝜷italic-ϵsuperscriptsubscript𝑙0𝑁matrix𝑀𝑙superscriptitalic-ϵ𝑙superscript1italic-ϵ𝑀𝑙\displaystyle\mathbb{P}^{M}(\mathbb{V}(\delta^{*},\bm{\beta}^{*})>\epsilon)% \leq\sum_{l=0}^{N}\begin{pmatrix}M\\ l\end{pmatrix}\epsilon^{l}(1-\epsilon)^{M-l}.blackboard_P start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT ( blackboard_V ( italic_δ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , bold_italic_β start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) > italic_ϵ ) ≤ ∑ start_POSTSUBSCRIPT italic_l = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ( start_ARG start_ROW start_CELL italic_M end_CELL end_ROW start_ROW start_CELL italic_l end_CELL end_ROW end_ARG ) italic_ϵ start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ( 1 - italic_ϵ ) start_POSTSUPERSCRIPT italic_M - italic_l end_POSTSUPERSCRIPT . (37)

Proof. By Remark 1, the objective function in (23) or (24) is not restricted to specific norms. We take 𝜹subscriptnorm𝜹\|\bm{\delta}\|_{\infty}∥ bold_italic_δ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT to convert the optimization (23) or (24) into programmings based on the computation of convex intersections:

minδ0,𝜷δs.t.(δ,𝜷)k=1M𝒳(𝜶[k],𝒙𝜶[k]*).formulae-sequencesubscript𝛿0𝜷𝛿st𝛿𝜷superscriptsubscript𝑘1𝑀𝒳𝜶delimited-[]𝑘subscriptsuperscript𝒙𝜶delimited-[]𝑘\displaystyle\min_{\delta\geq 0,\bm{\beta}}~{}\delta\quad\mathrm{s.t.}~{}(% \delta,\bm{\beta})\in\bigcap_{k=1}^{M}\mathcal{X}(\bm{\alpha}[k],\bm{x}^{*}_{% \bm{\alpha}[k]}).roman_min start_POSTSUBSCRIPT italic_δ ≥ 0 , bold_italic_β end_POSTSUBSCRIPT italic_δ roman_s . roman_t . ( italic_δ , bold_italic_β ) ∈ ⋂ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT caligraphic_X ( bold_italic_α [ italic_k ] , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α [ italic_k ] end_POSTSUBSCRIPT ) . (38)

We can verify that constraint 𝒳(𝜶,𝒙*)𝒳𝜶superscript𝒙\mathcal{X}(\bm{\alpha},\bm{x}^{*})caligraphic_X ( bold_italic_α , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) is convex in all the variables due to the convexity in 𝜷𝜷\bm{\beta}bold_italic_β. At this point, we have transformed the problem of obtaining the optimal parameters by data points from an uncertain set into a standard form of scenario programming [17].

On the one hand, for random programming (38) with given samplings, the intersection of constraints is feasible and endowed with a nonempty interior point due to Slater’s condition in Assumption 1.2. On the other hand, the uniqueness of the optimal solution is not a mandatory requirement. According to the operation in Remark 1 and [28, Discussion 2.1.5], the tie can be broken by selecting the one with the minimum Euclidean norm among all optimal solutions, which can still maintain the conclusion.

So far, the formulation (38) is in accordance with [28, Theorem 1], and we keep up with all the same assumed conditions. Then, the number of sums in (37) should originally be related to the dimension of all variables δ,𝜷𝛿𝜷\delta,\bm{\beta}italic_δ , bold_italic_β, that is, dim(δ,𝜷)𝑑𝑖𝑚𝛿𝜷dim(\delta,\bm{\beta})italic_d italic_i italic_m ( italic_δ , bold_italic_β ). Note that δ𝛿\deltaitalic_δ is a scalar and the dimension of 𝜷𝜷\bm{\beta}bold_italic_β is the number of players in \mathcal{I}caligraphic_I. The conclusion holds. \square

Theorem 3 tells that the generalization bound in (37) is independent of the sampling distribution \mathbb{Q}blackboard_Q of the uncertain parameter 𝜶𝜶\bm{\alpha}bold_italic_α. Taking Δ=l=0N(Ml)ϵl(1ϵ)MlΔsuperscriptsubscript𝑙0𝑁matrix𝑀𝑙superscriptitalic-ϵ𝑙superscript1italic-ϵ𝑀𝑙\Delta=\sum_{l=0}^{N}\begin{pmatrix}M\\ l\end{pmatrix}\epsilon^{l}(1-\epsilon)^{M-l}roman_Δ = ∑ start_POSTSUBSCRIPT italic_l = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ( start_ARG start_ROW start_CELL italic_M end_CELL end_ROW start_ROW start_CELL italic_l end_CELL end_ROW end_ARG ) italic_ϵ start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ( 1 - italic_ϵ ) start_POSTSUPERSCRIPT italic_M - italic_l end_POSTSUPERSCRIPT, we give some detailed explanations for Theorem 3 from different viewpoints.

i). Generalization perspective: With probability ΔΔ\Deltaroman_Δ for sampling, the violation probability 𝕍𝕍\mathbb{V}blackboard_V of the learning optimal solution (δ*,𝜷*)superscript𝛿superscript𝜷(\delta^{*},\bm{\beta}^{*})( italic_δ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , bold_italic_β start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) is at most ϵitalic-ϵ\epsilonitalic_ϵ. We can interpret this statement from “new-data” scenarios. The conclusion answers the following question: for a fixed learning pair (δ*,𝜷*)superscript𝛿superscript𝜷(\delta^{*},\bm{\beta}^{*})( italic_δ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , bold_italic_β start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) with M𝑀Mitalic_M data, how much confidence can one hold that the pair is still the optimal solution with the least probability 1ϵ1italic-ϵ1-\epsilon1 - italic_ϵ when a new data (𝜶[M+1],𝒙[M+1]𝜶*)(\bm{\alpha}[M+1],\bm{x}^{*}_{{}_{\bm{\alpha}}[M+1]})( bold_italic_α [ italic_M + 1 ] , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT start_FLOATSUBSCRIPT bold_italic_α end_FLOATSUBSCRIPT [ italic_M + 1 ] end_POSTSUBSCRIPT ) comes? Then the answer is that one can keep the confidence at least 1Δ1Δ1-\Delta1 - roman_Δ.

ii). Data-size perspective: A direct application of Theorem 3 is to find the smallest data size M𝑀Mitalic_M for given violation parameter ϵitalic-ϵ\epsilonitalic_ϵ and confidence threshold ΔΔ\Deltaroman_Δ. The issue can be handled by solving the equation Δ=l=0N(Ml)ϵl(1ϵ)MlΔsuperscriptsubscript𝑙0𝑁matrix𝑀𝑙superscriptitalic-ϵ𝑙superscript1italic-ϵ𝑀𝑙\Delta=\sum_{l=0}^{N}\begin{pmatrix}M\\ l\end{pmatrix}\epsilon^{l}(1-\epsilon)^{M-l}roman_Δ = ∑ start_POSTSUBSCRIPT italic_l = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ( start_ARG start_ROW start_CELL italic_M end_CELL end_ROW start_ROW start_CELL italic_l end_CELL end_ROW end_ARG ) italic_ϵ start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ( 1 - italic_ϵ ) start_POSTSUPERSCRIPT italic_M - italic_l end_POSTSUPERSCRIPT and regarding M𝑀Mitalic_M as a variable. We will provide Tab.1 in Section 6 to further illustrate the values ΔΔ\Deltaroman_Δ for given ϵitalic-ϵ\epsilonitalic_ϵ, M𝑀Mitalic_M, and N𝑁Nitalic_N. Actually, ΔΔ\Deltaroman_Δ corresponds to the tail probability of a binomial distribution in M𝑀Mitalic_M, which exponentially converges.

Remark 4

Under the assumed convex conditions in Theorem 3, to assess the deviation probability 𝕍𝕍\mathbb{V}blackboard_V, it’s necessary to compute which samples are supporting under the given M𝑀Mitalic_M uncertainties. There is already mature research on how to compute support samples, and the detailed progress can be found in [22, 29]. Moreover, the setup in (2) involves a well-deployed system encountering passive parameter perturbations and needs to recover the black-box part through a learning method with data. If there are potential mechanisms for acquiring more effective data, we believe that the performance of inverse learning approach (24) can be further improved.

6 Numerical Evaluation

We consider an aggregative game with N=4𝑁4N=4italic_N = 4 electricity users in demand of energy consumption [4, 8], as introduced in Example 1. For i=1,2,3,4𝑖1234i=1,2,3,4italic_i = 1 , 2 , 3 , 4,

Ωi,𝜶(𝒙i)={xi+:αiTxi75ji,j=14αjTxj},subscriptΩ𝑖𝜶subscript𝒙𝑖conditional-setsubscript𝑥𝑖subscriptsuperscriptsubscript𝛼𝑖𝑇subscript𝑥𝑖75superscriptsubscriptformulae-sequence𝑗𝑖𝑗14superscriptsubscript𝛼𝑗𝑇subscript𝑥𝑗\Omega_{i,\bm{\alpha}}(\bm{x}_{-i})=\{x_{i}\in\mathbb{R}_{+}:~{}\alpha_{i}^{T}% x_{i}\leq 75-\!\!\!\!\sum_{j\neq i,j=1}^{4}\alpha_{j}^{T}x_{j}\},roman_Ω start_POSTSUBSCRIPT italic_i , bold_italic_α end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) = { italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT : italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ 75 - ∑ start_POSTSUBSCRIPT italic_j ≠ italic_i , italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } ,

where the uncertain parameter αi[0.1,2]subscript𝛼𝑖0.12\alpha_{i}\in[0.1,2]italic_α start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ [ 0.1 , 2 ]. User i𝑖iitalic_i adopts xiΩi,𝜶(𝒙i)subscript𝑥𝑖subscriptΩ𝑖𝜶subscript𝒙𝑖x_{i}\in\Omega_{i,\bm{\alpha}}(\bm{x}_{-i})italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ roman_Ω start_POSTSUBSCRIPT italic_i , bold_italic_α end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) to minimize its electricity cost

Ji(xi,σ(𝒙))=li(xihi)2+P(σ(𝒙))xi,subscript𝐽𝑖subscript𝑥𝑖𝜎𝒙subscript𝑙𝑖superscriptsubscript𝑥𝑖subscript𝑖2𝑃𝜎𝒙subscript𝑥𝑖J_{i}(x_{i},\sigma(\bm{x}))=l_{i}(x_{i}-h_{i})^{2}+P(\sigma(\bm{x}))x_{i},italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_σ ( bold_italic_x ) ) = italic_l start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_P ( italic_σ ( bold_italic_x ) ) italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ,

where P(σ(𝒙))=qNσ(𝒙)+p0𝑃𝜎𝒙𝑞𝑁𝜎𝒙subscript𝑝0P(\sigma(\bm{x}))=qN\sigma(\bm{x})+p_{0}italic_P ( italic_σ ( bold_italic_x ) ) = italic_q italic_N italic_σ ( bold_italic_x ) + italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and the system coefficients are set as li=1subscript𝑙𝑖1l_{i}=1italic_l start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1, h1=50subscript150h_{1}=50italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 50, h2=55subscript255h_{2}=55italic_h start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 55, h3=60subscript360h_{3}=60italic_h start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = 60, h4=65subscript465h_{4}=65italic_h start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT = 65, q=0.04𝑞0.04q=0.04italic_q = 0.04, and p0=5subscript𝑝05p_{0}=5italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 5. Note that the real value of players’ weight in the aggregator is 𝜷={0.1,0.2,0.3,0.4}𝜷0.10.20.30.4\bm{\beta}=\{0.1,0.2,0.3,0.4\}bold_italic_β = { 0.1 , 0.2 , 0.3 , 0.4 }.

Due to the uncertain system, we first construct the dataset (𝜶[k],𝒙𝜶[k]*)𝜶delimited-[]𝑘subscriptsuperscript𝒙𝜶delimited-[]𝑘(\bm{\alpha}[k],\bm{x}^{*}_{\bm{\alpha}[k]})( bold_italic_α [ italic_k ] , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α [ italic_k ] end_POSTSUBSCRIPT ) from k=1,,M𝑘1𝑀k=1,\dots,Mitalic_k = 1 , … , italic_M samplings, where the equilibrium-seeking methods refer to [7, 8, 9, 10, 11]. Then with all data (𝜶[k],𝒙𝜶[k]*)𝜶delimited-[]𝑘subscriptsuperscript𝒙𝜶delimited-[]𝑘(\bm{\alpha}[k],\bm{x}^{*}_{\bm{\alpha}[k]})( bold_italic_α [ italic_k ] , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α [ italic_k ] end_POSTSUBSCRIPT ) as input, we employ the inverse VI-based learning approach (24) to estimate players’ weight 𝜷𝜷\bm{\beta}bold_italic_β in the black-box aggregator σ(𝒙)𝜎𝒙\sigma(\bm{x})italic_σ ( bold_italic_x ), as similarly illustrated in Example 1. Finally, based on learning results, we seek rGNE 𝒙*superscript𝒙\bm{x}^{*}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT by the first-order condition (33). We set M=4𝑀4M=4italic_M = 4 and Figs. 1, 2 show the learning results. In Fig. 1 the blue bars present the true weight 𝜷𝜷\bm{\beta}bold_italic_β for each player while the red bars present the estimated value 𝜷^^𝜷\hat{\bm{\beta}}over^ start_ARG bold_italic_β end_ARG by the learning approach (24). In detail, we get 𝜷^={0.0893,0.1907,0.2918,0.3926}^𝜷0.08930.19070.29180.3926\hat{\bm{\beta}}=\{0.0893,0.1907,0.2918,0.3926\}over^ start_ARG bold_italic_β end_ARG = { 0.0893 , 0.1907 , 0.2918 , 0.3926 }, which shows a good learning performance close to the true value. This verifies the validity of the VI-based inverse learning approach (24). Afterward, we can continue to compute rGNE 𝒙*superscript𝒙\bm{x}^{*}bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT with the estimated 𝜷^^𝜷\hat{\bm{\beta}}over^ start_ARG bold_italic_β end_ARG, which is illustrated in Fig. 2.

Refer to caption
Figure 1: Learning performance with data amounts M=4𝑀4M=4italic_M = 4.
Refer to caption
Figure 2: Convergence for seeking rGNE.
Refer to caption
Figure 3: Learning performance with different data sizes.

We further check the performance of the learning approach (24) with different amounts of data points (𝜶[k],𝒙𝜶[k]*)𝜶delimited-[]𝑘subscriptsuperscript𝒙𝜶delimited-[]𝑘(\bm{\alpha}[k],\bm{x}^{*}_{\bm{\alpha}[k]})( bold_italic_α [ italic_k ] , bold_italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_α [ italic_k ] end_POSTSUBSCRIPT ). We give the following mean estimated error

MEE=1N(i=1Nβi^βi2)1/2,MEE1𝑁superscriptsuperscriptsubscript𝑖1𝑁superscriptnorm^subscript𝛽𝑖subscript𝛽𝑖212\operatorname{MEE}=\frac{1}{N}\Big{(}{\sum_{i=1}^{N}\|\hat{\beta_{i}}-{\beta_{% i}}\|^{2}}\Big{)}^{1/2},roman_MEE = divide start_ARG 1 end_ARG start_ARG italic_N end_ARG ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ∥ over^ start_ARG italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG - italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ,

and take M=2,3,4,,10𝑀23410M=2,3,4,\dots,10italic_M = 2 , 3 , 4 , … , 10 to record MEE of each setting in Fig. 3. The trend of MEE becomes obviously mild and close enough to zero as the amount of data increases.

TABLE I: Data size, learning error, generalization bound
M=10𝑀10M=10italic_M = 10 M=20𝑀20M=20italic_M = 20 M=30𝑀30M=30italic_M = 30 M=40𝑀40M=40italic_M = 40 M=50𝑀50M=50italic_M = 50
MEE 0.0004 0.0001 3.1628*105absentsuperscript105*10^{-5}* 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT 2.4306*105absentsuperscript105*10^{-5}* 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT 2.4253*105absentsuperscript105*10^{-5}* 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT
ΔΔ\Deltaroman_Δ 0.9983 0.9568 0.8245 0.6290 0.4312
M=60𝑀60M=60italic_M = 60 M=70𝑀70M=70italic_M = 70 M=80𝑀80M=80italic_M = 80 M=90𝑀90M=90italic_M = 90 M=100𝑀100M=100italic_M = 100
MEE 2.0139*105absentsuperscript105*10^{-5}* 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT 1.8724*105absentsuperscript105*10^{-5}* 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT 1.5397*105absentsuperscript105*10^{-5}* 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT 1.2256*105absentsuperscript105*10^{-5}* 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT 1.0713*105absentsuperscript105*10^{-5}* 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT
ΔΔ\Deltaroman_Δ 0.1710 0.1588 0.0880 0.0465 0.0237

Finally, we greatly improve the numerical accuracy and provide Tab. I to show some numerical relation between the data amounts, learning errors MEE, and generalization bound ΔΔ\Deltaroman_Δ. Here, take N=4𝑁4N=4italic_N = 4 players, M𝑀Mitalic_M as the variant amount of data, and ϵ=0.1italic-ϵ0.1\epsilon=0.1italic_ϵ = 0.1 for violation probability. Thus, the generalization bound should be Δ=l=04(Ml)0.1l0.9Ml.Δsuperscriptsubscript𝑙04matrix𝑀𝑙superscript0.1𝑙superscript0.9𝑀𝑙\Delta=\sum_{l=0}^{4}\begin{pmatrix}M\\ l\end{pmatrix}{0.1}^{l}\cdot{0.9}^{M-l}.roman_Δ = ∑ start_POSTSUBSCRIPT italic_l = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ( start_ARG start_ROW start_CELL italic_M end_CELL end_ROW start_ROW start_CELL italic_l end_CELL end_ROW end_ARG ) 0.1 start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ⋅ 0.9 start_POSTSUPERSCRIPT italic_M - italic_l end_POSTSUPERSCRIPT . We can see from Table I that as the dataset size increases, the value of MEE decreases, while the value of ΔΔ\Deltaroman_Δ goes rapidly (exponentially) to zero. This can also be regarded as a tradeoff between accuracy and confidence. These figures support the results in Theorem 3 and the associated discussions.

7 Conclusions

In this note, we proposed a novel learning scheme to seek the robust equilibrium with players’ unknown weight in a black-box aggregator. We put together the data sets with two parts: perturbed parameters from uncertain feasibility and corresponding NE by developed solvers. We established the learning model by an inverse variational inequality relation. Then, we derived the robust counterpart thus obtaining the first-order conditions for robust generalized Nashe quilibria. Also, we showed a generalization guarantee of the proposed learning approach. The numerical results presented good performances and effectiveness of our methodology.

References

  • [1] D. Paccagnan, B. Gentile, F. Parise, M. Kamgarpour, and J. Lygeros, “Distributed computation of generalized Nash equilibria in quadratic aggregative games with affine coupling constraints,” in 2016 IEEE 55th Conference on Decision and Control (CDC).   IEEE, 2016, pp. 6123–6128.
  • [2] J. Lei, U. V. Shanbhag, and J. Chen, “Distributed computation of Nash equilibria for monotone aggregative games via iterative regularization,” in 2020 59th IEEE Conference on Decision and Control (CDC).   IEEE, 2020, pp. 2285–2290.
  • [3] S. Huang, J. Lei, and Y. Hong, “A linearly convergent distributed Nash equilibrium seeking algorithm for aggregative games,” IEEE Transactions on Automatic Control, vol. 68, no. 3, pp. 1753–1759, 2023.
  • [4] M. Ye and G. Hu, “Game design and analysis for price-based demand response: An aggregate game approach,” IEEE Transactions on Cybernetics, vol. 47, no. 3, pp. 720–730, 2016.
  • [5] J. Barrera and A. Garcia, “Dynamic incentives for congestion control,” IEEE Transactions on Automatic Control, vol. 60, no. 2, pp. 299–310, 2014.
  • [6] R. Cornes, “Aggregative environmental games,” Environmental and Resource Economics, vol. 63, no. 2, pp. 339–365, 2016.
  • [7] J. Koshal, A. Nedić, and U. V. Shanbhag, “Distributed algorithms for aggregative games on graphs,” Operations Research, vol. 64, no. 3, pp. 680–704, 2016.
  • [8] S. Liang, P. Yi, and Y. Hong, “Distributed Nash equilibrium seeking for aggregative games with coupled constraints,” Automatica, vol. 85, pp. 179–185, 2017.
  • [9] F. Fabiani, K. Margellos, and P. J. Goulart, “On the robustness of equilibria in generalized aggregative games,” in 2020 59th IEEE Conference on Decision and Control (CDC).   IEEE, 2020, pp. 3725–3730.
  • [10] G. Belgioioso, A. Nedić, and S. Grammatico, “Distributed generalized Nash equilibrium seeking in aggregative games on time-varying networks,” IEEE Transactions on Automatic Control, vol. 66, no. 5, pp. 2061–2075, 2020.
  • [11] G. Xu, G. Chen, H. Qi, and Y. Hong, “Efficient algorithm for approximating Nash equilibrium of distributed aggregative games,” IEEE Transactions on Cybernetics, vol. 53, no. 7, pp. 4375–4387, 2023.
  • [12] H. Yang, X. Xie, and A. V. Vasilakos, “Noncooperative and cooperative optimization of electric vehicle charging under demand uncertainty: A robust stackelberg game,” IEEE Transactions on Vehicular Technology, vol. 65, no. 3, pp. 1043–1058, 2015.
  • [13] M. E. Nikoofal and J. Zhuang, “Robust allocation of a defensive budget considering an attacker’s private information,” Risk Analysis: An International Journal, vol. 32, no. 5, pp. 930–943, 2012.
  • [14] Z. Cheng, G. Chen, and Y. Hong, “Single-leader-multiple-followers stackelberg security game with hypergame framework,” IEEE Transactions on Information Forensics and Security, vol. 17, pp. 954–969, 2022.
  • [15] M. Aghassi and D. Bertsimas, “Robust game theory,” Mathematical Programming, vol. 107, no. 1-2, pp. 231–273, 2006.
  • [16] D. Bertsimas, D. B. Brown, and C. Caramanis, “Theory and applications of robust optimization,” SIAM Review, vol. 53, no. 3, pp. 464–501, 2011.
  • [17] G. C. Calafiore and M. C. Campi, “The scenario approach to robust control design,” IEEE Transactions on Automatic Control, vol. 51, no. 5, pp. 742–753, 2006.
  • [18] G. Chen, Y. Ming, Y. Hong, and P. Yi, “Distributed algorithm for ε𝜀\varepsilonitalic_ε-generalized Nash equilibria with uncertain coupled constraints,” Automatica, vol. 123, p. 109313, 2021.
  • [19] G. Xu, G. Chen, and H. Qi, “Algorithm design and approximation analysis on distributed robust game,” Journal of Systems Science and Complexity, vol. 36, no. 2, pp. 480–499, 2023.
  • [20] M. Fochesato, F. Fabiani, and J. Lygeros, “Generalized uncertain Nash games: Reformulation and robust equilibrium seeking,” in 2023 European Control Conference (ECC), 2023, pp. 1–6.
  • [21] F. Fele and K. Margellos, “Probably approximately correct Nash equilibrium learning,” IEEE Transactions on Automatic Control, vol. 66, no. 9, pp. 4238–4245, 2020.
  • [22] F. Fabiani, K. Margellos, and P. J. Goulart, “Probabilistic feasibility guarantees for solution sets to uncertain variational inequalities,” Automatica, vol. 137, p. 110120, 2022.
  • [23] G. Pantazis, F. Fele, and K. Margellos, “A priori data-driven robustness guarantees on strategic deviations from generalised Nash equilibria,” arXiv preprint arXiv:2304.05308, 2023.
  • [24] G. Calafiore and M. C. Campi, “Uncertain convex programs: randomized solutions and confidence levels,” Mathematical Programming, vol. 102, pp. 25–46, 2005.
  • [25] F. Facchinei and C. Kanzow, “Generalized Nash equilibrium problems,” Annals of Operations Research, vol. 175, no. 1, pp. 177–211, 2010.
  • [26] F. Facchinei and J.-S. Pang, Finite-Dimensional Variational Inequalities and Complementarity Problems.   Springer Science & Business Media, 2007.
  • [27] D. Bertsimas, V. Gupta, and I. C. Paschalidis, “Data-driven estimation in equilibrium using inverse optimization,” Mathematical Programming, vol. 153, pp. 595–633, 2015.
  • [28] M. C. Campi and S. Garatti, “The exact feasibility of randomized solutions of uncertain convex programs,” SIAM Journal on Optimization, vol. 19, no. 3, pp. 1211–1230, 2008.
  • [29] M. C. Campi, S. Garatti, and F. A. Ramponi, “A general scenario theory for nonconvex optimization and decision making,” IEEE Transactions on Automatic Control, vol. 63, no. 12, pp. 4067–4078, 2018.