Inverse learning of black-box aggregator for robust Nash equilibrium

Guanpu Chen, Gehui Xu, Fengxiang He, Dacheng Tao, Thomas Parisini, Karl Henrik Johansson G. Chen , G. Xu, and Karl H. Johansson are with Division of Decision and Control Systems, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, and also with Digital Futures, Stockholm 100 44, Sweden. (e-mail: [email protected], [email protected], [email protected])F. He is with Artificial Intelligence and its Applications Institute, School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, Scotland. (e-mail: [email protected])D. Tao is with the College of Computing & Data Science, Nanyang Technological University, Singapore 639798. (e-mail: [email protected])T. Parisini is with Department of Electrical and Electronic Engineering, Imperial College London, London SW7 2AZ, UK, and also with Department of Engineering and Architecture, University of Trieste, Trieste 34127, Italy. (e-mail: [email protected])

Abstract

In this note, we investigate the robustness of Nash equilibria (NE) in multi-player aggregative games with coupling constraints. There are many algorithms for computing an NE of an aggregative game given a known aggregator. When the coupling parameters are affected by uncertainty, robust NE need to be computed. We consider a scenario where players’ weight in the aggregator is unknown, making the aggregator kind of “a black box”. We pursue a suitable learning approach to estimate the unknown aggregator by proposing an inverse variational inequality-based relationship. We then utilize the counterpart to reconstruct the game and obtain first-order conditions for robust NE in the worst case. Furthermore, we characterize the generalization property of the learning methodology via an upper bound on the violation probability. Simulation experiments show the effectiveness of the proposed inverse learning approach.

1 Introduction

Multi-player game-theoretical models have gained popularity as they offer a comprehensive understanding of interactions in multi-agent systems. Aggregative games play a particularly important role [1, 2, 3] in non-cooperative games, where each player’s payoff is dependent on both its action and an aggregator of all players’ weighted actions. Indeed, more and more aggregative games enjoy widespread applications, such as demand response management [4], congestion communication control [5], and public environmental investigation [6]. In this connection, many algorithms have been developed and deployed to compute a Nash equilibrium (NE) in aggregative games within several different scenarios [7, 8, 9, 10, 11].

In actual application contexts, though, uncertainty inevitably emerges, for example, in electric vehicle charging [12], security resource allocation [13], or moving target defense [14]. Therefore, the robustness of solutions is not an option but is a necessity. Robust game theory[15], where uncertainty arises in players’ payoffs or strategies, draws inspiration from robust optimization [16, 17].

More specifically, robust equilibrium seeking in multi-player games can be divided into two main categories. One consists in enforcing satisfaction of all uncertain feasibility [18, 19, 20]. This viewpoint originates from the deterministic robust optimization [16], in order to reveal the worst-case solution subject to all possible conditions. Usually, such approaches employ robust counterparts to reconstruct the problem against uncertainty. The other category is to address uncertainty with a high probability [21, 22, 23], stemming from scenario programming [24, 17], to randomly extract finite samples from uncertain feasibility and reconstruct the problem under their intersection. This viewpoint usually concerns how to find the supported samples.

Nonetheless, consider a practical scenario where a well-developed algorithm like [7, 8, 9, 10, 11] has been already deployed to compute NE given an aggregative game. The system automatically runs after deployment, and the algorithm returns an NE corresponding to the given parameters. In this setting, players’ weight in the aggregator turns out to be unnecessarily known to the public thus making the aggregator kind of “a black box”. When parameters in the system are affected by uncertainty, the equilibrium-seeking algorithm still returns an NE given a perturbed parameter, but robustness is lost. The internal knowledge of the game model is indispensable to achieve robustness. However, the black-box aggregator prevents us from directly using the existing methods [18, 19, 20, 21, 22, 23] to address uncertainty. A suitable learning approach to “disassemble” the black-box aggregator and estimate players’ weight therein is thus needed to obtain robust NE.

In this note, we focus on approaching robust NE in a class of aggregative games with uncertainty. We first formulate the game model under uncertainty and give the concept of robust NE in the worst case. Then, we consider the situation where the players’ weight in the aggregator is unknown to the public. We propose to address the robustness by recovering the black-box aggregator from data and reconstructing the problem from a robustness perspective.

The main contribution is threefold.

•

A learning method is proposed to estimate players’ weight in the black-box aggregator. By assembling perturbed parameters and computed NE into data, we obtain an inverse variational inequality relationship (Theorem 1). We employ a slack variable as the loss, the minimization of which enables to state an inverse optimization problem.
•

Through the counterpart, the robustness of NE is addressed by transforming the recovered aggregative game with uncertainty into a deterministic worst-case model (Theorem 2). We show first-order conditions of robust NE, making gradient-based approaches usable.
•

To characterize the learning performance, a generalization guarantee of the proposed method is provided by using the violation probability. A generalization bound indicates not only the independence of the uncertainty distribution, but also the exponential convergence as the dataset size increases (Theorem 3).

The note is organized as follows. Section II gives the problem formulation whereas Section III illustrates our learning approach. Section IV addresses the robustness and Section V presents the generalization aspects. Section VI shows the effectiveness of our methodology via extensive numerical results and Section VII gives a few concluding remarks.

2 Problem Formulation

In this section, we show the game model with uncertainty, the robustness of NE, and the problem statement.

2.1 Game Model with Uncertainty

Consider a multi-player aggregative game $\mathscr{G}$ , where players are indexed by $\mathcal{I}=\{1,\dots,N\}$ . For player $i\in\mathcal{I}$ , its strategy is given by the variable $x_{i}\in\mathbb{R}^{n}$ and the others’ strategies are collected in $\bm{x}_{-i}\in\mathbb{R}^{(N-1)\times n}$ . Let $\bm{x}\in\mathbb{R}^{Nn}$ stand for all players’ strategies. Player $i$ has a payoff function $f_{i}:\mathbb{R}^{Nn}\rightarrow\mathbb{R}$ . The map of an aggregator $\sigma:\mathbb{R}^{Nn}\rightarrow\mathbb{R}^{n}$ is defined by

\displaystyle\sigma(\bm{x})=\sum\limits_{i=1}^{N}\beta_{i}x_{i},

(1)

where $\beta_{i}\in\mathbb{R}$ corresponds to player $i$ ’s weight in the aggregator. Take $\bm{\beta}=(\beta_{1},\dots,\beta_{N})\in\mathbb{R}^{N}$ . For $i\in\mathcal{I}$ , let $J_{i}:\mathbb{R}^{n}\times\mathbb{R}^{n}\rightarrow\mathbb{R}$ be a continuously differentiable function and suppose that $J_{i}(x_{i},\sigma(\bm{x}))$ is convex in $x_{i}$ . In an aggregative game $\mathscr{G}$ , player $i$ ’s payoff function satisfies $f_{i}(x_{i},\bm{x}_{-i})=J_{i}(x_{i},\sigma(\bm{x}))$ .

We introduce the parameter $\bm{\alpha}=(\alpha_{1},\dots,\alpha_{N})$ with $\alpha_{i}\in\mathbb{R}^{n}$ . Given $\bm{x}_{-i}$ , the constraint set for $x_{i}$ is defined by

\displaystyle\Omega_{i,\bm{\alpha}}(\bm{x}_{-i})=\{x_{i}\in\mathbb{R}^{n}_{+}:% ~{}\alpha_{i}^{T}x_{i}\leq b-\!\sum_{j\neq i,j=1}^{N}\!\!\!\alpha_{j}^{T}x_{j}\},

where $b$ is a scalar parameter. Take $\bm{\mathrm{A}}=\prod_{i=1}^{N}\mathrm{A}_{i}\subseteq\mathbb{R}^{Nn}$ to represent the uncertainty in parameter $\bm{\alpha}$ . Then, given others’ strategies $\bm{x}_{-i}$ , player $i$ solves the following problem:

	$\displaystyle\min_{x_{i}}\quad$	$\displaystyle J_{i}(x_{i},\sigma(\bm{x}))$		(2)
	$\displaystyle\mathrm{s.t.}\quad$	$\displaystyle x_{i}\in\Omega_{i,\bm{\alpha}}(\bm{x}_{-i}),~{}{\bm{\alpha}\in% \bm{\mathrm{A}}.}$		(2)

The overall coupling constraint can be denoted by

\Omega_{\bm{\alpha}}=\{\bm{x}\in\mathbb{R}^{Nn}_{+}:~{}\sum_{i=1}^{N}\alpha_{i% }^{T}x_{i}\leq b\},

which means that players’ strategies are subject to a coupling constraint as resource allocation [8, 10]. The uncertainty in (2) indicates that the linear inequality in the coupling constraint $\Omega_{\bm{\alpha}}$ should be satisfied for all $\alpha_{i}\in{\mathrm{A}_{i}}$ , $i\in\mathcal{I}$ . Concretely, we investigate a typical uncertain feasibility

\mathrm{A}_{i}=\{\alpha_{i}\in\mathbb{R}^{n}:D_{i}\alpha_{i}\leq d_{i}\},\quad i% \in\mathcal{I},

where $D_{i}\in\mathbb{R}^{m_{i}\times n}$ is a matrix equipped with normalized rows and $d_{i}\in\mathbb{R}^{m_{i}}$ is a vector. In fact, $\mathrm{A}_{i}$ is a polyhedron enclosed by hyperplanes and the dimension $m_{i}$ reflects the number of hyperplanes.

Assumption 1

The constraint set $\Omega_{\bm{\alpha}}$ has a nonempty interior point under all the uncertainty $\bm{\alpha}\in\bm{\mathrm{A}}$ .

Notice that if Slater’s condition holds for all uncertain feasibility $\bm{\mathrm{A}}$ , it is also true with a fixed parameter $\bm{\alpha}$ .

2.2 Robustness of Nash Equilibrium

Given a fixed parameter $\bm{\alpha}\in\bm{\mathrm{A}}$ , minimizing the payoff subject to the coupling constraint $x_{i}\in\Omega_{i,\bm{\alpha}}(\bm{x}_{-i})$ turns out to be a deterministic problem. It is already widely studied in the past decade [18, 8, 4]. We take $\mathscr{G}_{\bm{\alpha}}$ as the deterministic aggregative game under a fixed $\bm{\alpha}$ . We first revisit the well-known definition of the generalized Nash equilibrium (GNE) in $\mathscr{G}_{\bm{\alpha}}$ [25].

Definition 1

A strategy profile $\bm{x}_{\bm{\alpha}}^{*}$ is a GNE of the aggregative game $\mathscr{G}_{\bm{\alpha}}$ if, for all $i\in\mathcal{I}$ , we have

\displaystyle f_{i}(x^{*}_{\bm{\alpha},i},\bm{x}^{*}_{\bm{\alpha},-i})\leq f_{% i}(x_{i},\bm{x}^{*}_{\bm{\alpha},-i}),~{}\forall x_{i}\in\Omega_{i,\bm{\alpha}% }(\bm{x}_{\bm{\alpha},-i}^{*}).

Definition 1 indicates that $\bm{x}_{\bm{\alpha}}^{*}$ is a GNE of $\mathscr{G}_{\bm{\alpha}}$ if no player can get a better payoff by modifying its strategy unilaterally. Then, the pseudo-gradient can be given by

	$\displaystyle\bm{F}(\bm{x})$	$\displaystyle=\begin{pmatrix}F_{1}(\bm{x})\\ \vdots\\ F_{N}(\bm{x})\end{pmatrix}=\begin{pmatrix}\nabla_{x_{1}}f_{1}(x_{1},\bm{x}_{-1% })\\ \vdots\\ \nabla_{x_{N}}f_{N}(x_{N},\bm{x}_{-N})\end{pmatrix}$		(9)
		$\displaystyle=\begin{pmatrix}\nabla_{x_{1}}J_{1}(\cdot,\sigma)+\beta_{1}\nabla% _{\sigma}J_{1}(x_{1},\cdot)\\ \vdots\\ \nabla_{x_{N}}J_{N}(\cdot,\sigma)+\beta_{N}\nabla_{\sigma}J_{N}(x_{N},\cdot)% \end{pmatrix}.$		(13)

With the pseudo-gradient information, we can establish the connection between GNE and the first-order condition in $\mathscr{G}_{\bm{\alpha}}$ . As we know, there exist various ways to seek a GNE. One of the most accepted approaches is to seek a variational GNE (vGNE), which requires a unified multiplier when deriving the Nash Karush-Kuhn-Tucker (KKT) condition [18, 25, 8]. Technically, the definition of a vGNE can be found as follows.

Definition 2

A strategy profile $\bm{x}^{*}_{\bm{\alpha}}$ is a vGNE of $\mathscr{G}_{\bm{\alpha}}$ if and only if there exists $\lambda^{*}\in\mathbb{R}$ such that, for all $i\in\mathcal{I}$ ,

	$\displaystyle\bm{0}_{n}\in$	$\displaystyle\nabla_{x_{i}}J_{i}(\cdot,\sigma(\bm{x}^{}_{\bm{\alpha}}))\!+\!% \beta_{i}\nabla_{\sigma}J_{i}(x^{}_{\bm{\alpha},i},\cdot)\!+\!\lambda^{}% \alpha_{i}\!+\!\mathcal{N}_{\mathbb{R}_{+}^{n}}(x^{}_{\bm{\alpha},i}),$
	$\displaystyle 0\geq$	$\displaystyle\sum_{i=1}^{N}\alpha_{i}^{T}x^{}_{\bm{\alpha},i}-b\perp\lambda^{% }.$

It follows from [25, Theorem 4.8] that with the convexity of payoffs $J_{i}$ and Assumption 1, a vGNE $\bm{x}^{*}_{\bm{\alpha}}$ of $\mathscr{G}_{\bm{\alpha}}$ is equivalent to a solution to a variational inequality (VI) problem VI $(\Omega_{\bm{\alpha}},\bm{F})$ , i.e., to find a vector $\bm{x}^{*}_{\bm{\alpha}}\in\Omega_{\bm{\alpha}}$ such that

\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}(\bm{x}-\bm{x}^{*}_{\bm{\alpha}})\geq 0,~{% }\forall\bm{x}\in\Omega_{\bm{\alpha}}.

(14)

In this view, many works are devoted to the algorithm design for seeking vGNE with coupling constraints [4, 8, 18, 19]. Here we do not restrict the monotonicity of $\bm{F}$ since we mainly focus on the equivalence between GNE and $\operatorname{VI}$ solutions. Details for existence and uniqueness can be found in [25, 26].

Now recall the uncertainty in game $\mathscr{G}$ . Clearly, different perturbed parameters $\bm{\alpha}$ from uncertain feasibility $\bm{\mathrm{A}}$ yield different vGNE $\bm{x}^{*}_{\bm{\alpha}}$ of a deterministic game $\mathscr{G}_{\bm{\alpha}}$ . Therefore, game $\mathscr{G}$ in (2) should be solved under all uncertain feasibility $\bm{\alpha}\in\bm{\mathrm{A}}$ , and we need a solution in the worst case. We introduce the following concept for the robust GNE.

Definition 3

A strategy profile $\bm{x}^{*}$ is a robust GNE (rGNE) of the aggregative game $\mathscr{G}$ if, for all $i\in\mathcal{I}$ , we have

f_{i}(x^{*}_{i},\bm{x}^{*}_{-i})\leq f_{i}(x_{i},\bm{x}^{*}_{-i}),~{}\forall x% _{i}\in\Omega_{i,\bm{\alpha}}(\bm{x}_{-i}^{*}),~{}\forall\bm{\alpha}\in\bm{% \mathrm{A}}.

It is important to find an rGNE since it gives an acceptable solution for all players against the worst case [9]. There have been several works devoted to robust NE seeking in multi-player games. One viewpoint consists in enforcing satisfaction of all uncertainty [18, 19, 20]. It originates from deterministic robust optimization [16] to reveal the worst-case solution subject to all uncertainty. Another way is to satisfy the uncertainty with a high probability, stemming from scenario programming [24, 17]. Randomly extract finite samples from uncertain feasibility, and reconstruct the problem under their intersection [21, 22, 23]. This viewpoint usually concerns how to find the supported samples.

2.3 Problem Statement

The main goal of this paper is to compute an rGNE of the aggregative game $\mathscr{G}$ in (2) against uncertainty. Since robustness is required, it is significant to logically reconstruct the problem with the internal knowledge of the game model. However, conditions may not be always perfect in reality and we consider the following practical scenario.

Given a fixed parameter $\bm{\alpha}$ , suppose that a well-developed algorithm like [7, 8, 9, 10, 11] has been already deployed to compute a vGNE of $\mathscr{G}_{\bm{\alpha}}$ . The system automatically runs after deployment and the algorithm returns a vGNE with the given parameter. Such an input-output process, from the given parameter $\bm{\alpha}$ to the corresponding vGNE $\bm{x}^{*}_{\bm{\alpha}}$ , yields that an outsider does not need the internal knowledge of the system. As a result, some structures of the game model may not be accessible to an outsider. Here we focus on that players’ weight $\bm{\beta}$ in the aggregator $\sigma(\bm{x})=\sum_{i=1}^{N}\beta_{i}x_{i}$ becomes unknown to the public, which makes the aggregator $\sigma$ a black box.

When the parameter $\bm{\alpha}$ suffers uncertainty in $\bm{\mathrm{A}}$ , the vGNE-seeking algorithm still returns a vGNE according to the given condition but will lose robustness. What an outsider indeed needs is an rGNE, serving as a worst-case solution under all feasibility $\bm{\mathrm{A}}$ . As mentioned, the knowledge of the game model is indispensable to achieve robustness, since one needs the structure knowledge to reconstruct the problem against uncertainty. However, the black-box aggregator prevents us from directly using the existing methods [18, 19, 20, 21, 22, 23] to finish the job.

Hence, there should be a learning approach to disassemble the black-box aggregator $\sigma$ and recover players’ weight $\bm{\beta}$ before investigating the robustness. Recall what we have: perturbed parameters $\bm{\alpha}$ from uncertainty $\bm{\mathrm{A}}$ as inputs and corresponding vGNE $\bm{x}^{*}_{\bm{\alpha}}$ computed by the deployed solver as outputs. They constitute a data point $(\bm{\alpha},\bm{x}^{*}_{\bm{\alpha}})$ . On this basis, a data-driven approach is required to estimate the black-box part before achieving robustness.

The problem to solve in this paper can be stated as follows.

Problem 1

Given data points $(\bm{\alpha},\bm{x}^{*}_{\bm{\alpha}})$ composed of the perturbed parameters and the computed vGNE, develop a method to learn the black-box aggregator $\sigma$ and obtain an rGNE $\bm{x}^{*}$ of game $\mathscr{G}$ in (2) under uncertainty.

In terms of Problem 1, we will address the following three concerns in the sequel: i) to propose a novel learning method based on an inverse VI-based relationship; ii) to solve the robustness with respect to the worst-case situation; iii) to measure the generalization of our learning method.

3 Inverse Learning

In this section, we provide an inverse VI-based learning approach to reveal players’ weight $\bm{\beta}$ in the black-box aggregator $\sigma(\bm{x})$ . Recall the data $(\bm{\alpha},\bm{x}_{\bm{\alpha}}^{*})$ composed of a set of parameters $\bm{\alpha}\in\bm{\mathrm{A}}$ and corresponding vGNE $\bm{x}_{\bm{\alpha}}^{*}$ . In fact, the learning task is to recover the mapping from $\bm{\alpha}$ to $\bm{x}_{\bm{\alpha}}^{*}$ .

3.1 Inverse VI-based Relationship

Consider the VI-based relationship in (14). Given any fixed parameter $\bm{\alpha}\in\bm{\mathrm{A}}$ , a vGNE $\bm{x}_{\bm{\alpha}}^{*}$ of the deterministic game $\mathscr{G}_{\bm{\alpha}}$ serves as a solution to VI $(\Omega_{\bm{\alpha}},\bm{F})$ , that is, $\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}(\bm{x}-\bm{x}^{*}_{\bm{\alpha}})\geq 0,~{% }\forall\bm{x}\in\Omega_{\bm{\alpha}}.$ The following theorem indicates an inverse relation which will help construct the learning model.

Theorem 1

Consider the deterministic game $\mathscr{G}_{\bm{\alpha}}$ under a fixed parameter $\bm{\alpha}$ . Under Assumption 1, $\bm{x}^{*}_{\bm{\alpha}}$ is a vGNE of $\mathscr{G}_{\bm{\alpha}}$ if and only if there exists a scalar $\gamma\leq 0$ such that

	$\displaystyle\bm{F}(\bm{x}^{}_{\bm{\alpha}})^{T}\bm{x}^{}_{\bm{\alpha}}-\gamma b$	$\displaystyle\leq 0,$		(15)
	$\displaystyle F_{i}(\bm{x}^{*}_{\bm{\alpha}})-\gamma\alpha_{i}$	$\displaystyle\geq\bm{0}_{n},~{}\forall i\in\mathcal{I}.$		(15)

Proof. It follows from the expression in (14) that

\displaystyle\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}^{*}_{\bm{\alpha}}\leq% \bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x},~{}\forall\bm{x}\in\Omega_{\bm{% \alpha}},

(16)

where we notice that $\bm{x}^{*}_{\bm{\alpha}}$ is a constant. (16) should be satisfied for all $\bm{x}\in\Omega_{\bm{\alpha}}$ , which means the inequality holds when the right-hand side takes on the minimum.

\displaystyle\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}^{*}_{\bm{\alpha}}\leq% \min_{\bm{x}\in\Omega_{\bm{\alpha}}}\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}.

(17)

Hence, a new optimization problem arises:

\min_{\bm{x}\in\Omega_{\bm{\alpha}}}\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}.

(18)

Then, we investigate (18) from a duality perspective. Recalling the coupling constraint set $\Omega_{\bm{\alpha}}=\{\bm{x}\in\mathbb{R}^{Nn}_{+}:~{}\sum_{i=1}^{N}\alpha_{i% }^{T}x_{i}\leq b\}$ , the Lagrangian function can be designed as follows:

	$\displaystyle\mathcal{L}_{1}(\bm{x},\gamma,\bm{\mu})$	$\displaystyle=\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}-\gamma(\sum_{i=1}^{N}% \alpha_{i}^{T}x_{i}-b)+\sum_{i=1}^{N}\mu_{i}^{T}x_{i}$
		$\displaystyle=\sum_{i=1}^{N}(F_{i}(\bm{x}^{*}_{\bm{\alpha}})^{T}-\gamma\alpha_% {i}^{T}+\mu_{i}^{T})x_{i}+\gamma b,$

where the multipliers $0\geq\gamma\in\mathbb{R}$ and $\bm{0}_{nN}\geq\bm{\mu}=col\{\mu_{1},\dots,\mu_{N}\}\in\mathbb{R}^{nN}$ are employed for the inequality constraints $\sum_{i=1}^{N}\alpha_{i}^{T}x_{i}\leq b$ and the non-negative orthant $\bm{x}\in\mathbb{R}^{Nn}_{+}$ , respectively.

Under Assumption 1, given any fixed $\bm{\alpha}$ , the dual gap of $\mathcal{L}_{1}$ vanishes. Followed by the duality relation, the minimum of $\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}$ subject to $\bm{x}\in\Omega_{\bm{\alpha}}$ equals to

\displaystyle\max_{\gamma,\bm{\mu}\leq 0}~{}\gamma b\quad s.t.~{}F_{i}(\bm{x}^% {*}_{\bm{\alpha}})-\gamma\alpha_{i}+\mu_{i}=\bm{0}_{n},~{}\forall i\in\mathcal% {I}.

In fact, we can remove the multiplier $\bm{\mu}$ and thus simplify the expression of the above optimization as

\displaystyle\max_{\gamma\leq 0}~{}\gamma b\quad s.t.~{}F_{i}(\bm{x}^{*}_{\bm{% \alpha}})-\gamma\alpha_{i}\geq\bm{0}_{n},~{}\forall i\in\mathcal{I}.

(19)

So far, we derived the above dual problem (19) corresponding to the optimization $\min_{\bm{x}\in\Omega_{\bm{\alpha}}}\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}$ , and there is no duality gap due to Assumption 1. Hence, the inequality in (16), which has been equivalently transferred to the inequality in (17), can be further rewritten as

\displaystyle\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}^{*}_{\bm{\alpha}}\leq% \max_{\gamma\in\Gamma_{\bm{\alpha}}}\gamma b,

(20)

where $\Gamma_{\bm{\alpha}}=\{\gamma\leq 0:~{}F_{i}(\bm{x}^{*}_{\bm{\alpha}})-\gamma% \alpha_{i}\geq\bm{0}_{n},~{}\forall i\in\mathcal{I}\}.$ Again, if the inequality in (20) needs to be satisfied with the right-hand side taking on the maximum, then we should merely ensure that there exists at least one feasible $\gamma\in\Gamma_{\bm{\alpha}}$ . Hence, the inequality in (20) finally becomes

\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}\bm{x}^{*}_{\bm{\alpha}}\leq\gamma b.

(21)

We combine the requirement (21) and $\Gamma_{\bm{\alpha}}$ , and obtain that if $\bm{x}^{*}_{\bm{\alpha}}$ is a vGNE of game $\mathscr{G}_{\bm{\alpha}}$ satisfying the relation in (16), then there exists $\gamma\leq 0$ leading to the consequence.

The reverse result can be proven similarly using weak duality properties since all the aforementioned conversions are equivalently conducted. $\square$

Most vGNE-seeking solvers compute a vGNE asymptotically since there is rarely a closed-form solution in complicated multi-player games. A numerical vGNE may not satisfy an exact solution, considering the computational errors and other biases. Thus, we relax (14) with a slack coefficient $\delta\geq 0$ such that $\bm{F}(\bm{x}^{*}_{\bm{\alpha}})^{T}(\bm{x}-\bm{x}^{*}_{\bm{\alpha}})+\delta% \geq 0,\forall\bm{x}\in\Omega_{\bm{\alpha}}.$ Accordingly, the relation (15) in Theorem 1 can be correspondingly revised. There exists $\gamma\leq 0$ such that

	$\displaystyle\bm{F}(\bm{x}^{}_{\bm{\alpha}})^{T}\bm{x}^{}_{\bm{\alpha}}-\gamma b$	$\displaystyle\leq\delta,$		(22)
	$\displaystyle F_{i}(\bm{x}^{*}_{\bm{\alpha}})-\gamma\alpha_{i}$	$\displaystyle\geq 0,~{}\forall i\in\mathcal{I}.$		(22)

Similar to the proof in Theorem 1, the modified relation in (22) can also be rigorously guaranteed. Here, if the mapping from $\bm{\alpha}$ to $\bm{x}_{\bm{\alpha}}^{*}$ is recovered well enough, then given $\bm{\alpha}$ , the prediction should also be exactly $\bm{x}_{\bm{\alpha}}^{*}$ . Hence, $\delta$ also stands for a role of loss, which means the more accurate the mapping, the smaller the loss between the prediction and the practical equilibrium point.

3.2 Data-driven Optimization

We show how to use the relation in (22) to design a learning approach for the unknown weight $\bm{\beta}$ in the black-box aggregator. We recall what knowledge we already have: 1) a data point composed of perturbed $\bm{\alpha}$ and the computed vGNE $\bm{x}^{*}_{\bm{\alpha}}$ ; 2) a relaxed relation with loss in (22). By the expressions of $\bm{\beta}$ in the pseudo-gradient $\bm{F}$ in (9), clearly, $\bm{F}$ belongs to a parametric family indexed by $\bm{\beta}$ . Thus, we can rewrite $\bm{F}(\bm{x})$ as $\bm{F}(\bm{x};\bm{\beta})$ to describe this dependence, and naturally suppose that $\bm{F}(\bm{x};\bm{\beta})$ is continuous in $\bm{\beta}$ .

On this basis, we design an inverse VI-based optimization. Here, the data point is $(\bm{\alpha},\bm{x}^{*}_{\bm{\alpha}})$ , variables are the unknown weight $\bm{\beta}$ , the auxiliary variable $\gamma$ , and the slack variable $\delta$ as the loss. Hence,

$\displaystyle\min_{\bm{\beta},\gamma,\delta}\quad$	$\displaystyle\|\delta\|$	(23)
$\displaystyle\mathrm{s.t.}~{}$	$\displaystyle\bm{F}(\bm{x}^{}_{\bm{\alpha}};\bm{\beta})^{T}\bm{x}^{}_{\bm{% \alpha}}-\gamma b\leq\delta,~{}\gamma\leq 0,$
	$\displaystyle F_{i}(\bm{x}^{*}_{\bm{\alpha}};\bm{\beta})-\gamma\alpha_{i}\geq 0% ,~{}\forall i\in\mathcal{I}.$

We turn back to robust considerations for the uncertain feasibility $\bm{\mathrm{A}}$ . Note that (23) is derived under a fixed $\bm{\alpha}$ , but the problem cannot be well-solved with only one data point $(\bm{\alpha},\bm{x}_{\bm{\alpha}}^{*})$ . Fortunately, uncertainty occurs in the parameter $\bm{\alpha}\in\bm{\mathrm{A}}$ and provides enough data. Once the system is perturbed, there emerges a new-extracted parameter $\bm{\alpha}$ . Then, a well-deployed vGNE-seeking solver produces a corresponding numerical solution $\bm{x}^{*}_{\bm{\alpha}}$ , and $(\bm{\alpha},\bm{x}^{*}_{\bm{\alpha}})$ will serve as a new data point satisfying the relation (22).

Therefore, we use index $k$ to represent the $k$ th uncertain condition and regard $(\bm{\alpha}[k],\bm{x}^{*}_{\bm{\alpha}[k]})$ as the $k$ th data point. Also, take $\gamma[k]$ and $\delta[k]$ as the $k$ th variables to be optimized together according to (23). We take $\bm{\delta}=col\{\delta[1],\cdots\,\delta[M]\}$ , $\bm{\gamma}=col\{\gamma[1],\cdots,\gamma[M]\}$ , and $\bm{\beta}=\{\beta_{1},\dots,\beta_{N}\}$ as all variables. With well-defined data and variables, we can finally design a data-based learning approach:

$\displaystyle\min_{\bm{\beta},\bm{\gamma},\bm{\delta}}~{}$	$\displaystyle\\|\bm{\delta}\\|$	(24)
$\displaystyle\mathrm{s.t.}~{}$	$\displaystyle\bm{F}(\bm{x}^{}_{\bm{\alpha}[k]};\bm{\beta})^{T}\bm{x}^{}_{\bm% {\alpha}[k]}-\gamma[k]b\leq\delta[k],$
	$\displaystyle F_{i}(\bm{x}^{*}_{\bm{\alpha}[k]};\bm{\beta})-\gamma[k]\alpha_{i% }[k]\geq 0,\quad\forall i\in\mathcal{I},$
	$\displaystyle\gamma[k]\leq 0,~{}k=1,\dots,M.$

Remark 1

First, we do not request the uniqueness of vGNE $\bm{x}^{*}_{\bm{\alpha}}$ in $\mathscr{G}_{\bm{\alpha}}$ . The learning still works as long as $\bm{x}^{*}_{\bm{\alpha}}$ satisfies the inequality (5), even if the payoffs might yield multiple equilibria [27]. Second, the optimal solution to (24) should exist, but is not necessarily unique. The tie can be broken by selecting the one with the minimal $l_{2}$ -norm among optimal solutions. Third, the norm in the objective of (24) is not restricted and can be determined by concrete conditions.

Remark 2

If the necessary convexity can be maintained, the learning approach is capable of being extended for nonlinear aggregators in $\bm{x}$ , that is, $\sigma(\bm{x})=\sum_{i=1}^{N}\beta_{i}g_{i}(x_{i})$ . Such a form of an aggregator actually still maintains explicit parametric properties in variable $\bm{\beta}$ . As for more general cases, for example $\sigma(\bm{x})=\sum_{i=1}^{N}g_{i}(x_{i})$ , the learning approach (24) based on parametric estimation may fail. Some non-parametric learning approaches like kernel methods or neural networks would help in learning $g_{i}$ .

We provide a typical case study for interpretation.

Example 1

Consider an aggregative game with $N$ electricity users in the demand of energy consumption problem [4, 8]. User $i$ adopts $x_{i}\in\Omega_{i,\bm{\alpha}}(\bm{x}_{-i})$ as the energy consumption and aims to minimize its electricity cost $J_{i}(x_{i},\sigma(\bm{x}))=l_{i}(x_{i}-m_{i})^{2}+P(\sigma(\bm{x}))x_{i}$ , where $l_{i}$ and $m_{i}$ are constants of energy consumption, and $P=qN\sigma(\bm{x})+p_{0}$ with $\sigma(\bm{x})=\sum_{i=1}^{N}\beta_{i}x_{i}$ . Then learning in (24) can be expressed as follows:

	$\displaystyle\min_{\bm{\beta},\bm{\gamma},\bm{\delta}}~{}$	$\displaystyle\\|\bm{\delta}\\|$
	$\displaystyle\mathrm{s.t.}~{}$	$\displaystyle\sum_{i\in\mathcal{I}}\big{(}2l_{i}(x^{}_{i,\bm{\alpha}[k]}\!-\!% m_{i})\!+\!2qN\beta_{i}x^{}_{i,\bm{\alpha}[k]}\!$
		$\displaystyle\quad+\!qN\!\!\!\!\!\sum_{j\in\mathcal{I},j\neq i}\!\!\!\beta_{j}% x^{}_{j,\bm{\alpha}[k]}\!+\!p_{0}\big{)}x^{}_{i,\bm{\alpha}[k]}\!-\!\gamma[k% ]b\!\leq\!\!\delta[k],$
		$\displaystyle 2l_{i}(x^{}_{i,\bm{\alpha}[k]}\!-\!m_{i})\!+\!2qN\beta_{i}x^{}% _{i,\bm{\alpha}[k]}$
		$\displaystyle\quad+\!qN\!\!\!\!\!\sum_{j\in\mathcal{I},j\neq i}\!\!\!\!\beta_{% j}x^{*}_{j,\bm{\alpha}[k]}\!+\!p_{0}\!-\!\gamma[k]\alpha_{i}[k]\!\geq\!0,~{}% \forall i\!\in\!\mathcal{I},$
		$\displaystyle\gamma[k]\leq 0,~{}k=1,\dots,M.$

It is a solvable optimization problem with linear constraints.

4 GNE Robustness

In this section, we address the robustness of GNE with the recovered knowledge by considering the worst-case solution. We introduce some new notations after learning. Take $\hat{\sigma}(\bm{x})=\sum_{i=1}^{N}\hat{\beta}_{i}x_{i}\in\mathbb{R}^{n}$ for the aggregator, where $\hat{\bm{\beta}}=\{\hat{\beta}_{1},\dots,\hat{\beta}_{N}\}$ is revealed by (24). Accordingly, take $\widehat{\mathscr{G}}$ to represent the game model with uncertainty after learning, i.e., each player $i$ has to solve the following problem:

\displaystyle\min_{x_{i}}~{}J_{i}(x_{i},\hat{\sigma}(\bm{x}))~{}\mathrm{s.t.}~% {}x_{i}\in\Omega_{i,\bm{\alpha}}(\bm{x}_{-i}),~{}{\bm{\alpha}\in\bm{\mathrm{A}% }.}

(25)

Then, we seek an rGNE of game $\widehat{\mathscr{G}}$ in the worst case, that is, the robustness of Nash equilibrium in (25) satisfying all possible uncertainties in the feasibility $\bm{\mathrm{A}}$ . In this view, we consider transforming the uncertain problem (25) into a deterministic model. By the virtue of deterministic robust optimization, the following theorem shows how to construct a robust counterpart.

Theorem 2

Under Assumption 1, a strategy profile $\bm{x}^{*}$ is an rGNE of game $\widehat{\mathscr{G}}$ (25) if and only if there exists $\bm{y}^{*}=col\{y_{1}^{*},\dots,y_{N}^{*}\}$ with $y_{i}^{*}\in\mathbb{R}^{m_{i}}$ such that $(\bm{x}^{*},\bm{y}^{*})$ is a GNE of the following deterministic game:

	$\displaystyle\min_{x_{i}\in\mathbb{R}^{n}_{+},y_{i}\in\mathbb{R}_{+}^{m_{i}}}$	$\displaystyle J_{i}(x_{i},\hat{\sigma}(\bm{x}))$		(26)
	$\displaystyle\mathrm{s.t.}\quad$	$\displaystyle\sum_{i=1}^{N}y_{i}^{T}d_{i}\leq b,~{}D_{i}^{T}y_{i}\!-\!x_{i}=% \bm{0}_{n},~{}\forall i\in\mathcal{I}.$		(26)

Proof. Recall the expression of the coupling constraint $\Omega_{\bm{\alpha}}$ under all uncertain feasibility $\bm{\alpha}\in\bm{\mathrm{A}}$ . If all possible constraints hold in (25), then it can be equivalently regarded as

\displaystyle\max_{\bm{\alpha}\in\bm{\mathrm{A}}}\sum_{i=1}^{N}\alpha_{i}^{T}x% _{i}=\sum_{i=1}^{N}\max\limits_{\alpha_{i}\in\mathrm{A}_{i}}\alpha_{i}^{T}x_{i% }\leq b.

(27)

We find a sub-optimization problem $\max\limits_{\alpha_{i}\in\mathrm{A}_{i}}\alpha_{i}^{T}x_{i}$ with the feasibility structure $\mathrm{A}_{i}=\{\alpha_{i}\in\mathbb{R}^{n}:D_{i}\alpha_{i}\leq d_{i}\}$ .

\displaystyle\max_{\alpha_{i}}\alpha_{i}^{T}x_{i}\quad\mathrm{s.t.}~{}D_{i}% \alpha_{i}\leq d_{i}.

(28)

Notice that here $x_{i}$ serves as a constant while $\alpha_{i}$ is variable. Accordingly, design a Lagrangian function with an auxiliary multiplier $y_{i}\in\mathbb{R}^{m_{i}}_{+}$ .

	$\displaystyle\mathcal{L}_{2}(\alpha_{i},y_{i})$	$\displaystyle=-\alpha_{i}^{T}x_{i}+y_{i}^{T}(D_{i}\alpha_{i}-d_{i})$
		$\displaystyle=(y_{i}^{T}D_{i}-x_{i}^{T})\alpha_{i}-y_{i}^{T}d_{i}.$

By Assumption 1, the polyhedron $\mathrm{A}_{i}$ is nonempty and the Slater’s condition is therefore satisfied in problem (28). Then, the duality gap vanishes in the Lagrangian function $\mathcal{L}_{2}$ . It follows from the duality theory that the maximum in (28) can be equivalently described by the following minimum

\displaystyle\min_{y_{i}}~{}y_{i}^{T}d_{i}\quad\mathrm{s.t.}~{}D_{i}^{T}y_{i}-% x_{i}=\bm{0}_{n},~{}y_{i}\geq\bm{0}_{m_{i}}.

(29)

Hence, the inequality relation (27) in the worst case becomes

\displaystyle\sum_{i=1}^{N}\min_{y_{i}\in Y_{i}}~{}

\displaystyle y_{i}^{T}d_{i}\leq b,

(30)

where the constraint $Y_{i}=\{y_{i}\in\mathbb{R}_{+}^{m_{i}}:\;D_{i}^{T}y_{i}-x_{i}=\bm{0}_{n}\}$ . It follows from [16] that, the minimum on the left-hand side in (30) can be removed. In fact, if there exists at least one qualified profile $\bm{y}=col\{y_{1},\dots,y_{N}\}$ , then the minimum will be naturally verified. Hence, (30) can be rewritten as

\displaystyle\sum_{i=1}^{N}y_{i}^{T}d_{i}\leq b,~{}y_{i}\in Y_{i},~{}\forall i% \in\mathcal{I}.

(31)

Thus, together with the definition of set $Y_{i}$ for $i\in\mathcal{I}$ , the coupling constraint in (25) under all feasibility $\bm{\alpha}\in\bm{\mathrm{A}}$ can be reformulated by the following deterministic constraints:

\displaystyle\sum_{i=1}^{N}y_{i}^{T}d_{i}\leq b,~{}D_{i}^{T}y_{i}-x_{i}=\bm{0}% _{n},~{}\forall i\in\mathcal{I},

(32)

where $x_{i}\in\mathbb{R}^{n}_{+}$ and $y_{i}\in\mathbb{R}_{+}^{m_{i}}$ . That is exactly (26).

The reverse proof can be conducted similarly when $(\bm{x}^{*},\bm{y}^{*})$ satisfies the deterministic formulation in (26), because all the above procedures are equivalent transformations. $\square$

Theorem 2 shows that, by introducing the auxiliary variable $\bm{y}$ , the uncertain game $\hat{\mathscr{G}}$ can be transformed into a deterministic one in (26). On this basis, we can obtain the first-order condition of (26). Technically, a strategy $(\bm{x}^{*},\bm{y}^{*})$ is a GNE of (26) (an rGNE of $\hat{\mathscr{G}}$ in (25)) if and only if there exists $\mu^{*}\in\mathbb{R}_{+}$ and $\bm{\omega}^{*}=col\{\omega_{i}^{*}\}_{i=1}^{N}\in\mathbb{R}^{m}$ such that, for $i\in\mathcal{I}$ ,

		$\displaystyle\bm{0}_{n}\in\nabla_{x_{i}}J_{i}(\cdot,\hat{\sigma}(\bm{x}^{}))% \!+\!\hat{\beta}_{i}\nabla_{\hat{\sigma}}J_{i}(x^{}_{i},\cdot)\!-\!\omega_{i}% ^{}\!+\!\mathcal{N}_{\mathbb{R}_{+}^{n}}(x^{}_{i}),$		(33)
		$\displaystyle\bm{0}_{m_{i}}\in\mu^{}d_{i}+D_{i}\omega_{i}^{}+\mathcal{N}_{% \mathbb{R}_{+}^{m_{i}}}(y^{*}_{i}),$
		$\displaystyle 0\geq\sum_{i=1}^{N}d_{i}^{T}y^{}_{i}-b\perp\mu^{},$
		$\displaystyle\bm{0}_{n}=D_{i}^{T}y^{}_{i}-x_{i}^{}.$

Now we can say that, after learning the black-box aggregator and deriving the robust counterpart, we are finally able to seek an rGNE of the original game $\mathscr{G}$ in (2) via the above first-order condition. There is no need for a detailed derivation of the solver design since such designs have been already given in many developed works [7, 8, 9, 10, 11].

For the sake of simplicity, we put together all the procedures in Algorithm 1 to compute an rGNE within a black-box aggregative game problem (2).

Algorithm 1

Initialization: aggregator $\sigma(\bm{x})$ , payoff functions $J_{i}(x_{i},\sigma(\bm{x}))$ , coupling constraint $\Omega_{\bm{\alpha}}$ , all uncertain feasibility $\bm{\mathrm{A}}$ , and a well-developed vGNE-seeking solver. 1) Data Construction

for $k=1,2,\dots,M$ , do

Input: uncertain $\alpha_{i}[k]\in\mathrm{A}_{i},~{}i\in\mathcal{I}$ ,

Solve: deterministic game $\mathscr{G}_{\bm{\alpha}[k]}$ by the existing vGNE-

seeking solver,

Output: vGNE $\bm{x}^{*}_{\bm{\alpha}[k]}$ of game $\mathscr{G}_{\bm{\alpha}[k]}$ .

end for

2) Inverse learning

Input: all data $(\bm{\alpha}[k],\bm{x}^{*}_{\bm{\alpha}[k]}),~{}k=1,2,\dots,M$ ,

Solve: inverse VI-based learning approach in (24),

Return: estimated weight $\hat{\bm{\beta}}$ in black-box aggregator

3) Robust Counterpart

Solve: the deterministic game $\hat{\mathscr{G}}$ in (26) with auxiliary

variables $y_{i},i\in\mathcal{I}$ and estimated weight $\hat{\bm{\beta}}$ ,

Return: rGNE $\bm{x}^{*}$ of game $\mathscr{G}$ .

5 Generalization Guarantee

In this section, we address the generalization capabilities of our learning approach (24). In fact, an important aspect of a learning method is its ability to generalize to real data, that is, whether its estimation can perform well on new data.

5.1 Violation Probability

In (24), the generalization guarantee refers to whether the optimal solution with the used data $(\bm{\alpha}[k],\bm{x}^{*}_{\bm{\alpha}[k]})$ , $k=1,2,\dots,M$ , can truly represent all cases under the uncertain feasibility $\bm{\mathrm{A}}$ . In fact, no matter what sampling (referring to $\bm{\alpha}\in\bm{\mathrm{A}}$ ) is taken, it is unrealistic to fully represent the entire uncertain domain. Therefore, the optimal solution we learn by (24) may merely be applicable to those given samplings $(\bm{\alpha}[k],\bm{x}^{*}_{\bm{\alpha}[k]})$ , $k=1,2,\dots,M$ , and the optimal solution $\bm{\beta}$ may not be optimal under all uncertain feasibility. In other terms, there may be a new sampled data point $(\bm{\alpha}[M+1],\bm{x}^{*}_{\bm{\alpha}[M+1]})$ for which the previously obtained optimal solution by (24) is no longer valid.

Hence, we investigate the above fact using the following violation probability concept [24]. For convenience, we rewrite the constraints in problem (23) as follows.

	$\displaystyle\mathcal{X}(\bm{\alpha},\bm{x}^{*}_{\bm{\alpha}})\triangleq\{% \delta,\bm{\beta}:~{}\exists\gamma,~{}\mathrm{s.t.}$	$\displaystyle~{}\bm{F}(\bm{x}^{}_{\bm{\alpha}};\bm{\beta})^{T}\bm{x}^{}_{\bm% {\alpha}}-\gamma b\leq\delta,$
		$\displaystyle F_{i}(\bm{x}^{*}_{\bm{\alpha}};\bm{\beta})-\gamma\alpha_{i}\geq 0% ,~{}\forall i\in\mathcal{I}\}.$

Definition 4

Given $(\bm{\beta},\delta)$ , the violation probability is

\displaystyle\mathbb{V}(\bm{\beta},\delta)\triangleq\mathbb{P}\{\bm{\alpha}\in% \bm{\mathrm{A}},~{}(\bm{\beta},\delta)\notin\mathcal{X}(\bm{\alpha},\bm{x}^{*}% _{\bm{\alpha}})\}.

(34)

Then, the generalization guarantee takes on the form: given $\epsilon\in(0,1)$ , evaluate the upper bound of $\mathbb{P}^{M}(\mathbb{V}(\bm{\beta}^{*},\delta^{*})>\epsilon),$ where $(\bm{\beta}^{*},\delta^{*})$ is the optimal solution via the learning approach (24) with data $\{(\bm{\alpha}[1],\bm{x}^{*}_{\bm{\alpha}[1]}),\cdots,(\bm{\alpha}[M],\bm{x}^{% *}_{\bm{\alpha}[M]})\}$ . In qualitative forms, if $\epsilon$ is small enough and the upper bound in $\mathbb{P}$ is also suitably small, then the learning method turns out to be reliable. If so, when we employ the inverse VI-based learning approach (24) to estimate the players’ weight information in the black-box aggregator, the confidence of the learning result is high. Otherwise, the learning solution may overfit and become too dependent on the samplings of each uncertain data point.

Remark 3

Generally, one would focus on the expectation of the loss $\delta$ for the generalization error. With the definition of violation probability, we learn that $\mathbb{V}$ monotonically approaches zero as the value of $\delta$ increases. Besides the consensus in the monotonicity of loss, we choose to investigate $\mathbb{V}$ because of its representation on the solution region of $\beta$ and $\delta$ , providing a more interpretable result.

5.2 Generalization Bound

With the above statements, we consider that dataset $(\bm{\alpha}[k],\bm{x}^{*}_{\bm{\alpha}[k]})$ with $k=1,\dots,M$ is considered as random samplings extracted from the uncertain feasibility $\bm{\mathrm{A}}$ subject to a distribution $\mathbb{Q}$ , whose exact form does not need to be known. We provide the following main result on the generalization bound of the inverse VI-based learning approach (24).

Theorem 3

Under Assumption 1, suppose that the constraint $\mathcal{X}(\bm{\alpha},\bm{x}^{*})$ is convex in $\bm{\beta}$ and the optimal solution $(\bm{\beta}^{*},\delta^{*})$ does exist. Then, for any $\epsilon\in(0,1)$ , we have:

\displaystyle\mathbb{P}^{M}(\mathbb{V}(\delta^{*},\bm{\beta}^{*})>\epsilon)% \leq\sum_{l=0}^{N}\begin{pmatrix}M\\ l\end{pmatrix}\epsilon^{l}(1-\epsilon)^{M-l}.

(37)

Proof. By Remark 1, the objective function in (23) or (24) is not restricted to specific norms. We take $\|\bm{\delta}\|_{\infty}$ to convert the optimization (23) or (24) into programmings based on the computation of convex intersections:

\displaystyle\min_{\delta\geq 0,\bm{\beta}}~{}\delta\quad\mathrm{s.t.}~{}(% \delta,\bm{\beta})\in\bigcap_{k=1}^{M}\mathcal{X}(\bm{\alpha}[k],\bm{x}^{*}_{% \bm{\alpha}[k]}).

(38)

We can verify that constraint $\mathcal{X}(\bm{\alpha},\bm{x}^{*})$ is convex in all the variables due to the convexity in $\bm{\beta}$ . At this point, we have transformed the problem of obtaining the optimal parameters by data points from an uncertain set into a standard form of scenario programming [17].

On the one hand, for random programming (38) with given samplings, the intersection of constraints is feasible and endowed with a nonempty interior point due to Slater’s condition in Assumption 1.2. On the other hand, the uniqueness of the optimal solution is not a mandatory requirement. According to the operation in Remark 1 and [28, Discussion 2.1.5], the tie can be broken by selecting the one with the minimum Euclidean norm among all optimal solutions, which can still maintain the conclusion.

So far, the formulation (38) is in accordance with [28, Theorem 1], and we keep up with all the same assumed conditions. Then, the number of sums in (37) should originally be related to the dimension of all variables $\delta,\bm{\beta}$ , that is, $dim(\delta,\bm{\beta})$ . Note that $\delta$ is a scalar and the dimension of $\bm{\beta}$ is the number of players in $\mathcal{I}$ . The conclusion holds. $\square$

Theorem 3 tells that the generalization bound in (37) is independent of the sampling distribution $\mathbb{Q}$ of the uncertain parameter $\bm{\alpha}$ . Taking $\Delta=\sum_{l=0}^{N}\begin{pmatrix}M\\ l\end{pmatrix}\epsilon^{l}(1-\epsilon)^{M-l}$ , we give some detailed explanations for Theorem 3 from different viewpoints.

i). Generalization perspective: With probability $\Delta$ for sampling, the violation probability $\mathbb{V}$ of the learning optimal solution $(\delta^{*},\bm{\beta}^{*})$ is at most $\epsilon$ . We can interpret this statement from “new-data” scenarios. The conclusion answers the following question: for a fixed learning pair $(\delta^{*},\bm{\beta}^{*})$ with $M$ data, how much confidence can one hold that the pair is still the optimal solution with the least probability $1-\epsilon$ when a new data $(\bm{\alpha}[M+1],\bm{x}^{*}_{{}_{\bm{\alpha}}[M+1]})$ comes? Then the answer is that one can keep the confidence at least $1-\Delta$ .

ii). Data-size perspective: A direct application of Theorem 3 is to find the smallest data size $M$ for given violation parameter $\epsilon$ and confidence threshold $\Delta$ . The issue can be handled by solving the equation $\Delta=\sum_{l=0}^{N}\begin{pmatrix}M\\ l\end{pmatrix}\epsilon^{l}(1-\epsilon)^{M-l}$ and regarding $M$ as a variable. We will provide Tab.1 in Section 6 to further illustrate the values $\Delta$ for given $\epsilon$ , $M$ , and $N$ . Actually, $\Delta$ corresponds to the tail probability of a binomial distribution in $M$ , which exponentially converges.

Remark 4

Under the assumed convex conditions in Theorem 3, to assess the deviation probability $\mathbb{V}$ , it’s necessary to compute which samples are supporting under the given $M$ uncertainties. There is already mature research on how to compute support samples, and the detailed progress can be found in [22, 29]. Moreover, the setup in (2) involves a well-deployed system encountering passive parameter perturbations and needs to recover the black-box part through a learning method with data. If there are potential mechanisms for acquiring more effective data, we believe that the performance of inverse learning approach (24) can be further improved.

6 Numerical Evaluation

We consider an aggregative game with $N=4$ electricity users in demand of energy consumption [4, 8], as introduced in Example 1. For $i=1,2,3,4$ ,

\Omega_{i,\bm{\alpha}}(\bm{x}_{-i})=\{x_{i}\in\mathbb{R}_{+}:~{}\alpha_{i}^{T}% x_{i}\leq 75-\!\!\!\!\sum_{j\neq i,j=1}^{4}\alpha_{j}^{T}x_{j}\},

where the uncertain parameter $\alpha_{i}\in[0.1,2]$ . User $i$ adopts $x_{i}\in\Omega_{i,\bm{\alpha}}(\bm{x}_{-i})$ to minimize its electricity cost

J_{i}(x_{i},\sigma(\bm{x}))=l_{i}(x_{i}-h_{i})^{2}+P(\sigma(\bm{x}))x_{i},

where $P(\sigma(\bm{x}))=qN\sigma(\bm{x})+p_{0}$ and the system coefficients are set as $l_{i}=1$ , $h_{1}=50$ , $h_{2}=55$ , $h_{3}=60$ , $h_{4}=65$ , $q=0.04$ , and $p_{0}=5$ . Note that the real value of players’ weight in the aggregator is $\bm{\beta}=\{0.1,0.2,0.3,0.4\}$ .

Due to the uncertain system, we first construct the dataset $(\bm{\alpha}[k],\bm{x}^{*}_{\bm{\alpha}[k]})$ from $k=1,\dots,M$ samplings, where the equilibrium-seeking methods refer to [7, 8, 9, 10, 11]. Then with all data $(\bm{\alpha}[k],\bm{x}^{*}_{\bm{\alpha}[k]})$ as input, we employ the inverse VI-based learning approach (24) to estimate players’ weight $\bm{\beta}$ in the black-box aggregator $\sigma(\bm{x})$ , as similarly illustrated in Example 1. Finally, based on learning results, we seek rGNE $\bm{x}^{*}$ by the first-order condition (33). We set $M=4$ and Figs. 1, 2 show the learning results. In Fig. 1 the blue bars present the true weight $\bm{\beta}$ for each player while the red bars present the estimated value $\hat{\bm{\beta}}$ by the learning approach (24). In detail, we get $\hat{\bm{\beta}}=\{0.0893,0.1907,0.2918,0.3926\}$ , which shows a good learning performance close to the true value. This verifies the validity of the VI-based inverse learning approach (24). Afterward, we can continue to compute rGNE $\bm{x}^{*}$ with the estimated $\hat{\bm{\beta}}$ , which is illustrated in Fig. 2.

Refer to caption — Figure 1: Learning performance with data amounts $M=4$ .

We further check the performance of the learning approach (24) with different amounts of data points $(\bm{\alpha}[k],\bm{x}^{*}_{\bm{\alpha}[k]})$ . We give the following mean estimated error

\operatorname{MEE}=\frac{1}{N}\Big{(}{\sum_{i=1}^{N}\|\hat{\beta_{i}}-{\beta_{% i}}\|^{2}}\Big{)}^{1/2},

and take $M=2,3,4,\dots,10$ to record MEE of each setting in Fig. 3. The trend of MEE becomes obviously mild and close enough to zero as the amount of data increases.

TABLE I: Data size, learning error, generalization bound

	$M=10$	$M=20$	$M=30$	$M=40$	$M=50$
MEE	0.0004	0.0001	3.1628 $*10^{-5}$	2.4306 $*10^{-5}$	2.4253 $*10^{-5}$
$\Delta$	0.9983	0.9568	0.8245	0.6290	0.4312
	$M=60$	$M=70$	$M=80$	$M=90$	$M=100$
MEE	2.0139 $*10^{-5}$	1.8724 $*10^{-5}$	1.5397 $*10^{-5}$	1.2256 $*10^{-5}$	1.0713 $*10^{-5}$
$\Delta$	0.1710	0.1588	0.0880	0.0465	0.0237

Finally, we greatly improve the numerical accuracy and provide Tab. I to show some numerical relation between the data amounts, learning errors MEE, and generalization bound $\Delta$ . Here, take $N=4$ players, $M$ as the variant amount of data, and $\epsilon=0.1$ for violation probability. Thus, the generalization bound should be $\Delta=\sum_{l=0}^{4}\begin{pmatrix}M\\ l\end{pmatrix}{0.1}^{l}\cdot{0.9}^{M-l}.$ We can see from Table I that as the dataset size increases, the value of MEE decreases, while the value of $\Delta$ goes rapidly (exponentially) to zero. This can also be regarded as a tradeoff between accuracy and confidence. These figures support the results in Theorem 3 and the associated discussions.

7 Conclusions

In this note, we proposed a novel learning scheme to seek the robust equilibrium with players’ unknown weight in a black-box aggregator. We put together the data sets with two parts: perturbed parameters from uncertain feasibility and corresponding NE by developed solvers. We established the learning model by an inverse variational inequality relation. Then, we derived the robust counterpart thus obtaining the first-order conditions for robust generalized Nashe quilibria. Also, we showed a generalization guarantee of the proposed learning approach. The numerical results presented good performances and effectiveness of our methodology.

References

[1] D. Paccagnan, B. Gentile, F. Parise, M. Kamgarpour, and J. Lygeros, “Distributed computation of generalized Nash equilibria in quadratic aggregative games with affine coupling constraints,” in 2016 IEEE 55th Conference on Decision and Control (CDC). IEEE, 2016, pp. 6123–6128.
[2] J. Lei, U. V. Shanbhag, and J. Chen, “Distributed computation of Nash equilibria for monotone aggregative games via iterative regularization,” in 2020 59th IEEE Conference on Decision and Control (CDC). IEEE, 2020, pp. 2285–2290.
[3] S. Huang, J. Lei, and Y. Hong, “A linearly convergent distributed Nash equilibrium seeking algorithm for aggregative games,” IEEE Transactions on Automatic Control, vol. 68, no. 3, pp. 1753–1759, 2023.
[4] M. Ye and G. Hu, “Game design and analysis for price-based demand response: An aggregate game approach,” IEEE Transactions on Cybernetics, vol. 47, no. 3, pp. 720–730, 2016.
[5] J. Barrera and A. Garcia, “Dynamic incentives for congestion control,” IEEE Transactions on Automatic Control, vol. 60, no. 2, pp. 299–310, 2014.
[6] R. Cornes, “Aggregative environmental games,” Environmental and Resource Economics, vol. 63, no. 2, pp. 339–365, 2016.
[7] J. Koshal, A. Nedić, and U. V. Shanbhag, “Distributed algorithms for aggregative games on graphs,” Operations Research, vol. 64, no. 3, pp. 680–704, 2016.
[8] S. Liang, P. Yi, and Y. Hong, “Distributed Nash equilibrium seeking for aggregative games with coupled constraints,” Automatica, vol. 85, pp. 179–185, 2017.
[9] F. Fabiani, K. Margellos, and P. J. Goulart, “On the robustness of equilibria in generalized aggregative games,” in 2020 59th IEEE Conference on Decision and Control (CDC). IEEE, 2020, pp. 3725–3730.
[10] G. Belgioioso, A. Nedić, and S. Grammatico, “Distributed generalized Nash equilibrium seeking in aggregative games on time-varying networks,” IEEE Transactions on Automatic Control, vol. 66, no. 5, pp. 2061–2075, 2020.
[11] G. Xu, G. Chen, H. Qi, and Y. Hong, “Efficient algorithm for approximating Nash equilibrium of distributed aggregative games,” IEEE Transactions on Cybernetics, vol. 53, no. 7, pp. 4375–4387, 2023.
[12] H. Yang, X. Xie, and A. V. Vasilakos, “Noncooperative and cooperative optimization of electric vehicle charging under demand uncertainty: A robust stackelberg game,” IEEE Transactions on Vehicular Technology, vol. 65, no. 3, pp. 1043–1058, 2015.
[13] M. E. Nikoofal and J. Zhuang, “Robust allocation of a defensive budget considering an attacker’s private information,” Risk Analysis: An International Journal, vol. 32, no. 5, pp. 930–943, 2012.
[14] Z. Cheng, G. Chen, and Y. Hong, “Single-leader-multiple-followers stackelberg security game with hypergame framework,” IEEE Transactions on Information Forensics and Security, vol. 17, pp. 954–969, 2022.
[15] M. Aghassi and D. Bertsimas, “Robust game theory,” Mathematical Programming, vol. 107, no. 1-2, pp. 231–273, 2006.
[16] D. Bertsimas, D. B. Brown, and C. Caramanis, “Theory and applications of robust optimization,” SIAM Review, vol. 53, no. 3, pp. 464–501, 2011.
[17] G. C. Calafiore and M. C. Campi, “The scenario approach to robust control design,” IEEE Transactions on Automatic Control, vol. 51, no. 5, pp. 742–753, 2006.
[18] G. Chen, Y. Ming, Y. Hong, and P. Yi, “Distributed algorithm for $\varepsilon$ -generalized Nash equilibria with uncertain coupled constraints,” Automatica, vol. 123, p. 109313, 2021.
[19] G. Xu, G. Chen, and H. Qi, “Algorithm design and approximation analysis on distributed robust game,” Journal of Systems Science and Complexity, vol. 36, no. 2, pp. 480–499, 2023.
[20] M. Fochesato, F. Fabiani, and J. Lygeros, “Generalized uncertain Nash games: Reformulation and robust equilibrium seeking,” in 2023 European Control Conference (ECC), 2023, pp. 1–6.
[21] F. Fele and K. Margellos, “Probably approximately correct Nash equilibrium learning,” IEEE Transactions on Automatic Control, vol. 66, no. 9, pp. 4238–4245, 2020.
[22] F. Fabiani, K. Margellos, and P. J. Goulart, “Probabilistic feasibility guarantees for solution sets to uncertain variational inequalities,” Automatica, vol. 137, p. 110120, 2022.
[23] G. Pantazis, F. Fele, and K. Margellos, “A priori data-driven robustness guarantees on strategic deviations from generalised Nash equilibria,” arXiv preprint arXiv:2304.05308, 2023.
[24] G. Calafiore and M. C. Campi, “Uncertain convex programs: randomized solutions and confidence levels,” Mathematical Programming, vol. 102, pp. 25–46, 2005.
[25] F. Facchinei and C. Kanzow, “Generalized Nash equilibrium problems,” Annals of Operations Research, vol. 175, no. 1, pp. 177–211, 2010.
[26] F. Facchinei and J.-S. Pang, Finite-Dimensional Variational Inequalities and Complementarity Problems. Springer Science & Business Media, 2007.
[27] D. Bertsimas, V. Gupta, and I. C. Paschalidis, “Data-driven estimation in equilibrium using inverse optimization,” Mathematical Programming, vol. 153, pp. 595–633, 2015.
[28] M. C. Campi and S. Garatti, “The exact feasibility of randomized solutions of uncertain convex programs,” SIAM Journal on Optimization, vol. 19, no. 3, pp. 1211–1230, 2008.
[29] M. C. Campi, S. Garatti, and F. A. Ramponi, “A general scenario theory for nonconvex optimization and decision making,” IEEE Transactions on Automatic Control, vol. 63, no. 12, pp. 4067–4078, 2018.

		$\displaystyle\bm{0}_{n}\in\nabla_{x_{i}}J_{i}(\cdot,\hat{\sigma}(\bm{x}^{}))% \!+\!\hat{\beta}_{i}\nabla_{\hat{\sigma}}J_{i}(x^{}_{i},\cdot)\!-\!\omega_{i}% ^{}\!+\!\mathcal{N}_{\mathbb{R}_{+}^{n}}(x^{}_{i}),$		(33)
		$\displaystyle\bm{0}_{m_{i}}\in\mu^{}d_{i}+D_{i}\omega_{i}^{}+\mathcal{N}_{% \mathbb{R}_{+}^{m_{i}}}(y^{*}_{i}),$
		$\displaystyle 0\geq\sum_{i=1}^{N}d_{i}^{T}y^{}_{i}-b\perp\mu^{},$
		$\displaystyle\bm{0}_{n}=D_{i}^{T}y^{}_{i}-x_{i}^{}.$