- Research
- Open access
- Published:
Mean-field optimal control in a multi-agent interaction model for prevention of maritime crime
Advances in Continuous and Discrete Models volume 2023, Article number: 24 (2023)
Abstract
We study a multi-agent system for the modeling maritime crime. The model involves three interacting populations of ships: commercial ships, pirate ships, and coast guard ships. Commercial ships follow commercial routes, are subject to traffic congestion, and are repelled by pirate ships. Pirate ships travel stochastically, are attracted by commercial ships and repelled by coast guard ships. Coast guard ships are controlled. We prove well-posedness of the model and existence of optimal controls that minimize dangerous contacts. Then we study, in a two-step procedure, the mean-field limit as the number of commercial ships and pirate ships is large, deriving a mean-field PDE/PDE/ODE model. Via Γ-convergence, we study the limit of the corresponding optimal control problems.
1 Introduction
Systems featuring interactions among multi-agents have attracted much attention of the scientific community in recent years as they find applications in various fields. They are a proper tool to study, e.g., biological aggregation as in flocks, swarms, or fish schools [14, 19, 37], crowd dynamics [2], emergent economic behaviors [16, 21], consensus in collective decision-making [13, 29], coordination and cooperation in robotics [17, 34]. In this framework, mathematical analysis has played a role in the proof of well-posedness of the models, in the derivation of mean-field limit, and in the analysis of optimal control problems for this kind of models [1, 3–7, 12, 24, 25, 30].
In this paper, we exploit the tools developed for the analysis of multi-agent systems to study optimal control in a model for the prediction of maritime crime. The majority of world’s goods is carried by sea [22], but the freedom of navigation is affected by the presence of modern maritime piracy, which poses serious threats to international traffic and individual safety. It is a priority to prevent crimes and suppress them [23].
To face this problem, we devise a model featuring three populations of agents, representing the types of ships. Our model is inspired by the macroscopic model (i.e., with a large number of ships) introduced in [18], but it differs from it in that our derivation starts from a microscopic model (i.e., with a finite number of ships). We briefly outline it in this Introduction, referring to Sect. 3 for the precise description of all the features and assumptions on the model.
We consider three populations: N commercial ships with trajectories \(X_{1}, \dots, X_{N}\), M pirate (criminal) ships with trajectories \(Y_{1},\dots, Y_{M}\), and L coast guard (patrol) ships with trajectories \(Z_{1},\dots,Z_{L}\). The trajectory of each ship evolves in a time interval \([0,T]\) according to a specific dynamical law based on its type and on the presence of other surrounding ships, as we illustrate now.
Commercial ships tend to follow commercial routes, but their motion is affected by traffic congestion: a commercial ship obstructed by a high density of commercial ships travels slower than the one with free space. Moreover, in the presence of pirate ships, commercial ships are repelled by them and adjust their trajectory to travel far from danger. Hence, the nth commercial ship evolves according to
where \(v^{N}_{n}\) is a suitable function depending on all the other commercial ships needed for the congestion phenomenon, r is the vector field indicating the commercial route, and \(K^{\mathrm{cp}}\) is the term due to the repulsion from pirate ships that adjusts the direction of the trajectory.
Pirate ships are attracted by commercial ships and are repelled by coast guard ships. Moreover, in the absence of other ships, they travel randomly in search of targets. Hence, the mth pirate ship evolves stochastically according to
where \(K^{\mathrm{pg}}\) and \(K^{\mathrm{pc}}\) are the repulsion and attraction terms with coast guard ships and commercial ships, respectively. The term \((W_{m}(t))_{t \in [0,T]}\) is a Brownian motion accounting for the stochastic behavior mentioned above. Its effect is a white noise with coefficient \(\sqrt{2 \kappa}\) added to the velocity of \(Y_{m}\).
Finally, for coast guard ships, we only impose that they are repelled by each other and that their trajectory is controllable, at a cost. Hence, the ℓth coast guard ship evolves according to
where \(K^{\mathrm{gg}}\) is the repulsion term among coast guard ships and the \(u_{\ell}\)s are the control.
The search of coast guard ships for dangerous contacts between commercial and pirate ships will be driven by the optimal control of the system based on the cost defined as follows. The cost of a control \(u = (u_{1},\dots,u_{L})\) takes into account the effort in modifying the trajectories of coast guard ships (it can be thought as the cost of fuel) and the total number of dangerous contacts among commercial and pirate ships
where \(H^{\mathrm{d}}\) is a compactly supported convolution kernel used for counting dangerous contacts and \(\mathbb{E}\) denotes the expected value. We study the problem of finding a control that minimizes \(\mathcal{J}_{N,M}\).
In Sect. 4 we prove well-posedness of the model that describes the evolution and we prove the existence of an optimal control.
Next, we proceed with the derivation of the mean-field limit of the optimal control problem. We carry out this analysis in two steps: first, we let \(M \to +\infty \) (a large number of pirate ships), and then \(N \to +\infty \) (a large number of commercial ships). The reason thereof is that the limit as \(M \to +\infty \) is interesting per se, as we explain forthwith.
Under suitable conditions, in Sect. 7 (see Theorem 7.1 and Proposition 7.2) we show that, as \(M \to +\infty \), the mean-field behavior of pirate ships is described by a probability distribution \(\bar{\mu}^{\mathrm{p}}\). The trajectories of commercial ships \(\bar{X}_{n}\) in this mean-field model satisfy
which corresponds to (1.1) with the trajectories of pirate ships replaced by their mean-field behavior. The probability distribution \(\bar{\mu}^{\mathrm{p}}\) of pirate ships solves the diffusive PDE
This mean-field model is interesting per se when the precise location of pirate ships is not known, but one can only predict the probability of finding them in certain regions of the sea. Proving convergence of solutions of the original model to the mean-field model as \(M \to +\infty \) requires some technical steps, mainly done following the guidelines in [9]. First, in Sect. 5 we introduce an auxiliary averaged model where the evolution of pirate ships is replaced by a single stochastic process evolving according to the same dynamics of (1.2), i.e.,
where \(\bar{X}_{n}\) evolves according to (1.4), \(\bar{\mu}^{\mathrm{p}}\) being the law of the stochastic process \((\bar{Y}(t))_{t \in [0,T]}\). In Sect. 5 we prove well-posedness for this averaged model using a fixed point argument. Solutions to the original model converge, as \(M \to +\infty \), to solutions of this auxiliary averaged model. To see this, in Propositions 6.1–6.2 we rely on a propagation of the chaos principle [15], from which we deduce that the solutions to (1.2) are independent and identically distributed stochastic processes if so are the initial conditions. Then, a Glivenko–Cantelli-type result allows us to deduce convergence of the empirical measures of the \(Y_{m}\)s to their common law \(\bar{\mu}^{\mathrm{p}}\). The parabolic PDE (1.5) is then the Fokker–Planck equation for pirate ships, as shown in Proposition 7.2.
After deriving the mean-field limit as \(M\to +\infty \), in Theorem 7.3 we show that the costs \(\mathcal{J}_{N,M}\) defined in (1.3) Γ-converge, as \(M \to +\infty \), to the cost for the limit problem
As a consequence, optimal controls for the original problem converge as \(M \to +\infty \) to optimal controls for the limit problem, see Proposition 7.4. This concludes the analysis as \(M \to +\infty \).
The next step is to study the mean-field limit as the number of commercial ships is large, i.e., when \(N \to +\infty \). In Theorem 8.1 and Proposition 8.3, we show that the mean-field limit of commercial ships is described in terms of their distribution \(\mu ^{\mathrm{c}}\), which solves a scalar conservation law with a nonlocal flux, apt to describe traffic flow in sea. More precisely, \(\mu ^{\mathrm{c}}\) is a solution to the PDE
where \(v ( \eta *_{2} \mu ^{\mathrm{c}} )\) arises from the limit of the congestion velocities and \(\mu ^{\mathrm{p}}\) is the probability distribution of pirate ships, evolving according to the parabolic PDE
Under suitable assumptions, in Theorem 8.4 we prove the uniqueness of solutions to this PDE system and, as observed in Remark 8.5, that the measures are absolutely continuous, i.e., \(\mu ^{\mathrm{c}}= \rho ^{\mathrm{c}} \,\mathrm{d}x\) and \(\mu ^{\mathrm{p}}= \rho ^{\mathrm{p}} \,\mathrm{d}y\).
We conclude the paper by finding in Theorem 8.6 the Γ-limit of the costs \(\mathcal{J}_{N}\) defined in (1.6) as \(N \to +\infty \). It is given by the cost for the following mean-field system:
Also in this case, we deduce convergence of optimal controls as \(N \to +\infty \), see Proposition 8.7. The limit problem is an optimal control problem with a finite number of coast guard ships driving the densities of commercial and criminal ships.
2 Notation and preliminary results
2.1 Basic notation and preliminary results
Given a matrix A, we let \(|A|\) its Frobenius norm. We shall often consider matrices of the form \(A \in \mathbb{R}^{2 {\times }d}\). By writing \(A = (A_{1},\dots,A_{d})\), we make explicit its columns \(A_{i} \in \mathbb{R}^{2}\).
If Ω, \(\Omega '\) are measurable spaces, μ is a measure on Ω, and \(\psi \colon \Omega \to \Omega '\) is a measurable map, then the push-forward \(\psi _{\#} \mu \) is the measure on \(\Omega '\) satisfying \(\int _{\Omega '} \phi (\omega ') \,\mathrm{d}\psi _{\#} \mu ( \omega '):= \int _{\Omega} \phi (\psi (\omega )) \,\mathrm{d}\mu ( \omega )\) for every measurable function ϕ.
Throughout the paper, we systematically apply Grönwall’s inequality. We recall that if \(u,v, w \colon [0,T] \to \mathbb{R}\) are continuous and nonnegative functions satisfying
then
cf. [32, Theorem 1.3.2]. If, in addition, \(w \colon [0,T] \to \mathbb{R}\) is continuous, positive, and nondecreasing, then
cf. [32, Theorem 1.3.1]
If not specified otherwise, we let C denote a constant that might change from line to line. We make precise the dependence of C on other constants when it is relevant for the discussion.
2.2 Stochastic processes and Brownian motion
For the theory of stochastic processes and stochastic differential equations, we refer to the monographs [27, 28, 31]. Here we recall some basic facts and definitions used in the paper.
We fix a probability space \((\Omega, \mathcal{F}, \mathbb{P})\) used throughout the paper. By a.s. (almost surely) we mean \(\mathbb{P}\)-almost everywhere. We let \(\mathbb{E}\) denote the expectation.
A filtration on \((\Omega, \mathcal{F}, \mathbb{P})\) is a collection of σ-algebras \((\mathcal{F}_{t})_{t \in [0,T]}\) increasing in t, i.e., \(\mathcal{F}_{s} \subset \mathcal{F}_{t}\) for \(s \leq t\). When \((\Omega, \mathcal{F}, \mathbb{P})\) is a complete probability space, \((\mathcal{F}_{t})_{t \in [0,T]}\) is said to satisfy the usual conditions if it is right-continuous (i.e., \(\mathcal{F}_{s} = \bigcap_{t > s} \mathcal{F}_{t}\) for all s) and if \(\mathcal{N}_{\mathbb{P}} \subset \mathcal{F}_{0}\), where \(\mathcal{N}_{\mathbb{P}} = \{A \subset \Omega \text{ s.t. } A \subset B \text{ with } B \in \mathcal{F}\text{ and } \mathbb{P}(B) = 0 \}\) (if \((\Omega, \mathcal{F}, \mathbb{P})\) is complete, this means that \(\mathcal{F}_{0}\) contains \(\mathbb{P}\)-null sets).
A stochastic process is a parametrized collection of random variables \((S(t))_{t \in [0,T]}\) defined on \((\Omega,\mathcal{F},\mathbb{P})\) and assuming values in \(\mathbb{R}^{d}\) (equipped with σ-algebra of Borel sets). Given \(t \in [0,T]\) and \(\omega \in \Omega \), we will write \(S(t,\omega ) = S(t)(\omega )\) for the realization of the random variable \(S(t)\) at ω. A path of the stochastic process is a curve in \(\mathbb{R}^{d}\) obtained as the realization \(t \mapsto S(t,\omega )\) for some \(\omega \in \Omega \). A stochastic process \((S(t))_{t \in [0,T]}\) is adapted to a filtration \((\mathcal{F}_{t})_{t \in [0,T]}\) if \(S(t)\) is \(\mathcal{F}_{t}\)-measurable for every \(t \in [0,T]\).
Let \((\mathcal{F}_{t})_{t \in [0,T]}\) be a filtration. A d-dimensional Brownian motion (or Wiener process) is an \(\mathbb{R}^{d}\)-valued stochastic process \((W(t))_{t \in [0,T]}\), adapted to \((\mathcal{F}_{t})_{t \in [0,T]}\), a.s. with continuous paths such that: \(W(0) = 0\) a.s.; \(W(t) - W(s) \sim \mathcal{N}(0,(t-s)\mathrm{Id}_{d})\); \(W(t) - W(s)\) is independent of \(\mathcal{F}_{s}\) for \(t \geq s\).Footnote 1 Equivalently, it has components \(W(t) = (W_{1}(t), \dots, W_{d}(t))\) with \((W_{1}(t))_{t \in [0,T]}, \dots, (W_{d}(t))_{t \in [0,T]}\) independent 1-dimensional Brownian motions.
2.3 Stochastic differential equation
For the general theory about SDEs, we refer to [27, 28, 31]. We recall here some basic facts. Let \((\mathcal{F}_{t})_{t \in [0,T]}\) be a filtration satisfying the usual conditions, let \((W(t))_{t \in [0,T]}\) be a d-dimensional Brownian motion, and let us consider an initial datum \(S^{0}\) given by an \(\mathcal{F}_{0}\)-measurable random variable.Footnote 2
However, in this paper we are only interested in a specific class of SDEs, i.e., those with a constant dispersion matrix of the form
A stochastic process \((S(t))_{t \in [0,T]}\) is a strong solution to (2.1) if \((S(t))_{t \in [0,T]}\) has a.s. continuous paths, it is adapted to the filtration \((\mathcal{F}_{t})_{t \in [0,T]}\), satisfies a.s. \(\int _{0}^{T} |b(t,S(t))| \,\mathrm{d}t < \infty \), and for every \(t \in [0,T]\)
For this class of SDEs, it is well known that the well-posedness theory is simpler [27, Equation (2.34)] and requires weaker assumptions on the initial datum \(S^{0}\) than those usually stated in general theorems. For the reader’s convenience, we state and prove the result in the form needed in this paper, as we did not find a precise reference in the literature. Besides, some of the tools used in the proof will be exploited later in the paper. The result is stated with the Euclidean norm \(|\cdot |\) on \(\mathbb{R}^{d}\), but we remark that it holds true when replacing it with any equivalent norm, e.g., also \(\max_{h} |S_{h}|\), as long as the assumptions on b are satisfied with that norm.
Proposition 2.1
Let \(b \colon [0,T] {\times }\mathbb{R}^{d} \mapsto \mathbb{R}^{d}\) be a Carathéodory function satisfying:
-
\(|b(t,S)| \leq C_{b}(1+|S|)\) for every \(t \in [0,T]\) and \(S \in \mathbb{R}^{d}\);
-
For every \(R > 0\), there exists \(C_{R}\) such that \(|b(t,S) - b(t,S')| \leq \mathrm{Lip}_{R}(b)|S - S'|\) for all \(t \in [0,T]\) and \(S,S'\in \mathbb{R}^{d}\) such that \(|S|,|S'|\leq R\).
Let \(\sigma \in \mathbb{R}^{d {\times }d}\). Let \((W(t))_{t \in [0,T]}\) be an \(\mathbb{R}^{d}\)-valued Brownian motion, and let \(S^{0}\) be a random variable such that a.s. \(|S^{0}| < +\infty \). Then there exists a unique strong solution \((S(t))_{t \in [0,T]}\) to (2.1). Moreover, if \(\mathbb{E}(|S^{0}|) < +\infty \), then \(\mathbb{E}(\|S\|_{\infty}) \leq C(1+\mathbb{E}(|S^{0}|) )\), where the constant C depends on \(C_{b}\), T, and W.
Proof
The scheme of the proof is the classical one, see [28, Theorem 3.3].
Let us fix \(\omega \in \Omega \) such that \(|S^{0}(\omega )| < +\infty \) and \(t \mapsto W(t,\omega )\) is continuous, which occurs almost surely. We consider the Picard iterations
Note that the curve \(t \mapsto \widetilde{S}^{j}(t,\omega )\) is continuous. First of all, let us prove that for all j and for all \(t \in [0,T]\)
where \(C_{b}\) is the constant appearing in \(|b(t,S)| \leq C_{b}(1+|S|)\). For \(j = 0\), (2.4) is trivially satisfied. Assume that (2.4) is true for j. Then, by (2.3) and by the linear growth of b,
which proves (2.4). In particular,
Since b is locally Lipschitz, there exists a constant \(\mathrm{Lip}_{R(\omega )}(b)\) such that \(|b(t,S) - b(t,S')| \leq \mathrm{Lip}_{R(\omega )}(b)|S - S'|\) for all \(t \in [0,T]\) and \(S,S'\in \mathbb{R}^{d}\) such that \(|S|,|S'|\leq R(\omega )\). Thanks to this, we show that
for a suitable constant \(C(\omega )\) depending on ω. Indeed, for \(j = 0\), by the linear growth of b, we have that for every \(s \in [0,T]\)
hence
Moreover, by the local Lipschitz continuity of b, we have that for every \(s \in [0,T]\)
Assuming (2.6) true for \(j-1\), we have that
This implies that \(\widetilde{S}^{j}(\cdot, \omega )\) is a Cauchy sequence in the uniform norm since for \(j \geq i\)
Thus there exists a continuous curve \(S(\cdot,\omega )\) such that
We have constructed \(S(\cdot,\omega )\) for a.e. \(\omega \in \Omega \). The stochastic processes \((\widetilde{S}^{j}(t))_{t \in [0,T]}\) are adapted to the filtration \((\mathcal{F}_{t})_{t \in [0,T]}\) and have a.s. continuous paths. This implies that the limit \((S(t))_{t \in [0,T]}\) is a stochastic process adapted to the filtration \((\mathcal{F}_{t})_{t \in [0,T]}\) and has a.s. continuous paths. Moreover, passing to the limit in (2.2) for a.e. \(\omega \in \Omega \), it is a strong solution to (2.1).
Uniqueness is proven in a more general setting in [27, Theorem 2.5] via a stopping time argument.
Assume now \(\mathbb{E}(|S^{0}|) < + \infty \) and let us prove the estimate on \(\mathbb{E}(\|S\|_{\infty})\). Passing to the limit in (2.5), we get that for a.e. \(\omega \in \Omega \)
By Doob’s maximal inequality [27, Chap. 1, Theorem 3.8-(iv)] we have that
and thus, by Hölder’s inequality,
Hence, taking the expectation in (2.9),
which concludes the proof. □
Remark 2.2
A comment about the Picard iterations used in the proof of Proposition 2.1 is in order. If b is globally Lipschitz, i.e., \(|b(t,S) - b(t,S')| \leq \mathrm{Lip}(b) |S - S'|\) and \(\mathbb{E}(|S^{0}|) < +\infty \), then the convergence of the Picard iterations can be improved. Indeed, \(\mathbb{E}(|S^{0}|) < +\infty \) and (2.10) yield \(\mathbb{E}(C(\omega )) < +\infty \), where \(C(\omega )\) is the constant defined in (2.7). Then, taking the expectation in (2.8) and replacing \(\mathrm{Lip}_{R(\omega )}(b)\) with the global Lipschitz constant \(\mathrm{Lip}(b)\), we deduce that \(\mathbb{E}(\|\widetilde{S}^{j} - S\|_{\infty}) \to 0\).
2.4 Wasserstein space
Given a complete metric space \((B,d)\), we let \(\mathcal{P}_{1}(B)\) denote the 1-Wasserstein space, i.e., the space of Borel probability measures \(\mu \in \mathcal{P}(B)\) such that
where \(x_{0} \in B\) is fixed. The 1-Wasserstein space is equipped with the 1-Wasserstein distance defined for every \(\mu _{1}, \mu _{2} \in \mathcal{P}_{1}(B)\) by (see [39, Definition 6.1])
where the infimum is taken over all transport plans \(\gamma \in \mathcal{P}(B {\times }B)\) with marginals \(\pi ^{1}_{\#} \gamma = \mu _{1}\) and \(\pi ^{2}_{\#} \gamma = \mu _{2}\), where \(\pi ^{i}\) is the projection on the ith component.
We shall often exploit the dual formulation of the 1-Wasserstein distance. By Kantorovich’s duality [39, Theorem 5.10], we have that
where \(\psi ^{d}\) is the d-transform \(\psi ^{d}(x') = \inf_{x \in B} (\psi (x) + d(x,x'))\). Since d is a distance on a metric space, a d-convex function ψ is a Lipschitz function with Lipschitz constant 1 and it coincides with its d-transform, cf. [39, Particular Case 5.4]. Hence, if ψ is a Lipschitz function with Lipschitz constant \(\mathrm{Lip}(\psi )\), then we have that
When in this paper we refer to Kantorovich’s duality, we apply this inequality. Note that the condition \(\psi \in L^{1}(\mu _{1}) \cap L^{1}(\mu _{2})\) is satisfied since \(|\psi (x)| \leq |\psi (0)| + \mathrm{Lip}(\psi ) |x|\) and \(\mu _{1}, \mu _{2} \in \mathcal{P}_{1}(B)\).
2.5 Wiener space
Given an interval \([0,T]\), we shall consider the so-called Wiener space of \(\mathbb{R}^{d}\)-valued continuous functions \(C^{0}([0,T];\mathbb{R}^{d})\) equipped with the uniform norm. Given \(t \in [0,T]\), we consider the evaluation function \(\mathrm{ev}_{t} \colon C^{0}([0,T];\mathbb{R}^{d}) \to \mathbb{R}^{d}\) defined by \(\mathrm{ev}_{t}(\varphi ):= \varphi (t)\) for every \(\varphi \in C^{0}([0,T];\mathbb{R}^{d})\). The family of evaluation functions \(\{\mathrm{ev}_{t}\}_{t \in [0,T]}\) generates a σ-algebra on \(C^{0}([0,T];\mathbb{R}^{d})\), which coincides with the Borel σ-algebra with respect to the uniform norm in \(C^{0}([0,T];\mathbb{R}^{d})\).Footnote 3 This is generated by cylindrical sets of the form \(\{\varphi \in C^{0}([0,T];\mathbb{R}^{d}): \varphi (t_{1}) \in A_{1} , \dots, \varphi (t_{k}) \in A_{k}\}\), where \(A_{1}, \dots, A_{k} \subset \mathbb{R}^{d}\) are Borel sets.
Let \((S(t))_{t \in [0,T]}\) be an \(\mathbb{R}^{d}\)-valued stochastic process a.s. with continuous paths. This means that there exists an event \(E \in \mathcal{F}\) such that \(\mathbb{P}(E) = 1\) and \(t \mapsto S(t,\omega )\) is continuous for all \(\omega \in E\). We can redefine \(S(t,\omega ) = 0\) for all \(t \in [0,T]\) when \(\omega \in \Omega \setminus E\). This new stochastic process is indistinguishable from the previous one and satisfies \(S(\cdot,\omega ) \in C^{0}([0,T];\mathbb{R}^{d})\) for all \(\omega \in \Omega \). The stochastic process \((S(t))_{t \in [0,T]}\) can be regarded as the random variable \(S \colon \Omega \to C^{0}([0,T];\mathbb{R}^{d})\) such that \(\omega \mapsto S(\cdot, \omega )\).
The σ-algebra generated by this random variable is the σ-algebra generated by sets of the form \(S^{-1}(A)\) where \(A \subset C^{0}([0,T];\mathbb{R}^{d})\) is a cylindrical Borel set. This means that \(A = \{\varphi \in C^{0}([0,T];\mathbb{R}^{d}): \varphi (t_{1}) \in A_{1}, \dots, \varphi (t_{k}) \in A_{k}\}\), where \(A_{1}, \dots, A_{k} \subset \mathbb{R}^{d}\) are Borel sets. For these sets, we have that
and thus the σ-algebra generated by \(S \colon \Omega \to C^{0}([0,T];\mathbb{R}^{d})\) coincides with the σ-algebra generated by the family \(\{S(t)\}_{t \in [0,T]}\) of random variables \(S(t,\cdot ) \colon \Omega \to \mathbb{R}^{d}\), i.e., the σ-algebra generated by the stochastic process.
In particular, if \((S_{1}(t))_{t \in [0,T]}, \dots, (S_{K}(t))_{t \in [0,T]}\) are stochastic processes a.s. with continuous paths, then they are independent as stochastic processes if and only if they are independent as random variables \(S_{1}, \dots, S_{K} \colon \Omega \to C^{0}([0,T];\mathbb{R}^{d})\).
Finally, we remark that a random variable \(S \colon \Omega \to C^{0}([0,T];\mathbb{R}^{d})\) induces the probability measure \(S_{\#} \mathbb{P}\) on the space \(C^{0}([0,T];\mathbb{R}^{d})\). We let \(\mathrm{Law}(S):= S_{\#} \mathbb{P}\in \mathcal{P} ( C^{0}([0,T]; \mathbb{R}^{d}) )\).Footnote 4
If \(\mu \in \mathcal{P} ( C^{0}([0,T];\mathbb{R}^{d}) )\), then we let \(\mu (t):= (\mathrm{ev}_{t})_{\#} \mu \in \mathcal{P}(\mathbb{R}^{d})\).
2.6 Empirical measures
Given random variables \(X_{1},\dots, X_{K} \colon \Omega \to \mathbb{R}^{d}\) with \(\mathbb{E}(|X_{k}|) < +\infty \), we define their empirical measure as the random measureFootnote 5\(\mu _{K} \colon \Omega \to \mathcal{P}_{1}(\mathbb{R}^{d})\) given by
for a.e. \(\omega \in \Omega \). Note that indeed \(\mu _{K} \in \mathcal{P}_{1}(\mathbb{R}^{d})\) a.s. since
Empirical measures of independent samples from a law approximate the law itself. More precisely, let us fix a law \(\mu \in \mathcal{P}_{1}(\mathbb{R}^{d})\) and \((X_{k})_{k \in \mathbb{N}}\) a sequence of i.i.d. random variables with law μ (which thus satisfy \(\mathbb{E}(|X_{k}|) < +\infty \)). Let \(\mu _{K}\) be the empirical measure of \(X_{1}, \dots, X_{K}\). Then \(\mathbb{E}(\mathcal{W}_{1}(\mu _{K},\mu )) \to 0\) as \(K \to + \infty \), see, e.g., [33, Lemma 4.7.1]. In fact, also precise rates of convergence are available in the literature, see [26, Theorem 1].
2.7 Γ-convergence
For the theory of Γ-convergence, we refer to the monograph [20]. In this paper it will be used to find the limits of optimal control problems.
3 Description of the model
To better describe the phenomena that we aim to capture, we introduce all the ingredients that enter in the model step by step. For the reader’s convenience, all the unknowns, the parameters, and the initial data of the model are summarized in Tables 1–5.
The model is an evolutionary system analyzed in a fixed time interval \([0,T]\).
Ships. The system describes the evolution of N commercial ships, M pirate (criminal) ships, and L coast guard (patrol) ships, whose trajectories are curves \(X_{n} \colon [0,T] \to \mathbb{R}^{2}\) for \(n \in \{1,\dots, N\}\), \(Y_{m} \colon [0,T] \to \mathbb{R}^{2}\) for \(m \in \{1,\dots, M\}\), and \(Z_{\ell }\colon [0,T] \to \mathbb{R}^{2}\) for \(\ell \in \{1,\dots, L\}\), respectively.
We shall often collect the trajectories based on their type by considering the matrix-valued curves \(X = (X_{1},\dots,X_{N}) \colon [0,T] \to \mathbb{R}^{2 {\times }N}\), \(Y = (Y_{1},\dots,Y_{M}) \colon [0,T] \to \mathbb{R}^{2 {\times }M}\), and \(Z = (Z_{1},\dots,Z_{M}) \colon [0,T] \to \mathbb{R}^{2 {\times }L}\). The letters X, Y, Z will unambiguously indicate the type of ship, even when decorated, e.g., as X̄, X̃, or with superscripts and subscripts.
Hereafter, whenever a variable is related to commercial, pirate, or guard ships, it is indexed with the superscript c, p, or g, respectively.
Evolution of commercial ships.
Step 1. We start by describing the evolution of commercial ships in safe waters (absence of pirate ships) and in the absence of congestion in the traffic. We assume that there is a vector field \(\mathbf{r}\colon \mathbb{R}^{2} \to \mathbb{R}^{2}\) indicating safe commercial routes. In this ideal setting, commercial ships evolve according to the ODEs
where \(X^{0} = (X_{1}^{0},\dots, X_{N}^{0}) \in \mathbb{R}^{2 {\times }N}\) is the initial position of commercial ships.
We shall assume that r is globally Lipschitz continuous.
Step 2. To include congestion in the model, we introduce \(v^{N} = (v^{N}_{1},\dots,v^{N}_{N}) \colon \mathbb{R}^{2{\times }N} \to [0,v_{\mathrm{max}}]^{N}\). The component \(v^{N}_{n}\) weighs the speed of the trajectory of the nth commercial ship according to the presence of all the other commercial ships:
The assumptions on \(v^{N}\) needed throughout the paper are the following: \(v^{N}\) is Lipschitz continuous with respect to the max norm with a Lipschitz constant independent of N, i.e., \(|v^{N}(X) - v^{N}(X')| \leq C \max_{n} |X_{n} - X'_{n}|\).
For \(v^{N}\), we have in mind a precise expression that will be used in Sect. 8. We consider a globally Lipschitz smooth convolution kernel \(\eta \colon \mathbb{R}^{2} {\times }\mathbb{R}^{2} \to [0,1]\) satisfying \(\eta (X,0) = 0\). The quantity
suitably countsFootnote 6 the number of commercial ships around the nth commercial ship at time t. Hence, the quantity
can be regarded as the density of commercial ships around the nth commercial ship at time t. The precise expression of the scaling factor \(\frac{1}{N-1}\) is relevant only to interpret the previous expression as a density and can, in fact, be replaced by a sequence converging to zero with the same rate of \(\frac{1}{N}\). Given a Lipschitz function \(v \colon [0,1] \to [0,v_{\mathrm{max}}]\), the corrected speed of the nth commercial ship depends on the density of its surrounding ships as follows:
To model congestion, v must be assumed to be nonincreasing in the density.
Step 3. Eventually, let us modify the dynamics of commercial ships in the presence of pirate ships. We consider a globally Lipschitz vector-valued interaction kernel \(K^{\mathrm{cp}}\colon \mathbb{R}^{2} \to \mathbb{R}^{2}\) (here cp stands for commercial-pirate). To model repulsion of the nth commercial ship from the pirate ships, we modify the direction of the trajectory \(X_{n}(t)\) by averaging the vectors \(K^{\mathrm{cp}}(X_{n}(t) - Y_{m}(t))\), i.e.,
For \(K^{\mathrm{cp}}\), we have in mind the following expression:
where \(H^{\mathrm{cp}}\) has compact support with a radius given by the length for which the presence of a pirate ship at \(Y_{m}(t)\) affects the trajectory \(X_{n}(t)\). An example of \(H^{\mathrm{cp}}\) is \(H^{\mathrm{cp}}(w) = \frac{h(|w|)}{|w|}\), where h is compactly supported in \((0,+\infty )\) so that \(K^{\mathrm{cp}}(X_{n}(t) - Y_{m}(t)) = h(|X_{n}(t) - Y_{m}(t)|) \frac{X_{n}(t) - Y_{m}(t)}{|X_{n}(t) - Y_{m}(t)|}\) and \(\frac{X_{n}(t) - Y_{m}(t)}{|X_{n}(t) - Y_{m}(t)|}\) is, for \(X_{n}(t)\), the direction pointing opposite to \(Y_{m}(t)\).
Evolution of pirate ships.
Step 1. Pirate ships are repelled by guard ships and are attracted by commercial ships. To model this, we consider globally Lipschitz vector-valued interaction kernels \(K^{\mathrm{pg}}\colon \mathbb{R}^{2} \to \mathbb{R}^{2}\) and \(K^{\mathrm{pc}}\colon \mathbb{R}^{2} \to \mathbb{R}^{2}\). Then
where \(Y^{0} = (Y_{1}^{0},\dots, Y_{M}^{0}) \in \mathbb{R}^{2 {\times }M}\) is the initial position of pirate ships.
For the precise form of \(K^{\mathrm{pg}}\), \(K^{\mathrm{pc}}\), see the analogous discussion for commercial ships done after (3.1).
Step 2. In the absence of commercial and guard ships, pirate ships explore the environment in search of targets by navigating randomly. To model this, we add a stochastic term in the evolution of pirate ships by considering M Brownian motions \((W_{1}(t))_{t \in [0,T]}, \dots, (W_{M}(t))_{t \in [0,T]}\). The pirate ships then evolve according to the following SDEs:
where \(\kappa > 0\).
Evolution of guard ships. The last part of the system describes guard ships. In the absence of other ships, guard ships tend to repel each other. To model this, we consider globally Lipschitz vector-valued interaction kernel \(K^{\mathrm{gg}}\colon \mathbb{R}^{2} \to \mathbb{R}^{2}\). In this setting, the guard ships evolve according to
where \(Z^{0} = (Z_{1}^{0},\dots, Z_{L}^{0}) \in \mathbb{R}^{2 {\times }L}\) is the initial position of guard ships. We do not require more on the dynamics of guard ships as we want the global dynamics of the system to be governed by the optimal control policy for guard ships.
Controls. We consider a set of admissible controls \(\mathcal{U}\subset \mathbb{R}^{2{\times }L}\). We assume \(\mathcal{U}\) to be compact. A fixed control \(u = (u_{1}, \dots, u_{L}) \in L^{\infty}([0,T];\mathcal{U})\) drives the evolution of guard ships as follows:
Full model. In conclusion, we are interested in the following ODE/SDE/ODE model:
(The first equation is expressed as an SDE to stress that the solution X is a stochastic process. However, given a trajectory Y, the first equation is, in fact, an ODE.)
We prove well-posedness for (3.2) in Sect. 4.1.
Initial data. The initial data in (3.2) will be given by \(X^{0} = (X^{0}_{1},\dots,X^{0}_{N}) \in \mathbb{R}^{2 {\times }N}\) with \(|X^{0}_{n}| \leq R_{0}\) for some \(R_{0} > 0\); \(\mathbb{R}^{2}\)-valued i.i.d. random variables \(Y^{0}_{1},\dots,Y^{0}_{M}\); \(Z^{0} = (Z^{0}_{1},\dots,Z^{0}_{L}) \in \mathbb{R}^{2 {\times }L}\).
Optimal control. As previously mentioned, the dynamics of guard ships will be driven by an optimal control. To define the cost, we consider a bounded and globally Lipschitz function \(H^{\mathrm{d}}\colon \mathbb{R}^{2} \to \mathbb{R}\). If the quantity \(H^{\mathrm{d}}(X_{n}(t) - Y_{m}(t))\) is significantly different from zero when \(Y_{m}(t)\) is close to \(X_{n}(t)\) and is small when \(Y_{m}(t)\) is far from \(X_{n}(t)\) (e.g., when \(H^{\mathrm{d}}\) is compactly supported), then this function can be used to count contacts between commercial and pirate ships (the superscript d stands for “danger”). Hence we consider the cost functional \(\mathcal{J}_{N,M} \colon L^{\infty}([0,T];\mathcal{U}) \to \mathbb{R}\) defined for every control \(u \in L^{\infty}([0,T];\mathcal{U})\) by
where the stochastic processes \((X(t))_{t \in [0,T]} = (X_{1}(t),\dots, X_{N}(t)))_{t \in [0,T]}\) and \((Y(t))_{t \in [0,T]} = (Y_{1}(t), \dots, Y_{M}(t)))_{t \in [0,T]}\) are given by the unique strong solutions to (3.2) corresponding to the control u obtained in Proposition 4.1.
The objective is to minimize the cost \(\mathcal{J}_{N,M}\).
4 Well-posedness of the ODE/SDE/ODE model
4.1 Well-posedness of the ODE/SDE/ODE model for a fixed control
In this section we prove well-posedness for the model presented in (3.2).
We remark that the solutions depend on N and M. Not to overburden the notation, in this section we drop the dependence on N and M as we will not consider limits as \(N \to + \infty \) or \(M \to + \infty \).
Proposition 4.1
Assume the following:
-
Let \((W_{1}(t))_{t \in [0,T]}, \dots, (W_{M}(t))_{t \in [0,T]}\) be independent Brownian motions;
-
Let \(X^{0} = (X^{0}_{1},\dots,X^{0}_{N}) \in \mathbb{R}^{2{\times }N}\);
-
Let \(Y^{0}_{1},\dots,Y^{0}_{M}\) be \(\mathbb{R}^{2}\)-valued random variables with \(|Y^{0}_{m}| < +\infty \) a.s. for \(m=1,\dots,M\);
-
Let \(Z^{0} = (Z^{0}_{1},\dots,Z^{0}_{L}) \in \mathbb{R}^{2 {\times }L}\);
-
Let \(u \in L^{\infty}([0,T];\mathcal{U})\).
Then there exists a unique strong solution to (3.2), \((X(t))_{t \in [0,T]} = (X_{1}(t),\dots,X_{N}(t))_{t \in [0,T]}\), \((Y(t))_{t \in [0,T]} = (Y_{1}(t),\dots, Y_{M}(t))_{t \in [0,T]}\), and \(Z = (Z_{1},\dots,Z_{L})\). Moreover, if \(\mathbb{E}(|Y^{0}_{m}|) < + \infty \) for \(m=1,\dots,M\), then \(\mathbb{E}(\max_{m} \|Y_{m}\|_{\infty}) < +\infty \).
Proof
We start by noticing that the ODEs involving the variables \(Z_{\ell}\) are decoupled from the equations involving \(X_{n}\) and \(Y_{m}\). Given a control \(u = (u_{1},\dots,u_{L}) \in L^{\infty}([0,T];\mathcal{U})\), we solve
We observe that there exists a unique solution for all times \(t \in [0,T]\) to the previous ODE system. Too see this, we introduce the function \(f = f_{u} = (f_{u,1},\dots,f_{u,L}) \colon [0,T] {\times } \mathbb{R}^{2 {\times }L} \to \mathbb{R}^{2{\times }L}\) (we drop the dependence on u for ease of notation) defined by
and we notice that the system reads
where \(Z = (Z_{1},\dots,Z_{L})\). The right-hand side \(f(t,Z)\) is a Carathéodory function, globally Lipschitz continuous in the Z variable (with Lipschitz constant independent of t). These properties are sufficient for the well-posedness of the ODE.Footnote 7 We remark that solutions to (4.1) are bounded. Indeed,
hence
and, by Grönwall’s inequality, for \(t \in [0,T]\),
where the constant C depends on \(K^{\mathrm{gg}}\) and \(\mathcal{U}\) (compact).
We exploit the solution \(Z(t)\) to solve the ODE/SDE/ODE system, which now we write in a more compact way. Let us introduce the \(\mathbb{R}^{2 {\times }(M+N)}\)-valued stochastic process \((S(t))_{t\in [0,T]}\) defined by
(we put the components \(Y_{1}(t), \dots, Y_{M}(t)\) in the first block for consistency later). We consider the drift vector \(b_{Z} = b = (b_{1}, \dots, b_{M+N}) \colon [0,T] {\times } \mathbb{R}^{2 {\times }(M+N)} \to \mathbb{R}^{2 {\times }(M+N)}\) (we drop the dependence on Z for the ease of notation) defined for every \(S = (S_{1},\dots, S_{M+N}) \in \mathbb{R}^{2 {\times }(M+N)}\) by
for \(i = 1, \dots, M\) and
for \(i = M+1, \dots, M+N\). Moreover, let \(\sigma \in \mathbb{R}^{(2 {\times }2) {\times }(M+N)}\) be the constant dispersion tensor given by the collection \(\sigma = (\sigma _{1},\dots,\sigma _{M+N})\) of the matrices \(\sigma _{i} \colon \mathbb{R}^{2 {\times }(M+N)} \to \mathbb{R}^{2 { \times }2}\) defined by
and \(\sigma _{i}:= 0\) for \(i = M+1,\dots,M+N\). For \(W = (W_{1},\dots,W_{M+N})\in \mathbb{R}^{2{\times }(M+N)}\), we adopt the short-hand notation σW to denote the element in \(\mathbb{R}^{2{\times }(M+N)}\) with columns \((\sigma W)_{1}, \dots, (\sigma W)_{M+N} \in \mathbb{R}^{2}\) given by \((\sigma W)_{i} = \sigma _{i} W_{i}\).
By setting \(S^{0}:= (Y_{1}^{0}, \dots, Y_{M}^{0}, X_{1}^{0}, \dots, X_{N}^{0})\), the system reads
where \((W(t))_{t \in [0,T]}\) is an \(\mathbb{R}^{2{\times }(M+N)}\)-valued Brownian motion. Note that \(W_{1}(t), \dots, W_{M}(t)\) correspond to the M independent \(\mathbb{R}^{2}\)-valued Brownian motions already introduced for (3.2). This is the reason why we chose to put the \(Y_{m}\)s in place of the \(X_{n}\)s in the first block of S.
We are now left to check that the conditions for the existence and uniqueness stated in Proposition 2.1 are satisfied by (4.6). By the continuity of \(Z(t)\), the function \(t \mapsto b(t,S)\) is continuous for every S. Let \(i \in \{1,\dots,M\}\), so that \(b_{i}\) is given by (4.4). By the Lipschitz continuity of \(K^{\mathrm{pg}}\), we have that
Reasoning analogously for \(K^{\mathrm{pc}}\), it follows that
where we used the continuity, and thus boundedness, of \(Z_{\ell}(t)\) for \(t \in [0,T]\). Let us check the Lipschitz continuity condition. By the Lipschitz continuity of \(K^{\mathrm{pg}}\) and \(K^{\mathrm{pc}}\), we have that
where the constant C depends on \(K^{\mathrm{pg}}\), \(K^{\mathrm{pc}}\). (In fact, \(b_{i}\) is even globally Lipschitz continuous for \(i \in \{1,\dots,M\}\)).
Let now \(i \in \{M+1,\dots,M+N\}\), so that \(b_{i}\) is given by (4.5). By the boundedness of \(v^{N}\), by the bound \(\mathbf{r}(x) \leq C(1+|x|)\), and reasoning for \(K^{\mathrm{cp}}\) as in (4.7), we have that
To check the local Lipschitz continuity of \(b_{i}\), let us fix \(R > 0\). For \(t \in [0,T]\) and \(\max_{h} |S_{h}| \leq R\), \(\max_{h}|S'_{h}| \leq R\). By the boundedness and the Lipschitz property of \(v^{N}\) (recall that it has a Lipschitz constant independent of N) and by the Lipschitz continuity of r and \(K^{\mathrm{cp}}\), we have that
where the constant C depends on \(v^{N}\), r, and \(K^{\mathrm{cp}}\) (independent of N). Choosing \(C_{R} = C(1+R)\), we get the desired inequality.
Applying Proposition 2.1, we conclude the proof of existence and uniqueness. Moreover, we also get \(\mathbb{E}(\max_{h} \|S_{h}\|_{\infty}) < +\infty \) and, in particular, \(\mathbb{E}(\max_{m} \|Y_{m}\|_{\infty}) < +\infty \). □
4.2 Existence of an optimal control for the ODE/SDE/ODE model
Let \(\mathcal{J}_{N,M}\) be the cost defined in (3.3). We have the following result concerning the existence of optimal controls.
Proposition 4.2
Under the assumptions of Proposition 4.1, there exists an optimal control \(u^{*} \in L^{\infty}([0,T];\mathcal{U})\), i.e.,
Proof
The result is obtained via the direct method in the calculus of variations. We divide the proof into steps for the sake of presentation.
Step 1. (Preliminary steps) Let \(u^{j} \in L^{\infty}([0,T];\mathcal{U})\) be a minimizing sequence, i.e., \(\mathcal{J}_{N,M}(u^{j}) \to \min \mathcal{J}_{N,M}\) as \(j \to +\infty \). Since \(u^{j}\) is bounded in \(L^{\infty}([0,T];\mathcal{U})\), there exist \(u^{*} \in L^{\infty}([0,T];\mathcal{U})\) and a subsequence (not relabeled) such that \(u^{j} \stackrel{*}{\rightharpoonup }u^{*}\) weakly-* in \(L^{\infty}([0,T];\mathcal{U})\). We claim that \(u^{*}\) is an optimal control.
To prove the claim, let us fix \((X^{j}(t))_{t \in [0,T]} = (X^{j}_{1}(t),\dots,X^{j}_{N}(t))_{t \in [0,T]}\), \((Y^{j}(t))_{t \in [0,T]} = (Y^{j}_{1}(t),\dots,Y^{j}_{M}(t))_{t \in [0,T]}\), and \(Z^{j} = (Z^{j}_{1},\dots,Z^{j}_{L})\), the strong solutions to (3.2) corresponding to the controls \(u^{j}\) obtained in Proposition 4.1. We adopt the notation of the proof of Proposition 4.1 and let \(S = (Y_{1}, \dots, Y_{M}, X_{1}, \dots, X_{N})\). In this way, for every j, we have that
(we stress the dependence of \(f_{u^{j}}\) on the controls \(u^{j}\)) and
(we stress the dependence of the drift vector \(\mathbb{R}^{2{\times }L}\) on the trajectories \(Z^{j}\)).
Step 2. (Identifying the limit of \(Z^{j}\)) We remark that (4.3) yields \(\|Z^{j}\|_{\infty }\leq C\) for every j, where C depends on \(Z^{0}\), T, \(K^{\mathrm{gg}}\), and \(\mathcal{U}\). Let us check that the \(Z^{j}\)s are also equicontinuous. By (4.2), for every j and for \(s \leq t\), we have that
where C depends on \(Z^{0}\), T, \(K^{\mathrm{gg}}\), and \(\mathcal{U}\) (compact). By Arzelà–Ascoli’s theorem, we obtain \(Z^{*} \in C^{0}([0,T];\mathbb{R}^{2})\) such that \(\| Z^{j} - Z^{*}\|_{\infty }\to 0\) up to a subsequence that we do not relabel. This together with the convergence \(u^{j} \stackrel{*}{\rightharpoonup }u^{*}\) and
yields, letting \(j \to +\infty \),
i.e., \(Z^{*}\) is the solution to
Step 3. (Identifying the limit of \(S^{j}\)) We let \((S^{*}(t))_{t \in [0,T]}\) be the \(\mathbb{R}^{2}\)-valued stochastic process obtained as the strong solution to
We claim that a.s. \(\max_{h} \|S^{j}_{h} - S^{*}_{h}\|_{\infty } \to 0\) as \(j \to +\infty \). We start by observing that a.s. for \(0 \leq s \leq t \leq T\) and \(i=1,\dots,M+N\)
We estimate the former integrand by exploiting the Lipschitz property of \(b_{Z^{j},i}\) obtained in (4.9) and (4.11)
where the constant C depends on \(K^{\mathrm{pg}}\), \(K^{\mathrm{pc}}\), \(K^{\mathrm{cp}}\), \(v^{N}\), and r (independent of N). To estimate the latter integrand in (4.12), we resort to the definition of \(b_{Z}\). By (4.4), for \(i=1,\dots,M\), we get that
where the constant C depends on \(K^{\mathrm{pg}}\). For \(i=M+1,\dots,M+N\), by (4.5) we have instead that \(|b_{Z^{j}, i}(r,S^{*}(r)) - b_{Z^{*}, i}(r,S^{*}(r))| = 0\). We observe that Proposition 2.1 also gives us that \(\mathbb{E}(\max_{h} \|S^{*}_{h} \|_{\infty}) < C(1+\mathbb{E}(\max_{h} |S^{0}_{h}|))\), thus a.s. \(\max_{h} \|S^{*}_{h}\|_{\infty }< + \infty \).
We are now in a position to prove that a.s. \(\max_{h} \|S^{j}_{h} - S^{*}_{h} \|_{\infty }\to 0\). For \(k \geq 1\), let us consider the events
We remark that \(\mathbb{P} ( \bigcup_{k} A_{k} ) = 1\) since a.s. \(\max_{h} \|S^{*}_{h}\|_{\infty }< +\infty \). Let us fix \(\omega \in A_{k}\) and such that (4.12)–(4.14) hold true. Then we have that
Integrating on \(A_{k}\), we get that
By Grönwall’s inequality, we deduce that
and, in particular,
By Step 2 we have that \(\| Z^{j} - Z^{*}\|_{\infty }\to 0\) as \(j \to +\infty \), and thus \(\max_{h} \| S^{j}_{h} (\cdot,\omega ) - S^{*}_{h} (\cdot,\omega ) \|_{\infty }\to 0\) for a.e. \(\omega \in A_{k}\). Since \(\mathbb{P} ( \bigcup_{k} A_{k} ) = 1\), we conclude that a.s. \(\max_{h} \|S^{j}_{h} - S^{*}_{h}\|_{\infty }\to 0\).
Step 4. (Limit of the cost) Let us show that
Since \(u^{j}\) is a minimizing sequence, this will be sufficient to conclude that \(\mathcal{J}_{N,M}(u^{*}) = \min_{u} \mathcal{J}_{N,M}(u)\).
By sequential weak semicontinuity of the \(L^{2}\)-norm, we get that
From Step 3 we have that a.s. \(\max_{h} \|S^{j}_{h} - S^{*}_{h} \|_{\infty }\to 0\), thus a.s. \(\max_{m} \|Y^{j}_{m} - Y^{*}_{m}\|_{ \infty }\to 0\) and \(\max_{n} \|X^{j}_{n} - X^{*}_{n}\|_{\infty }\to 0\) (recall that \(S = (Y_{1},\dots,Y_{M}, X_{1},\dots,X_{N})\)). Then, using the fact that \(H^{\mathrm{d}}\) is bounded, by the dominated convergence theorem
as \(j \to +\infty \). By the superadditivity of the lim inf, we conclude the proof. □
5 An averaged ODE/SDE/ODE system
5.1 Introducing the averaged ODE/SDE/ODE system
To study the mean-field limit of (3.2) as \(M \to +\infty \), we consider an averaged ODE/SDE/ODE system, where the trajectories \(Y_{m}(t)\) are replaced by a single trajectory \(\bar{Y}(t)\) interacting with the other agents via its probability distribution. More precisely, let \((W(t))_{t \in [0,T]}\) be an \(\mathbb{R}^{2}\)-valued Brownian motion and consider the problem
We start by giving a precise definition for the notion of solutions to the previous system.
Definition 5.1
A strong solution to (5.1) is given by a curve \(\bar{X} = (\bar{X}_{1}, \dots, \bar{X}_{N}) \in C^{0}([0,T]; \mathbb{R}^{2 {\times }N})\), an \(\mathbb{R}^{2}\)-valued stochastic process \((\bar{Y}(t))_{t \in [0, T]}\) a.s. with continuous paths, and a curve \(Z = (Z_{1}, \dots, Z_{L}) \in C^{0}([0,T]; \mathbb{R}^{2{\times }L})\) such that
-
(1)
a.s. for all \(t \in [0,T]\),
$$\begin{aligned} \bar{Y}(t) = {}&\bar{Y}^{0} + \int _{0}^{t} \Biggl(\frac{1}{L} \sum _{\ell =1}^{L} K^{\mathrm{pg}}\bigl(\bar{Y}(s) - Z_{\ell}(s)\bigr) - \frac{1}{N} \sum _{n=1}^{N} K^{\mathrm{pc}}\bigl(\bar{Y}(s) - \bar{X}_{n}(s)\bigr) \Biggr) \,\mathrm{d}t\\ &{} + \sqrt{2 \kappa} W(t); \end{aligned}$$ -
(2)
setting \(\bar{\mu}^{\mathrm{p}}:= \mathrm{Law}(\bar{Y}) \in \mathcal{P} (C^{0}([0,T]; \mathbb{R}^{2}) )\), the curves X̄ and Z satisfy
$$ \bar{X}_{n}(t) = X_{n}^{0} + \int _{0}^{t} v^{N}_{n}\bigl( \bar{X}(s)\bigr) \bigl( \mathbf{r}\bigl(\bar{X}_{n}(s)\bigr) + K^{\mathrm{cp}}*\bar{\mu}^{\mathrm{p}}(t) \bigl( \bar{X}_{n}(s) \bigr) \bigr)\,\mathrm{d}s $$and
$$ Z_{\ell}(t) = Z_{\ell}^{0} + \int _{0}^{t} \Biggl( \frac{1}{L}\sum _{ \ell '=1}^{L} K^{\mathrm{gg}} \bigl(Z_{\ell}(s) - Z_{\ell '}(s)\bigr) + u_{\ell}(s) \Biggr) \,\mathrm{d}s $$for all \(t \in [0,T]\).
5.2 Well-posedness of the averaged ODE/SDE/ODE system
Let us prove the following well-posedness result.
Proposition 5.2
Assume the following:
-
Let \((W(t))_{t \in [0,T]}\) be a Brownian motion;
-
Let \(X^{0} = (X^{0}_{1},\dots,X^{0}_{N}) \in \mathbb{R}^{2{\times }N}\);
-
Let \(\bar{Y}^{0}\) be a random variable with \(\mathbb{E}(|\bar{Y}^{0}|) < +\infty \);
-
Let \(Z^{0} = (Z^{0}_{1},\dots,Z^{0}_{L}) \in \mathbb{R}^{2 {\times }L}\);
-
Let \(u \in L^{\infty}([0,T];\mathcal{U})\).
Then there exists a unique strong solution to (5.1). Moreover, \(\mathbb{E}(\|\bar{Y}\|_{\infty}) < +\infty \) and \(\bar{\mu}^{\mathrm{p}}\in \mathcal{P}_{1} (C^{0}([0,T];\mathbb{R}^{2}) )\).
Proof
As recalled in the proof of Proposition 4.1, for every control \(u = (u_{1},\dots, u_{L})\in L^{\infty}([0,T];\mathcal{U})\), there exists a unique continuous solution to
hence \(Z_{\ell}(t)\) will be treated as fixed in the following.
The proof now mainly follows the lines of [9, Theorem 3.1]. For the sake of brevity, we let \(C^{0} = C^{0}([0,T];\mathbb{R}^{2})\).
Step 1. (Decoupling the system) Let us fix \(\mu \in \mathcal{P}_{1}(C^{0})\) (μ plays the role of \(\bar{\mu}^{\mathrm{p}}\) in the equation and is used to apply a fixed point argument). Let us consider the decoupled system
where the \(Z_{\ell}(t)\) are obtained in (5.2).
Substep 1.1. We start by commenting about the existence (and uniqueness) of continuous curves \(\widetilde{X} = (\widetilde{X}_{1}, \dots, \widetilde{X}_{N}) \in C^{0}([0,T]; \mathbb{R}^{2{\times }N})\) solutions to (5.3). For this, we need to check the conditions for well-posedness of ODE systems. Let us consider the function \(g_{\mu }= (g_{\mu,1},\dots,g_{\mu,N}) \colon [0,T] {\times } \mathbb{R}^{2 {\times }N} \to \mathbb{R}^{2 {\times }N}\) defined by
for \(n = 1, \dots, N\). The system then reads
The dependence of \(g_{\mu}\) on the time variable t is only due to the terms
which are continuous in t. This follows from, e.g., the dominated convergence theorem by observing that the Lipschitz continuity of \(K^{\mathrm{cp}}\) yields
and \(\int _{C^{0}} \|\varphi \|_{\infty }\,\mathrm{d}\mu (\varphi ) < + \infty \) since \(\mu \in \mathcal{P}_{1}(C^{0})\). The functions \(g_{\mu,n}\) are locally Lipschitz in X, i.e., given \(R > 0\), there exists \(C_{R} > 0\) such that for \(t \in [0,T]\) and \(\max_{n} |X_{n}| \leq R\), \(\max_{n} |X'_{n}| \leq R\) it holds that
The computations are analogous to those in (4.11), the only difference being in the term
In conclusion, \(g_{\mu}(t,X)\) is continuous in t and locally Lipschitz in X with respect to the max norm. By Picard–Lindelhöf’s theorem, the ODE system (5.6) admits a unique solution for small times. For the existence for all times, with computation analogous to those in (4.10), we observe that we have linear growth for \(g_{\mu}\), i.e.,
the constant C above depending on \(\|v^{N}\|_{\infty}\), r, and \(K^{\mathrm{cp}}\). This upper bound allows for a Grönwall inequality. Indeed,
which yields
and, in particular, the boundedness of solutions in terms of the initial datum \(X^{0}\) and final time T (in addition to \(\|v^{N}\|_{\infty}\), r, and \(K^{\mathrm{cp}}\)). This is enough to deduce global existence in time.
Substep 1.2. Given the continuous curves X̃ and Z obtained previously, we consider SDE (5.4). We rewrite this SDE by introducing the drift vector \(b_{\widetilde{X}} \colon [0,T] {\times }\mathbb{R}^{2} \to \mathbb{R}^{2}\) (depending on X̃)
and by considering the constant dispersion matrix \(\sigma = \sqrt{2 \kappa} \mathrm{Id}_{2}\), so that the SDE reads
For the existence and uniqueness of a strong solution to this SDE, we check that the assumptions of Proposition 2.1 are satisfied. The drift \(b_{\widetilde{X}}\) is continuous in t: it follows from the continuity of the curves \(\widetilde{X}_{n}\) and \(Z_{\ell}\). The drift \(b_{\widetilde{X}}\) is globally Lipschitz continuous in Y. Indeed, we have that
the constant C only depending on the Lipschitz constants of \(K^{\mathrm{pg}}\) and \(K^{\mathrm{pc}}\). Finally, \(b_{\widetilde{X}}\) satisfies the linear growth condition. This follows from (4.7) and the analogous condition for \(K^{\mathrm{pc}}\), which yield
where the constant C depends on \(K^{\mathrm{pg}}\), \(K^{\mathrm{pc}}\), \(\|Z\|_{\infty}\), and \(\max_{n} \|\widetilde{X}_{n}\|_{\infty}\) and we used the boundedness of X̃ obtained in (5.8).
We are in a position to apply Proposition 2.1, which also gives us that
This implies that \(\mathrm{Law}(\widetilde{Y}) \in \mathcal{P}_{1}(C^{0})\). Indeed,
Step 2. (Fixed-point argument) Let us implement the machinery to carry out a fixed point argument.
Substep 2.1. (Definition of Picard operator) We consider the functional \(\mathcal{L}\colon \mathcal{P}_{1}(C^{0}) \to \mathcal{P}_{1}(C^{0})\) defined as follows: given \(\mu \in \mathcal{P}_{1}(C^{0})\), we let \(\widetilde{X} =(\widetilde{X}_{1},\dots,\widetilde{X}_{N})\) and \((\widetilde{Y}(t))_{t \in [0,T]}\) be the unique solution to (5.3)–(5.4) obtained as explained in the previous step. Then we set \(\mathcal{L}(\mu ):= \mathrm{Law}(\widetilde{Y})\), which belongs to \(\mathcal{P}_{1}(C^{0})\), as explained in the previous step. We shall show that \(\mathcal{L}\) is a contraction with respect to a suitable auxiliary distance on \(\mathcal{P}_{1}(C^{0})\) to deduce the existence of a fixed point.
Substep 2.2. (Definition of equivalent Wasserstein distance) The auxiliary distance we consider on \(\mathcal{P}_{1}(C^{0})\) is defined as follows. We let \(\alpha > 0\) (its choice is made precise later in (5.22)) and we define on \(C^{0}\) the norm
Then we define the auxiliary distance on \(\mathcal{P}_{1}(C^{0})\) by
where the infimum is taken over all transport plans \(\gamma \in \mathcal{P}(C^{0} {\times }C^{0})\) with marginals \(\pi ^{1}_{\#} \gamma = \mu _{1}\) and \(\pi ^{2}_{\#} \gamma = \mu _{2}\), where \(\pi ^{i}\) is the projection on the ith component. Since the norm \({\vert \kern - 0.25 ex\vert \kern - 0.25 ex\vert }\cdot {\vert \kern - 0.25 ex\vert \kern - 0.25 ex\vert }_{\alpha}\) is equivalent to the usual uniform norm \(\|\cdot \|_{\infty}\) on \(C^{0}\), the distance \(\mathcal{W}_{1,\alpha}\) is equivalent to the usual 1-Wasserstein distance \(\mathcal{W}_{1}\) on \(\mathcal{P}_{1}(C^{0})\).
Substep 2.3. (Start of the proof of the contraction property) Given \(\mu, \mu ' \in \mathcal{P}_{1}(C^{0})\), let us estimate \(\mathcal{W}_{1,\alpha} ( \mathcal{L}(\mu ), \mathcal{L}(\mu ') )\). Let \(\widetilde{X} = (\widetilde{X}_{1}, \dots, \widetilde{X}_{N})\), Ỹ and \(\widetilde{X}' = (\widetilde{X}'_{1}, \dots, \widetilde{X}'_{N})\), \(\widetilde{Y}'\) be solutions obtained in Step 1 corresponding to μ and \(\mu '\), respectively. By Kantorovich’s duality, there exists a functional \(\Psi \colon C^{0} \to C^{0}\) Lipschitz continuous with respect to \({\vert \kern - 0.25 ex\vert \kern - 0.25 ex\vert }\cdot {\vert \kern - 0.25 ex\vert \kern - 0.25 ex\vert }_{\alpha}\) with Lipschitz constant 1 such that, using the fact that \(\mathcal{L}(\mu ) = \mathrm{Law}(\widetilde{Y})\) and \(\mathcal{L}(\mu ') = \mathrm{Law}(\widetilde{Y}')\),
The following substeps show how to estimate the term \(\mathbb{E} ( {\vert \kern - 0.25 ex\vert \kern - 0.25 ex\vert } \widetilde{Y} - \widetilde{Y}' {\vert \kern - 0.25 ex\vert \kern - 0.25 ex\vert }_{\alpha} )\).
Substep 2.4. (Estimate of \(|\widetilde{Y}(t) - \widetilde{Y}'(t) |\)) We start by observing that from (5.10), from the definition of \(b_{\widetilde{X}}\) in (5.9), by the Lipschitz continuity of \(K^{\mathrm{pc}}\), and by the Lipschitz continuity of \(b_{\widetilde{X}'}\) obtained in (5.11), we have that a.s.
the constant C depending only on the Lipschitz constants of \(K^{\mathrm{pg}}\) and \(K^{\mathrm{pc}}\).
Substep 2.5. (Estimate of \(|\widetilde{X}_{n}(s) - \widetilde{X}'_{n}(s) |\)) The curves X̃ and \(\widetilde{X}'\) are solutions to (5.6). As obtained in (5.8), they are bounded by a constant \(R > 0\) depending on the initial datum \(X^{0}\), the final time T, and the parameters of the problem (\(\|v\|_{\infty}\), r, and \(K^{\mathrm{cp}}\)), i.e., \(\max_{n} \|\widetilde{X}_{n}\|_{\infty }\leq R\), \(\max_{n} \|\widetilde{X}'_{n}\|_{\infty }\leq R\). We recall that \(g_{\mu}\) and \(g_{\mu '}\) are locally Lipschitz, hence there exists \(C > 0\) (depending on R) such that (5.7) is satisfied. It follows that for \(n=1,\dots,N\)
Let us now apply the definition of \(g_{\mu}\) and \(g_{\mu '}\) in (5.5) to estimate for \(n = 1, \dots, N\) and \(r \in [0,s]\)
where the constant C depends on \(\|v\|_{\infty}\). We observe that by the Lipschitz continuity of \(x \mapsto K^{\mathrm{cp}}(\widetilde{X}'_{n}(r) - x)\) and by Kantorovich’s duality,
where C is the Lipschitz constant of \(K^{\mathrm{cp}}\). To bound this term, let us fix an optimal plan \(\gamma \in \mathcal{P}(C^{0} {\times }C^{0})\) with marginals \(\pi ^{1}_{\#} \gamma = \mu \), \(\pi ^{2}_{\#} \gamma = \mu '\) and satisfying
We remark that \(\gamma (r) = (\mathrm{ev}_{r})_{\#} \gamma \in \mathcal{P}( \mathbb{R}^{2} {\times }\mathbb{R}^{2})\) has marginals \(\pi ^{1}_{\#} (\mathrm{ev}_{r})_{\#} \gamma = (\mathrm{ev}_{r})_{\#} \pi ^{1}_{\#} \gamma = \mu (r)\) and \(\pi ^{2}_{\#} (\mathrm{ev}_{r})_{\#} \gamma = (\mathrm{ev}_{r})_{\#} \pi ^{2}_{\#} \gamma = \mu '(r)\), hence, by the optimality of \(\mathcal{W}_{1}\) and by the definition of \({\vert \kern - 0.25 ex\vert \kern - 0.25 ex\vert }\cdot {\vert \kern - 0.25 ex\vert \kern - 0.25 ex\vert }_{\alpha}\) in (5.13), we obtain for \(r \in [0,s]\)
Integrating in r, we get that
Putting together (5.16)–(5.19), we conclude that
By Grönwall’s inequality, we conclude that
To sum up, the constant C in the previous formula depends on \(X^{0}\), T, \(\|v\|_{\infty}\), r, and \(K^{\mathrm{cp}}\).
Substep 2.6. (Concluding the estimate of \(|\widetilde{Y}(t) - \widetilde{Y}'(t) |\)) Substituting (5.20) in (5.15), we obtain that
Multiplying both sides by \(e^{-\alpha s}\) and using that \(e^{- \alpha s} \leq e^{-\alpha r}\), we get that a.s. for \(s \in [0,t]\)
Taking the supremum for \(s \in [0,t]\) and the expectation, we deduce that
and thus, by Grönwall’s inequality,
which for \(t = T\) yields
Keeping track of the constant C, it depends on \(X^{0}\), T, \(\|v\|_{\infty}\), r, \(K^{\mathrm{cp}}\), \(K^{\mathrm{pg}}\), and \(K^{\mathrm{pc}}\).
Substep 2.7. (Choice of α and end of the proof of the contraction property) We choose \(\alpha > 0\) in such a way that
where C is the constant obtained in (5.21). In this way, by (5.14) and (5.21) we conclude that
i.e., \(\mathcal{L}\colon \mathcal{P}_{1}(C^{0}) \to \mathcal{P}_{1}(C^{0})\) is a contraction with respect to the equivalent Wasserstein distance \(\mathcal{W}_{1,\alpha}\). As such, it admits a unique fixed point \(\bar{\mu}^{\mathrm{p}}\in \mathcal{P}_{1}(C^{0})\).
Step 3. Given the fixed point \(\bar{\mu}^{\mathrm{p}}\in \mathcal{P}_{1}(C^{0})\) of \(\mathcal{L}\), we define \(\bar{X} = (\bar{X}_{1}, \dots, \bar{X}_{N})\) as the solution to (5.3) corresponding to \(\bar{\mu}^{\mathrm{p}}\), and then we let Ȳ be the solution to (5.4) corresponding to X̄. Since \(\bar{\mu}^{\mathrm{p}}\) is a fixed point, we have that \(\mathcal{L}(\bar{\mu}^{\mathrm{p}}) = \bar{\mu}^{\mathrm{p}}\), i.e., \(\mathrm{Law}(\bar{Y}) = \bar{\mu}^{\mathrm{p}}\). Hence we found the unique strong solution to the coupled system. This concludes the proof. □
Remark 5.3
By (5.8), it follows that \(\max_{n} \|\bar{X}_{n}\|_{\infty}\) is bounded by a constant depending on the initial datum \(X^{0}\), the final time T, \(\|v^{N}\|_{\infty}\), r, and \(K^{\mathrm{cp}}\).
By (5.12), it follows that \(\mathbb{E}(\|\bar{Y}\|_{\infty}) \leq C(1+\mathbb{E}(|\bar{Y}^{0}|))\), where the constant C depends on \(K^{\mathrm{pg}}\), \(K^{\mathrm{pc}}\), \(\|Z\|_{\infty}\), \(\max_{n} \|\bar{X}_{n}\|_{\infty}\), T, and W.
6 Propagation of chaos
Proposition 6.1
Assume the following:
-
Let \((W_{1}(t))_{t \in [0,T]}\) and \((W_{2}(t))_{t \in [0,T]}\) be two \(\mathbb{R}^{2}\)-valued Brownian motions.
-
Let \(X^{0} = (X_{1}^{0},\dots,X_{N}^{0}) \in \mathbb{R}^{2{\times }N}\);
-
Let \(Y_{1}^{0}, Y_{2}^{0}\) be identically distributed \(\mathbb{R}^{2}\)-valued random variables with \(\mathbb{E}(|Y_{m}^{0}|) < +\infty \);
-
Let \(Z^{0} = (Z_{1}^{0},\dots,Z_{L}^{0}) \in \mathbb{R}^{2{\times }L}\);
-
Let \(u \in L^{\infty}([0,T];\mathcal{U})\).
For \(m = 1, 2\), let \(\bar{X}_{m} = (\bar{X}_{m,1},\dots,\bar{X}_{m,N})\), \((\bar{Y}_{m}(t))_{t \in [0,T]}\), \(Z = (Z_{1}, \dots, Z_{L})\) be the unique strong solution toFootnote 8
Then the stochastic processes \((\bar{Y}_{1}(t))_{t \in [0,T]}\) and \((\bar{Y}_{2}(t))_{t \in [0,T]}\) are identically distributed and \(\bar{X}_{1}(t) = \bar{X}_{2}(t)\) for \(t \in [0,T]\).
Proof
We fix \(Z = (Z_{1},\dots,Z_{L})\) as the solution to
which is independent of m since it is decoupled from the first two sets of equations.
Let \(m \in \{1,2\}\). We resort to some tools already considered in Step 2 in the proof of Proposition 5.2. As in that proof, we set \(C^{0}:= C^{0}([0,T];\mathbb{R}^{2})\).
Step 1. (Exploiting the decoupled system) Given \(\mu \in \mathcal{P}_{1}(C^{0})\), we let \(\widetilde{X} = (\widetilde{X}_{1},\dots,\widetilde{X}_{N})\) and \((\widetilde{Y}_{m}(t))_{t \in [0,T]}\) be the unique solution to the decoupled system
obtained as explained in Step 1 in the proof of Proposition 5.2. We claim that
Using the short-hand notation introduced in (5.9), we have a.s. for \(t \in [0,T]\)
Substep 1.1. (Proof of claim (6.4) for Picard iterations \(\widetilde{Y}_{m}^{j}\)) We consider the Picard iterations (used in the proof of Proposition 2.1) constructed as follows for \(m=1,2\): for \(\omega \in \Omega \),
We observe that \(\mathrm{Law}(\widetilde{Y}_{1}^{0}) = \mathrm{Law}(\widetilde{Y}_{2}^{0})\), as by (6.5) they coincide with the common law of the identically distributed random variables given by the initial data \(Y_{1}^{0}\), \(Y_{2}^{0}\). This is the base step of an induction argument. Let \(j \geq 1\) and assume \(\mathrm{Law}(\widetilde{Y}_{1}^{j-1}) = \mathrm{Law}(\widetilde{Y}_{2}^{j-1})\). Let \(\Psi \colon \mathbb{R}^{2} {\times }C^{0} {\times }C^{0} \to C^{0}\) be the continuous map defined by
With this notation, (6.6) reads \(Y_{m}^{j}(\cdot,\omega ) = \Psi (Y_{m}^{0}(\omega ), \widetilde{Y}^{j-1}_{m}( \cdot, \omega ), W_{m}(\cdot,\omega ))\) for \(\omega \in \Omega \) such that \(W_{m}(\cdot,\omega )\) is a continuous path (this occurs a.s.). Then we have that
Since \(Y^{0}_{1}\), \(Y^{0}_{2}\) are identically distributed, by the induction assumption, and since \(\mathrm{Law}(W_{1}) = \mathrm{Law}(W_{2})\) (it is the Wiener measure), we have that \((Y_{1}^{0}, \widetilde{Y}^{j-1}_{1}, W_{1})_{\#}\mathbb{P}= (Y_{2}^{0}, \widetilde{Y}^{j-1}_{2}, W_{2})_{\#}\mathbb{P}\). Thus, repeating backward the same computations for \(\widetilde{Y}_{2}^{j}\), we conclude that \(\mathrm{Law}(\widetilde{Y}_{1}^{j}) = \mathrm{Law}(\widetilde{Y}_{2}^{j})\).
Substep 1.2. (Convergence of Picard iterations to \(\widetilde{Y}_{m}\)) By Remark 2.2 we have that \(\mathbb{E}(\| \widetilde{Y}^{j}_{m} - \widetilde{Y}_{m}\|_{\infty}) \to 0\). (Note that \(b_{\widetilde{X}}\) is globally Lipschitz continuous, as proven in (5.11).)
Substep 1.3. (Proof of claim (6.4)) The convergence \(\mathbb{E}(\|\widetilde{Y}_{m}^{j} - \widetilde{Y}_{m}\|_{\infty}) \to 0\) implies that \(\widetilde{Y}_{m}^{j} \to \widetilde{Y}_{m}\) in law, hence \(\mathrm{Law}(\widetilde{Y}_{1}) = \mathrm{Law}(\widetilde{Y}_{2})\), which is our claim (6.4).
Step 2. (Exploiting the fixed point) For \(m=1,2\), we consider the functionals \(\mathcal{L}_{m} = \mathcal{L}_{Y^{0}_{m}, W_{m}}\colon \mathcal{P}_{1}(C^{0}) \to \mathcal{P}_{1}(C^{0})\) defined as in Step 2 in the proof of Proposition 5.2 (we stress here the dependence on m to keep track of the dependence on the initial datum \(Y^{0}_{m}\) and the Brownian motion \(W_{m}\)). Given \(\mu \in \mathcal{P}_{1}(C^{0})\), we let \(\widetilde{X} = (\widetilde{X}_{1},\dots,\widetilde{X}_{N})\) and \((\widetilde{Y}_{m}(t))_{t \in [0,T]}\) be the unique solution to the decoupled system (6.2)–(6.3). Then we set \(\mathcal{L}_{m}(\mu ):= \mathrm{Law}(\widetilde{Y}_{m})\). By the discussion in Step 1, we have that \(\mathcal{L}_{1}(\mu ) = \mathcal{L}_{2}(\mu )\).
Let us now fix an initial guess for the law μ, e.g., \(\mu = \delta _{0} \in \mathcal{P}_{1}(C^{0})\) (it is enough that it satisfies \(\mathcal{W}_{1}(\mu,\bar{\mu}^{\mathrm{p}}_{m}) < +\infty \)). We apply iteratively \(\mathcal{L}_{m}^{0}(\mu ) = \mu \), \(\mathcal{L}_{m}^{j}(\mu ) = \mathcal{L}_{m}(\mathcal{L}^{j-1}_{m}( \mu ))\). Since \(\mathcal{L}_{m}\) is a contraction with respect to the modified 1-Wasserstein distance \(\mathcal{W}_{1,\alpha}\), \(\mathcal{L}_{m}^{j}(\mu ) \to \bar{\mu}^{\mathrm{p}}_{m}\) as \(j \to +\infty \), where \(\bar{\mu}^{\mathrm{p}}_{m}\) is the unique fixed point \(\bar{\mu}^{\mathrm{p}}_{m} = \mathcal{L}_{m}(\bar{\mu}^{\mathrm{p}}_{m})\). Since \(\mathcal{L}_{1}(\mu ) = \mathcal{L}_{2}(\mu )\), we conclude that \(\bar{\mu}^{\mathrm{p}}_{1} = \bar{\mu}^{\mathrm{p}}_{2}\), i.e., the law given by the solution \(\bar{Y}_{m}\) to (6.1) does not depend on m. In conclusion, \(\bar{Y}_{1}, \bar{Y}_{2}\) are identically distributed. We let \(\bar{\mu}^{\mathrm{p}}\) denote their common law.
The solution \(\bar{X}_{m} = (\bar{X}_{m,1},\dots,\bar{X}_{m,N})\) is then obtained as the solution to (6.2) corresponding to \(\bar{\mu}^{\mathrm{p}}\). Thus it does not depend on m, yielding \(\bar{X}_{1} = \bar{X}_{2}\). □
Proposition 6.2
Assume the following:
-
Let \((W_{m}(t))_{t \in [0,T]}\), \(m = 1, \dots,M\) be M independent \(\mathbb{R}^{2}\)-valued Brownian motions;
-
Let \(X^{0} = (X_{1}^{0},\dots,X_{N}^{0}) \in \mathbb{R}^{2{\times }N}\);
-
Let \(Y_{1}^{0}, \dots, Y_{M}^{0}\) be i.i.d. \(\mathbb{R}^{2}\)-valued random variables with \(\mathbb{E}(|Y_{m}^{0}|) < +\infty \) and independent of the Brownian motions \((W_{m}(t))_{t \in [0,T]}\);
-
Let \(Z^{0} = (Z_{1}^{0},\dots,Z_{L}^{0}) \in \mathbb{R}^{2 {\times }L}\);
-
Let \(u \in L^{\infty}([0,T];\mathcal{U})\).
For every \(m = 1, \dots, M\), let \(\bar{X} = (\bar{X}_{1},\dots,\bar{X}_{N})\), \((\bar{Y}_{m}(t))_{t \in [0,T]}\), \(Z = (Z_{1}, \dots, Z_{L})\) be the unique strong solution toFootnote 9
Then the stochastic processes \((\bar{Y}_{1}(t))_{t \in [0,T]}, \dots, (\bar{Y}_{M}(t))_{t \in [0,T]}\) are independent.
Proof
The leading idea of the proof is to write \(\bar{Y}_{m}\) in terms of the initial datum \(Y_{m}^{0}\) and the Brownian motion \(W_{m}\).
We consider the solution operator \(\mathcal{S}\colon \mathbb{R}^{2} {\times }C^{0} \to C^{0}\) defined by \(\mathcal{S}(\xi,w):= \varphi \), where φ is the unique solution to the integral equation
The fact that there exists a unique solution to the previous problem follows from the fact that the operator \(\Psi \colon \mathbb{R}^{2} {\times }C^{0} {\times }C^{0} \to C^{0}\) defined by
is such that \(\Psi (\xi, \cdot, w) \colon C^{0} \to C^{0}\) is a contraction with respect to the auxiliary norm \({\vert \kern - 0.25 ex\vert \kern - 0.25 ex\vert }\varphi {\vert \kern - 0.25 ex\vert \kern - 0.25 ex\vert }_{\alpha }:= \sup_{t \in [0,T]} (e^{-\alpha t} |\varphi (t)|)\) for suitable \(\alpha > 0\). Indeed, by the Lipschitz continuity of \(b_{\bar{X}}\),
hence, choosing \(\alpha > 0\) such that \(C_{\alpha }= \frac{C}{\alpha} < 1\),
and thus it has a unique fixed point.
We now observe that the solution operator \(\mathcal{S}\colon \mathbb{R}^{2} {\times }C^{0} \to C^{0}\) is continuous. Indeed, it is Lipschitz with respect to both variables. Letting \(\varphi _{1} = \mathcal{S}(\xi _{1},w)\) and \(\varphi _{2} = \mathcal{S}(\xi _{2}, w)\), by the Lipschitz continuity of \(b_{\bar{X}}\), we have that
thus, by Grönwall’s inequality,
Analogously, letting \(\varphi _{1} = \mathcal{S}(\xi,w_{1})\) and \(\varphi _{2} = \mathcal{S}(\xi, w_{2})\), by the Lipschitz continuity of \(b_{\bar{X}}\), we have that
thus, by Grönwall’s inequality,
We are now in a position to write the stochastic processes \((\bar{Y}_{m}(t))_{t \in [0,T]}\) as \(Y_{m}(\cdot, \omega ) = \mathcal{S}(Y_{m}^{0}(\omega ), W_{m}( \cdot, \omega ))\) for a.e. \(\omega \in \Omega \). Note that \(Y_{1}^{0}, \dots, Y_{M}^{0} \colon \Omega \to \mathbb{R}^{2}\) and \(W_{1}, \dots, W_{M} \colon \Omega \to C^{0}\) are independent random variables. It follows that \((\bar{Y}_{1}(t))_{t \in [0,T]}, \dots, (\bar{Y}_{M}(t))_{t \in [0,T]}\) are independent stochastic processes. This concludes the proof. □
7 Mean-field limit for a large number of pirate ships
In this section we study the limit of the problem as \(M \to +\infty \). For this reason we will stress the dependence of initial data and solutions on M. Still, we do not stress dependence on N, not to overburden the notation.
7.1 Mean-field ODE/SDE/ODE limit model as \(M\to +\infty \)
In the following theorem we shall describe convergence of solutions in terms of empirical measures. Given stochastic processes \((S_{1}(t))_{t\in [0,T]}, \dots, (S_{M}(t))_{t\in [0,T]}\) a.s. with continuous paths, we associate the empirical measureFootnote 10\(\nu _{M} \colon \Omega \to \mathcal{P}(C^{0}([0,T];\mathbb{R}^{2}))\) defined for a.e. \(\omega \in \Omega \) by
(The first placeholder is kept free for the time variable.) If \(\max_{m}\mathbb{E}(\|S_{m}\|_{\infty}) < +\infty \), then a.s. \(\nu _{M} \in \mathcal{P}_{1}(C^{0}([0,T];\mathbb{R}^{2}))\). Indeed,
We set \(\nu _{M}(t,\omega ):= (\mathrm{ev}_{t})_{\#} \nu _{M}(\cdot, \omega )\) for all \(\omega \in \Omega \) and \(t \in [0,T]\). With a slight abuse of notation, we let \(\nu _{M}(t)\) denote the random measure \(\nu _{M}(t) \colon \Omega \to \mathcal{P}(\mathbb{R}^{2})\).
Theorem 7.1
Assume the following:
-
Let \((W_{m}(t))_{t \in [0,T]}\), \(m \geq 1\), be a sequence of independent \(\mathbb{R}^{2}\)-valued Brownian motions;
-
Let \(X^{0} = (X_{1}^{0},\dots,X_{N}^{0}) \in \mathbb{R}^{2{\times }N}\);
-
Let \(Y^{0} =(Y_{1}^{0}, \dots, Y_{M}^{0})\), where \(Y_{1}^{0}, \dots, Y_{M}^{0}\) are i.i.d. \(\mathbb{R}^{2}\)-valued random variables with \(\mathbb{E}(|Y^{0}_{m}|) < +\infty \) and independent of the Brownian motions \((W_{m}(t))_{t \in [0,T]}\);
-
Let \(Z^{0} = (Z_{1}^{0},\dots,Z_{L}^{0}) \in \mathbb{R}^{2 {\times }L}\);
-
Let \((W(t))_{t \in [0,T]}\) be a Brownian motion;
-
Let \(\bar{Y}^{0}\) be an \(\mathbb{R}^{2}\)-valued random variable identically distributed to \(Y_{1}^{0}, \dots, Y_{M}^{0}\).
Let \(u^{M}, u \in L^{\infty}([0,T];\mathcal{U})\) be such that \(u^{M} \stackrel{*}{\rightharpoonup }u\) weakly* in \(L^{\infty}([0,T];\mathcal{U})\).Footnote 11Let \((X^{M}(t))_{t \in [0,T]} = (X^{M}_{1}(t),\dots,X^{M}_{N}(t))_{t \in [0,T]}\), \((Y^{M}(t))_{t \in [0,T]} = (Y^{M}_{1}(t), \dots, Y^{M}_{M}(t))_{t \in [0,T]}\), and \(Z^{M} = (Z^{M}_{1},\dots,Z^{M}_{L})\) be the unique strong solution toFootnote 12
Let \(\nu ^{\mathrm{p}}_{M}\) be the empirical measures associated with \((Y^{M}_{1}(t))_{t \in [0,T]}, \dots, (Y^{M}_{M}(t))_{t \in [0,T]}\). Then there exist \(\bar{X} = (\bar{X}_{1},\dots, \bar{X}_{N})\), \((\bar{Y}(t))_{t \in [0,T]}\), and \(Z = (Z_{1},\dots,Z_{L})\) such that
Moreover, \(\bar{X} = (\bar{X}_{1},\dots, \bar{X}_{N})\), \((\bar{Y}(t))_{t \in [0,T]}\), and \(Z = (Z_{1},\dots,Z_{L})\) are the unique strong solution to (5.1).Footnote 13
Proof
To prove the result, we need to exploit an intermediate problem. For \(m=1,\dots, M\), let \(\bar{X} = (\bar{X}_{1},\dots, \bar{X}_{N})\), \((\bar{Y}^{M}_{m}(t))_{t \in [0,T]}\), and \(Z = (Z_{1},\dots,Z_{L})\) be the unique strong solution toFootnote 14
Our first task is to prove that
from which (7.3) will follow as shown in Step 5.
As in the previous proofs, let \(C^{0}:= C^{0}([0,T];\mathbb{R}^{2})\). Let us also consider the empirical measuresFootnote 15\(\bar{\nu}^{\mathrm{p}}_{M} \colon \Omega \to \mathcal{P}(C^{0})\) associated with \((\bar{Y}^{M}_{1}(t))_{t \in [0,T]}, \dots, (\bar{Y}^{M}_{M}(t))_{t \in [0,T]}\). To be precise, we have that for a.e. \(\omega \in \Omega \)
(The first placeholder is kept free for the time variable.) Notice that, in fact, a.s. \(\nu ^{\mathrm{p}}_{M} \in \mathcal{P}_{1}(C^{0})\) and \(\bar{\nu}^{\mathrm{p}}_{M} \in \mathcal{P}_{1}(C^{0})\) by (7.1) and since by Proposition 4.1 and Proposition 5.2 we have that \(\mathbb{E}(\max_{m} \|Y^{M}_{m}\|_{\infty}) < +\infty \) and \(\mathbb{E}(\max_{m} \|\bar{Y}^{M}_{m}\|_{\infty}) < +\infty \), respectively.
Step 1. (Estimate of \(|Y^{M}_{m} - \bar{Y}^{M}_{m}|\)) Using the fact that \(Y^{M}_{m}\) and \(\bar{Y}^{M}_{m}\) are strong solutions to (7.2) and (7.4), respectively, and by the Lipschitz continuity of \(K^{\mathrm{pg}}\) and \(K^{\mathrm{pc}}\), we have that a.s. for \(0\leq s \leq t\) and \(m=1,\dots,M\)
the constant C depending on \(K^{\mathrm{pg}}\) and \(K^{\mathrm{pc}}\). Taking the supremum in \(s \in [0,t]\), the maximum in m and then the expectation, we obtain that for every \(t \in [0,T]\)
Step 2. (Estimate of \(|X^{M}_{n} - \bar{X}_{n}|\)) To estimate \(|X^{M}_{n}(s) - \bar{X}_{n}(s)|\), we rewrite
Then we exploit the properties of \(v^{N}\), r, and \(K^{\mathrm{cp}}\) and (7.7) to get from (7.2) that a.s. for \(0\leq s \leq t\) and \(n=1,\dots,N\)
To estimate the term involving \(|K^{\mathrm{cp}}*\nu ^{\mathrm{p}}_{M}(r)(X^{M}_{n}(r)) - K^{ \mathrm{cp}}*\bar{\mu}^{\mathrm{p}}(r)(\bar{X}_{n}(r))|\) in (7.8), we exploit Kantorovich’s duality and the Lipschitz continuity of \(K^{\mathrm{cp}}\) to get that a.s.
We bound \(\mathcal{W}_{1}(\nu ^{\mathrm{p}}_{M}(r), \bar{\nu}^{\mathrm{p}}_{M}(r))\) using for a.e. \(\omega \in \Omega \) as an admissible transport plan the diagonal transport \(\gamma (\omega ) = \frac{1}{N} \sum_{n=1}^{N} \delta _{(Y^{M}_{n}(r, \omega ), \bar{Y}^{M}_{n}(r, \omega ))}\) to obtain that a.s.
To estimate the term involving \(K^{\mathrm{cp}}*\bar{\mu}^{\mathrm{p}}(r)(\bar{X}_{n}(r))\) in (7.8), we use the fact that \(|K^{\mathrm{cp}}(z)| \leq |K^{\mathrm{cp}}(0)| + C|z|\) to get that
where in the last inequality we used that
which is finite since \(\bar{\mu}^{\mathrm{p}}\in \mathcal{P}_{1}(C^{0})\) by Proposition 5.2. By Remark 5.3 we recall that \(\max_{n} \|\bar{X}_{n} \|_{\infty}\) is bounded by a constant depending on \(\max_{n} \|X^{0}_{n} \|_{\infty}\), T, \(\|v^{N}\|_{\infty}\), r, and \(K^{\mathrm{cp}}\). Hence
Then we can proceed with the estimate in (7.8): By (7.9)–(7.11) and by exploiting also the Lipschitz continuity of r and \(v^{N}\), we obtain that
Taking the supremum in s, the maximum in n, and then the expectation, we obtain that for every \(t \in [0,T]\)
Step 3. (Grönwall’s inequality) Putting together (7.6) and (7.12), we have that for every \(t \in [0,T]\)
By Grönwall’s inequality, we deduce that for every \(t \in [0,T]\)
In particular,
where the constant depends additionally on T.
Step 4. (Convergence to zero of \(\alpha (M)\)) To conclude the proof, we show that \(\alpha (M) \to 0\) as \(M\to +\infty \).
Substep 4.1. Let us show that \(\|Z^{M} - Z\|_{\infty }\to 0\) as \(M \to +\infty \). We start by observing that by (7.2) and (7.4)
where the constant C depends on \(K^{\mathrm{gg}}\). By Grönwall’s inequality, it follows that
where \(R_{M}(t) = | \int _{0}^{t} ( u^{M}(s) - u(s) ) \,\mathrm{d}s |\), hence
Since \(u^{M} \stackrel{*}{\rightharpoonup }u\) weakly* in \(L^{\infty}([0,T];\mathcal{U})\), we have that \(R_{M}(t) \to 0\) for every \(t \in [0,T]\). Moreover, by the boundedness of \(\mathcal{U}\), \(R_{M}(t)\) are equibounded and equi-Lipschitz. It follows that \(\|R_{M}\|_{\infty }\to 0\), thus \(\|Z^{M} - Z\|_{\infty }\to 0\).
Substep 4.2. Let us show that \(\int _{0}^{T} \mathbb{E} ( \mathcal{W}_{1}(\bar{\nu}^{\mathrm{p}}_{M}(s), \bar{\mu}^{\mathrm{p}}(s)) ) \,\mathrm{d}s \to 0\) as \(M \to +\infty \).
To show this, we apply the discussion in Sect. 2.6 about the approximation of a law (here played by \(\bar{\mu}^{\mathrm{p}}(s)\)) with empirical measures on independent samples of the law (here played by \(\bar{\nu}^{\mathrm{p}}_{M}(s)\)). Let us check that all the assumptions hold true. For every \(s \in [0,T]\), we have that \(\bar{\mu}^{\mathrm{p}}(s) \in \mathcal{P}_{1}(\mathbb{R}^{d})\). This follows from the fact that, by Proposition 6.1, \(\bar{\mu}^{\mathrm{p}}= \mathrm{Law}(\bar{Y}^{M}_{1}) = \cdots = \mathrm{Law}(\bar{Y}^{M}_{M}) = \mathrm{Law}(\bar{Y})\), thus
where the finiteness of \(\mathbb{E}(\|\bar{Y}\|_{\infty})\) follows from Proposition 5.2. Moreover, the random variables \(\bar{Y}^{M}_{1}(s), \dots, \bar{Y}^{M}_{M}(s)\) are i.d. with law \((\bar{Y}^{M}_{m}(s,\cdot ))_{\#} \mathbb{P}= (\mathrm{ev}_{s})_{\#} ( \bar{Y}^{M}_{m})_{\#} \mathbb{P}= (\mathrm{ev}_{s})_{\#} \bar{\mu}^{ \mathrm{p}}= \bar{\mu}^{\mathrm{p}}(s)\). Finally, by Proposition 6.2 we have that \((\bar{Y}^{M}_{1}(t))_{t \in [0,T]}, \dots, (\bar{Y}^{M}_{M}(t))_{t \in [0,T]}\) are independent stochastic processes, thus, in particular, \(\bar{Y}^{M}_{1}(s), \dots, \bar{Y}^{M}_{M}(s)\) are independent random variables. By [33, Lemma 4.7.1] we conclude that
as \(M \to +\infty \). Let us now show that \(s \mapsto \mathbb{E} (\mathcal{W}_{1}(\bar{\nu}^{\mathrm{p}}_{M}(s), \bar{\mu}^{\mathrm{p}}(s)) )\) is dominated. Indeed, since \(\bar{Y}_{1}(s), \dots, \bar{Y}_{M}(s)\) are identically distributed and by (7.13), for every \(s \in [0,T]\), we have that
where the finiteness of the last term follows from Proposition 5.2. We conclude that
as \(M \to +\infty \). This concludes the proof of (7.5).
Step 5. (Conclusion with the proof of (7.3)) By (7.10), we have that
Combining this with (7.14) and (7.5), we obtain (7.3) and we conclude the proof. □
Proposition 7.2
Under the assumptions of Theorem 7.1, the curve \(\bar{X} = (\bar{X}_{1},\dots,\bar{X}_{N})\), the law \(\bar{\mu}^{\mathrm{p}}\in \mathcal{P}_{1}(C^{0}([0,T];\mathbb{R}^{2}))\), and the curve \(Z = (Z_{1},\dots,Z_{L})\) from (7.4) are solutions to the ODE/PDE/ODE system
where the parabolic PDE is understood in the sense of distributions.Footnote 16
Proof
We exploit the fact that \(\bar{\mu}^{\mathrm{p}}\) is the law of stochastic processes \((\bar{Y}(t))_{t \in [0,T]}\), where \((\bar{Y}(t))_{t \in [0,T]}\) solves the SDE
Let us fix a test function \(\xi \in C^{\infty}_{c}((-\infty,T) {\times }\mathbb{R}^{2})\). By Itô’s formula [28, Theorem 6.4], we have that \((\xi (t,\bar{Y}(t)))_{t \in [0,T]}\) is an Itô process solving the SDE
with initial datum \(\xi (0,\bar{Y}^{0})\). This means that a.s. for every \(t \in [0,T]\)
By [28, Lemma 5.4] we have that
Thus, taking the expectation, we obtain in particular that
Using the fact that \(\bar{\mu}^{\mathrm{p}}(t) = \mathrm{Law}(\bar{Y}(t))\) and \(\xi (T,\cdot ) \equiv 0\), we get that
This concludes the proof. □
7.2 Limit of optimal control problems as \(M \to + \infty \)
Let us consider the following cost functional for the limit problem obtained in (7.1). Let \(\mathcal{J}_{N} \colon L^{\infty}([0,T]; \mathcal{U}) \to \mathbb{R}\) be defined for every \(u \in L^{\infty}([0,T];\mathcal{U})\) by
where \(\bar{X} = (\bar{X}_{1}, \dots, \bar{X}_{N})\) and \((\bar{Y}(t))_{t \in [0,T]}\) are the unique strong solutions to (5.1) provided by Proposition 5.2. Notice that the definition of \(\mathcal{J}_{N}\) does not depend on the particular initial random datum \(\bar{Y}^{0}\) but only on its law since this is also the case for \(\bar{\mu}^{\mathrm{p}}\) by Proposition 6.1.
Theorem 7.3
Let us fix \(N \geq 1\). Under the assumptions of Theorem 7.1, the sequence of functionals \((\mathcal{J}_{N,M})_{M \geq 1}\) Γ-converges to \(\mathcal{J}_{N}\) as \(M \to +\infty \) with respect to the weak* topology in \(L^{\infty}([0,T];\mathcal{U})\).Footnote 17
Proof
Step 1. (Asymptotic lower bound). Let us fix a sequence of controls \((u^{M})_{M \geq 1}\), \(u^{M} \in L^{\infty}([0,T];\mathcal{U})\) such that \(u^{M} \stackrel{*}{\rightharpoonup }u\) weakly* in \(L^{\infty}([0,T];\mathcal{U})\) as \(M \to +\infty \). Let us show that
On the one hand, by Definition (3.3), we have that
where the stochastic processes \((X^{M}(t))_{t \in [0,T]} = (X^{M}_{1}(t),\dots,X^{M}_{N}(t))_{t \in [0,T]}\), \((Y^{M}(t))_{t \in [0,T]} = (Y^{M}_{1}(t), \dots, Y^{M}_{M}(t))_{t \in [0,T]}\) (and the curve \(Z^{M} = (Z^{M}_{1},\dots,Z^{M}_{L})\)) are the unique strong solution to (7.2). On the other hand, we have that
where the curve \(\bar{X} = (\bar{X}_{1}, \dots, \bar{X}_{N})\), the stochastic process \((\bar{Y}(t))_{t \in [0,T]}\) with law \(\bar{\mu}^{\mathrm{p}}\) (and the curve \(Z = (Z_{1},\dots,Z_{L})\)) are the unique strong solution to (5.1).
By the weak sequential lower semicontinuity of the \(L^{2}\)-norm, we have that
Let us prove the convergence
as \(M \to +\infty \). This will conclude the proof of (7.17).
We exploit the equality
to deduce that
We estimate the first term on the right-hand side of (7.19) by using the fact that, by the Lipschitz continuity of \(H^{\mathrm{d}}\), a.s. for every \(t \in [0,T]\)
We estimate the second term on the right-hand side of (7.19) by Kantorovich’s duality, which by the Lipschitz continuity of \(H^{\mathrm{d}}(\bar{X}_{n}(t) - \cdot )\) yields a.s. for every \(t \in [0,T]\)
Putting together the previous inequalities, we conclude that
Step 2. (Asymptotic upper bound). Let us fix \(u \in L^{\infty}([0,T];\mathcal{U})\). For every \(M \geq 1\), let us set \(u^{M} = u\). As in Step 1, we have that
where the stochastic processes \((X^{M}(t))_{t \in [0,T]}\), \((Y^{M}(t))_{t \in [0,T]}\) (and the curve \(Z^{M}\)) are the unique strong solution to (7.2) corresponding to the control \(u^{M} = u\) and
where the curve X̄, the stochastic process \((\bar{Y}(t))_{t \in [0,T]}\) with law \(\bar{\mu}^{\mathrm{p}}\) (and the curve Z) are the unique strong solution to (5.1). Trivially, we have \(u^{M} \stackrel{*}{\rightharpoonup }u\), hence we deduce (7.18) once again and, in particular, the asymptotic upper bound
This concludes the proof. □
As a byproduct, we obtain the following result.
Proposition 7.4
Under the assumptions of Proposition 5.2, there exists an optimal control \(u^{*} \in L^{\infty}([0,T];\mathcal{U})\), i.e.,
Proof
The proof is standard in the theory of Γ-convergence. Let us consider a sequence of independent Brownian motions \((W_{m}(t))_{t \in [0,T]}\), \(m \geq 1\) and \(Y^{0}_{1}, \dots, Y^{0}_{M}\) i.i.d. random variable with the same law of \(\bar{Y}^{0}\). Let \((u^{M})_{M \geq 1}\) be a sequence such that \(\mathcal{J}_{N,M}(u^{M}) = \inf \mathcal{J}_{N,M}\). Since \((u^{M})_{M \geq 1}\) is bounded in \(L^{\infty}([0,T];\mathcal{U})\), there exist \(u^{*}\) and a subsequence (not relabeled) such that \(u^{M} \stackrel{*}{\rightharpoonup }u^{*}\) weakly-* in \(L^{\infty}([0,T];\mathcal{U})\). By Theorem 7.3 we have that
(Here we used the fact that the recovery sequence for \(u^{*}\) is the constant sequence given by \(u^{*}\), see the proof of Theorem 7.3.) □
8 Mean-field limit for a large number of commercial ships
In this section we study the limit of the problem as \(N \to +\infty \). For this reason we will stress the dependence of initial data and solutions on N.
8.1 Mean-field limit as \(N \to +\infty \)
In this section, we will use the explicit formula for the velocity correction
where we set
In what follows, we shall use the symbol ∗2 to indicate that the convolution is done with respect to the second variable, i.e., \(\eta *_{2} \nu (x) = \int _{\mathbb{R}^{2}} \eta (x,x-x') \,\mathrm{d}\nu (x')\).
Theorem 8.1
Assume the following:
-
Let \((W(t))_{t \in [0,T]}\) be an \(\mathbb{R}^{2}\)-valued Brownian motion;
-
Let \(X^{N,0} = (X_{1}^{0},\dots,X_{N}^{0}) \in \mathbb{R}^{2{\times }N}\) and assume that \(\max_{n} \|X^{N,0}_{n}\|_{\infty }\leq R_{0}\) with \(R_{0}\) independent of N;
-
Let \(\bar{Y}^{0}\) be an \(\mathbb{R}^{2}\)-valued random variable with \(\mathbb{E}(|\bar{Y}^{0}|) < + \infty \);
-
Let \(Z^{0} = (Z_{1}^{0},\dots,Z_{L}^{0}) \in \mathbb{R}^{2 {\times }L}\);
-
Let \(\mu ^{\mathrm{c}}_{0} \in \mathcal{P}_{1}(\mathbb{R}^{2})\) with \(\operatorname{supp}(\mu ^{\mathrm{c}}_{0}) \subset \bar{B}_{R_{0}}\) be such that \(\mathcal{W}_{1}(\frac{1}{N} \sum_{n=1}^{N} \delta _{X_{n}^{0}}, \mu ^{\mathrm{c}}_{0}) \to 0\) as \(N \to +\infty \);
Let \(u^{N}, u \in L^{\infty}([0,T];\mathcal{U})\) be such that \(u^{N} \stackrel{*}{\rightharpoonup }u\) weakly* in \(L^{\infty}([0,T];\mathcal{U})\).Footnote 18Let \(\bar{X}^{N} = (\bar{X}^{N}_{1},\dots, \bar{X}^{N}_{N})\), \((\bar{Y}^{N}(t))_{t \in [0,T]}\), and \(Z^{N} = (Z^{N}_{1},\dots, Z^{N}_{L})\) be the unique strong solution toFootnote 19
Let us consider the measures
Then there exist \(\mu ^{\mathrm{c}}\in C^{0}([0,T];\mathcal{P}_{1}(\mathbb{R}^{2}))\), \((\bar{Y}(t))_{t \in [0,T]}\), and \(Z=(Z_{1},\dots,Z_{L})\) such that
Moreover, \(\mu ^{\mathrm{c}}\in C^{0}([0,T];\mathcal{P}_{1}(\mathbb{R}^{2}))\), \((\bar{Y}(t))_{t \in [0,T]}\), and \(Z=(Z_{1},\dots,Z_{L})\) provide the unique solution to
Proof
Step 1. (PDE solved by the empirical measures) In terms of \(\nu ^{\mathrm{c}}_{N}(t)\), \(v^{N}_{n}(\bar{X}^{N}(t))\) reads
Let us derive the PDE solved by \(\nu ^{\mathrm{c}}_{N}(t)\) in the sense of distributions.Footnote 20 Let us fix \(\xi \in C^{\infty}_{c}((-\infty,T) {\times }\mathbb{R}^{2})\). By (8.2) and (8.5) we have that
This means that \(\nu ^{\mathrm{c}}_{N}\) is a distributional solution to
Step 2. (Convergence of empirical measures \(\nu ^{\mathrm{c}}_{N}\)) To show the compactness of the sequence of curves \(\nu ^{\mathrm{c}}_{N} \in C^{0}([0,T];\mathcal{P}_{1}(\mathbb{R}^{2}))\), we rely on the Arzelà–Ascoli theorem for metric-valued functions. We split the proof in substeps.
Substep 2.1. (Equiboundedness of supports) By Remark 5.3, we have that \(\max_{n} \|\bar{X}^{N}_{n} \|_{\infty }\leq R\), where the constant R depending on the initial datum \(X^{0}\), the final time T, \(\|v^{N}\|_{\infty}\), r, and \(K^{\mathrm{cp}}\). This implies that \(\operatorname{supp} (\nu ^{\mathrm{c}}_{N}(t) )\) are contained in the closed ball \(\bar{B}_{R}\) for every \(t \in [0,T]\).
Substep 2.2. (Equicontinuity) Let us prove that \(\nu ^{\mathrm{c}}_{N} \in C^{0}([0,T];\mathcal{P}_{1}(\mathbb{R}^{2}))\) are equicontinuous.
We observe that the sequence \(\|Z^{N}\|_{\infty}\) is bounded. Indeed, by (8.2),
the constant C depending on \(K^{\mathrm{gg}}\) and the set of admissible controls \(\mathcal{U}\) (bounded). Taking the norm of \(Z^{N}\) and by Grönwall’s inequality, we obtain that
where the constant \(R'\) depends on \(K^{\mathrm{gg}}\), \(\mathcal{U}\), and T.
By Remark 5.3, for every \(r \in [0,T]\), we have that
where the constant C depends on \(K^{\mathrm{pg}}\), \(K^{\mathrm{pc}}\), \(\|Z^{N}\|_{\infty}\) (bounded by \(R'\)), \(\max_{n} \|\bar{X}^{N}_{n} \|_{\infty}\) (bounded by R), T, and W. Then the Lipschitz continuity of \(K^{\mathrm{cp}}\) and (8.8) yield
where the constant C additionally depends on \(\mathbb{E}(|\bar{Y}^{0}|)\).
By (8.2) and (8.9), for \(s \leq t\) and \(n=1,\dots,N\), we have that
where the constant C depends on the constant obtained in (8.9) and additionally on \(\|v^{N}\|_{\infty}\) and r. Using as a transport plan between \(\nu ^{\mathrm{c}}_{N}(s)\) and \(\nu ^{\mathrm{c}}_{N}(t)\) the measure \(\gamma = \frac{1}{N} \sum_{n=1}^{N} \delta _{(\bar{X}^{N}_{n}(s), \bar{X}^{N}_{n}(t))}\), we obtain that
i.e., the curves \(\nu ^{\mathrm{c}}_{N} \in C^{0}([0,T]; \mathcal{P}_{1}(\bar{B}_{R}))\) are equi-Lipschitz with respect to the 1-Wasserstein distance.
Substep 2.3. (Compactness) Since the ball \(\bar{B}_{R}\) is compact, the Wasserstein space \(\mathcal{P}_{1}(\bar{B}_{R})\) is compact too [39, Remark 6.19].Footnote 21 Hence the Arzelà–Ascoli theorem for continuous functions with values in a metric space guarantees the existence of a curve \(\mu ^{\mathrm{c}}\in C^{0}([0,T]; \mathcal{P}_{1}(\bar{B}_{R}))\) and a subsequence \(N_{k}\) such that
Without loss of generality, we do not relabel this subsequence and denote it simply by N. This does not affect the proof as in Theorem 8.2 we shall prove the uniqueness of solutions for the limit problem.
Step 3. (Convergence of \(Z^{N}\)) We let \(Z = (Z_{1},\dots, Z_{L})\) be the unique solution to
As in Substep 4.1, in the proof of Theorem 7.1 we get that
Step 4. (Convergence of \(\bar{Y}^{N}\)) Let us consider the SDE
We will show that \(\bar{Y}^{N}\) converges to Ȳ.
Substep 4.1. (Well-posedness of (8.12)) There exists a unique strong solution to (8.12). Indeed, let us consider the drift
and the constant dispersion matrix \(\sigma = \sqrt{2\kappa} \mathrm{Id}_{2}\), so that
Let us observe that b is continuous in t and Lipschitz continuous in Y (with Lipschitz constant independent of t). Indeed, Z is a continuous curve, while by Kantorovich’s duality
and \(t \mapsto \mu ^{\mathrm{c}}(t)\) is a continuous curve in the Wasserstein space \(\mathcal{P}_{1}(\mathbb{R}^{2})\). Moreover, the function \(Y \mapsto K^{\mathrm{pg}}( Y - Z_{\ell}(t))\) is Lipschitz continuous, and so is \(Y \mapsto K^{\mathrm{pc}}*\mu ^{\mathrm{c}}(t)(Y)\) since
Moreover, we have that
and
where the last inequality follows from the fact that Z is bounded and \(\mu ^{\mathrm{c}}(t)\) has support in the ball \(\bar{B}_{R}(0)\) for every \(t \in [0,T]\). We conclude that
where the constant C depends on \(K^{\mathrm{pg}}\), \(K^{\mathrm{pc}}\), \(\|Z\|_{\infty}\), R. Thus the assumptions of Proposition 2.1 are satisfied. Proposition 2.1 also gives us that
where the constant C depends on \(K^{\mathrm{pg}}\), \(K^{\mathrm{pc}}\), \(\|Z\|_{\infty}\), R, \(\bar{Y}^{0}\), T, and W.
Substep 4.2. (Convergence of \(\bar{Y}^{N}\) to Ȳ) Let us prove that
We start by noticing that
Hence, by (8.2), (8.12), (8.13), and by Kantorovich’s duality, we have a.s. for \(0 \leq s \leq t \leq T\)
Taking the supremum and the expectation, we deduce that
and, by Grönwall’s inequality,
In particular,
the constant C depending also on T. By (8.11) and (8.10), we obtain (8.16).
Step 5. (Limit problem) With (8.10), (8.11), and (8.16) at hand, we are in a position to pass to the limit as \(N \to +\infty \) in (8.7) and prove that \(\mu ^{\mathrm{c}}\) is a distributional solution to
i.e.,
We divide the proof in substeps.
Substep 5.1. (Convergence of initial datum term) By the Lipschitz continuity of \(x \mapsto \xi (0,x)\) and by Kantorovich’s duality, we have that
By the assumption on the initial data, we have that \(\mathcal{W}_{1}(\frac{1}{N} \sum_{n=1}^{N} \delta _{X_{n}^{0}}, \mu ^{\mathrm{c}}_{0}) \to 0\), hence
Substep 5.2. (Convergence of time-derivative term) Since \(x \mapsto \partial _{t} \xi (t,x)\) is Lipschitz continuous with a Lipschitz constant independent of t, by Kantorovich’s duality we have that
for every t. By (8.10) it follows that \(\int _{\mathbb{R}^{2}} \partial _{t} \xi (t,x) \,\mathrm{d}\nu ^{ \mathrm{c}}_{N}(t)(x) \to \int _{\mathbb{R}^{2}} \partial _{t} \xi (t,x) \,\mathrm{d}\mu ^{\mathrm{c}}(t)(x)\) as \(N \to +\infty \) uniformly in t, thus
Substep 5.3. (Convergence of divergence term – I) Let us show that
We start by splitting
By the Lipschitz continuity of v, by (8.1), by the Lipschitz continuity of η, and by Kantorovich’s duality, we have that for every \(x \in \mathbb{R}^{2}\) and \(t \in [0,T]\)
where the constant C depends on v and η. Integrating in time and space and using the fact that \(|\mathbf{r}(x)| \leq C(1+|x|)\), thus it is bounded on the compact support of ξ, we obtain that
where the constant C depends on v, η, r, ξ, and T.
Moreover, the function \(x \mapsto v ( \eta *_{2} \mu ^{\mathrm{c}}(t) (x) ) \mathbf{r}(x) \cdot \nabla _{x} \xi (t,x)\) is Lipschitz continuous with a Lipschitz constant independent of t and depending on v, η, r, and ξ. For \(x \mapsto v ( \eta *_{2} \mu ^{\mathrm{c}}(t) (x) )\) satisfies the latter property since
where the constant C depends on v and η. As above, r is bounded on the support of ξ. By the Lipschitz continuity of r and \(\nabla _{x} \xi \), we conclude that the product \(x \mapsto v ( \eta *_{2} \mu ^{\mathrm{c}}(t) (x) ) \mathbf{r}(x) \cdot \nabla _{x} \xi (t,x)\) is also Lipschitz continuous. Thus by Kantorovich’s duality we obtain that for every \(t \in [0,T]\)
where C depends on v, η, r, ξ. Combining (8.22), (8.24), and (8.26), by (8.10) it follows that
where the constant C depends on v, η, r, ξ, and T.
Substep 5.4. (Convergence of divergence term – II) Let us prove that
We start by splitting
For the first term on the right-hand side of (8.28), we argue analogously to (8.24) to obtain that
where the constant C depends on v, η, \(K^{\mathrm{cp}}\), ξ, and T. The only difference consists in the fact that we have \(K^{\mathrm{cp}}*\bar{\mu}^{\mathrm{p}}_{N}(t)(x)\) in place of \(\mathbf{r}(x)\). For this, we need to observe that
In the last inequality, we used the fact that, since \(\bar{\mu}^{\mathrm{p}}_{N}(t)\) is the law of \(\bar{Y}^{N}(t)\),
where the boundedness follows from the convergence (8.16).
For the second term on the right-hand side of (8.28), we start by observing that \(K^{\mathrm{cp}}\) is Lipschitz, thus we have for every \(x \in \mathbb{R}^{2}\) and \(t \in [0,T]\)
By (8.16), it follows that
For the third term on the right-hand side of (8.28), we observe that the function \(x \mapsto v ( \eta *_{2} \mu ^{\mathrm{c}}(t) (x) ) K^{ \mathrm{cp}}*\mu ^{\mathrm{p}}(t)(x) \cdot \nabla _{x} \xi (t,x)\) is Lipschitz continuous with a Lipschitz constant independent of t and depending on v, η, \(K^{\mathrm{cp}}\), \(\mu ^{\mathrm{p}}\), and ξ. This follows from (8.25), from the fact that ξ is compactly supported, and the inequality
obtained as in (8.29). By Kantorovich’s duality,
by (8.10).
Substep 5.5. (Conclusion) Combining (8.6), (8.19), (8.20), (8.21), and (8.27), we conclude the proof of (8.18).
We prove the uniqueness of the solution in Theorem 8.2. □
Theorem 8.2
Under the assumptions of Theorem 8.1, the solution \(\mu ^{\mathrm{c}}\in C^{0}([0,T];\mathcal{P}_{1}(\mathbb{R}^{2}))\), \((\bar{Y}(t))_{t \in [0,T]}\), and \(Z=(Z_{1},\dots,Z_{L})\) to (8.4) is unique.
Proof
The uniqueness of Z is direct as the ODE for Z is decoupled from the first two equations.
Assume now that \(\mu ^{\mathrm{c}}_{i} \in C^{0}([0,T];\mathcal{P}_{1}(\mathbb{R}^{2}))\), \((\bar{Y}_{i}(t))_{t \in [0,T]}\) for \(i=1,2\) are solutions to (8.4) with the same initial data, i.e.,
where \(\operatorname{supp}(\mu ^{\mathrm{c}}_{i}(0)) = \operatorname{supp}( \mu ^{\mathrm{c}}_{0}) \subset \bar{B}_{R}\). As customary in uniqueness proofs for evolutionary problems, we will temporary neglect the assumption that the initial data \(\bar{Y}_{1}(0)\) and \(\bar{Y}_{2}(0)\) are a.s. equal and \(\mu ^{\mathrm{c}}_{1}(0)\) and \(\mu ^{\mathrm{c}}_{2}(0)\) are equal in order to carry out a Grönwall-type argument to deduce stability with respect to the initial data. The objective is to prove the following pair of estimates:
These two inequalities provide uniqueness when combined. Indeed, if \(\bar{Y}_{1}(0) = \bar{Y}^{0} = \bar{Y}_{2}(0)\) a.s. and \(\mu ^{\mathrm{c}}_{1}(0) = \mu ^{\mathrm{c}}_{0} = \mu ^{\mathrm{c}}_{2}(0)\), then (8.31) simply reads
Substituting into (8.32), we get that
which by Grönwall’s inequality yields \(\mathcal{W}_{1}(\mu ^{\mathrm{c}}_{1}(t), \mu ^{\mathrm{c}}_{2}(t)) = 0\) for all \(t \in [0,T]\). Then (8.31) gives \(\mathbb{E}( \|\bar{Y}_{1} - \bar{Y}_{2}\|_{\infty }) = 0\).
We divide the proof of (8.31)–(8.32) into several steps.
Step 1. (Estimate of \(\mathbb{E}(\|\bar{Y}_{1} - \bar{Y}_{2} \|_{\infty})\)) By (8.30) we have that a.s. for \(0 \leq s \leq t \leq T\)
The first integrand in (8.33) is bounded using the Lipschitz continuity of \(K^{\mathrm{pg}}\) by
The second integrand in (8.33) is estimated using the Lipschitz continuity of \(K^{\mathrm{pc}}\) as follows:
The third integrand in (8.33) is estimated using the Lipschitz continuity of \(K^{\mathrm{pc}}\) by Kantorovich’s duality
Combining (8.33)–(8.36), we infer that a.s. for \(0 \leq s \leq t \leq T\)
Taking the supremum in s and the expectation, we deduce that
By Grönwall’s inequality,
for \(t \in [0,T]\) and, in particular,
for \(t \in [0,T]\), where the constant C also depends on T.
Step 2. (Introducing the flow for the transport equation) Following an idea in [35, 36], we prove uniqueness by regarding the solutions of the transport equation from a Lagrangian point of view. Let us consider for every \(x \in \operatorname{supp}( \mu ^{\mathrm{c}}_{i}(0))\) the flow
Then \(\mu ^{\mathrm{c}}_{i}(t) = \Phi _{i}(t,\cdot )_{\#} \mu ^{\mathrm{c}}_{i}(0)\), see [38, Theorem 5.34].
Let us show that the flows \(\Phi _{i}\) are bounded. We notice that
where we used bound (8.15). By (8.37) and by the estimate \(|r(X)| \leq C(1+|X|)\), we deduce that for every \(x \in \bar{B}_{R}\)
By Grönwall’s inequality and since \(x \in \bar{B}_{R}\), we obtain that
where the constant C depends on \(\|v\|_{\infty}\), r, \(K^{\mathrm{cp}}\), R, and T (in addition to the constant in (8.15)).
In what follows we will show that
for x, \(x' \in \bar{B}_{R}\) and \(t \in [0,T]\).
We start by observing that
for every x, \(x' \in \mathbb{R}^{2}\) and \(t \in [0,T]\).
Step 3. (Estimate of \(|\Phi _{1}(t,x) - \Phi _{2}(t,x)|\)) We estimate the first term on the right-hand side of (8.40) as follows:
In the following substeps we estimate the five terms on the right-hand side of (8.41).
Substep 3.1. Let us estimate the first term on the right-hand side of (8.41). For \(x \in \operatorname{supp}( \mu ^{\mathrm{c}}_{i}(0))\) and \(s \in [0,T]\), we split
We exploit the Lipschitz continuity of v and η to obtain that
Moreover, we use the Lipschitz continuity of \(x' \mapsto \eta (\Phi _{2}(s,x), \Phi _{2}(s,x) - x')\) and Kantorovich’s duality to deduce that
By (8.38) we have that for \(x \in \bar{B}_{R}\) and \(t \in [0,T]\)
By (8.42)–(8.45) we get that for every \(x \in \bar{B}_{R}\) and \(t \in [0,T]\)
Substep 3.2. Let us estimate the second term on the right-hand side of (8.41). By the Lipschitz continuity of \(K^{\mathrm{cp}}\), we observe that for \(x \in \bar{B}_{R}\) and \(s \in [0,T]\)
where we used (8.15) and (8.38). Then, as for (8.46), we have that
Substep 3.3. Let us estimate the third term on the right-hand side of (8.41). By the Lipschitz continuity of r, we get that
Substep 3.4. Let us estimate the fourth term on the right-hand side of (8.41). We exploit the Lipschitz continuity of \(K^{\mathrm{cp}}\) to deduce that
from which it follows that
Substep 3.5. Let us estimate the fifth term on the right-hand side of (8.41). By the Lipschitz continuity of \(y \mapsto K^{\mathrm{cp}}(\Phi _{2}(s,x) - y)\), we have that
from which we infer that
the constant C depending also on T.
Substep 3.6. Combining (8.41), (8.46), (8.47), (8.48), (8.49), and (8.50), we obtain that
By Grönwall’s inequality this yields
for \(t \in [0,T]\) and, in particular,
for \(t \in [0,T]\) with C depending also on T.
Step 4. (Estimate of \(|\Phi _{2}(t,x) - \Phi _{2}(t,x')|\)) Let us estimate the second term on the right-hand side of (8.40). By (8.37), we have that
Reasoning similarly to Step 2 (i.e., exploiting the Lipschitz continuity of v, η, r, and \(K^{\mathrm{cp}}\)), one shows that for x, \(x' \in \bar{B}_{R}\) and \(t \in [0,T]\)
which by Grönwall’s inequality yields \(|\Phi _{2}(t,x) - \Phi _{2}(t,x')| \leq |x - x'| e^{Ct}\) for x, \(x' \in \bar{B}_{R}\) and \(t \in [0,T]\) and, in particular,
for x, \(x' \in \bar{B}_{R}\), where the constant C depends also on T.
Putting together (8.40), (8.51), and (8.52), we conclude that (8.39) holds true.
Step 5. (Estimate of \(\mathcal{W}_{1}(\mu ^{\mathrm{c}}_{1}(t), \mu ^{\mathrm{c}}_{2}(t))\)) Let \(\gamma \in \mathcal{P}(\mathbb{R}^{2}{\times }\mathbb{R}^{2})\) be an optimal transport plan satisfying \(\pi ^{i}_{\#} \gamma = \mu ^{\mathrm{c}}_{i}(0)\) and
We observe that since \(\mu ^{\mathrm{c}}_{1}(0)\) and \(\mu ^{\mathrm{c}}_{2}(0)\) have both supports contained in the closed ball \(\bar{B}_{R}\), the measure γ is also concentrated on a set contained in \(\bar{B}_{R} {\times }\bar{B}_{R}\), see [39, Theorem 5.10-(ii)-(e)]. We consider the map \((x,x') \mapsto (\Phi _{1}(t,\pi ^{1}(x,x')),\Phi _{2}(t,\pi ^{2}(x,x')))\) and the transport plan \((\Phi _{1}(t,\pi ^{1}),\Phi _{2}(t,\pi ^{2}))_{\#} \gamma \), observing that it has marginals \(\mu ^{\mathrm{c}}_{1}(t)\) and \(\mu ^{\mathrm{c}}_{2}(t)\) since we have that \(\pi ^{i}_{\#} (\Phi _{1}(t,\pi ^{1}),\Phi _{2}(t,\pi ^{2}))_{\#} \gamma = \Phi _{i}(t,\cdot )_{\#} \pi ^{i}_{\#} \gamma = \Phi _{i}(t, \cdot )_{\#} \mu ^{\mathrm{c}}_{i}(0) = \mu ^{\mathrm{c}}_{i}(t)\). From (8.39) and (8.53) it follows that
By Grönwall’s inequality
for \(t \in [0,T]\), where the constant C also depends on T. This concludes the proof of (8.32) and of the theorem. □
Proposition 8.3
Under the assumptions of Theorem 8.1, the curve \(\mu ^{\mathrm{c}}\in C^{0}([0,T];\mathcal{P}_{1}(\mathbb{R}^{2}))\), the law \(\mu ^{\mathrm{p}}\in \mathcal{P}_{1}(C^{0}([0,T];\mathbb{R}^{2}))\), and the curve \(Z = (Z_{1},\dots,Z_{L})\) from (8.4) are solutions to the ODE/PDE/ODE system
where the PDEs are understood in the sense of distributions and \(\mu ^{\mathrm{p}}_{0}\) is the law of \(\bar{Y}^{0}\).
Proof
The proof consists in deriving the PDE solved by \(\mu ^{\mathrm{p}}\) using Itô’s lemma and is obtained as in the proof of Proposition 7.2mutatis mutandis. □
Theorem 8.4
Under the assumptions of Theorem 8.1and further assuming that:
-
\(\mu ^{\mathrm{p}}_{0} \in \mathcal{P}_{2}(\mathbb{R}^{2})\);
-
\(\mu ^{\mathrm{p}}_{0}\) has finite entropy, i.e., \(\mu ^{\mathrm{p}}_{0} = \rho ^{\mathrm{p}}_{0}(x) \,\mathrm{d}x\) for some function \(\rho ^{\mathrm{p}}_{0} \in L^{1}(\mathbb{R}^{2})\) satisfying \(\int _{\mathbb{R}^{2}} \rho ^{\mathrm{p}}_{0}(x) \log (\rho ^{ \mathrm{p}}_{0}(x)) \,\mathrm{d}x < +\infty \);
-
\(\mu ^{\mathrm{p}}_{0} = \mathrm{Law}(\widetilde{Y}^{0})\);
the solution to (8.54) is unique.
Proof
Step 1. Let us fix \(\mu ^{\mathrm{c}}\in C^{0}([0,T];\mathcal{P}_{1}(\mathbb{R}^{2}))\) and \(Z = (Z_{1},\dots, Z_{L}) \in C^{0}([0,T];\mathbb{R}^{2})\) solution to the ODE in (8.54) involving Z. We start by observing that the PDE
has at most one solution \(\mu ^{\mathrm{p}}\in \mathcal{P}_{1}(C^{0}([0,T];\bar{B}_{R}))\), where \(\bar{B}_{R}\) is a closed ball of radius \(R>0\). As done in [9, Theorem 3.7], we apply the result [10, Theorem 3.3]. To check all the conditions, let us write the PDE using the same notation of [10]. Let \(A(t,y) = \mathrm{Id}_{2}\) and \(b(t,y) = \frac{1}{L} \sum_{\ell =1}^{L} K^{\mathrm{pg}}(y - Z_{\ell}(t)) - K^{\mathrm{pc}}*\mu ^{\mathrm{c}}(t)(y)\). Let us define the operator \(\mathscr{L} \xi = \partial _{t} \xi + \mathrm{tr}(A\nabla ^{2} \xi ) + b \cdot \nabla \xi \). If \(\mu ^{\mathrm{p}}\in \mathcal{P}_{1}(C^{0}([0,T];\mathbb{R}^{2}))\) solves (8.55), then it is a Radon measureFootnote 22 on \((0,T) {\times }\mathbb{R}^{2}\) solving \(\mathscr{L}^{*} \mu ^{\mathrm{p}}= 0\), i.e., \(\int _{(0,T){\times }\mathbb{R}^{2}} \mathscr{L} \xi \,\mathrm{d} \mu ^{\mathrm{p}}= 0\) for every \(\xi \in C^{\infty}_{c}((0,T) {\times }\mathbb{R}^{2})\). Trivially, A is bounded and Lipschitz in the y variable. By the Lipschitz continuity of \(K^{\mathrm{pg}}\) and the boundedness of Z,
Moreover, by the Lipschitz continuity of \(K^{\mathrm{pc}}\),
where the constant C also depends on R. In conclusion,
By the assumption \(\mu ^{\mathrm{p}}_{0} \in \mathcal{P}_{2}(\mathbb{R}^{2})\), we have that \(\int _{\mathbb{R}^{2}} |y|^{2} \,\mathrm{d}\mu ^{\mathrm{p}}_{0}(y) < +\infty \). Finally, the initial condition is attained also in the sense
since \(t \in [0,T] \mapsto \int _{\mathbb{R}^{2}} \xi (y) \,\mathrm{d}\mu ^{ \mathrm{p}}(t)(y)\) is a continuous function, cf. also Footnote 16.
By [10, Theorem 3.3] we conclude that there is at most one family \((\mu ^{\mathrm{p}}(t))_{t \in [0,T]}\) that solves (8.55).
Step 2. Let now \(\mu ^{\mathrm{c}}_{i} \in C^{0}([0,T];\mathcal{P}_{1}(\mathbb{R}^{2}))\), \(\mu ^{\mathrm{p}}_{i} \in \mathcal{P}_{1}(C^{0}([0,T];\mathbb{R}^{2}))\), \(i = 1,2\) (and \(Z = (Z_{1},\dots,Z_{L}) \in C^{0}([0,T];\mathbb{R}^{2})\)) be two solutions of (8.54), i.e., for \(i = 1,2\)
Let \(\bar{Y}^{0}\) be an \(\mathbb{R}^{2}\)-valued random variable with law \(\mu ^{\mathrm{p}}_{1}(0) = \mu ^{\mathrm{p}}_{2}(0) = \mu ^{ \mathrm{p}}_{0}\), and let us consider for \(i=1,2\) the solutions to
which exist and are unique by Substep 4.1 in the proof of Theorem 8.1. Let us set \(\bar{\mu}^{\mathrm{p}}_{i} = \mathrm{Law}(\bar{Y}_{i})\). (Notice the temporary difference between \(\bar{\mu}^{\mathrm{p}}_{i}\) and \(\mu ^{\mathrm{p}}_{i}\).) By deriving the PDE solved by the law \(\bar{\mu}^{\mathrm{p}}_{i}\) using Itô’s lemma (see the proof of Proposition 7.2), we obtain that for \(i=1,2\)
However, by (8.56) and by the uniqueness proven in Step 1, we deduce that \(\bar{\mu}^{\mathrm{p}}_{i} = \mu ^{\mathrm{p}}_{i}\) for \(i = 1,2\). Combining (8.56) and (8.57), we obtain that, for \(i=1,2\),
By Theorem 8.2, the problem above has a unique solution, hence \(\mu ^{\mathrm{c}}_{1} = \mu ^{\mathrm{c}}_{2}\) and \(\bar{Y}_{1} = \bar{Y}_{2}\) a.s., yielding \(\mu ^{\mathrm{p}}_{1} = \mathrm{Law}(\bar{Y}_{1}) = \mathrm{Law}( \bar{Y}_{2}) = \mu ^{\mathrm{p}}_{2}\). This concludes the proof. □
Remark 8.5
Thanks to Theorem 8.2, if there exist absolutely continuous solutions to (8.54), then \(\mu ^{\mathrm{c}}\) and \(\mu ^{\mathrm{p}}\) are a fortiori absolutely continuous by uniqueness. Under suitable conditions, the solutions are, in fact, absolutely continuous.
If \(\mu ^{\mathrm{c}}(0) = \rho ^{\mathrm{c}}_{0} \,\mathrm{d}x\), then by [36, Theorem 2] the measure \(\mu ^{\mathrm{c}}(t)\) is absolutely continuous for all \(t \in [0,T]\), hence there exists a density \(\rho ^{\mathrm{c}}(t) \in L^{1}(\mathbb{R}^{2})\) such that \(\mu ^{\mathrm{c}}(t) = \rho ^{\mathrm{c}}(t) \,\mathrm{d}x\). This is a consequence of the Lagrangian representation of the solution to the nonlocal transport equation.
By [10, Theorem 2.5] (see also [11, Corollary 3.9]) there exists a function \(\rho ^{\mathrm{p}}\) locally Hölder continuous on \((0,T) {\times }\mathbb{R}^{2}\) such that \(\mu ^{\mathrm{p}}= \rho ^{\mathrm{p}}(t,y) \,\mathrm{d}t \,\mathrm{d}y\).
In conclusion,
8.2 Limit of optimal control problems as \(N \to + \infty \)
Let us consider the following cost functional for the limit problem obtained in (8.4). Let \(\mathcal{J}\colon L^{\infty}([0,T]; \mathcal{U}) \to \mathbb{R}\) be defined for every \(u \in L^{\infty}([0,T];\mathcal{U})\) by
where \(\mu ^{\mathrm{c}}\in C^{0}([0,T];\mathcal{P}_{1}(\mathbb{R}^{2}))\), \((\bar{Y}(t))_{t \in [0,T]}\) are obtained by Theorem 8.1 as the unique solution to (8.4) and \(\mu ^{\mathrm{p}}\) is the law of \((\bar{Y}(t))_{t \in [0,T]}\).
Theorem 8.6
Under the assumptions of Theorem 8.1, the sequence of functionals \((\mathcal{J}_{N})_{N \geq 1}\) Γ-converges to \(\mathcal{J}\) as \(N \to +\infty \) with respect to the weak* topology in \(L^{\infty}([0,T];\mathcal{U})\).Footnote 23
Proof
Step 1. (Asymptotic lower bound). Let us fix a sequence of controls \((u^{N})_{N \geq 1}\), \(u^{N} \in L^{\infty}([0,T];\mathcal{U})\) such that \(u^{N} \stackrel{*}{\rightharpoonup }u\) weakly* in \(L^{\infty}([0,T];\mathcal{U})\) as \(N \to +\infty \). Let us show that
On the one hand, by Definition (7.16), we have that
where \(\bar{X}^{N} = (\bar{X}^{N}_{1},\dots, \bar{X}^{N}_{N})\), \((\bar{Y}^{N}(t))_{t \in [0,T]}\), and \(Z^{N} = (Z^{N}_{1},\dots, Z^{N}_{L})\) are the unique strong solution to (8.2) and \(\bar{\mu}^{\mathrm{p}}_{N}\) is the law of \((\bar{Y}^{N}(t))_{t \in [0,T]}\). On the other hand,
where \(\mu ^{\mathrm{c}}\in C^{0}([0,T];\mathcal{P}_{1}(\mathbb{R}^{2}))\), \((\bar{Y}(t))_{t \in [0,T]}\) are obtained by the unique solution to (8.4) and \(\mu ^{\mathrm{p}}\) is the law of \((\bar{Y}(t))_{t \in [0,T]}\).
By the weak sequential lower semicontinuity of the \(L^{2}\)-norm, we have that
Let us prove the convergence
as \(N \to +\infty \). This will conclude the proof of (8.59).
Setting \(\check{H}^{\mathrm{d}}(w) = H^{\mathrm{d}}(-w)\) and considering the empirical measures in (8.3), we get that
Moreover, by Fubini’s theorem
These equations allow us to estimate
We estimate the first term on the right-hand side of (8.61) using the Lipschitz continuity of \(\check{H}^{\mathrm{d}}(y - \cdot )\) and Kantorovich’s duality by
For the second term on the right-hand side of (8.61), we observe that \(\check{H}^{\mathrm{d}}*\mu ^{\mathrm{c}}(t)\) is Lipschitz continuous as
Hence, since \(\bar{\mu}^{\mathrm{p}}_{N}(t)\) is the law of \(\bar{Y}^{N}(t)\) and \(\mu ^{\mathrm{p}}(t)\) is the law of \(\bar{Y}(t)\),
Combining the previous inequalities, (8.61) reads
where the convergence follows from Theorem 8.1. This proves (8.60).
Step 2. (Asymptotic upper bound). Let us fix \(u \in L^{\infty}([0,T];\mathcal{U})\). For every \(N \geq 1\), let us set \(u^{N} = u\). As in Step 1, we have that
where \(\bar{X}^{N} = (\bar{X}^{N}_{1},\dots, \bar{X}^{N}_{N})\), \((\bar{Y}^{N}(t))_{t \in [0,T]}\), and \(Z^{N} = (Z^{N}_{1},\dots, Z^{N}_{L})\) are the unique strong solution to (8.2) corresponding to the control \(u^{N} = u\) and \(\bar{\mu}^{\mathrm{p}}_{N}\) is the law of \((\bar{Y}^{N}(t))_{t \in [0,T]}\). Moreover,
where \(\mu ^{\mathrm{c}}\in C^{0}([0,T];\mathcal{P}_{1}(\mathbb{R}^{2}))\), \((\bar{Y}(t))_{t \in [0,T]}\) are obtained by the unique solution to (8.4) and \(\mu ^{\mathrm{p}}\) is the law of \((\bar{Y}(t))_{t \in [0,T]}\). Trivially, we have \(u^{N} \stackrel{*}{\rightharpoonup }u\), hence we deduce (8.60) once again and, in particular, the asymptotic upper bound
concluding the proof. □
Proposition 8.7
Under the assumptions of Theorem 8.1, there exists an optimal control \(u^{*} \in L^{\infty}([0,T];\mathcal{U})\), i.e.,
Proof
The proof is completely analogous to the proof of Proposition 7.4, as it is a general result about Γ-convergence. □
Availability of data and materials
Not applicable.
Notes
One can speak of a Brownian motion without introducing the filtration \((\mathcal{F}_{t})_{t \in [0,T]}\) by replacing the condition that \(W(t) - W(s)\) is independent of \(\mathcal{F}_{s}\) with the requirement that it has independent increments. In this case, one implicitly considers a filtration constructed from \((W(t))_{t \in [0,T]}\) by letting \(\mathcal{F}^{W}_{t}\) be σ-algebra generated by \(\{W(s) | s \leq t\}\). If the filtration needs to satisfy the usual conditions, then \(\mathcal{F}^{W}_{t}\) is modified with the augmentation \(\mathcal{F}_{t}\) defined as the σ-algebra generated by \(\mathcal{F}^{W}_{t}\) and \(\mathcal{N}_{\mathbb{P}}\), see [28, p. 16] or [27, Proposition 2.7.7].
If this is not the case, then the construction explained in Footnote 1 is modified by considering the σ-algebra generated by \(S^{0}\), \(\{W(s) | s \leq t\}\), and \(\mathcal{N}_{\mathbb{P}}\).
The reason for this is that the evaluation maps \(\mathrm{ev}_{t}\) are continuous with respect to the uniform norm, thus Borel measurable; conversely, open balls in the Wiener space (which is separable) are measurable with respect to the σ-algebra generated by \(\{\mathrm{ev}_{t}\}_{t \in [0,T]}\) since \(\|\varphi \|_{\infty }= \sup_{t \in [0,T] \cap \mathbb{Q}} | \mathrm{ev}_{t}(\varphi )|\).
This discussion applies, in particular, to a Brownian motion \((W(t))_{t \in [0,T]}\). The probability measure \(\mathrm{Law}(W)\) is known as Wiener measure on \(C^{0}([0,T];\mathbb{R}^{d})\).
The map \(\mu _{K} \colon \Omega \to \mathcal{P}_{1}(\mathbb{R}^{d})\) is indeed measurable with respect to the Borel σ-algebra on the 1-Wasserstein space \(\mathcal{P}_{1}(\mathbb{R}^{d})\). To see this, we observe that \(\mathcal{P}_{1}(\mathbb{R}^{d})\) endowed with the 1-Wasserstein distance is separable, see, e.g., [39, Theorem 6.18], hence the Borel σ-algebra is generated by balls \(\{\mu \in \mathcal{P}_{1}(\mathbb{R}^{d}): \mathcal{W}_{1}(\mu, \mu _{0}) < r \}\). The pre-image of such a ball through \(\mu _{K}\) is the event \(\{\omega \in \Omega: \mathcal{W}_{1}(\frac{1}{K} \sum_{k=1}^{K} \delta _{X_{k}(\omega )},\mu _{0}) < r\}\). This is measurable since the function \((x_{1},\dots,x_{K}) \mapsto \mathcal{W}_{1}(\frac{1}{K} \sum_{k=1}^{K} \delta _{x_{k}},\mu _{0})\) is Lipschitz continuous.
For example, let \(\hat{\eta}\in C^{\infty}_{c}(\mathbb{R}^{2})\) be supported in a ball \(B_{2 \delta}\) of radius 2δ with \(\hat{\eta}= 1\) on \(B_{\delta}\). If \(\eta (X,X') = \hat{\eta}(X - X')\), then \(\sum_{n'=1}^{N} \hat{\eta}( X_{n}(t) - X_{n'}(t))\) (approximately) counts the number of ships in a δ-neighborhood of \(X_{n}(t)\) (around all directions). Instead, if \(\eta (X,X') = \hat{\eta}(X - X' - \delta \mathbf{r}(X))\), then \(\sum_{n'=1}^{N} \hat{\eta}( X_{n}(t) - X_{n'}(t) - \delta \mathbf{r}(X_{n}(t)) )\) (approximately) counts the number of commercial ships obstructing the commercial route in front of \(X_{n}(t)\).
The result is classical: one considers the Picard operator \(\mathcal{S}\colon C^{0}([0,T];\mathbb{R}^{2 {\times }L}) \to C^{0}([0,T]; \mathbb{R}^{2 {\times }L})\) defined by \(\mathcal{S}(Z)(t):= Z^{0} + \int _{0}^{t} f(s,Z(s)) \,\mathrm{d}s\), which is a contraction with respect to the norm (equivalent to the uniform norm) \({\vert \kern - 0.25 ex\vert \kern - 0.25 ex\vert }\varphi {\vert \kern - 0.25 ex\vert \kern - 0.25 ex\vert }_{\alpha}:= \sup_{t \in [0,T]} (e^{-\alpha t} |\varphi (t)| )\) for suitable \(\alpha > 0\) (depending on the Lipschitz constant of f).
This corresponds to the averaged ODE/SDE/ODE system (5.1) with initial data \(X^{0}\), \(Y_{m}^{0}\), \(Z^{0}\), with Brownian motion \(W_{m}\), and with control u. The solution is provided by Proposition 5.2. Note that we applied Proposition 6.1 to deduce that \((\bar{Y}_{1}(t))_{t \in [0,T]}, \dots, (\bar{Y}_{M}(t))_{t \in [0,T]}\) are identically distributed with common law \(\bar{\mu}^{\mathrm{p}}\) and the curve X̄ is independent of m.
The measurability of these random variables is proven with an argument analogous to the one in Footnote 5, keeping in mind the separability of \(C^{0}([0,T];\mathbb{R}^{2})\).
In fact, by the boundedness of \(\mathcal{U}\), this is equivalent to requiring that \(u^{M} \rightharpoonup u\) weakly in \(L^{1}([0,T];\mathcal{U})\).
Corresponding to the initial data \(X^{0}\), \(\bar{Y}^{0}\), \(Z^{0}\) with Brownian motion W and control u. We recall that the solution is provided by Proposition 5.2.
This corresponds to the averaged ODE/SDE/ODE system (5.1) with initial data \(X^{0}\), \(Y^{0}_{m}\), \(Z^{0}\), with Brownian motion \(W_{m}\), and with control u. The solution is provided by Proposition 5.2. Note that we applied Proposition 6.1 to deduce that \((\bar{Y}^{M}_{1}(t))_{t \in [0,T]}, \dots, (\bar{Y}^{M}_{M}(t))_{t \in [0,T]}\) are identically distributed with common law \(\bar{\mu}^{\mathrm{p}}\) and the curve X̄ does not depend on m and M.
The random measure \(\bar{\nu}^{\mathrm{p}}_{M} \colon \Omega \to \mathcal{P}(C^{0})\) (empirical measure of \(\bar{Y}^{M}_{1}, \dots, \bar{Y}^{M}_{M}\)) must not be confused with \(\nu ^{\mathrm{p}}_{M} \colon \Omega \to \mathcal{P}(C^{0})\) (empirical measure of \(Y^{M}_{1}, \dots, Y^{M}_{M}\)) or \(\bar{\mu}^{\mathrm{p}}\in \mathcal{P}(C^{0})\) (common law of the stochastic processes \((\bar{Y}^{M}_{1}(t))_{t \in [0,T]}, \dots, (\bar{Y}^{M}_{M}(t))_{t \in [0,T]}\), and \((\bar{Y}(t))_{t \in [0,T]}\)).
To be precise, we regard \(\bar{\mu}^{\mathrm{p}}\in \mathcal{P}_{1}(C^{0}([0,T];\mathbb{R}^{2}))\) as the distribution defined by the duality
$$ \int _{-\infty}^{0} \int _{\mathbb{R}^{2}} \xi (t,y) \,\mathrm{d} \bar{\mu}^{\mathrm{p}}(0) (y) \,\mathrm{d}t + \int _{0}^{T} \int _{ \mathbb{R}^{2}} \xi (t,y) \,\mathrm{d}\bar{\mu}^{\mathrm{p}}(t) (y) \,\mathrm{d}t \quad\text{for every } \xi \in C^{\infty}_{c} \bigl((- \infty,T) {\times }\mathbb{R}^{2}\bigr)). $$Note that the function \(t \mapsto \int _{\mathbb{R}^{2}} \xi (t,y) \,\mathrm{d}\bar{\mu}^{ \mathrm{p}}(t)(y) = \int _{C^{0}([0,T];\mathbb{R}^{2})} \xi (t, \varphi (t)) \,\mathrm{d}\mu (\varphi )\) is continuous in t, e.g., by the dominated convergence theorem. A solution to the PDE in the sense of distributions satisfies
$$ \begin{aligned} & \int _{\mathbb{R}^{2}} \xi (0,y) \,\mathrm{d} \mathrm{Law}\bigl(\bar{Y}^{0}_{1}\bigr) (y) + \int _{0}^{T} \int _{\mathbb{R}^{2}} \Biggl[ \partial _{t} \xi (t, y) + \kappa \Delta _{y} \xi (t,y) \\ &\quad{} + \Biggl( \frac{1}{L} \sum_{\ell =1}^{L} K^{\mathrm{pg}}\bigl( y - Z_{\ell}(t) \bigr) - \frac{1}{N} \sum _{n=1}^{N} K^{\mathrm{pc}}\bigl(y - \bar{X}_{n}(t)\bigr) \Biggr) \cdot \nabla _{y} \xi (t, y) \Biggr] \,\mathrm{d} \bar{\mu}^{\mathrm{p}}(t) (y) \,\mathrm{d}t = 0 \end{aligned} $$for every \(\xi \in C^{\infty}_{c}((-\infty,T) {\times }\mathbb{R}^{2}))\).
Note that the weak* convergence in \(L^{\infty}([0,T];\mathcal{U})\) is metrizable since \(\mathcal{U}\) is bounded, hence we can use the sequential characterization of Γ-limits, cf. [20, Proposition 8.1].
In fact, by the boundedness of \(\mathcal{U}\), this is equivalent to requiring that \(u^{N} \rightharpoonup u\) weakly in \(L^{1}([0,T];\mathcal{U})\).
We use here the duality introduced in Footnote 16.
In fact, the curves \(\nu ^{\mathrm{c}}_{N} \in C^{0}([0,T];\mathcal{P}_{1}(\mathbb{R}^{2}))\) take values in a compact set of \(\mathcal{P}_{1}(\mathbb{R}^{2})\) independent of N even under weaker assumptions. This is the case, e.g., when q-moments of \(\nu ^{\mathrm{c}}_{N}(t)\) with \(q>1\) are uniformly bounded, i.e., \(\sup_{N} \sup_{t} \int _{\mathbb{R}^{2}} |x|^{q} \,\mathrm{d}\nu ^{ \mathrm{c}}_{N}(t)(x) < +\infty \) for some \(q>1\) (this can be proven based on [39, Theorem 6.9]. A uniform bound on the q-moments follows from the analogous assumption on the distribution of initial data by a Grönwall inequality.
Indeed, \(\mu ^{\mathrm{p}}\) can be seen as a Radon measure on \((0,T){\times }\mathbb{R}^{2}\) since the duality \(\xi \in C^{0}_{c}((0,T){\times }\mathbb{R}^{2}) \mapsto \int _{0}^{T} \int _{\mathbb{R}^{2}} \xi (t,y) \mu ^{\mathrm{p}}(t)(y) \,\mathrm{d}t\) is a linear and continuous operator, cf. [8, Corollary 1.55].
cf. Footnote 17.
References
Albi, G., Almi, S., Morandotti, M., Solombrino, F.: Mean-field selective optimal control via transient leadership. Appl. Math. Optim. 85, Article ID 22 (2022)
Albi, G., Bongini, M., Cristiani, E., Kalise, D.: Invisible control of self-organizing agents leaving unknown environments. SIAM J. Appl. Math. 76, 1683–1710 (2016)
Albi, G., Bongini, M., Rossi, F., Solombrino, F.: Leader formation with mean-field birth and death models. Math. Models Methods Appl. Sci. 29, 633–679 (2019)
Albi, G., Choi, Y., Fornasier, M., Kalise, D.: Mean field control hierarchy. Appl. Math. Optim. 76, 93–135 (2017)
Almi, S., D’Eramo, C., Morandotti, M., Solombrino, F.: Mean-field limits for entropic multi-population dynamical systems. Milan J. Math. 91, 175–212 (2023)
Almi, S., Morandotti, M., Solombrino, F.: A multi-step Lagrangian scheme for spatially inhomogeneous evolutionary games. J. Evol. Equ. 21, 2691–2733 (2021)
Ambrosio, L., Fornasier, M., Morandotti, M., Savaré, G.: Spatially inhomogeneous evolutionary games. Commun. Pure Appl. Math. 74, 1353–1402 (2021)
Ambrosio, L., Fusco, N., Pallara, D.: Functions of Bounded Variations and Free Discontinuity Problems. Oxford Mathematical Monographs. Oxford University Press, New York (2000)
Ascione, G., Castorina, D., Solombrino, F.: Mean field sparse optimal control of systems with additive white noise (2022). arXiv:2204.02431. Preprint
Bogachev, V., Da Prato, G., Röckner, M., Stannat, W.: Uniqueness of solutions to weak parabolic equations for measures. Bull. Lond. Math. Soc. 38, 631–640 (2007)
Bogachev, V., Krylov, N., Röckner, M.: On regularity of transition probabilities and invariant measures of singular diffusions under minimal conditions. Commun. Partial Differ. Equ. 26, 2037–2080 (2001)
Bongini, M., Fornasier, M., Rossi, F., Solombrino, F.: Mean-field Pontryagin maximum principle. J. Optim. Theory Appl. 175, 2037–2080 (2017)
Carbone, G., Giannoccaro, I.: Model of human collective decision-making in complex environments. Eur. Phys. J. B 88, Article ID 339 (2015)
Carrillo, J.A., Fornasier, M., Rosado, J., Toscani, G.: Asymptotic flocking dynamics for the kinetic Cucker–Smale model. SIAM J. Math. Anal. 42, 218–236 (2010)
Chaintron, L.-P., Diez, A.: Propagation of chaos: a review of models, methods and applications. I. Models and methods. Kinet. Relat. Models 15, 895–1015 (2022)
Chakraborti, A.: Distributions of money in models of market economy. Int. J. Mod. Phys. C 13, 1315–1321 (2002)
Chuang, Y.-L., Huang, Y., D’Orsogna, M.R., Bertozzi, A.: Multi-vehicle flocking: scalability of cooperative control algorithms using pairwise potentials. In: IEEE International Conference on Robotics and Automation, pp. 2292–2299 (2007)
Coclite, G., Garavello, M., Spinolo, L.: A mathematical model for piracy control through police response. NoDEA Nonlinear Differ. Equ. Appl. 24, Article ID 48 (2017)
Cucker, F., Smale, S.: Emergent behavior in flocks. IEEE Trans. Autom. Control 52, 852–862 (2007)
Dal Maso, G.: An Introduction to Γ-Convergence, Progress in Nonlinear Differential Equations and Their Applications. Birkhäuser, Boston (1993)
Dragulescu, A., Yakovenko, V.: Statistical mechanics of money. Eur. Phys. J. B 17, 723–729 (2000)
EU Commission: Transport modes, maritime, international cooperation and coordination. https://transport.ec.europa.eu/transport-modes/maritime/international-cooperation-and-coordination_en
EU Commission: Transport modes, maritime, maritime security. https://transport.ec.europa.eu/transport-modes/maritime/maritime-security_en
Fagioli, S., Kaufmann, A., Radici, E.: Optimal control problems of nonlocal interaction equations (2022). arXiv:2205.08921. Preprint
Fornasier, M., Solombrino, F.: Mean-field optimal control. ESAIM Control Optim. Calc. Var. 20, 1123–1152 (2014)
Fournier, N., Guillin, A.: On the rate of convergence in Wasserstein distance of the empirical measure. Probab. Theory Relat. Fields 162, 707–738 (2015)
Karatzas, I., Shreve, S.E.: Brownian Motion and Stochastic Calculus. Graduate Texts in Mathematics. Springer, New York (1998)
Mao, X.: Stochastic Differential Equations and Applications. Woodhead Publishing Limited (2008)
Massari, G.F., Giannoccaro, I., Carbone, G.: Are distrust relationships beneficial for group performance? The influence of the scope of distrust on the emergence of collective intelligence. Int. J. Prod. Econ. 208, 343–355 (2019)
Morandotti, M., Solombrino, F.: Mean-field analysis of multi-population dynamics with label switching. SIAM J. Math. Anal. 52, 1427–1462 (2020)
Øksendal, B.: Stochastic Differential Equations, Universitext. Springer, Berlin (2003)
Pachpatte, B.: Inequalities for Differential and Integral Equations. Academic Press, San Diego (1998)
Panaretos, V.M., Zemel, Y.: An Invitation to Statistics in Wasserstein Space, SpringerBriefs in Probability and Mathematical Statistics. Springer, Cham (2020)
Perea, L., Gómez, G., Elosegui, P.: Extension of the Cucker–Smale control law to space flight formations. J. Guid. Control Dyn. 32, 527–537 (2009)
Piccoli, B., Rossi, F.: Transport equation with nonlocal velocity in Wasserstein spaces: convergence of numerical schemes. Acta Appl. Math. 124, 73–105 (2013)
Piccoli, B., Rossi, F., Trélat, E.: Control to flocking of the kinetic Cucker–Smale model. SIAM J. Math. Anal. 47, 4685–4719 (2015)
Topaz, C., Bertozzi, A.: Swarming patterns in a two-dimensional kinematic model for biological groups. SIAM J. Appl. Math. 65, 152–174 (2004)
Villani, C.: Topics in Optimal Transportation. Graduate Studies in Mathematics, vol. 58. AMS, Providence (2003)
Villani, C.: Optimal Transport: Old and New. Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], vol. 338. Springer, Berlin (2009)
Acknowledgements
The author thanks Giuseppe Maria Coclite for suggesting the problem and for interesting discussions about the model.
Funding
The author has been supported by the project “Approccio integrato e predittivo per il controllo della criminalità marittima” in the program “Research for Innovation” (REFIN)—POR Puglia FESR FSE 2014-2020, Codice CUP: D94I20001410008. He is a member of Gruppo Nazionale per l’Analisi Matematica, la Probabilità e le loro Applicazioni (GNAMPA) of the Istituto Nazionale di Alta Matematica (INdAM) and has received funding from the GNAMPA 2022 project “Approccio multiscala all’analisi di modelli di interazione”, Codice CUP: E55F22000270001. This work was supported by the Italian Ministry of University and Research under the Programme “Department of Excellence” Legge 232/2016 (Grant No. CUP—D93C23000100001).
Author information
Authors and Affiliations
Contributions
GO is the unique author of the manuscript. GO prepared the manuscript initially and performed all the steps of the proofs in this research. GO read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The author declares that he has no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Orlando, G. Mean-field optimal control in a multi-agent interaction model for prevention of maritime crime. Adv Cont Discr Mod 2023, 24 (2023). https://doi.org/10.1186/s13662-023-03771-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13662-023-03771-7