key: cord-0480790-jyswn3sp
authors: Reis, Goncalo dos; Platonov, Vadim
title: Forward utilities and Mean-field games under relative performance concerns
date: 2020-05-16
journal: nan
DOI: nan
sha: b88ef3e86fb048a4f3fe5394e6a2fc10fd12ebc9
doc_id: 480790
cord_uid: jyswn3sp

We introduce the concept of mean field games for agents using Forward utilities to study a family of portfolio management problems under relative performance concerns. Under asset specialization of the fund managers, we solve the forward-utility finite player game and the forward-utility mean-field game. We study best response and equilibrium strategies in the single common stock asset and the asset specialization with common noise. As an application, we draw on the core features of the forward utility paradigm and discuss a problem of time-consistent mean-field dynamic model selection in sequential time-horizons.

This work focuses on bringing the concept of forward utilities to the mean-field game setting in the limelight of competitive optimal portfolio management of agents under relative performance criteria and the analysis of the associated finite-player game.

There exists a very rich literature on portfolio management for agents with utility preferences and under performance concerns to which this short introduction cannot possibly due justice. For a literature perspective of the financial setting including an in-depth discussion of agents with performance concerns and its impact in the utility maximization framework we refer to [12, 13, 4] and references therein. Additionally, we point the reader to the beautiful introductions of [19, 18] where those concepts are brought to the framework of mean-field games. Further, those works also make for an excellent review of mean-field games in the context of the Merton problem which is the framework underlying our work.

In short, mean-field games (MFG), stochastic or not, gained renewed interest due to their modelling power in crucially reducing the dimensionality of the underlying problem under the assumption of statistically equivalent populations [17, 5, 6] . In other words, as long as the actions of a single agent do not affect the average interaction of the agents in their whole, then, in principle, the MFG framework stands to be more tractable than the n-agent games. See [19, 18] .

The novelty of our work is the conceptualization and analysis, simplified here, of the formulation of mean-field games within the so-called forward utilities framework. Further, we juxtapose our construction to the related finite-player game.

The classical and ubiquitous approach of utility preferences, found throughout the literature ( [12, 13, 4] ), is that each agent, at an initial-time, specifies their riskpreferences to some future time T and proceeds to optimize their investment to that initial-time. This backward approach lacks flexibility to handle mid-time changes of risk-preferences by the agents, or, to allow an update of the underlying model: having in mind Covid-19, if the fund manager made investments in early 2019 to mature in the later part of 2020, how would one update the underlying model stock model to the change of parameters?

These problems feature an inherently forward-in-time nature of investment. A view that is particularly clear for (competitive) fund managers updating their investment preferences frequently depending on market behavior. To cope with the limitation of the backward-in-time view induced by the classical utility optimization formulation, and, to better address this forward view, the mathematical tool of forward utilities was developed. It was initially introduced for the analysis of the portfolio management problems in [20, 21, 22] and subsequently expanded [25, 1, 7] and [11, 9, 10] . The latter dealing with general forward utility Itô random fields and with applications to longevity risk. Our approach builds from [14] where the first forward-utility definition under competition appeared (for finite-player games); we additionally refer the reader to the forthcoming [2] (who also builds from [14] ).

In essence, the concept of forward utility reflects that the utility map must be adaptive and adjusted to the information flow. The forward dynamic utility map is built to be consistent with respect to the given investment universe and the approach we discuss here is based on the martingale optimality principle (see Section 2.1).

To the MFG context, the closest to our work we have found is the concept of Forward-Forward MFG concept of [16] .

Organization of the paper. In Section 2 we introduce the financial market. In Sections 3 and 4 we study the finite-agent and mean-field game respectively. We study forward utilities of time-monotone type. In section 4.4 we discuss the meanfield investment problem with dynamic model selection in large time-horizons. We conclude in Section 5 with a discussion of open questions and future research.

2 Asset specialization, Forward utilities and CARA preferences The market. We consider a market environment with one riskless asset and n risky securities which serve as proxies for two distinct asset classes. We assume their prices to be of log-normal type, each driven by two independent Brownian motions. More precisely the price (S i t ) t 0 of the stock i traded by the i-th agent solves

with constant parameters µ i > 0, σ i 0 and ν i 0 with σ i + ν i > 0. The onedimensional standard Brownian motions B,W 1 , · · · ,W n are independent. When σ i > 0, the process B induces a correlation between the stocks, and thus we call B the common noise and W i an idiosyncratic noise. The independent Brownian motions B,W 1 , · · · ,W n are defined on a probability space (Ω , F, F , P) endowed with the natural filtration F = (F t ) t 0 generated by them and satisfies the usual conditions. We recall the case of single common stock, where ∀i = 1, . . . , n, (µ i , σ i ) = (µ, σ ), ν i = 0, for some µ, σ > 0 and independent of i. The single common stock case has been explored in great generality in [12, 13, 4] incorporating portfolio constraints, general stock price dynamics and risk-sharing mechanisms.

Agents' wealth. Each agent i = 1, . . . , n trades using a self-financing strategy, (π i t ) t 0 , which represent the (discounted by the bond) amount invested in the i-th stock. The i th agent's wealth (X i t ) t 0 then solves

A portfolio strategy is said admissible if it belongs to the set A i , which consists of

The Agents' social interaction. Each manager measures the performance of her strategy taking into account the policy of the other. Each agent engages in a form of social interaction that affects that agent's perception of wealth, all in an additive fashion modeled through the arithmetic average wealth of all agents (this model is largely inspired in [12, 13, 4, 19] ). Namely the relative performance metric of manager i ∈ {1, . . . , n}, denoted X i is defined to be

We easily obtain a dynamics for X and X i , namely

where x 0 , π µ and πσ are identified as averages (as seen from the 1st equation to the 2nd). Similarly to [19, Remark 2.5] , it is natural to replace the average wealth X in (3) by the average over all other agents. With that in mind we define for convenience

. This leads us to recast (3) as

We easily obtain a dynamics for X and X (−i) , namely

We also define the quantities

where we have the following relations between πσ (−i) , πσ (−i) and πσ :

and πσ (−i) = πσ − 1 n π i σ i . We do not write it explicitly but we extend the same notation and relations to π µ (−i) , π µ (−i) and π µ.

We recall, for reference, the classic forward utility formulation. We define a forward dynamic utilities in the context of the probability space (Ω , F, F , P). We denote by u 0 : R → R the initial data. The forward utility is constructed based on the martingale optimality principle.

Definition 1 (Forward dynamic utilities). Let U : Ω × R × [0, ∞) → R be an Fprogressively measurable random field. U is a forward dynamic utility if:

• For all t 0 the map x → U(x,t) is increasing and concave;

• It satisfies U(x, 0) = u 0 (x);

• For all T t and each self-financing strategy, represented by π, the associated discounted wealth process X π satisfies a supermartingale property

• For all T t there exists a self financing strategy, represented by π * , for which the associated discounted wealth X * satisfies a martingale property

The above definition assumes the optimizer is attained. This is a somewhat strong assumption which is discussed in [25, 1] . There it is argued that such constraint is not necessary for the forward utility construction in certain contexts.

3 Forward relative performance criteria

Each manager measures the output of her relative performance metric using a forward relative one as modeled by an F t -progressively measurable random field

. . , n}. The below criteria follows those proposed in [14] . The main idea here being a formulation inspired in the first step in the usual strategy of solving a Nash game, namely the best response of an agent to the actions of all other agents. Take manager i and assume all other agents j = i have acted with an investment policy π j then for any strategy π i ∈ A i , the process U i ( X i t ,t) is a (local) supermartingale, and there exists π i, * ∈ A i such that U i ( X i, * t ,t) is a (local) martingale where X i and X i, * solves (5) with strategies π i and π i, * respectively. This version of a relative criterion is (implicitly and) exogenously parametrized by the policies of all other managers j = i over which there is no assumption on their optimality. In Nash-game language, we solve the so-called best response.

Definition 2 (Forward relative performance for the manager). Each manager i ∈ {1, · · · , n} satisfies the following. Let π j ∈ A j , ∀ j = i be arbitrary but fixed and admissible policies for the other managers j = i.

An F-progressively measurable random field U i (x,t) is a forward relative performance for manager i if, for all t 0, the following conditions hold:

i) The mapping x → U i (x,t), is strictly increasing and strictly concave;

is a local supermartingale and X i is the relative performance metric given in (5) (5) with strategies π i, * being used.

In the above definition, we do not make explicit references to the initial conditions U k (x, 0) but we assume that admissible initial data exist such that the above definition is viable. Contrary to the classical expected utility case, the forward volatility process is an investor-specific input. Once it is chosen, the supermartingality and martingality properties impose conditions on the drift of the process. Under enough regularity, these conditions lead to the forward performance SPDE (see [24] ).

Since we are working in a log-normal market, it suffices to study smooth relative performance criteria of zero volatility (of the forward utility map). Such processes are extensively analyzed in [23] in the absence of relative performance concerns. There, a concise characterization of the forward criteria is given along (necessary and sufficient) conditions for their existence and uniqueness. In that setting, the zero-volatility forward processes are always time-decreasing processes. We point to the reader that this does not have to be case if relative performance concerns are present (see also [14] ).

We assume that the Itô decomposition of the forward utility map is

And we next derive a stochastic PDE and an optimal investment strategy for a smooth relative performance criteria of zero volatility of some agent i assuming that all other agents j = i have made their investment decisions.

Proposition 1 (Best responses). Fix i ∈ {1, · · · , n} and the agent's initial preference

and assume that for an admissible initial condition U(

where X i, * solves (6) with π i, * being used. If π i, * ∈ A i and X i, * are well-defined, then U i (x,t) is a forward utility performance process. Moreover, the policy π i, * is optimal (in the sense of Definition 2).

Using the language of [22, Section 5] , define the local risk tolerance function

Then, by direct inspection of the expression for I π i, * one sees that if the local risk tolerance function r i (x,t) = r i = Const ∀t > 0 (e.g. CARA utilities) then the optimal strategy will be constant throughout time if additionally all other agents also choose a constant strategy.

Assume that all agents j = i invest according to constant strategies α j ∈ R and that the local risk tolerance function r i is constant. Then π i, * is constant.

We now prove the previous "best responses" proposition above.

Proof (Proof of Proposition 1). From (5) we have the dynamics of d X i (and hence that of d(X i − θ i X (−i) )). We now apply the Itô formula to U i ( X i

0 , 0) and we used that the B,W j are all i.i.d.

By Definition 2, the process U i ( X i t ,t) becomes a Martingale at the optimum π. Direct computations using first order conditions (∂ π i "drift" = 0) yield

.

Injecting the expression of π i t in the drift term of (10) and simplifying we arrive at the consistency condition (9), we do not carry out this step explicitly, nonetheless, using that U i solves (9) equation (10) simplifies to (exact calculations are carried out in the Section 6)

The concavity assumption of U i (x,t) implies that the drift term above is non-positive and vanishes when (11) holds. We can conclude that, if π i, * t = π i t ∈ A i and the associated process X i, * is well-defined (solution to (6) with π i, * ), the process U i ( X i, * t ,t) is a local-martingale, otherwise it is a local supermartingale. The result concludes.

Example 1 (The classic CARA case -exponential case). The exponential criterion takes as initial condition the map U(x, 0) (x ∈ R) defined as

In this case, the local risk tolerance function r

In our case accounting for social interaction between agents in the form of performance concerns, the i-th agent's utility is a function U i : Ω × R × R × [0, ∞) → R of both her individual wealth x and the average wealth wealth of all agents, m. The initial/starting utility map is of the form

where we refer to the constants δ i > 0 and θ i ∈ [0, 1] as personal risk tolerance and competition weight parameters, respectively.

Example 2 (The time-monotone forward utility with starting exponential). For i ∈ {1, · · · , n}, let the dynamics of U i be given by (8) and assume U i (x, 0) = −e −ηx with η > 0. Then the solution to the SPDE (9) is given by

where ( f i (t)) t 0 is the random map given below independent of x satisfying f i (0) = 0, sufficiently integrable and t → f i (t) is differentiable. Note that in this case, the local risk tolerance function satisfies (9) yields an ODE for f i (we omit the time variable),

In particular, if all coefficients and strategies are constant, then (with a slight abuse of notation) f i (t) = tλ i for a constant λ i given by the RHS of the above ODE.

Example 3 (No performance concerns: θ i = 0). We continue to work under the timemonotone forward utility case of the previous example. Without performance concerns, i.e. θ i = 0, then λ i is just the Sharpe ratio λ i = µ 2 i 2(ν 2 i +σ 2 i ) and we recover known results. We have from Proposition 1 that

In view of the best responses discussed in Proposition 1 we now investigate the simultaneous best responses as to establish the existence of a Nash equilibrium.

Definition 3 (Forward Nash equilibrium). A forward Nash equilibrium consists of n-pairs of F t -adapted maps (U i , π i, * ) such that for any t 0 the following conditions hold.

• ∀i ∈ {1, · · · , n}, π i, * ∈ A i ; • For each player i ∈ {1, · · · , n} the following holds: given the strategies π j, * ∈ A j (any j = i) the processes U i ( X i t (π * ,−i ),t) is a local supermartingale where X i (π * ,−i ) solves (6) with all managers j = i acting according to π j, * ; • For each player i ∈ {1, · · · , n} the following holds: the process U i ( X i, * t (π * ,−i ),t) is a local martingale where X i (π * ,−i ) solves (6) with all managers j acting according to π j, * .

If all the optimal strategies are constant we say we have a constant forward Nash equilibrium.

Under appropriate integrability conditions plus the martingale/supermartingale characterizations, we have for some agent i for any

As expected, no manager can increase the expected utility of her relative performance metric by unilateral decision. The solvability of the general forward Nash equilibrium seems very difficult for a general forward criteria as one needs to solve the following system for the π i, * (see Proposition 1, in particular (11)) and the corresponding SPDEs for the U i , i ∈ {1, · · · , n}:

In order to obtain explicit results we focus on the time-monotone case presented in Example 2 for which U i x /U i xx = −δ i . More notably, at the level at which we have formulated our problem we can easily recover the results of [19, Theorem 2.3] for which one has U i x /U i xx = −δ i ∀t (note their Remark 2.5).

, · · · , n}. Assume furthermore that agents have time-monotone forward utility U i with initial condition (13) . Define the quantities ϕ σ n and ψ σ n by

(16) Then, if ψ σ n = 1 then a constant forward Nash equilibrium exists, with the constant optimal strategies π i, * given by

The forward Nash equilibria is given by the n-pairs {(U i, * , π i, * )} i=1,···,n where the U i, * is the solution of (9) (see Example 2) under the optimal constant strategies π ·, * . The term λ i (see Example 2) , at equilibrium, is given by

where the relevant expressions for πσ, π µ and (πν) 2 are given below in (18) , (19) and (20) .

We note that we solve not the same problem studied at [19] , but an equivalent one. However imposing the scaling factor given by [ Proof. Injecting the condition U x /U xx = −δ i in (15), the system to be solved in order to ascertain the Nash equilibrium is, across i ∈ {1, · · · , n},

The final line yields the expression for π i, * as a function of the unknown πσ. To determine the latter, multiply both sides by σ i and average over i ∈ {1, · · · , n}, this yields a solvability condition (πσ ) t = (πσ ) t ψ σ n + ϕ σ n ⇔ πσ = ϕ σ n 1 − ψ σ n as long as ψ σ n = 1.

Plugging the expression (πσ ) in that for π i, * yields the result. That the optimal strategies are constant is now obvious. It remains to derive the expression for the λ i 's. Just like for πσ, we obtain an expression for π µ by multiplying π i, * by µ i and averaging on both sides, we have

where we used (7) and the quantities ϕ µ n , ψ µ n are defined as

and ψ µ n :=

.

Similarly, defining (πν) 2 :

Similarly to (7), we have (πν) 2 (−i) = n n−1 (πν) 2 − 1 n−1 (π i ν i ) 2 . Replacing these expressions in that for λ i in Example 2 the expression in the result's statement follows.

From the forward utility machinery one can easily recover the classical case of utility optimization where one prescribes the utility map for the horizon time T then proceeds to optimize.

Example 4 (Recovering the classical utility problem from the forward one.). If one would start the forward utility with (for some 0 < T < ∞)

then computations like those presented yield the forward utility map U(x,t) as

and in particular U(x, T ) = −e −x/δ i . In other words, our forward utility recovers as a particular case the classical exponential utility maximization problem (discussed in [19] ). 

Then, if ψ σ n = 1 then a constant forward Nash equilibrium exists, with the constant optimal strategies π i, * given by

By inspection of Theorem 1 one sees that the optimal strategy and forward utility map for some agent depend on that agent's specific parameters (model parameters, initial wealth, risk tolerance and performance concern) and on certain averages of the parameters of all agents. This makes a case for a MFG approach to the game.

In this section and inspired by the results in the previous one, we formalize the concept of forward mean-field Nash game. We use the concept of type distributions introduced in [17] and [19] . We follow the construction presented in the latter.

We focus on initial forward utilities at time t = 0 that are of exponential type,

where we refer to the constants δ i > 0 and θ i ∈ [0, 1] as personal risk tolerance and competition weight parameters, respectively. For the n-agent game, we define for each agent i = 1, . . . , n the type vector

These type vectors induce an empirical measure, called the type distribution, which is the probability measure on the type space

given by

Assume now that as the number of agents becomes large, n → ∞, the above empirical measure m n has a weak limit m, in the sense that Z e f dm n → Z e f dm for every bounded continuous function f on Z e . For example, this holds almost surely if the ζ i 's are i.i.d. samples from m. Let ζ = (ξ , δ , θ , µ, ν, σ ) denote a random variable with this limiting distribution m.

The mean field game (MFG) defined next allows us to derive the limiting strategy as the outcome of a self-contained equilibrium problem, which intuitively represents a game with a continuum of agents with type distribution m. Rather than directly modeling a continuum of agents, we follow the MFG paradigm of modeling a single representative agent, who we view as randomly selected from the population. The probability measure m represents the distribution of type parameters among the continuum of agents; equivalently, the representative agent's type vector is a random variable with law m. Heuristically, each agent in the continuum trades in a single stock driven by two Brownian motions, one of which is unique to this agent and one of which is common to all agents. We extend the Forward Nash equilibrium of Definition 3 to the MFG setting below.

To formulate the MFG, we now assume that the probability space (Ω , F , P) supports yet another independent (one-dimensional) Brownian motion, W , as well as a random variable ζ = (ξ , δ , θ , µ, ν, σ ), independent of W and B, and with values in the space Z e defined in (22) . This random variable ζ is called the type vector, and its distribution is called the type distribution. 

where the portfolio strategy must belong to the admissible set A MF of self-financing F MF -progressively measurable real-valued processes (π t ) t 0 satisfying the squareintegrability condition E[ T 0 |π t | 2 dt] < ∞ for any T ∈ [0, ∞). The random variable ξ is the initial wealth of the representative agent, whereas (µ, ν, σ ) are the market parameters. In the sequel, the parameters δ and θ will affect the risk preferences of the representative agent. Note that each agent among the continuum may still have different preference parameters, captured by the fact that δ and θ are random.

The formulation of the forward Nash game of Section 3 drives the formulation of the Mean-field game we discuss here. Recall that in the MFG-formulation the generic agent has no influence on the average wealth of the continuum of agents, as but one agent amid a continuum of agents. We next introduce the concept of mean-field (MF)-forward relative performance, π * is the MF-equilibrium and, the main object of interest the MF-Forward relative performance equilibrium.

We recall the framework. We assume that the Itô decomposition of the forward utility map (without noise) is dU(x,t) = U t (x,t)dt and initial condition (13) , where the derivatives U t (x,t), U x (x,t) and U xx (x,t) exist for t 0. Given the market setup we developed so far, we next define our concept of equilibrium.

Definition 4 (MF-Forward relative performance equilibrium (for the generic manager)). Let π ∈ A MF and X π solving (23) with π; to (π, X π ) we associate the F B -adapted square integrable stochastic process (X t ) t 0 , representing the average wealth of the continuum of agents, as X t := E P⊗m [X π t |F B t ] for all t 0. The F MF -progressively measurable random field (U(x,t)) t 0 is a MF-forward relative performance for the generic manager if, for all t 0, the following conditions hold:

i) The mapping x → U(x,t), is strictly increasing and strictly concave; ii) For each π ∈ A MF , U(X π t − θ X t ,t) is a P-local supermartingale and X is the generic agent's wealth process solving (23) for the strategy π; iii) There exists π * ∈ A MF such that U(X * t − θ X t ,t) is a P-local martingale where X * solves (23) with π * plugged in as the strategy; iv) We call π

for all t 0 where where X * solves (23) with π * plugged in as the strategy.

We denote the triplet (U, π * , X) satisfying i)-iv) the MF-Forward relative performance equilibrium. An MF-equilibrium is constant if there exists an F MF 0measurable RV π * such that π t = π * , ∀t 0.

The last point can be understood as a fixed point argument which creates a compatibility condition between the generic agent within the continuum of agents. In fact, conditionally on the BM B each agent faces an independent noise W and an independent type vector ζ . As in MFG [19] , conditionally on B, all agents faces i.i.d. copies of the same optimization problem. The law of large numbers suggests that the average terminal wealth of the whole population should be E P⊗m [X * t |F B t ]. Our construction allows us to identify E P⊗m [X * t |F B t ] with a certain dynamics and, in turn, treat this component as an additional uncontrolled state process. This avoids altogether the conceptualization of the master equation for models with different types of agents. The latter is left for future research.

We now present the main result of this section which is the existence of a MF-Forward relative performance equilibrium for the generic manager according to Definition 4 within the context of time-monotone forward utilities.

From the methodological point of view, the problem is solved as before. Apply Itô-Wentzell to U(Z π t ,t), determine the optimal strategy π * and the consistency condition (the SPDE) for U such that the first three conditions of Definition 4 hold. The last condition, to show that π * is indeed the MFG Forward equilibrium follows by construction as we will see.

Assume that m-a.s. δ > 0, θ ∈ [0, 1], µ > 0, σ 0, ν 0 such that σ 2 + ν 2 > 0.

Assume the following constants are finite

Assume that ψ σ = 1. Then there exists a unique constant MF-Forward relative performance equilibrium in the sense of Definition 4.

The constant MF-equilibrium strategy is given by

constrained to the identity

The MF-forward CARA relative performance utility map is the solution of

When the initial condition is U(x, 0) = u 0 (x) = −e −x/δ , i.e. the exponential preferences, U is given explicitly by U(x,t) = u 0 (x)e tλ with λ given by

where σ α and µα are given by (29) and (30) respectively. If ψ σ = 1, then there exists no constant MF-equilibrium.

By comparing the statements of Theorem 1 and Theorem 2 (and same happens for the respective Single (common) Stock Corollaries) one easily sees that as n → ∞ the strategies, weights (φ · n and ψ · n ) and forward-utility map in Theorem 1 converge to the respective quantities appearing in Theorem 2.

In contrast with Remark 1, here we recover the result from [19, Theorem 2.10] as the scaling factors converge to 1 (as n → ∞). Hence, due to space constraints we defer the reader to [19, Section 2.3] for the discussion of the equilibria.

Proof. We proceed in several steps in order to construct the constant MF-equilibrium. To that end we must solve ii)-iii) in Definition 4 for a given X process associated to π ∈ A MF . Condition iv), for MF-equilibrium allows us to focus only on processes

Step 2. The optimality of the strategy. The argument is similar to that in [19] . The original constant strategy α if a MF-equilibrium if and only if for all t 0

a.s. ⇔ξ + µα t + σ αB t =ξ + µπ * t + σ π * B t a.s.

Taking expectations on both sides implies that α is a MG-equilibrium if and only if the following two conditions holds µα = µπ * and σ α = σ π * .

Using (28) with U x /U xx = −δ and the expressions for ϕ σ , ψ σ one derives that σ π * = θ σ 2 ν 2 + σ 2 σ α + δ µσ ν 2 + σ 2 ⇒ σ π * = σ αψ σ + ϕ σ , using that σ α = σ π * yields solvability if ψ σ = E m θ σ 2 ν 2 +σ 2 = 1. The same procedure deals with the condition µα = µπ * . We then have

Injecting these identities in the expression for π * we find (24) . For the non-solvability statement, if the equation (30) has ψ σ = 1 and ϕ σ = 0 then the equation has no solution and hence no constant MF-equilibrium exists. The case ψ σ = 1 and ϕ σ = 0 is impossible. Since µ > 0 and δ > 0 by assumption, it implies that σ = 0 and hence that ψ σ = 0 contradicting the condition ψ σ = 1.

Step 3. Finding the consistency SPDE and the Utility map. We do not carry out this step explicitly, nonetheless, injecting the expression of π * , σ α and µα in the drift term of (27) and simplifying, we find the necessary equation (25), i.e. the consistency condition the random field U must satisfy to that the required properties in Definition 4 hold.

Just like in Example 2, the time-monotone forward utility equation (25) can be solved and indeed one has a simplified version. We have

where the F MF 0 -measurable RV λ is given by (using (29) and (30))

where λ 0 is the version of (32) for the type of the agent over the time interval [T 0 , T 1 ] and all the coefficients correspond to a type ζ 0 , i.e. λ (ζ 0 ) = λ 0 .

At time T 1 , the generic agent assesses the previous model specification and chooses new coefficients (leading to a change in type, say from ζ 0 to ζ 1 ). The agent then carries out the optimization program over t ∈ [T 1 , T 2 ] but starting from initial utility U(x, T 1 ). Under the assumption of constant coefficients Theorem 2, yields,

where λ 1 = λ (ζ 1 ) (given by (35)) depends only on information at time T 1 . Quick calculations generalize to any time horizon T j . Assume we work on the time interval [T j , T j+1 ]. Stemming from previous calculations, it is easy to see that the initial condition for the forward utility problem is

(with the convention that if j < 1 then ∏ j k=1 · · · = 0) and the MFG forward utility is ∀t ∈ [T j , T j+1 ], j > 1 and using that λ j = λ (ζ j ).

U(x,t) = U(x, T j )e (t−T j )λ j = u 0 (x) j ∏ k=1 e (T k −T k−1 )λ k−1 · e (t−T j )λ j , = u 0 (x) exp T 1 (λ 0 − λ 1 ) + T 2 (λ 1 − λ 2 ) + · · · + T j (λ j−1 − λ j ) e tλ j .

There are two points to highlight. Firstly, the agent needs to carry information of what happened in the past in order to have time-consistency at present time. Secondly, this construction also allows the agents to change not just the model specification (µ, ν, σ ) but also their type including risk parameter δ and performanceconcern level θ . The initial wealth is fixed from the previous time interval.

In this work we considered two optimal portfolio management problems under forward utility performance concerns. We presented a simplified setting allowing for explicit calculations of the optimal control value function, strategies and an intuitive validation that the finite-play game reaches the mean-field game in the limit.

This work provides a proof-of-concept for the forward mean-field utility construction leaving open many questions. Generalizing the dynamics of the forward utility (8) to a fully Itô-dynamics and stochastic strategies is also open. A crucial tool for such would be a general Itô-Wenzell-Lions chain rule as developed in [8] . Such an approach would require [25] , [11] .

Here we addressed only the exponential-utilities (CARA) and left the power-case (CRRA) open. Even within (8) , one can build towards the CRRA case in [19] or include the consumption problem [18] ; for the general forward utility case see [9] . Also open is the so-called mean-field aggregation problem where different agents use utility maps from different families, e.g. CRRA and CARA: [10] would be a starting point for the finite-player case while the mean-field case would requires the multi-class approach of [3, Section 8] with the parameterization technique of from our Section 4. Many other questions can be posed in this context of meanfield forward utilities, ranging from possible non-solvability [13] , to risk-sharing [4] , ergodic problems [7] and associated numerics [15] .

Forward exponential performances: pricing and optimal risk sharing

Competitive investment strategies under relative forward performance criteria. Forthcoming

Mean field games and mean field type control theory

Equilibrium pricing under relative performance concerns

The master equation and the convergence problem in mean field games

Mean field games of timing and models for bank runs

An ergodic BSDE approach to forward entropic risk measures: representation and large-maturity behavior

Itô-Wentzell-Lions formula for measure dependent random fields under full and conditional measure flows

Consistent utility of investment and consumption: a forward/backward SPDE viewpoint

Construction of an aggregate consistent utility, without Pareto optimality. Application to long-term yield curve modeling

An exact connection between two solvable SDEs and a nonlinear utility stochastic PDE

Optimal Investment under Relative Performance Concerns

A financial market with interacting investors: does an equilibrium exist?

Passive and competitive investment strategies under relative forward performance criteria. Available at SSRN 2870040

Convergence rate of strong approximations of compound random maps, application to SPDEs

One-dimensional forward-forward mean-field games

Large population stochastic dynamic games: closed-loop McKean-Vlasov systems and the Nash certainty equivalence principle

Many-player games of optimal consumption and investment under relative performance criteria

Mean field and n-agent games for optimal investment under relative performance criteria

Investment and valuation under backward and forward dynamic exponential utilities in a stochastic factor model

Optimal asset allocation under forward exponential performance criteria

Portfolio choice under dynamic investment performance criteria

Portfolio choice under space-time monotone performance criteria

Stochastic partial differential equations and portfolio choice

A dual characterization of self-generation and exponential forward performances

where X α solves (23) for a constant (i.e. F MFmeasurable) strategy α satisfying E m [α 2 ] < ∞.Step 0. The dynamics of the average wealth process. To solve the above problem given (X t ) t 0 it suffices to restrict ourselves to processes (X t ) t 0 satisfying X t = E P⊗m [X α t |F B t ] P ⊗ m-a.s.. We then have P ⊗ m-a.s.where, for consistency of notation wrt to the previous section, we denotēHence for π ∈ A MF and as in the previous section we can define the dynamics of the processand solve the MFG Forward utility problem in Definition 4 with its help. Hence applying Itô-Wentzell to U(Z π t ,t) yieldswith U(Z π 0 , 0) = U(ξ − θ ξ , 0) = − exp{−(ξ − θ ξ )/δ } and we used that the B,W are all i.i.d. Exact calculations on deriving (27) are presented in the Section 6.Step 1. Finding the candidate optimal strategy π * . As before, the process U(Z π t ,t) becomes a Martingale at the optimum π. Direct computations using first order conditions (∂ π "drift" = 0) yieldwhere we injected the CARA constraint U x /U xx = −δ ∀t. By inspection it is clear that π * is a F MF 0 -measurable RV which is independent of time and is well-defined as long as σ α is finite. θ δ µαStep 4. The MFG forward utility dynamics. Injecting the consistency SPDE (25) in the expression for dU(Z π t ,t) given in (27) yields,We close with a corollary regarding the since common stock case. Then, if ψ = 1 then a constant MF-equilibrium exists, with the constant optimal strategy π * given by

Over the time interval [0, ∞) our generic agent selects a sequence of horizon time (T j ) j∈N 0 (such that T 0 = 0, T j+1 − T j > 0 and lim j T j = ∞) on which the agent assesses and updates the market model by adjusting the model's coefficients. Comparing with (23) the agent models the stock aswhere the index j represents the model specification at time T j . The associated wealth process of the generic agent isFollowing the earlier constructions of this section, assume that at time T 0 = 0 the agent starts with initial utility u 0 (x) = −e −x/δ . Then using the results of Theorem 2, the agent's forward utility map is given by

Proof (of Proposition 1). We recall the optimal strategy is given by (11) , where we defineThe drift of (10) becomes (we omit the argument in U t ,U x ,U xx and use σ := (πσ )which results in (12) . (27)). We take up the drift of (27) and we have just by re-organizing the terms

We recall the optimal strategy given by (28), where we complete the square inside the U xx term in the SPDE above we have 0 = U t +U x · µ θ σ σ π t (ν 2 + σ 2 ) − θ µπ t + 1 2 U xx · θ 2 (σ π t ) 2 1 − σ 2 ν 2 + σ 2 − 1 2Under the CARA condition U x /U xx = −δ and the choice of the optimal strategy, the remaining drift must zero-out. We then have· µθ σ σ π t + δ µ 2 +U xx (θ σ σ π t ) 2 2(ν 2 + σ 2 ) − 1 2 U xx · (θ σ π t ) 2 +U x θ µπ t .