key: cord-0829290-3ht6mjcy
authors: Sharp, Jesse A.; Burrage, Kevin; Simpson, Matthew J.
title: Implementation and acceleration of optimal control for systems biology
date: 2021-08-25
journal: Journal of the Royal Society, Interface
DOI: 10.1098/rsif.2021.0241
sha: b68f8ca470c460a7b3e136410bf2c7907de5fe0a
doc_id: 829290
cord_uid: 3ht6mjcy

Optimal control theory provides insight into complex resource allocation decisions. The forward–backward sweep method (FBSM) is an iterative technique commonly implemented to solve two-point boundary value problems arising from the application of Pontryagin’s maximum principle (PMP) in optimal control. The FBSM is popular in systems biology as it scales well with system size and is straightforward to implement. In this review, we discuss the PMP approach to optimal control and the implementation of the FBSM. By conceptualizing the FBSM as a fixed point iteration process, we leverage and adapt existing acceleration techniques to improve its rate of convergence. We show that convergence improvement is attainable without prohibitively costly tuning of the acceleration techniques. Furthermore, we demonstrate that these methods can induce convergence where the underlying FBSM fails to converge. All code used in this work to implement the FBSM and acceleration techniques is available on GitHub at https://github.com/Jesse-Sharp/Sharp2021.

Across the life sciences, we encounter systems over which we wish to exert control. Whether we consider outbreak control in epidemiology [1, 2] , chemotherapy in oncology [3] [4] [5] , muscle contraction and gait regulation in biomechanics [6] [7] [8] , engineering cellular processes in synthetic biology [9, 10] , cell population growth in tissue engineering [11, 12] , or biodiversity and invasive species management in ecology [13] [14] [15] , we face decisions around how a particular intervention should be applied to best achieve desired outcomes. Using mathematical models of such systems, optimal control theory provides insight into these resource allocation decisions.

Optimal control is a science of trade-offs; between competing objectives, or in weighing up the benefits of control measures against their costs. We illustrate some key concepts of optimal control in figure 1. Suppose that without intervention, a crop yield will double, from x 0 to 2x 0 , between now and harvest time. We might consider applying a control, such as fertilizer, to increase the growth rate of the crop; thereby increasing the yield at harvest to 3x 0 . Of course, applying fertilizer comes at a cost, and this must be considered against the increase in crop yield. As such, it is not immediately apparent how much fertilizer should be applied, and over what time period. This depends entirely on our characterization of optimality: the pay-off . Depending on the pay-off, the optimal control may be continuous; whereby the strength can be readily and continuously adjusted throughout time, or bang-bang (discontinuous); whereby the control is applied at either a lower or upper bound with finitely many discrete switches between the two. The pay-off determines the objective(s) of control; which in our stylized example may be to maximize profits after cost of fertilizing is considered, or achieve a specific yield, for example 3x 0 , using the minimum amount of fertilizer.

Much of modern day optimal control theory stems from the seminal works of Pontryagin, through the Pontryagin maximum principle (PMP) [16] , and Bellman, through the advent of dynamic programming and the Hamilton-Jacobi-Bellman equation [17] , in the 1950s and 1960s. These foundations of optimal control are built upon centuries of development in the calculus of variations [18] . For brief but broad expositions of the theoretical roots of optimal control and developments following these seminal works, we direct the reader to articles such as [19, 20] .

Often we are unable to solve optimal control problems analytically, so we pursue computational approaches. Broadly, the numerical methods for optimal control can be classed as either indirect or direct methods; for indirect methods optimality conditions are derived in the calculus of variations fashion via the PMP, leading to a two-point boundary value problem (TPBVP), while for direct methods the control problem is discretized and reformulated as a nonlinear programming problem [21] . For an early history of numerical methods in optimal control, including gradient and conjugate gradient methods, Newton-Raphson methods, quasi-linearization, feasible direction algorithms and feedback solutions we suggest [22] . Surveys [20, 21] give an excellent overview of more recent developments in relevant numerical methods, including the forward-backward sweep method (FBSM), multiple-shooting methods, control parameterization, collocation and pseudospectral methods and complete discretization into finite-dimensional nonlinear programming problems.

Optimal control methodology and numerical solution techniques are continually being developed and improved. The growing popularity of artificial intelligence, machine learning and related disciplines has precipitated significant advances in computational techniques for handling large-scale systems with many variables, and related infinite-dimensional optimization problems. Nonlinear approximators, including neural networks, can be used to reduce infinite-dimensional optimization problems to finite-dimensional nonlinear programming problems. This approach is presented in [23] , alongside other techniques that arise through unifying aspects of decision science, dynamic optimization, statistical and deterministic machine learning, nonlinear approximation theory and other fields. One example of control paired with machine learning arises in autonomous vehicles, where machine learning techniques can accelerate obtaining approximately optimal controls where computational power on-board is limited and controls satisfying strict safety constraints must be obtained rapidly [24] . Reinforcement learning, a technique from artificial intelligence resembling a model-free analogue of dynamic programming, has shown promising simulation results for the control of multi-species microbial communities in bioreactors [25] .

Formulation and approximate solutions of fractional optimal control problems (FOCP)-optimal control of systems involving fractional derivatives-has also garnered wide interest recently within the control, numerical methods and applied mathematics communities; resulting in the development of new numerical approaches such as the non-standard two-step Lagrange interpolation method [26, 27] ; and amalgamations of new and existing techniques, such as pairing predictor-corrector methods for solving fractional differential equations with the FBSM for optimal control [28, 29] . Applications involving FOCPs arise in areas of systems biology including epidemiology, where the incorporation of memory effects through fractional time derivatives may better describe disease transmission, by modelling the capacity for the population to learn from past outbreaks [26, 30] ; and in cancer therapy for determining optimal chemotherapeutic and immunotherapeutic treatment regimens [31, 32] .

The field of optimal control has historically focused on determining optimal interventions to apply to systems to meet specified objectives. More recently, however, optimal control techniques have been applied in a systems biology context to further our understanding of the underlying mechanisms or processes involved in a given system; for example via inverse optimal control, whereby exhibited behaviour observed in a system is used to elicit the underlying optimality Figure 1 . A pictorial example of optimal control for a growing crop. Suppose that initially, the crop yield is x 0 . We want to grow this crop to increase the yield, represented by the green line, come harvest time. Actions taken to increase the growth rate of the crop, such as applying fertilizer, are the controls, represented in black dash. Scenarios are presented for (a) no control, (b) continuous control and (c) bang-bang control. Optimal control theory helps us determine how best to apply these controls. Illustrations adapted from ilyakalinin/iStock/Getty Images, johavel/iStock/Getty Images. royalsocietypublishing.org/journal/rsif J. R. Soc. Interface 18: 20210241 principles that may guide the system [33] . Optimality principles have been employed to investigate mechanisms in metabolism; for example, in [34] , where optimal control techniques provide rationalization for experimentally and numerically observed sequential activation of metabolic pathways; in [35] where optimal control techniques predict enzyme activation times and metabolite concentrations; and in other work reviewed in [36] , where further insights are gained regarding metabolic pathway activation and regulation. Optimal control has also provided insight into the emergence of persister cells in the presence of environmental volatility [37] .

The FBSM is an iterative method for solving the TPBVPs that arise from the indirect PMP approach to optimal control. In systems biology, the FBSM for optimal control is very popular, owing particularly to its straightforward scalability to large systems, and to its moderate computational cost and mathematical complexity [38] . In this work, we review the implementation of the FBSM to solve optimal control problems, and investigate means of accelerating the convergence. To contextualize our discussion of the FBSM, we first consider the more familiar technique of successive over-relaxation (SOR). SOR is a generalization of the Gauss-Seidel method, and is widely applied in numerical linear algebra to accelerate convergence when solving linear systems iteratively [39] . Essentially, the process of SOR involves specifying an acceleration or relaxation parameter, β ∈ (0, 2); a weighting factor that serves to reduce the spectral radius of the iterative matrix operator [40] . The error and rate of convergence of SOR is sensitive to this ( problem dependent) choice of β, prompting investigation into theoretical convergence results and methods of determining β [40] [41] [42] . Despite challenges in identifying the optimal β, the SOR has historically been widely applied and studied in the literature due to the ease with which it can be implemented, and the rapid convergence it can deliver; even without identifying the optimal β [43, 44] .

This narrative closely parallels that of the FBSM in optimal control, where a weighting factor ω can be applied when updating the control between iterations to aid convergence. The optimal choice of ω is problem dependent, and significantly impacts the rate of convergence, or whether the FBSM converges at all. Nonetheless, the FBSM is frequently used in applied optimal control work as it is relatively straightforward to implement, and can still converge in absence of the optimal ω. Theoretical convergence results of the FBSM are available in the literature [45, 46] , although the focus is on the FBSM without weighted updating, with no consideration for choosing ω. Using regularization techniques, the FBSM is modified in [47] to improve convergence properties for large systems in a continuous setting, with a view to training deep neural networks in machine learning. These convergence results have recently been extended to the numerically discretized setting through symplectic Runge-Kutta discretization; taking advantage of the variational structure of optimal control problems [48] . The authors also demonstrate that the rate of convergence of the regularized FBSM with symplectic discretization can be improved with Anderson acceleration, an iterative acceleration technique. Although promising, this regularization introduces a regularization parameter, ρ. Similar to ω, the choice of ρ impacts convergence, and its choice is problem dependent. Understanding and implementing the regularization and symplectic techniques is not trivial, and introduces conceptual complexity beyond what is necessary for many applied optimal control problems. As such, the standard FBSM remains an attractive choice for practitioners.

To this end, we aim to review acceleration techniques that can be paired with the standard FBSM. We implement such techniques alongside the FBSM with the goals of: (1) increasing the rate and frequency of convergence and (2) reducing the importance of, and challenges associated with, selecting ω. A graphical overview of the optimal control process we employ in this work, including the incorporation of acceleration methods, is presented in figure 2 . We note that all code used to implement the algorithms presented in this review, the FBSM and the Wegstein, Aitken-Steffensen and Anderson acceleration methods, is available on GitHub (https://github.com/Jesse-Sharp/Sharp2021).

Throughout this work, we consider optimal control in the systems biology context. However, we note that optimal control is relevant to a wide variety of fields including chemical engineering [49] , aeronautics and astronautics [21] , management science and economics [50] . The FBSM, and by extension, the acceleration techniques we consider in this work, can be readily applied in any of these areas.

In §2, we review the PMP approach to optimal control, and the implementation of the FBSM. We provide a singlevariable linear model, and a multi-variable nonlinear model in §3; and pose and solve example continuous, bang-bang (discontinuous), and fixed endpoint control problems. We review potential iterative acceleration methods in §4, and present the results of selected techniques in §5. We discuss the performance of these techniques in §6, and identify opportunities for application and further investigation.

In an optimal control problem with one state variable, x(t), one control, u(t), over a fixed time interval, t ∈ [t 0 , t N ], such as the crop growth example presented in figure 1, we seek the optimal control u*(t) that minimizes or maximizes a specified pay-off function, J, subject to the dynamics of the state. In this section, we briefly review the PMP approach to such an optimal control problem, and the standard implementation of royalsocietypublishing.org/journal/rsif J. R. Soc. Interface 18: 20210241 the FBSM for solving the resulting two-point boundary value problem. The FBSM is readily extended to problems with multiple state variables, multiple controls, state constraints and free end-times [5, 38, 46, 51] ; however for this overview, we restrict ourselves to the single variable, single control, fixed end-time case for clarity. The pay-off typically comprises a cost function Lðt, xðtÞ, uðtÞÞ integrated over the time interval, and/or a function, ϕ, of the state at final time: ϕ(x(t N )). As such, we seek to minimize or maximize J, subject to

Lðt, xðtÞ, uðtÞÞ dt ð2:1Þ

and dx dt ¼ f ðxðtÞ, uðtÞ, tÞ, xðt 0 Þ ¼ x 0 : ð2:2Þ

Applying the PMP, we construct the Hamiltonian Hðt, xðtÞ, uðtÞ, lðtÞÞ ¼ Lðt, xðtÞ, uðtÞÞ þ lðtÞf ðxðtÞ, uðtÞ, tÞ, where λ(t) is the co-state variable linking our state to our pay-off. The necessary conditions for optimal control are obtained from the Hamiltonian:

(1) The optimal control, u*(t), is obtained by minimizing the Hamiltonian @H @u ¼ 0: ð2:3Þ

(2) The co-state is found by setting

satisfying the transversality condition

ð2:4Þ

Following these steps yields a TPBVP to solve for x(t), λ(t), subject to x(t 0 ) = x 0 , and λ(t N ) = λ N . To solve this numerically, we discretize t into N + 1 time points separated by a step-size dt = (t N − t 0 )/N; t = [t 0 , t 0 + dt, …, t 0 + Ndt] = [t 0 , t 1 , …, t N ]. Here, we consider a uniform discretization in time; although this is not strictly necessary, as discussed in §3. Using superscripts to denote the iteration number, provide an initial guess of the control at each t; u ð0Þ ¼ ½u 3), generate a temporary update for the control,û ð1Þ . The next iteration begins with an updated guess for the control, u (1) . These steps are repeated until a convergence condition is satisfied. The algorithm for the FBSM is summarized in §1 of the electronic supplementary material.

In some instances, directly updating the control, such that u ðkÞ ¼û ðkÞ , k ¼ 1, 2, . . . ð2:5Þ

is sufficient; however more commonly a weighted update is performed [5, 38] , such that in the (k + 1)th iteration,

This weighted updating is also referred to as applying a relaxation factor, similar to SOR as discussed in §1. An appropriate choice of ω in equation (2.6) can accelerate convergence relative to equation (2.5), or in some cases induce convergence where equation (2.5) leads to divergence. The weighting parameter, ω, can be held constant between iterations, although faster convergence may by achieved by updating ω. For example, by reducing ω as the system approaches convergence, a greater portion of the updated control is maintained relative to the control from the previous iteration [38] , possibly accelerating convergence. A challenge commonly faced in implementing this control updating scheme is that the best choice for ω is problem dependent, and often is determined heuristically in practice. We address the extent to which the proposed acceleration algorithms address this issue in §4.

To facilitate the following discussion regarding acceleration, we note that the FBSM can be thought of as a generalized fixed point iteration [46] , where each iteration comprises a forward and backward sweep and a control update. As such, for a control problem with one control, discretized into N + 1 time points, each iteration of the FBSM can be thought of as the application of a nonlinear operator, F , of dimension N + 1, such that u ðkþ1Þ ¼ F ðu ðkÞ Þ, or: Importantly, we use the term function evaluation in this work to refer to the process of solving the system of ODEs for the state forward in time and the system of ODEs for the co-state backwards in time, once. This aligns with a single iteration of the standard FBSM. The function evaluation nomenclature becomes convenient when discussing the FBSM in the context of acceleration algorithms that typically focus on reducing the number of times expensive functions are evaluated. Producing numerical solutions to the ODE systems is by far the most computationally expensive component of the FBSM. This computational expense increases with the size and complexity of the systems; reducing the number of times these systems must be solved becomes more advantageous as the size and complexity of the systems increases. The function evaluation description also facilitates comparison between acceleration methods that require solving the ODE systems a different number of times per iteration. Throughout this work, we use N to denote the total number of function evaluations a given method takes to achieve convergence.

The FBSM can be extended to handle problems where we aim not only to minimize or maximize a given quantity over time but also ensure that a specific state is reached at final time. This aligns with the crop growth example from figure 1 if the objective is to achieve a specific yield of 3x 0 at harvest, rather than to maximize yield. In this case, we may have an integral term in the pay-off as described in equation (2.1); royalsocietypublishing.org/journal/rsif J. R. Soc. Interface 18: 20210241 however, the function of the final state, ϕ(x(t N )), is redundant in a control problem with a prescribed final state. Equation (2.2) is also modified to incorporate the additional constraint

Lðt, xðtÞ, uðtÞÞ dt, subject to dx dt ¼ f ðxðtÞ, uðtÞ, tÞ,

Here, x N is the specified state that must be reached at final time. Since we have introduced an additional boundary value to the system, we no longer obtain the transversality condition from equation (2.4) . Instead, we seek the final time condition on the co-state, λ N , and associated optimal control that satisfies equation (2.7). We proceed by considering an adapted FBSM that takes as an input a guess for this final time condition,l N , and solves the corresponding control problem. If we denote this application of the FBSM as the function Vðl N Þ, and the corresponding final value of the state,x N , then the adapted FBSM is an iterative process that solves for the root of Vðl N Þ; the value ofl N for which x N Àx N ¼ 0. This outer iterative process can be solved using standard techniques such as the bisection method or secant method; the former converging more reliably provided that the initial guesses forl N form an interval that brackets the root, the latter converging in fewer iterations [38] . Each of these outer iterations necessitates solving a boundary value problem to convergence, often involving numerous iterations of the FBSM. In this work, we apply the secant algorithm as presented in [38] without modification, for the adapted FBSM. The acceleration techniques described in §4 are applied only to the inner FBSM processes, reducing N for each internal FBSM problem, leaving the outer secant iterations unchanged. Using N ðkÞ to denote the number of function evaluations in the kth internal FBSM problem, we can express the cumulative function evaluations required for convergence of the adapted FBSM as S, such that

The adapted FBSM can also be used to solve control problems with isoperimetric constraints; integral constraints of the form ð tN t0 hðt, xðtÞ, uðtÞÞ dt ¼ K,

where K is a prescribed constant. For example, if h(t, x(t), u(t)) = u(t), then K represents a specific and known amount of control that must be applied. The approach to solve problems with isoperimetric constraints, as outlined in [38] , is to introduce an additional state variable, z, with

This transforms the problem with an isoperimetric constraint into a problem with a fixed endpoint, that can be solved using the adapted FBSM as described.

To investigate the robustness and effectiveness of the iterative acceleration techniques that we will discuss in §4, we consider two distinct systems, and for each system we study three example control problems. The first system is a single species linear differential equation subject to a control. We later demonstrate that under certain conditions we are able to obtain exact solutions for control problems applied to this model. The second system is a three species model for acute myeloid leukaemia (AML) governed by a coupled nonlinear system of differential equations, subject to a control.

We construct the linear model to examine the behaviour of the acceleration techniques as applied to a simple idealized set of control problems. We include the AML model, variations upon which have been considered in the literature [5, 51, 52] , to examine how the acceleration techniques perform when applied to problems more reflective of those considered in applied optimal control. For each model, we consider three distinct control problems: continuous control, bang-bang control and continuous control with fixed endpoint. For all control results presented in this work, convergence is deemed to be achieved when the error, ε, measured as the Euclidean norm of the difference between subsequent controls, falls below a tolerance of 1 × 10 −10 . Numerical solutions to ODEs are obtained using a fourth-order Runge-Kutta method [53] with constant time-stepping. A uniform time discretization is sufficient for all control problems considered in this work. However, the FBSM and acceleration methods readily generalize to a non-uniform discretization. If the desired discretization for the state equations differs from that of the co-state equations, it is necessary to perform interpolation within each iteration of the FBSM to obtain values at corresponding time points. This can be computationally expensive and introduce an additional source of error. Where the desired discretizations for the state and co-state differ, numerical schemes with internal interpolation such as continuous Runge-Kutta methods may be appropriate [54, 55] .

The linear model is a single species model for the growth of x(t), subject to control u(t) that increases the growth rate. This model could represent our stylized crop growth example presented in §1. We suppress the explicit time dependence of the state and co-state variables and the control in the following equations for notational convenience. For numerical results, we solve the linear problems on the domain 0 ≤ t ≤ 1, with time-step dt = 3.91 × 10 −3 , giving N = 257 time points.

In the absence of control, u(t) ≡ 0, this model admits the solution x(t) = x 0 e γt , describing exponential growth.

We seek to maximize a quadratic cost function J, subject to

Following the standard PMP approach for solving optimal control problems, we form the Hamiltonian and derive the co-state equation, transversality condition and optimality condition. The Hamiltonian is given by

The co-state equation is

royalsocietypublishing.org/journal/rsif J. R. Soc. Interface 18: 20210241 with transversality condition λ(1) = 0. In this case, the optimality condition is

such that the optimal control is given by

ð3:5Þ

For model parameter γ = 0.5 and pay-off weightings a = b = 1, with initial condition x 0 = 1, we are able to solve the control problem analytically using standard techniques for linear systems with complex eigenvalues [56] . The process is laborious so we present the approach and analytical solution in §2 of the electronic supplementary material. In the electronic supplementary material, we also plot the analytical results against the numerical results to demonstrate the excellent agreement. The numerical solution to the linear continuous control problem is presented in figure 3 . Convergence via the FBSM requires N ¼ 57 iterations.

For the bang-bang control, we consider the same state equation as in equation (3.1), and incorporate bounds on the control.

We seek to maximize a cost function J that is linear in u,

We form the Hamiltonian and derive the co-state equation and transversality condition

The co-state equation is

In seeking the optimality condition, we find

As equation (3.6) does not depend on u, we define a switching function

and produce an expression for the control, based on the bounds on u and the sign of the switching function:

If ψ(t) is zero over any finite interval excluding isolated points, the optimal control is singular rather than bang-bang. Over such intervals, minimization of the Hamiltonian does not provide sufficient information to determine the optimal control, and further conditions must be considered [38, 57] . We restrict our focus in this work to non-singular bang-bang optimal control problems. The numerical solution to the linear bang-bang control problem is presented in figure 4 . Convergence to this solution via the FBSM required N ¼ 8 iterations.

For the fixed endpoint problem, we proceed with the same state equation; however we now impose a terminal condition on x.

We seek to maximize the same quadratic cost function J, as considered in equation (3.2) . As such, we form the same Hamiltonian given in equation (3. 3) and derive the same co-state, equation (3.4) , and expression for the control, equation (3.5) . Note however that we do not prescribe a final time condition on the co-state equation via the transversality condition; as the system already has two boundary conditions, doing so would cause it to be overdetermined. royalsocietypublishing.org/journal/rsif J. R. Soc. Interface 18: 20210241 Instead, we make two guesses for λ(1); for example, λ (0) (1) = −10 and λ (1) (1) = 10. We proceed by applying the adapted FBSM outlined in §2, using these guesses to initialize the secant method. Numerical results for the linear fixed endpoint control problem are presented in figure 5 . Convergence of the adapted FBSM is achieved after S ¼ 177 iterations.

The AML model is a nonlinear coupled multi-species model describing the interactions between progenitor blood cells, A(t), and leukaemic stem cells, L(t), that occupy the same niche in the bone marrow, thereby competing for space and resources. Haematopoietic stem cells, S(t), act as upstream production of A(t). These dynamics have been explored in the literature both experimentally [58, 59] , and through mathematical modelling [52, 60] . We subject the model to a chemotherapy-like control, u(t), that acts as an additional death term for L(t). The state can be expressed as

As there are now three state equations, we require three co-state equations:

We suppress the explicit time dependence of the state and co-state variables and the control in the following equations for notational convenience:

For each control problem associated with the AML model, we use initial conditions that yield a coexisting steady state in the absence of control (all three species non-zero): 

For the AML continuous control problem, we seek to minimize a quadratic cost function J that accounts for both the cost of applying the control and the cost of the leukaemic burden, subject to

We form the Hamiltonian and derive the co-state equation, transversality condition and optimality condition. The Hamiltonian is given by

The co-state equations are

with transversality conditions λ 1 (10) = λ 2 (10) = λ 3 (10) = 0, obtained in the usual way. In this case, the optimality condition is

such that the optimal control is given by

Numerical solutions for the AML continuous control problem are presented in figure 6 . These solutions are obtained via the FBSM, requiring N ¼ 38 iterations with ω = 0.55. This choice of ω minimizes N for the AML continuous control problem solved with the FBSM without acceleration techniques. We discuss the choice of ω further in §5.

For the bang-bang AML problem, we consider the same states as in equation (3.8), and incorporate bounds, 0 ≤ u ≤ 0.3, on the control. We seek to minimize a cost function J that is linear in the control and the state variable L:

We form the Hamiltonian and derive the co-state equations, transversality conditions and optimality condition. The Hamiltonian is given by

royalsocietypublishing.org/journal/rsif J. R. Soc. Interface 18: 20210241

The co-state equations are

with transversality conditions λ 1 (10) = λ 2 (10) = λ 3 (10) = 0. In this case, the switching function is

such that the optimal control is given by

Note that the correspondence between the sign of ψ(t) and the chosen bound is reversed in equation (3.17) relative to equation (3.7) as we are now performing minimization rather than maximization. Numerical solutions for the AML bang-bang control problem are presented in figure 7 . These solutions are obtained via the FBSM, requiring N ¼ 34 iterations with ω = 0.4. This choice of ω minimizes N for the AML bang-bang control problem solved with the FBSM without acceleration techniques. We discuss the choice of ω further in §5.

For the fixed endpoint problem, we proceed with the same state equations as for the AML continuous control problem given in equation (3.8); however we now impose a terminal condition on the leukaemic population: L(10) = 0.05. We seek to minimize the same quadratic cost function J, as considered in equation (3.9) . We form the same Hamiltonian given in equation (3.10) and derive the same co-state, equation (3.11), and expression for the control, equation (3.13). Table 1 . AML model parameters. Parameters correspond to those presented with the original model [52] , with immune response parameters introduced in subsequent work [5] .

characteristic rate of the immune response royalsocietypublishing.org/journal/rsif J. R. Soc. Interface 18: 20210241

We obtain final time conditions, λ 1 (10) = λ 2 (10) = 0, via the transversality conditions as usual; however we do not prescribe λ 3 (10) . Instead, we make two guesses for λ 3 (10); for instance, l 

In this section, we outline several techniques for acceleration of iterative schemes. Where appropriate, we first present the univariate/scalar version of the method for familiarity, then provide the multivariate/vector analogue of the method for use with accelerating the FBSM. We attempt to use notation that aligns most closely with commonly used notation in the literature, while maintaining internal consistency in this work. In the scalar case, we consider the iterative process x (k+1) = f (x (k) ), where x (k) is the kth iterate and f is the iterating function. In the vector case, we consider X (k+1) = F(X (k) ), where X ðkÞ ¼ ½x We note that many of the methods presented here can be written in several different forms. While some forms better facilitate analysis of aspects such as convergence speed and numerical stability, others emphasize ease of understanding and implementation. In this work, we prioritize usability and present methods and algorithms in forms reflective of their implementation where possible. For the purpose of this work, we feel it is sufficient to present the methods and discuss their implementation without delving into their derivation or rigorous theoretical convergence results. For readers interested in these aspects, we suggest these articles [61, 62] , and numerical analysis texts [63, 64] .

Newton's method is one of the most prevalent root-finding algorithms, due to its relatively straightforward implementation and potential for quadratic convergence [64] . For a univariate function, Newton's method is given by

We arrive at the scalar secant method by replacing the derivative term, f 0 (x (k) ), in equation (4.1) with a finite difference approximation

x ðkþ1Þ ¼ x ðkÞ À f ðx ðkÞ Þ

x ðkÞ À x ðkÀ1Þ f ðx ðkÞ Þ À f ðx ðkÀ1Þ Þ :

Newton's method for multivariate systems is

where DX ðkÞ is obtained by solving J k DX ðkÞ ¼ ÀFðX ðkÞ Þ:

Here, J k is the Jacobian matrix of F evaluated at X (k) [64] . Setting aside the interpretation of the Jacobian in the context of the FBSM, numerically approximating an N × N Jacobian matrix using finite differences requires OðN 2 Þ FBSM iterations at each Newton step. A range of quasi-Newton methods have been developed to minimize the computational expense associated with computing the Jacobian at each Newton step. It is not immediately apparent how the secant method should be extended to multivariate systems, but one such interpretation is the quasi-Newton Broyden's method. Broyden's method reduces the number of function evaluations required at each Newton step by forming the full Jacobian only initially, then updating the Jacobian matrix via a rank-one update based on the secant method [63, 65] . We later discuss the Wegstein method [66] , which is another interpretation of the secant method in multivariate settings.

In the context of accelerating the FBSM, techniques that require forming or approximating a full Jacobian, even once, are not appropriate. We have an N + 1 dimensional system, where N + 1 is the number of time points in the discretization of the ODEs, so we expect N to be large, relative to the number of iterations required for the FBSM to converge without acceleration techniques, via equation (2.6). As such, we restrict our focus to Jacobian-free methods in the remainder of this section; in particular, we discuss and implement the Wegstein and Aitken-Steffensen methods and Anderson acceleration. We provide a broad overview alongside the key equations here, and provide complete algorithms alongside notes for implementation in §4 of the electronic supplementary material.

Wegstein's method can be thought of as an element-wise extension of the secant method to multivariate systems [67] . Although Wegstein's method appears less popular than other methods considered in this work, it has found practical utility, particularly in chemical and process engineering software [68, 69] . We include it here due to the striking similarity it bears to the control update with relaxation presented in equation (2.6) . It is also one of the more straightforward techniques, both in conception and implementation:

x ðkþ1Þ ¼ f ðx ðkÞ Þ,

x ðkþ1Þ ¼ q ðkÞ x ðkÞ þ ð1 À q ðkÞ Þx ðkþ1Þ ,

ð4:2Þ

where q ðkÞ ¼ a ðkÞ a ðkÞ À 1 , and a ðkÞ ¼ f ðx ðkÞ Þ À f ðx ðkÀ1Þ Þ x ðkÞ À x ðkÀ1Þ : ð4:3Þ

In implementation, from an initial value x 0 , it is necessary to perform two function evaluations, i.e. x 1 = f (x 0 ), and f (x 1 ), before it is possible to compute equation (4.3) for the Wegstein method [66] . In subsequent iterations only one new function evaluation is required.

The extension of Wegstein's method to multivariate systems follows exactly the process outlined in equations (4.2) and (4.3), as it is extended element-wise. While convergence is guaranteed when using Wegstein's method for a single nonlinear equation, the uncoupling implied by the element-wise extension can lead to divergence [70] .

In equations (4.2) and (4.3), q (k) denotes q in the kth iteration; however, we note that it is not necessarily most effective to update q every iteration. As such, in this work, we explore various updating regimes. There is also the option of applying bounds on q. Bounds of −5 < q i < 0, 8i, where i denotes the ith element of the system, are frequently applied when implementing Wegstein's method [71, 72] . This bounding appears to work reasonably well for the small nonlinear test systems we consider in §5 of the electronic supplementary material, although we were not able to identify a theoretical result supporting this specific choice. For the control problems we consider, this bounding is not effective, so we apply different bounds, discussed further in §5. The univariate Wegstein method can be thought of as a modification of the Aitken method, which at the time the Wegstein method was developed, was only understood for the univariate case [73] .

Aitken's Δ 2 method, also referred to as Aitken's delta-squared process or Aitken extrapolation, was originally posed by Aitken in 1927 as a means of extending Bernoulli's method of approximating the largest root of an algebraic equation. This extension facilitates numerically approximating not only the largest root, but all roots of the equation [74] . Aitken's method generates a new sequence,x, in parallel to the fixed point iteration.

x ðkÞ ¼ x ðkÞ À ðx ðkþ1Þ À x ðkÞ Þ 2 x ðkþ2Þ À 2x ðkþ1Þ þ x ðkÞ , or

where Δ is the difference operator; Δx (k) = x (k+1) − x (k) , and the higher order operator is applied recursively; [64] . From an initial value, x (0) , two function evaluations, iterations of the underlying fixed point process, must be performed to obtain x (1) and x (2) , before equation (4.4) can be computed. The derivation of Aitken's method assumes an underlying linearly converging series of iterates. The order of convergence of the resulting Aitken accelerated series is still linear; however, this series converges faster than the original series [63] . We discuss Aitken's Δ 2 method and Steffensen iteration together, as Steffensen iteration is a straightforward extension of Aitken's method, whereby the Aitken value,x ðkÞ , is used to continue the fixed point iteration, i.e. x ðkþ1Þ ¼x ðkÞ . Despite the striking similarity, Steffensen's method was seemingly developed shortly after (1933) and without knowledge of Aitken's method [75] . Steffensen iteration can achieve quadratic convergence [ [78] and in a series of papers by Noda [79] [80] [81] .

Aitken and Steffensen iteration can be extended to multivariate systems [64] . In the following statements, we outline the method for an N + 1 dimensional system, where ΔX (k) = X (k+1) − X (k) , X ðkÞ is a matrix constructed with columns (X (k) , X (k+1) , …, X (k+N ) ), such that X ðkÞ is a square matrix of dimension N + 1, with DX ðkÞ ¼ X ðkþ1Þ À X ðkÞ , and D 2 X ðkÞ ¼ DX ðkþ1Þ À DX ðkÞ . In the form given by equation (4.5) , there are glaring issues with using the Steffensen method to accelerate convergence of the FBSM. Setting aside the question of whether D 2 X ðkÞ is invertible, forming X ðkÞ would require OðNÞ iterations of the FBSM to be performed, and since N relates to the number of time points in the discretization of the ODEs in the FBSM, we expect N to be large, relative to the number of iterations required for the FBSM to converge without acceleration.

We instead consider a modification of the Steffensen method, requiring fewer function evaluations per iteration. Introduce m < N, and define ΔX (k) = X (k+1) − X (k) as before, X ðkÞ is now a rectangular matrix constructed with columns (X (k) , X (k+1) , …, X (k+m+1) ), such that X ðkÞ [ R Nþ1Âmþ2 , with DX ðkÞ ¼ X ðkþ1Þ À X ðkÞ , and D 2 X ðkÞ ¼ DX ðkþ1Þ À DX ðkÞ , both of dimension N + 1 × m. We now interpret the matrix inverse in equation (4.5) as the Moore-Penrose pseudoinverse [82] , a generalization of the matrix inverse for singular and rectangular matrices; we discuss this further in §3 of the electronic supplementary material. This partial implementation requires only m + 1 function evaluations per iteration. For the remainder of this document when referring to the Steffensen method we are specifically referring to this partial Steffensen implementation. We present the derivation of the multivariate Aitken-Steffensen method and outline where the partial implementation differs in §3 of the electronic supplementary material.

Anderson acceleration or Anderson mixing, originally denoted as the extrapolation algorithm by Anderson in the 1960s [83] , is a technique developed for accelerating convergence of fixed point iteration problems with slowly converging Picard iterations [84] . Anderson acceleration is of particular interest in this work, as it has recently been implemented to accelerate the convergence of a regularized version of the FBSM [48] . In contrast to a standard fixed point iteration, whereby the next iterate depends only on the immediately preceding iterate, Anderson acceleration has 'memory' through the inclusion of previous iterates [85] . Unlike other methods considered in this work, Anderson acceleration explicitly uses the differences between residuals of subsequent iterates alongside iterates and their differences in computing future iterates.

Anderson acceleration involves solving a least-squares problem at each iteration. The problem can be expressed in both constrained and unconstrained forms, with the updating step dependent on the form [86, 87] . We solve the following unconstrained least-squares problem in each iteration of where G (k) = F(X (k) ) − X (k) is the residual, dX (k) is a matrix with columns (ΔX (k−m) , ΔX (k−m+1) , …, ΔX (k) ), and dG (k) is a matrix with columns (ΔG (k−m) , ΔG (k−m+1) , …, ΔG (k) ), and m indicates the number of previous iterates that are incorporated.

As a precursor to implementing these acceleration methods for control problems, we apply them to solve example nonlinear systems of dimension 2 × 2, 3 × 3 and 4 × 4. We provide these systems and the results of the acceleration methods compared to standard fixed point iteration in §5 of the electronic supplementary material. We do not discuss these results in detail, although broad comparisons regarding the application of the acceleration methods to these systems and to control problems are made in §6. We provide code on GitHub (https://github.com/Jesse-Sharp/Sharp2021) for implementing the acceleration algorithms to solve systems of arbitrary size.

In this section, we discuss the results of applying the acceleration algorithms. When discussing results we are solely focused on reducing N , the number of function evaluations required for the control problems to reach convergence; as in all convergent cases we arrive at the same optimal control results. We first discuss the aspects of each method that can be tuned, then outline the results of the standard FBSM with the best choice of ω but without acceleration methods applied, to establish a baseline against which to compare the acceleration methods. A detailed suite of results for each control problem and each acceleration method, for various combinations of tuning parameters, is provided in §6 of the electronic supplementary material.

Each method we consider has parameters that can be tuned to improve performance for a given problem. For the FBSM without acceleration, we can select ω ∈ [0, 1); the parameter that weights the contribution of the control from the previous iteration, and the newly calculated control, to the control used in the next iteration, as stated in equation (2.6) . Control problems based on the linear model are able to converge via direct updating, as given in equation (2.5) , equivalent to ω = 0. Increasing ω in this case only serves to increase N , so we do not attempt to tune ω when considering the linear model. Using the standard FBSM without acceleration the continuous linear problem requires N ¼ 57 while the bang-bang linear problem requires only N ¼ 8.

royalsocietypublishing.org/journal/rsif J. R. Soc. Interface 18: 20210241

In figure 9 , we plot N against ω ∈ [0, 1), for the continuous and bang-bang AML problems. As expected, for small ω we find that the problem does not converge, and for large ω, N increases. For the continuous AML problem we identify ω = 0.55 as the best choice, with N ¼ 38. For the bang-bang AML problem, we find that ω = 0.4 is best, with N ¼ 34.

Recall that fixed endpoint problems are solved using the adapted FBSM; this entails solving several control problems to convergence with the FBSM. Each of these control problems can have a different optimal ω. In this instance, ω = 0.55 also happens to be best for the AML fixed endpoint problem if holding ω constant, when considering ω ∈ [0, 1) at increments of 0.05. These ω values will not coincide in general. When applying the acceleration methods to the fixed endpoint problems, we employ the tuning parameters that perform best for the continuous problem. This does not imply that we are using the best tuning parameters for the acceleration methods in the context of the fixed endpoint problem. Importantly, this demonstrates whether or not the techniques can effectively reduce S, the cumulative function evaluations required for convergence of the adapted FBSM for fixed endpoint problems, without requiring prohibitive tuning.

With the Wegstein method, we only select ω for the two FBSM iterations required for initialization, and specify n, such that we update q every nth iteration. We generate results for n ∈ {1, 2, … , 10}. We also bound q; however identifying suitable bounds is challenging. In this work, we select bounds that perform reasonably, but acknowledge that we do not search for optimal bounds, nor do we think that attempting to do so is realistic. This drawback of Wegstein's method contributes to its inconsistent performance relative to other methods. For the partial Aitken-Steffensen methods, we choose ω, and the parameter m that specifies the dimension of the N + 1 × m matrices in the updating step, requiring m + 1 function evaluations per iteration. We generate results for m ∈ {1, 2, … , 10}. Similarly, for Anderson acceleration we select ω and M, where M determines the maximum number of previous iterations to retain when solving the least-squares problem and performing the updating step. We produce results for M ∈ {1, 2, …, 10}.

For the continuous linear problem, we apply bounds of −2 ≤ q ≤ 0. For the bang-bang linear problem, we leave q unbounded. For both AML problems, we apply bounds −1 ≤ q ≤ 1. We explore the effect of updating q every nth iteration, n ∈ {1, 2, … , 10}. For the continuous linear problem, n = 4 minimizes N , although n ∈ {1, 2, … , 5} all perform well. For the linear bang-bang problem, the Wegstein method converges without bounding on q, and varying n does not affect convergence. The Wegstein method outperforms other acceleration methods for the linear bang-bang problem with N ¼ 9, but does not improve upon N ¼ 8 for the FBSM without acceleration.

For the continuous AML problem, the performance of Wegstein's method is inconsistent. With n = 6 and ω = 0.55, the Wegstein method achieves convergence with N ¼ 26, outperforming the FBSM; however, almost every other combination of tuning parameters considered with ω ∈ [0, 1) and n ∈ {1, 2, …, 10} require larger N than the FBSM without acceleration. Generally, increasing n produces worse outcomes. We do, however, observe that the Wegstein method can induce convergence for ω < 0.4, where the standard FBSM does not converge. For the bang-bang AML problem, the Wegstein method appears more robust; consistently outperforming the standard FBSM across most of the tuning parameter space. The best result requires only N ¼ 9, with ω = 0 and n = 7, although several other combinations of tuning parameters are similarly successful. For ω ≥ 0.4, corresponding to values that the underlying FBSM converges, we find that moderate n ∈ {3, 4, …, 7} produces the best results; while for smaller ω, larger n ∈ {6, 7, … , 10} consistently performs best. Once again we observe that convergence is achieved for ω values where the underlying FBSM would not converge.

For the linear fixed endpoint problem, the adapted FBSM with the Wegstein method consistently generates a moderate reduction in S, compared to the adapted FBSM without acceleration, for all n ∈ {1, 2, … , 10}, −2 ≤ q ≤ 0. For the AML fixed endpoint problem we do not observe improvement. Using the tuning parameters that perform best for the continuous AML problem, we find that S for the adapted FBSM with Wegstein's method is more than double that of the adapted FBSM without acceleration. This results from the inconsistency of Wegstein's method with poor tuning. In §6 of the electronic supplementary material, it can be seen that some control problems within the adapted FBSM that require N % 50 without Wegstein's method, require N % 200 with the specified Wegstein tuning parameters.

Both Aitken and Steffensen methods significantly and consistently outperform the FBSM without acceleration for the continuous linear problem. The Aitken method performs best for m ∈ {1, 2, 3}, requiring N ¼ 12. Steffensen's method performs best when m = 6, requiring only N ¼ 8. In the linear bang-bang case, both Aitken and Steffensen methods perform marginally worse than the FBSM without acceleration, which requires only N ¼ 8. In the best cases, with m = 1 the Aitken Figure 9 . Here, we plot N against ω ∈ [0, 1) in increments of 0.05, for the AML continuous (in red) and bang-bang (in black) control problems, using FBSM with no acceleration. Results correspond to model parameters given in table 1, time-step dt = 4.88 × 10 −4 , over the interval 0 ≤ t ≤ 10. The continuous problem is solved with pay-off weightings of a 1 = 1 for control, and a 2 = 2 for the state variable L(t), while the bang-bang problem is solved with a 1 = 1 and a 2 = 3. Where an ω value does not have a corresponding marker, this indicates that the procedure fails to converge within N ¼ 200.

royalsocietypublishing.org/journal/rsif J. R. Soc. Interface 18: 20210241 method requires N ¼ 10, and with m = 7 the Steffensen method requires N ¼ 9.

For the continuous AML problem, we observe a stark difference between the Aitken and Steffensen methods; while the Steffensen method is able to achieve convergence for values of ω where the underlying FBSM fails to converge, ω ≤ 0.35, particularly for m ∈ {1, 2, 3, 4}, the Aitken method only converges to the optimal control for ω values where the underlying FBSM converges. For ω ≤ 0.35, the Aitken method achieves apparent convergence; the iterative procedure terminates as the convergence criteria are met. However, explicitly calculating the pay-off associated with these controls via equation (3.9) , and comparing this result to the pay-off associated with the control obtained via the standard FBSM, indicates that the controls obtained via the Aitken method for ω ≤ 0.35 are not optimal, as they fail to minimize J. The best result for the Aitken method, with ω = 0.5 and m = 5, requires N ¼ 30, marginally improving on the FBSM without acceleration, requiring N ¼ 38. Steffensen's method produces more significant improvements, requiring only N ¼ 19 with ω = 0.5 and m = 5. In each case, neighbouring combinations of tuning parameters also yield equivalent or comparable improvement over the standard FBSM. In the bang-bang AML problem, we observe similar behaviour; for ω values that the underlying FBSM fails to converge, the Steffensen method consistently converges. The Aitken method achieves apparent convergence for these values of ω; the iterative procedure terminates as the convergence criteria are met; however, the resulting controls contain intermediate values between the lower and upper bounds. As such the resulting controls are not bang-bang, so we treat these results as failing to converge. At best, Aitken's method requires N ¼ 8, with ω = 0.5 and m = 1, while Steffensen's requires only N ¼ 7, with ω = 0.5 and m = 5. The vast majority of tuning parameter combinations yield improvements over the N ¼ 34 of the standard FBSM.

Aitken and Steffensen methods consistently offer significant improvement over the standard adapted FBSM for the linear fixed endpoint problem for m ∈ {1, 2, …, 10}, with the exception of m = 1 for the Steffensen method, which yields only marginal improvement. Using the best performing tuning parameters for the continuous AML problem, we find that both Aitken and Steffensen methods improve upon the standard adapted FBSM for the AML fixed endpoint problem. Relative to S ¼ 434 required without acceleration, the S ¼ 360 required with Aitken's method reflects a modest improvement, while the S ¼ 238 required with the Steffensen method is a significant improvement.

Anderson acceleration performs exceptionally well on the continuous linear problem, requiring only N ¼ 7 for M ∈ {4, 5, … , 10} compared to N ¼ 57 for the standard FBSM. For the linear bang-bang problem, however, it is the worst performing acceleration method; achieving at best N ¼ 11, with M = 1.

Similarly to the Wegstein and Steffensen methods, Anderson acceleration achieves convergence in both the continuous and bang-bang AML problems for ω values where the underlying FBSM fails to converge. Anderson acceleration achieves the best individual result for the continuous AML problem, requiring only N ¼ 17, with ω = 0.85 and M = 6. Again, we observe comparable improvement over a wide range of tuning parameters. For the bang-bang AML problem Anderson acceleration consistently outperforms FBSM without acceleration, particularly for ω < 0.7, at best requiring N ¼ 17, with ω = 0.35 and M ∈ {7, 8, 9, 10} , with other nonneighbouring tuning parameter combinations also yielding N ¼ 17.

For both the linear and AML fixed endpoint problems Anderson acceleration produces the most significant reduction in S, and improves upon the adapted FBSM over a wide range of tuning parameters. In the linear case, Anderson acceleration requires only S ¼ 24 for M ∈ {4, 5, … , 10}. In the AML fixed endpoint problem, using the tuning parameters that perform best for the continuous AML problem, Anderson acceleration converges in only S ¼ 204; less than half as many as the standard adapted FBSM.

Results presented in figures 10 and 11 provide comparison of the error, ε, as each method approaches convergence, for the linear and AML problems, respectively. Error is measured as the Euclidean norm of the difference between subsequent controls; ε = | |F(X (k) ) − X (k) | |, with the exception of Aitken's method, where error is measured as the difference between subsequent values in the Aitken series; 1 ¼ kX ðkÞ ÀX ðkÀ1Þ k. Convergence is achieved when ε ≤ 1 × 10 −10 , marked in black dash. In each case, we are plotting the result that minimizes N for each method, over the space of tuning parameters considered, including the best tuning of ω for the FBSM without acceleration. Error is plotted on a logarithmic scale. For the linear bang-bang problem with the Wegstein and Anderson methods, and the AML bang-bang problem with the Wegstein method, the error after the final iteration is ε = 0, as two subsequent iterates for the control are identical. This is represented on the logarithmic scale as a line that intersects the horizontal axis.

Modelling processes in systems biology is complex; frequently involving large state systems consisting of several ODEs [88] [89] [90] , including canonical examples such as the mitogenactivated protein kinase cascade [91] , Wnt/β-catenin signalling pathway [92] , early incarnations of whole-cell models [93, 94] , and other cellular signalling, metabolic and regulatory processes and mechanisms [95, 96] . The acceleration methods we implement act only on the control term; the number and form of state equations has no bearing on the mathematical and computational complexity of the acceleration methods. As such, the methods scale excellently with system complexity. In this section, we discuss the results presented in §5, and draw insights into the convergence behaviour of the FBSM when augmented with acceleration techniques. We highlight opportunities for application of these methods, and outline several avenues for further investigation.

In evaluating the performance of each acceleration method, we are interested in: (1) how significantly they are able to reduce N , (2) method robustness and (3) method accessibility. In this context, we use robustness to refer to how consistently the method outperforms the best tuned FBSM over the range of tuning parameters considered. We judge the accessibility of each method based on implementation and conceptual complexity. Overall, we find that the acceleration methods, particularly Anderson and Steffensen, significantly and robustly reduce N . Anderson acceleration appears most effective for continuous control, while the Steffensen method appears best for bangbang control. The Aitken method occasionally outperforms Steffensen, but overwhelmingly the Steffensen method appears to be the better option of the two for the range of parameters we consider. Implementing the Anderson and Steffensen methods introduces challenge beyond that of the underlying FBSM, although it is not prohibitively difficult; particularly with reference to the code where we implement these methods, that we make available on GitHub (https://github.com/Jesse-Sharp/ Sharp2021). Both methods introduce conceptual complexity, perhaps marginally less so for the Steffensen method due to the similarities it shares with the familiar Newton's method.

We produce heatmaps to visualize the convergence behaviour of the acceleration methods across the range of tuning parameters considered. Figure 12 corresponds to the AML continuous control problem, while figure 13 corresponds to the AML bang-bang control problem. Recall that with the tuning of ω that minimizes N , the FBSM with no acceleration requires N ¼ 38 for the AML continuous control problem, and N ¼ 34 for the AML bang-bang control problem. Tuning parameter combinations that reflect a reduction in N relative to the these FBSM results are shaded in the green spectrum, while In identifying tuning parameter combinations that yield significant reductions in N , we are looking for bright green areas in the heatmaps. We assess the robustness of each method by considering whether we observe large contiguous areas in the green spectrum, such as in figure 12c, indicating robustness, or patchy areas with both green spectrum and red spectrum, such as figure 13d, suggesting a lack of robustness.

In table 2, we provide our subjective but informed rating of the methods against the criteria of reduction in N , robustness and accessibility. We consider the continuous and bang-bang control cases separately in terms of reduction in N and robustness.

Despite its conceptual simplicity and straightforward implementation, Wegstein's method is significantly hampered by the difficulty in choosing bounds. If there were a more informed approach for identifying suitable bounds, Wegstein's method could be particularly useful for bang-bang control problems. Due to the effect of ω, intermediate control iterates of the FBSM do not appear bang-bang; as such the bulk of N are incurred in refining the control about the switching points. Wegstein's method can accelerate this refinement by adaptively setting q i = 0 where appropriate.

As outlined in §5, the linear model control problems converge with ω = 0. It may at first seem counterintuitive that Wegstein's method can improve upon this, given that the computed q in Wegstein's method acts as a stand-in for ω. There are two aspects of distinction that enable Wegstein's method to generate improvement in this case: first, while ω is held constant both within the time discretization and between iterations, the element-wise nature of Wegstein's method enables each element of the discretization to have a different value, q i , i ∈ {0, 1, … , N}, and q can be updated between iterations; second, observing the values of q i in Wegstein's method indicates that q < 0 can be appropriate. This suggests that ω < 0 could also be used to accelerate the standard implementation of the FBSM. Preliminary investigation suggests that this is true for the linear model; Figure 12 . This heatmap provides insight into the convergence behaviour of the acceleration methods for the AML continuous control problem. Here, we visualize N against ω and the method specific tuning parameter: n for Wegstein (a), m for partial Aitken (b) and partial Steffensen (c), and M for Anderson acceleration (d ). Tuning parameter combinations requiring N ¼ 38, equivalent to the best tuned FBSM without acceleration, are shaded yellow. Colours in the green-yellow spectrum represent a reduction in N relative to FBSM without acceleration, while colours in the yellow-red spectrum represent an increase in N .

however, we do not pursue this further as we expect it to be of limited applicability beyond contrived problems. We apply the acceleration methods to small nonlinear test systems in §5 of the electronic supplementary material. We know these systems have multiple fixed points; all methods we consider aside from Aitken's method, in some of our examples, reach different fixed points to fixed point iteration. By contrast, when applied to accelerate control problems, we observe only the Aitken method converging to a result other than the optimal control obtained via the FBSM, as discussed in §5. This apparent convergence of the Aitken method to controls that are not optimal is a significant deterrent to Figure 13 . This heatmap provides insight into the convergence behaviour of the acceleration methods for the AML bang-bang control problem. Here, we visualize N against ω and the method specific tuning parameter: n for Wegstein (a), m for partial Aitken (b) and partial Steffensen (c), M for Anderson acceleration (d ). Tuning parameter combinations requiring N ¼ 34, equivalent to the best tuned FBSM without acceleration, are shaded yellow. Colours in the green-yellow spectrum represent a reduction in N relative to FBSM without acceleration, while colours in the yellow-red spectrum represent an increase in N . Table 2 . Method comparison. We rate the methods considered in this work against key factors such as the reduction in N that they deliver and how robustly they perform over the range of tuning parameters considered, for both continuous (Cts) and bang-bang (BB) control problems. We also consider how accessible the methods are from the standpoints of ease of implementation (Imp) and conceptual complexity. Methods are rated as being either strongly positive (✓✓), positive (✓), neutral (∼), negative (✗) against each aspect.

reduction in N robustness accessibility method Cts BB Cts BB Imp complexity

royalsocietypublishing.org/journal/rsif J. R. Soc. Interface 18: 20210241 using the Aitken method in situations where the optimal control is not known a priori. Outside of this issue with the Aitken method, each acceleration method produces the same optimal control for a given problem. However, they each approach the converged control differently. In figure 14 , we plot the control as it converges for the FBSM and acceleration methods. In the code, we provide on GitHub (https://github. com/Jesse-Sharp/Sharp2021), users can view the control iterates of each method as they approach convergence. Visualizing these methods as they converge gives insight into how they may be able to arrive at different fixed points; under certain circumstances the accelerated series of iterates may leave the basin of attraction for the fixed point found via fixed point iteration.

In this work, we review the theory and implementation of the FBSM for solving TPBVPs that arise from application of PMP in solving optimal control problems. We study a single-variable linear model and a multiple-variable nonlinear model and consider continuous, bang-bang and fixed endpoint control problems. Conceptualizing the FBSM as a fixed point iteration, we leverage and adapt existing acceleration methods to significantly and robustly increase the convergence rate of the FBSM for a range of optimal control problems. The Anderson and partial Steffensen methods appear to perform best, without requiring prohibitive tuning.

Accelerating the convergence of the FBSM, and reducing the importance of appropriately selecting ω for a single control problem, is promising. That said, the real utility of the robust acceleration methods in this work is in application to families of control problems. We provide a glimpse of this benefit through considering fixed endpoint control problems, though other excellent opportunities for application arise due to the uncertainty prevalent in the life sciences. First, it is common for there to be uncertainty around model parameters and structure [97, 98] . In this case, solving optimal control problems over several model structures and sets of model parameters provides insight into the sensitivity of the control strategy [99] [100] [101] [102] . Secondly, when performing multi-objective optimization, a trade-off is made between objectives. For example, in equation (3.14), we seek to minimize the cumulative negative impact of leukaemia and of the control; parameters a 1 and a 2 weight the relative important of each objective. In practical applications, it is not always clear how to determine these weightings. It can therefore be useful to generate a family of optimal controls that are each optimal for their specific combination of pay-off weighting parameters, akin to a Pareto frontier [49, 103, 104] . Producing these sets of control results benefits significantly from acceleration techniques such as the Anderson and Steffensen methods, where a consistent reduction in N is obtained without optimal tuning.

In this work, multi-objective optimization is considered in the form of a control problem with a single cost function comprising a scalar combination of state and control terms. More Results for the Aitken method are not shown as they are visually similar to the Steffensen result. We present the results corresponding to the tuning parameters that minimize N , outlined in §5. Where it is visually distinguishable, we indicate the number of function evaluations corresponding to a particular iterate. While all methods produce the same eventual result for u*, they follow considerably different series of iterates. Note that the vertical scale in (a) and (b) differs from that of (c) and (d ).

generally, multi-objective optimization can be formulated as a control problem with a vector-valued cost function, with the goal of minimizing each component simultaneously. There are a range of strategies for handling multi-objective optimal control problems formulated in this way, and we direct readers to [105] for a recent and extensive survey. Here, we have only considered systems subject to a single control. While this is reflective of the vast majority of applications featured in the control literature, there are instances where we are interested in applying multiple controls simultaneously [13, 14, 51] . The FBSM can be readily applied to solve problems with multiple controls [51] ; a logical extension of this work is to adapt the acceleration methods or identify suitable alternative methods for accelerating convergence of the FBSM for problems with multiple controls.

Over a range of tuning parameters the Wegstein, Steffensen and Anderson methods are able to induce convergence where the underlying FBSM fails to converge; such as in the AML control problems with ω = 0. This behaviour has been documented for Anderson acceleration [87] and Wegstein's method [67] when applied to standard fixed point iteration problems. This presents an opportunity for future exploration, in identifying control problems that cannot be solved via the FBSM for any ω, and attempting to produce solutions using these acceleration techniques.

The examples we consider in this work include a variety of control problem formulations that arise in systems biology. However, it is worth noting that the examples are not exhaustive. Further challenges can be introduced; either through the formulation of the control problem, or as a result of the behaviour of the underlying system. Examples of such challenges include control problems with singular arcs, path constraints, multiple local solutions, discontinuous dynamics and sensitivity to the initial guess of the control [35] . These challenges can introduce numerical difficulties, and complications in terms of the optimal control theory; for example, control problems with singular arcs typically require additional necessary conditions for optimality beyond those obtained from the PMP [38] . A thorough assessment of the appropriateness of the FBSM as a method for solving control problems with such complications is an avenue for further investigation. We stress that the acceleration techniques that we develop and survey in this work are able to accelerate convergence when compared to a naive FBSM implementation, and in some cases induce convergence where the naive FBSM fails to converge. We anticipate that these trends will persist if these acceleration techniques are applied to appropriately conceived implementations of the FBSM for the various complications outlined here.

Data accessibility. This article has no additional data. Competing interests. We declare we have no competing interests. 

2020 Optimal control design of impulsive SQEIAR epidemic models with application to COVID-19

Modeling optimal intervention strategies for cholera

Polytherapeutic strategies with oncolytic virus-bortezomib and adjuvant NK cells in cancer treatment

Randomised comparison of addition of autologous bone-marrow transplantation to intensive chemotherapy for acute myeloid leukaemia in first remission: results of MRC AML 10 trial

Optimal control of acute myeloid leukaemia

Rapid predictive simulations with complex musculoskeletal models suggest that diverse healthy and pathological human gaits can emerge from similar control strategies

Task-dependent recruitment across ankle extensor muscles and between mechanical demands is driven by the metabolic cost of muscle contraction

Task-level regulation enhances global stability of the simplest dynamic walker

Control theory meets synthetic biology

Synthetic biology and regulatory networks: where metabolic systems biology meets control engineering

Optimising cell aggregate expansion in a perfused hollow fibre bioreactor via mathematical modelling

A strategy to determine operating parameters in tissue engineering hollow fibre bioreactors

Handling overheads: optimal multi-method invasive species control

Applying optimal control theory to a spatial simulation model of sudden oak death: ongoing surveillance protects tanoak while conserving biodiversity

Optimal and robust control of invasive alien species spreading in homogeneous landscapes

The mathematical theory of optimal processes

Dynamic programming

A history of the calculus of variations from the 17th through to the 19th century

Optimal control-1950 to 1985

A survey of numerical methods for optimal control

An historical survey of computational methods in optimal control

Neural approximations for optimal control and decision

Learning-supported approximated control for autonomous vehicles in the presence of state dependent uncertainties

Deep reinforcement learning for the control of microbial cocultures in bioreactors

Fractional optimal control dynamics of coronavirus model with Mittag-Leffler law

Optimal control for a fractional tuberculosis infection model including the impact of diabetes and resistant strains

On the fractional optimal control problems with a general derivative operator

A new and efficient numerical method for the fractional modeling and optimal control of diabetes and tuberculosis co-existence

In press. A hybrid fractional optimal control for a novel Coronavirus (2019-nCov) mathematical model

An optimization technique for solving a class of nonlinear fractional optimal control problems: application in cancer treatment

2020 Optimal control for cancer treatment mathematical model using Atangana-Baleanu-Caputo fractional derivative

Optimality and identification of dynamic models in systems biology: an inverse optimal control framework

Sequential activation of metabolic pathways: a dynamic optimization approach

Using optimal control to understand complex metabolic pathways

Deciphering the regulation of metabolism with dynamic optimization: an overview of recent advances

2021 Persistence as an optimal hedging strategy

Optimal control applied to biological models

Matrix iterative analysis

A method for finding the optimum successive over-relaxation parameter

The determination of the optimum accelerating factor for successive over-relaxation

A practical technique for the determination of the optimum relaxation factor of the successive over-relaxation method

Successive overrelaxation (SOR) and related methods

Iterative solutions of large linear systems

A numerical method for solving parabolic equations with opposite orientations

Convergence of the forward-backward sweep method in optimal control

Maximum principle based algorithms for Deep Learning

Symplectic Runge-Kutta discretization of a regularized forward-backward sweep iteration for optimal control problems

Robust multi-objective optimal control of uncertain (bio)chemical processes

Optimal control theory-applications to management science and economics

Designing combination therapies using multiple optimal controls

Feedback mechanisms control coexistence in a stem cell model of acute myeloid leukaemia

Numerical recipes: the art of scientific computing

Some applications of continuous Runge-Kutta methods

Natural continuous extensions of Runge-Kutta methods

Ordinary differential equations and linear algebra: a systems approach

Applied optimal control: optimization, estimation, and control

Chemotherapy-resistant human AML stem cells home to and engraft within the bone-marrow endosteal region

In vivo imaging of specialized bone marrow endothelial microdomains for tumour engraftment

Population dynamics of normal and leukaemia stem cells in the haematopoietic stem cell niche show distinct regimes where leukaemia will be controlled

The genesis and early developments of Aitken's process, Shanks' transformation, the ε-algorithm, and related fixed point methods

Iterative residual-based vector methods to accelerate fixed point iterations

Numerical analysis

Elements of numerical analysis

A class of methods for solving nonlinear simultaneous equations

Accelerating convergence of iterative processes

An iterative method of Wegstein for solving simultaneous nonlinear equations

A tool for assessing the scalability of pressure-retarded osmosis (PRO) membranes

Dyssol-an open-source flowsheet simulation framework for royalsocietypublishing.org/journal/rsif

Convergence promotion in the simulation of chemical processes with recycle-the dominant eigenvalue method

Acceleration algorithms for process design simulations

Computer aided chemical process design: the FLOWTRAN system

Convergence promotion in the simulation of chemical processes-the general dominant eigenvalue method

On Bernoulli's numerical solution of algebraic equations

Remarks on iteration

Classical and modern numerical analysis: theory, methods and practice

Aitken's and Steffensen's accelerations in several variables

The condition of Steffensen's acceleration in several variables

The Steffensen iteration method for systems of nonlinear equations

The Aitken-Steffensen formula for systems of nonlinear equations

The Steffensen iteration method for systems of nonlinear equations

A generalized inverse for matrices

Iterative procedures for nonlinear integral equations

Comments on 'Anderson Acceleration, Mixing and Extrapolation

A comparative study on methods for convergence acceleration of iterative vector sequences

Two classes of multisecant methods for nonlinear acceleration

Anderson acceleration for fixed-point iterations

Large-scale networks in engineering and life sciences

Using constraints and their value for optimization of large ODE systems

Methods of model reduction for large-scale biological systems: a survey of current methods and trends

Ultrasensitivity in the mitogen-activated protein kinase cascade

A multiple timescale analysis of a mathematical model of the Wnt/β-catenin signalling pathway

Incorporating genome-wide DNA sequence information into a dynamic whole-cell model of Escherichia coli: application to DNA replication

A whole-cell computational model predicts phenotype from genotype

Dynamic analysis of integrated signaling, metabolic and regulatory networks

Modeling regulatory mechanisms in IL-6 signal transduction in hepatocytes

Systems interface biology

Universally sloppy parameter sensitivities in systems biology models

Optimal control of a vectored plant disease model for a crop with continuous replanting

Optimal control of plant disease epidemics with clean seed usage

Optimal control problems of epidemic systems with parameter uncertainties: application to a malaria two-age-classes transmission model with asymptomatic carriers

Optimal control approaches for combining medicines and mosquito control in tackling dengue

Multi-objective Paretooptimal control: an application to wastewater management

Design search and optimization in aerospace engineering

A survey of recent trends in multiobjective optimal control-surrogate models, feedback control and objective reduction

Acknowledgements. We thank two anonymous referees for their helpful comments.