ODE-based models
Often, an explicit analytical expression for a given mathematical model is not known. Instead, the model might be defined implicitly, e.g. as the solution to a system of ordinary differential equations. Especially in fields such as systems biology, modeling in terms of (various kinds of) differential equations appears to be the norm.
As a toy example, we will consider the well-known "SIR model" in the following, which groups a population into susceptible, infected and recovered subpopulations and assumes mass action kinetics with constant transmission and recovery rates to describe the growths and decays of the respective populations.
While the DifferentialEquations.jl ecosystem offers many different ways of specifying such systems, we will use the syntax introduced by ModelingToolkit.jl since it is particularly convenient in this case.
using InformationGeometry, ModelingToolkit, Plots
@parameters t β γ
@variables S(t) I(t) R(t)
Dt = Differential(t)
SIReqs = [ Dt(S) ~ -β * I * S,
Dt(I) ~ +β * I * S - γ * I,
Dt(R) ~ +γ * I]
SIRstates = [S, I, R]; SIRparams = [β, γ]
@named SIRsys = ODESystem(SIReqs, t, SIRstates, SIRparams)
Here, the parameter β
denotes the transmission rate of the disease and γ
is the recovery rate. Note that in the symbolic scheme of ModelingToolkit.jl, the equal sign is represented via ~
.
An infection dataset which is well-known in the literature is taken from an influenza outbreak at a English boarding school in 1978. Its numerical values can be found e.g. in table 1 of this paper. As no uncertainties associated with the number of infections is given, we will assume the $1\sigma$ uncertainties to be $\pm 15$ as a reasonable value. Further, it is known that the total number of students at said boarding school was $763$ and we will therefore assume the initial conditions to be
SIRinitial = [762, 1, 0.]
for the respective susceptible, infected and recovered subpopulations on day zero. Next, the DataSet
object is constructed as:
days = collect(1:14)
infected = [3, 8, 28, 75, 221, 291, 255, 235, 190, 126, 70, 28, 12, 5]
SIRDS = DataSet(days, infected, 15ones(14); xnames=["Days"], ynames=["Infected"])
DataSet with N=14, xdim=1 and ydim=1:
x-data [Days] : [1.0, 2.0, 3.0, 4.0, 5.0, 6.0, 7.0, 8.0, 9.0, 10.0, 11.0, 12.0, 13.0, 14.0]
y-data [Infected] : [3.0, 8.0, 28.0, 75.0, 221.0, 291.0, 255.0, 235.0, 190.0, 126.0, 70.0, 28.0, 12.0, 5.0]
Standard deviation associated with y-data:
[15.0, 15.0, 15.0, 15.0, 15.0, 15.0, 15.0, 15.0, 15.0, 15.0, 15.0, 15.0, 15.0, 15.0]
Finally, the DataModel
associated with the SIR model and the given data is constructed by
SIRobservables = [2]
SIRDM = DataModel(SIRDS, SIRsys, SIRinitial, SIRobservables, [0.001, 0.1]; tol=1e-11)
where SIRobservables
denotes the components of the ODESystem
that have actually been observed in the given dataset (i.e. the second component which are the infected in this case). The optional vector [0.001, 0.1]
is our initial guess for the parameters [β, γ]
for the maximum likelihood estimation and the keyword tol
specifies the desired accuracy of the ODE solver for all model predictions.
Instead of specifying the observable components of an ODE system as an array, it is also possible to provide an arbitrary observation function with argument signature f(u)
, f(u,t)
or f(u,t,θ)
. Similarly, (parts of) the initial conditions for the ODE system can be included as parameters of the problem and estimated from data by providing a splitter function of the form θ -> (u0, p)
. The first entry of the returned tuple will be used as the initial condition for the ODE system and the second argument enters into the ODEFunction
itself.
In this particular example, one might include the initial number of infections as a dynamical parameter via the splitter function θ -> ([763.0 - θ[1], θ[1], 0.0], θ[2:3])
.
It is now possible to compute properties of this DataModel
such as confidence regions, confidence bands, geodesics, profile likelihoods, curvature tensors and so on as with any other model.
sols = ConfidenceRegions(SIRDM, 1:2)
VisualizeSols(SIRDM, sols)
While it visually appears as though the confidence regions are perfectly ellipsoidal and the model would therefore be linearly dependent on its parameters β
and γ
, this is of course not the case. The non-linearity with respect to the parameters becomes much more apparent further away from the MLE, as one can confirm e.g. via radial geodesics emanating from the MLE or the profile likelihood.
Finally the exact confidence bands associated with the computed confidence regions are obtained as:
plot(SIRDM; Confnum=0)
ConfidenceBands(SIRDM, sols[2])
plot(SIRDM; Confnum=0) # hide
M = ConfidenceBands(SIRDM, sols[2]; plot=false) # hide
InformationGeometry.PlotConfidenceBands(SIRDM, M; label=["$(round(InformationGeometry.GetConfnum(SIRDM,sols[2])))σ Conf. Band" nothing]) # hide