PETSc pointwise functions and PDE solvers#
As we saw in [the Finite Element Pages], the finite element method provides a very general way to approach the numerical solution of a very wide variety of problems in continuum mechanics using a standardised, matrix-based formulation and with considerable flexibility in the choice of discretisation, mesh geometry, and the ability to deal very naturally with jumps in material properties.
However, the FEM is based upon a variational, “weak” form of the governing equations of any problem while most publications outline the strong form of any problem and this can make it difficult for anyone without a strong mathematical background to translate quickly from publication to a new model.
PETSc provides a mechanism to automatically generate a finite element weak form from the strong (point-wise) form of the governing equations. This takes the form of a template equation and, in the case of non-linear problems, a template for the corresponding Jacobian derivatives.
The PETSc strong-form interface asks the user to provide pre-compiled functions with a pre-defined pattern of arguments relating known nodal-point variables, shape-functions and derivatives at any point in the domain. If the strong form is written
Where the \(f_0\) term, generally speaking, represents the forces and \(f_1\) comprises flux terms. Then a corresponding weak form is
The discrete form of this equation has some obvious similarities to the standard finite element matrix form except that the functions \(f_0\) and \(\mathbf{f}_1\) are not directly expressed in terms of the basis function matrices:
where \(q\) represents a set of quadrature points, \(W\) is a matrix of weights and \(B\), \(D\) are the usual basis function matrices but here evaluated at the quadrature points. \(\epsilon\) restricts the terms to the element. See [KBRS13] for details.
The user must provide a compiled representation of the terms \(f_0\) and \(\mathbf{f}_1\) but it is also necessary to provide the corresponding compiled representations of the Jacobians which satisfy
This formulation greatly simplifies writing optimal code as the user sets up a problem without touching the bulk of the implementation. The only difficulty is ensuring that the Jacobians are derived consistently and correctly. As the Jacobians need to be calculated and implemented as separate functions this introduces a potential point of failure.
In underworld
, we provide a fully symbolic approach to specifying the strong form terms, \(f_0\) and \(\mathbf{f}_1\) using the sympy
symbolic algebra python package. sympy
is also able to differentiate
\(f_0\) and \(\mathbf{f}_1\) to obtain the relevant Jacobian matrices in symbolic form. We also take
advantage of sympy
s automatic code generation capabilities to compile functions that match
the PETSc templates.
This provides a very natural mapping between the formulation above and the sympy
python code.
# X and U are sympy position vector, unknown vector
# that may include mesh variables
U_grad = sympy.derive_by_array(U, X)
F0 = f
F1 = kappa * U_grad
# Jacobians are
J_00 = sympy.derive_by_array(F0, U)
J_01 = sympy.derive_by_array(F0, U_grad)
J_10 = sympy.derive_by_array(F1, U)
J_11 = sympy.derive_by_array(F1, U_grad)
# Pass F0, F1, J_xx to sympy code generation routines
# ...
Note: some re-indexing may be necessary to interface between sympy
and PETSc
especially for vector problems.
Note: underworld provides a translation between mesh variables and their sympy
symbolic representation on the user-facing side that also needs to translate
to PETSc data structures in the compiled code.
Underworld Solver Classes#
We provide 3 base classes to build solvers. These are a scalar SNES solver, a vector SNES solver and a Vector SNES saddle point solver (constrained vector problem). These are bare-bones classes that implement the pointwise function / sympy approach that can then be used to build solvers for many common situations.
A blank slate is a very scary thing and so we provide templates for some common equations and examples to show how these can be extended.
Implementation & Examples#
Poisson Solvers#
(link to another document) Diffusion
Darcy flow
Advection-diffusion (SLCN)
Advection dominated flow#
(link to another document)
Swarm-based problems
Projection / swarm evaluation
Advection-diffusion (Swarm)
Material point methods
Incompressible Stokes#
(link to another document)
Saddle point problems
Stokes, boundary conditions, constraints
Navier-Stokes (Swarm)
Viscoelasticity
Remarks#
The generic solver classes can be used to construct all of the examples above. The equation-system classes that we provide help to provide a template or scaffolding for a less experienced user and they also help to orchestrate cases where multiple solvers come together in a specific order (e.g. the Navier-Stokes case where history variable projections need to be evaluated during the solve).
Creating sub-classes from the equation systems or the generic solvers is an excellent way to build workflows whenever there is a risk of exposing some fragile construction at the user level.
Some of the need for these templates is a result of inconsistencies in the way sympy
treats matrices, vectors and tensor (array) objects. We expect this to change over time.
Example 1 - The Poisson Equation#
The classical form of the scalar Poisson Equation is
Where \(\psi\) is an unknown scalar quantity, \(\alpha\) is a constitutive parameter that relates gradients to fluxes, and \(f\) is a source term.
This equation is obtained by considering the divergence of fluxes needed to balance the sources. For example, in thermal diffusion we identify \(\psi\) with the temperature, \(T\), and the constitutive parameter, \(k\), is a thermal conductivity.
In this form, \(\mathbf{q} = k \nabla T\) is Fourier’s expression of the heat flux in terms of temperature gradients.This form matches the template above if we identify:
and, in fact, this is exactly what we need to specify in the underworld equation system.
solver.L= sympy.derive_by_array(solver.U, solver.X).transpose()
# f0 residual term (weighted integration) - scalar function
solver.F0 = -h
# f1 residual term (integration by parts / gradients)
solver.F1 = k * solver.L
which means the user only needs to supply a mesh, a mesh variable to hold the solution and sympy expressions for \(k\) and \(h\) in order to solve a Poisson equation.
The SNES_Poisson
class is a very lightweight wrapper on
the SNES_Scalar
class which provides a template for the flux
term and very little else.
\(F_0\) and \(F_1\) are inherited as an empty scalar and vector respectively.
These are available in the template for the user to extend the equation as needed.
This notebook compares the generic class and the one with the flux templated.
Example 2 - Projections and Evaluations#
PETSc has a very general concept of discretisation spaces that do not necessarily admit to continuous interpolation to or from arbitrary points. For this reason, a more general concept is to create projections that map between representations of the data. For example, in Finite Elements, fluxes are generally not available at nodal points because shape functions have discontinuous gradients there. To compute fluxes at nodal points, we would establish a projection problem to form a best fitting continous function to the values at points where we can evaluate the fluxes. In addition, sympy functions (including those for fluxes) that contain derivatives of finite element variables can not be evaluated numerically by sympy but can be evaluated as compiled functions in the context of a solver.
We write these evaluations using the Projection
solver classes. This is
the simplest of the solvers and we are only discussing it second because it
is almost too simple to be instructive (and because the weak form of this
equation is the natural one to work with).
We would like to solve for a continuous, nodal point solution \(u\) that satisfies as best possible,
where \(\tilde{u}\) is a function with unknown continuity that we are able to evaluate at integration points in the mesh.
The generic solver specification in underworld looks like this
# f0 residual term (weighted integration) - scalar function
solver.F0 = solver.u.fn - user_uw_function
# f1 residual term (integration by parts / gradients)
solver.F1 = 0.0
where user_uw_function
is some sympy expression in terms of spatial
coordinates that can include mesh or swarm variables. solver.u.fn
is the
mesh variable (in function form) where the solution will reside.
In principle we could add a smoothing term via solver.F1
and, importantly,
we can also add boundary conditions or constraints that need to be satisfied in
addition to fitting the integration point values.
We provide projection operators for scalar fields, vector fields and solenoidal vector fields (ensuring that the projection remains divergence free). These provide templates for the \(F_0\) and \(F_1\) terms with generic smoothing. For an explanation of the divergence free projection methodology, see the next example on the incompressible Stokes problem.
This notebook has an example of each of these cases.
Example 3 - Incompressible Stokes Equation#
The incompressible Stokes Equation is an example of a problem with an additional constraint equation that must be satisfied by the solution. Variational formulations (the weak form of classical finite elements is one) naturally lend themselves to the addition of multiple constraints into the functional to be minimised.
Unfortunately, the incompressiblity constraint needs to be enforced very strongly to obtain reasonable solutions and this can lead to unacceptably ill conditioned systems that are slow or impossible to solve.
Alternatively, we solve a coupled problem in which additional, kinematic, parameters of the constraint term are introduced. This forms a new block system of equations that is a general expression for a large range of constrained problems.
These are often known as saddle point problems which represents the trade-off between satisfying the original equations and the constraint equations. (Saddle point here refers to the curvature of the functional we are optimising) See: M. Benzi, G. Golub and J. Liesen, Numerical solution of saddle point problems, Acta Numerica 14 (2005), pp. 1–137. for a general discussion.
The coupled equation system we want to solve is
with the constraint
The saddle-point solver requires us to specify both of these equations and to provide two solution vectors \(\mathbf{u}\) and \(\mathbf{p}\). In this system, \(\mathbf{p}\) is the parameter that enforces the incompressiblity constraint equation and is physically identifiable as a pressure.
# definitions
U_grad = sympy.derive_by_array(solver.U, solver.X)
strainrate = (sympy.Matrix(U_grad) + sympy.Matrix(U_grad).T)/2
stress = 2*solver.viscosity*solver.strainrate
# set up equation terms
# u f0 residual term (weighted integration) - vector function
solver.UF0 = - solver.bodyforce
# u f1 residual term (integration by parts / gradients) - tensor (sympy.array) term
solver.UF1 = stress
# p f0 residual term (these are the constraints) - vector function
solver.PF0 = sympy.vector.divergence(solver.U)
In underworld
, the SNES_Stokes
solver class is responsible for managing the
user interface to the saddle point system for incompressible Stokes flow.
Example 4 - Advection in the absence of diffusion#
The pure transport equation can be written in Lagrangian
or in Eulerian form
In the Lagrangian form, there is nothing to solve, provided the fluid-transported reference frame is available. In the Eulerian form, the non-linear advection term \(\mathbf{u} \cdot \nabla \psi\) is reknowned for being difficult to solve, especially in the pure-transport form.
Underworld provides discrete Lagrangian swarm
variables [ CROSSREF ] that make it straightforward to work with transported quantities
on a collection of moving sample points that we normally refer to as particles.
Behind the scenes, there is complexity in 1) following the Lagrangian reference frame accurately,
2) mapping the fluid-deformed reference frame to the stationary mesh, and 3) for
parallel meshes, migrating particles (and their data) across the decomposed domain.
The swarm
that manages the variables is able to update the locations of the particles
when provided with a velocity vector field and a time increment and will handle the
particle re-distribution in the process.
Each variable on a swarm has a corresponding mesh variable (a proxy variable) that is automatically updated when the particle locations change. The proxy variable is computed through a projection (see above).
Note: If specific boundary conditions need to be applied, it is necessary for the user to define their own projection operator, apply the boundary conditions, and solve when needed. (Feature request: allow user control over the projection, including boundary conditions / constraints, so that this is not part of the user’s responsibility)
Example 5 - The Scalar Advection-diffusion Equation#
The situation where a quantity is diffusing through a moving fluid.
where \(\mathbf{u}\) is a (velocity) vector that transports \(\psi\) and \(\alpha\) is a diffusivity. In Lagrangian form (following \(\mathbf{u}\)),
As before, the advection terms are greatly simplified in a Lagrangian reference frame but now we also have diffusion terms and boundary conditions that are easy to solve accurately in an Eulerian mesh but which must also be applied to variables that derive from a Lagrangian swarm (which has no boundary conditions of its own).
Advection-diffusion equations are often dominated by the presence of boundary layers where advection of a quantity (along the direction of flow) is balanced by a diffusive flux in the cross-stream direction. Under these conditions, there is some work to be done to ensure that these two terms are calculated consistently and this is particularly important close to regions where boundary conditions need to be applied.
The approach in underworld
is to provide a solver structure to manage
advection-diffusion problems on behalf of the user. We use
a swarm-variable for tracking the history of the \(\psi\) as it is transported
by \(\mathbf{u}\) and we allow the user to specify (solve for) this flow, and
to update the swarm positions accordingly. The history variable, \(\psi^*\)
is the value of \(\psi\) upstream at an earlier timestep and allows us to
approximate \(D \psi/Dt\) as a finite difference approximation along the
characteristics of the advection operator:
Here, the subscript \(p\) indicates a value at a particle in the Lagrangian swarm.
This approach leads to a very natural problem description in python that corresponds closely to the mathematical formulation, namely:
solver.L = sympy.derive_by_array(solver.U, solver.X).transpose()
solver.Lstar = sympy.derive_by_array(solver.U_star, solver.X).transpose()
# f0 residual term
solver._f0 = -solver.f + (solver.U.fn - solver.U_star.fn) / solver.delta_t
# f1 residual term (backward Euler)
solver._f1 = solver.L * solver.k
## OR
# f1 residual term (Crank-Nicholson)
solver._f1 = 0.5 * (solver.L + solver.Lstar) * solver.k
In the above, the U_star
variable is a projection of the Lagrangian history variable
\(\psi^*_p\) onto the mesh subject to the same boundary conditions as \(\psi\).
In the SNES_AdvectionDiffusion_Swarm
class (which is derived from SNES_Poisson
),
the solve
method solves for U_star
using an in-built projection and boundary
conditions copied from the parent, before calling a standard Poisson solver. This class manages every aspect of the creation, refresh and solution of the necessary
projection subroutines Lagrangian history term, but not the update of this variable or the advection.
Caveat emptor: In the Crank-Nicholson stiffness matrix terms above, we form the derivatives in both the flux and the flux history with the same operator where, strictly, we should transport the derivatives (or form derivatives with respect to the transported coordinate system).