caricato da Utente15976

A-posteriori error analysis of linear finite elements

OUTLINE FOR THE LECTURES ON ADVANCED COMPUTATIONAL TECHNIQUES
A-posteriori error analysis of linear finite elements for a two-point value problem for a
complete 2nd order ordinary differential equation
by
ANTONIO ORLANDO
1
Strong formulation
We consider a two point value problem for a complete 2nd order linear differential equation as model
problem. Let Ω =]0, L[, consider then the following problem.
Data: k ∈ C1 (Ω); β, c and f ∈ C 0 ([0, L])
Find: u ∈ C 2 ([0, L]) such that
ODE: − (k(x)u′ (x))′ + β(x)u′ (x) + c(x)u(x) = f (x)
BCs: u(0) = 0
u(L) = 0 ,
for all x ∈ [0, L]
(BVP)
where ′ denotes the classical derivative. For sake of simplicity only homogeneous essential boundary
conditions are considered. The classical setting for the formulation of the above mathematical model
requires strong assumptions on the data if the boundary value problem BVP is expected to have solutions.
Moreover such assumptions restrict class of physical problems which can be described by BVP. In order
to further enlarge the application areas the mathematical model requires some generalizations.
2
Weak formulation
One possible generalisation of BVP is obtained by interpreting its derivatives in the weak sense and
assuming the appropriate smoothness of the problem data. Assume then f ∈ L2 (Ω), k, c ∈ L∞ (Ω) and
β ∈ H 1 (Ω) with
κ1 ≥ k(x) ≥ κ0 > 0 and c(x) −
β ′ (x)
≥0
2
a.e. x ∈ Ω̄ .
(2.1)
In (2.1), κ0 , κ1 are constants, that is, they do not depend on x. Condition (2.1)1 is referred to as uniform
ellipticity wheraes (2.1)2 is called Garding inequality.
By using test functions v meeting homogeneous boundary conditions on the part of ∂Ω where essential
boundary conditions are assigned (in our case this is all ∂Ω), the weak formulation corresponding to BVP
reads as follows:
Find: u ∈ H01 (Ω) such that
Z L
a(u, v) =
[k(x)u′ (x)v ′ (x) + β(x)u′ (x)v(x) + c(x)u(x)v(x)] dx =
0
=
Z
L
0
f (x)v(x) dx = (f, v)L2 (Ω) for all v ∈ H01 (Ω)
(WBVP)
Remark 2.1. If c = 0 then a = a(u, v) is symmetric.
The Sobolev space V = H01 (Ω) used in stating the WBVP is a Hilbert space. The assumptions (2.1) allow
one to conclude that the bilinear form a(u, v) and the linear functional ℓ = ℓ(v) = (f, v)L2 (Ω) satisfy the
hypotheses of the Lax-Milgram theorem. Hence, (WBVP) has solution and is unique. One obtains also
the following stability result. There exists a constant C > 0 such that there holds
kukH 1 (Ω) ≤ Ckf kL2 (Ω)
The value of the constant is important from the numerical point of view as it reflects the sensitivity of the
solution to data changes.
For a given u ∈ V (= H01 (Ω)),
ru (v) := (f, v)L2 (Ω) − a(u, v)
1
for all v ∈ V
March 15, 2013 7:22
(2.2)
denotes a linear functional defined over V which is continuous. Hence ru is an element of the dual of V 1 .
The functional ru is referred to as the residual functional associated with u. As result, WBVP may be also
expressed as the problem of finding u ∈ V such that its corresponding residual ru (v) is the zero element
of V ′ .
Remark 2.2. In mechanics, the residual functional expresses the difference between the internal and
external virtual power with the space of the test functions as space of the virtual velocities.
3
Discrete Problem
The difficulty in trying to solve WBVP arises in the infinite dimension of the space V over which the
functional ru (v) is defined (which is referred to as the space of the test or weighted functions), and in the
infinite dimension of the space V in which the solution u is sought for (which is referred to as the space
of the trial functions). The replacement of the space V with one of its finite dimensional subspaces Vh is
the underlying idea of the Galerkin methods. If Vh denotes a finite dimensional subspace of V we define
the discrete problem as follows
Find: uh ∈ Vh ⊂ V such that
a(uh , v) = (f, v)L2 (Ω) for all v ∈ Vh
3.1
(GWBVP)
Galerkin orthogonality
The comparison of the exact solution u of WBVP with the solution uh of GWBVP gives, for any v ∈ Vh ⊂ V
a(u − uh , v) = a(u, v) − a(uh , v) = (f, v) − a(uh , v) = 0
(3.1)
that is, the Galerkin approximation uh is such that the corresponding residual ruh (v) defined over V is
equal to zero only on Vh ⊂ V , and consequently uh is, in general, different from u, the exact solution of
WBVP. The element of V given by e = u − uh is referred to as the discretization error.
3.2
Error representation equation
The discretization error e is such that, for any v ∈ V
a(e, v) = a(u − uh , v) = a(u, v) − a(uh , v) = (f, v) − a(uh , v)
and recalling the definition (2.2), it follows
a(e, v) = (f, v) − a(uh , v) = ruh (v) for all v ∈ V
(3.2)
that is, the discretization error e is a solution of the following weak formulation
Find: ψ ∈ V such that
a(ψ, v) = ruh (v) for all v ∈ V
(ERWP)
Since this problem has solution and is unique, then the discretization error is the unique solution of the
problem ERWP. This property, along with (3.1) will be the point of departure for the error analysis of
the finite element method.
4
Considerations on the convergence of Galerkin approximations
The concept of closeness of the approximate solution uh to the exact solution u may be made precise if it
is related to the one of convergence. In the Galerkin method, for each choice of the discrete space Vh ⊂ V ,
an element uh of the space V is obtained. Therefore, any convergence consideration requires first a family
of discrete solutions to be determined (i.e. the criterion of choice of the space Vh to be given), and then,
1 Given a vector normed space V , the dual of V , denoted by V ′ , is the space of all linear continuous functionals ℓ defined
over V . For a linear functional ℓ : V → R defined over a normed space V , continuity is equivalent to boundness, i.e.
∃M > 0 : |ℓ(x)| ≤ M kxkV for all x ∈ V
.
2
March 15, 2013 7:22
the topological structure (necessary to analyse the convergence) to be specified. For this purpose, the
Cea’s lemma plays a fundamental role. It refers to a basic property of the Galerkin approximation uh
expressed by the following inequality
ku − uh kV ≤ C inf ku − vh kV
vh ∈Vh
where
(4.1)
inf ku − vh kV denotes the distance in V of the exact solution u to Vh and C is a constant
vh ∈Vh
independent of Vh given by the ratio of the continuity constant to the coercivity constant of the bilinear
form a = a(u, v). When the bilinear form a = a(u, v) is symmetric, i.e. when β = 0 in WBVP, since
a(u − uh , v) = 0 for all v ∈ Vh , it follows that uh represents the projection onto Vh of the exact solution
u ∈ V with respect to the inner product defined by a(u, v).
Equation (4.1) shows that the natural setting to study the convergence of a family of Galerkin approximations is given by the same topological structure as the space V in which WBVP is formulated. Furthermore,
the concept of convergence of a family of Galerkin approximations may be made precise as well. In fact,
if there exists a family (Vh )h∈]0,1[ with Vh ⊂ V such that the distance of u to the subspace Vh approaches
to zero, that is,
lim inf ku − vh kV = 0
(4.2)
h→0 vh ∈Vh
the family of the corresponding solutions (uh )h∈]0,1[ is convergent to the exact solution u, that is,
lim ku − uh kV = 0
h→0
⇐⇒
lim uh = u in V .
h→0
(4.3)
Proof. Just pass to the limit in (4.1) and account for (4.2).
Given a familty of spaces (Vh )h∈]0, 1[ with the property (4.2), the problem of estimating the distance of u
to the subspace Vh is object of the approximation theory in Sobolev spaces.
The previous considerations regard the convergence of a family of Galerkin approximations (uh )h∈]0,1[ in
the natural norm of V . However, the same family of Galerkin approximations (uh )h∈]0,1[ can be convergent
to the exact solution u even by assuming other metrics to measure the distance of uh to p
u. For instance, if
the bilinear form a = a(u, v) is continuous, elliptic and symmetric, the function k|u|k = a(u, u), referred
to as energy norm, may be used to define a norm on V which is equivalent to its natural norm (see below
next section). Since the natural norm of V , which is the H 1 -norm, is stronger than the L2 -norm, then
there holds
lim uh = u in V ⇒ lim uh = u in L2
h→0
h→0
but not viceversa.
The previous considerations refer to generic finite dimensional spaces Vh ⊂ V . By specifying Vh is then
possible to address, for instance, problems such as the study of the order of convergence of a family of
Galerkin approximations and the evaluation of quantitative bounds for the discretization error. These
bounds express estimates of the error and are generally referred to as error estimates, which will be object
of study of the following sections.
We conclude this section by introducing some notations which will be needed for building and analysing
the error estimates in the subsequent sections.
The finite element space Vh which we will refer to is defined as follows. Let T be a finite partition of
n
Ω =]0, L[ such that Ω = ∪ Ωi where Ωi =]xi−1 , xi [ and for every (i, j) with i, j ∈ (1, 2, . . . , n), Ωi ∩Ωj = ∅.
i=1
Assume x0 = 0 and xn = L and define the mesh size function h = h(x) as h(x) = hi = xi − xi−1 if x ∈ Ωi .
Once the function h(x) has been given, the set Vh of the continuous functions defined over Ω such that v
is a linear polynomial on each subinterval Ωi and v(0) = 0 and v(L) = 0 is uniquely determined2 . It is
trivial to show that Vh is a closed finite dimensional linear subspace of H01 and is therefore a Hilbert space
with the norm of H1 . Because Vh ⊂ V , the finite element is referred to as conforming finite element.
2 It is worth noting that for a given n, specifying the number of elements which the domain Ω is subdivided into, there
is an infinite number of partitions of Ω consisting of n elements, and consequently there is an infinite number of different
spaces Vh corresponding to the same number n of elements
3
March 15, 2013 7:22
5
A simple a-posteriori error estimate for the solution of a linear
system
Let A ∈ Rn×n be a positive definite matrix and f ∈ Rn . Consider then the following linear system
(5.1)
Ax = f
Given any x̃ ∈ Rn , in order to check whether this is solution of (5.1) one should evaluate Ax̃ and check
whether it is equal to f . If that is not the case, then we define the error e as
e = x − x̃
where x is the solution of (5.1). Now, we ask whether it is possible to give an estimate of some norm of
the error. From the definition of e, we soon check that e ∈ Rn is solution of the following linear system
Ae = r
(5.2)
r = f − Ax̃ .
(5.3)
where r is called residual and is defined as
Equation (5.2) shows that if r = 0, then e = 0 (since A is positive definite). Can we then assume a measure
of the size of r as a measure of the size of e?
Well this depends clearly on the matrix as it appears from the following proposition. Let k · kRn denote
the Euclidean norm in Rn induced by the Euclidean inner product in Rn , that is, for x, y ∈ Rn with
x = (xi )i=1,...,n and y = (yi )i=1,...,n we have
q
(5.4)
kxkRn = x21 + . . . + x2n and x · y = x1 y1 + . . . + xn yn .
Then we can state the following.
Proposition 5.1. Let e ∈ Rn be solution of (5.2) and r ∈ Rn be defined by (5.3). Then there holds
kekRn ≤
1
krkRn
λ1
where λ1 is the smallest eigenvalue of A.
Proof. Since A is positive definite, from the property of the Rayleigh quotient we have that
λ1
Ay · y
for all y ∈ Rn ,
kyk2Rn
(5.5)
with λ1 being the smallest eigenvalue of A. Since A is positive definite, then λ1 > 0. From (5.5) for y = e
and using Cauchy-Scwarz inequality it follows
kek2 ≤
1
1
1
Ae · e =
r·e≤
krkkek ,
λ1
λ1
λ1
which proves the statement.
As result of this proposition, we conclude that if λ1 is very small, a measure of the size of only the residual
cannot be an indication on the size of the error.
6
A-posteriori error estimates
Error estimates are usually distinguished as a-priori and a-posteriori. They are called a-priori when
they are based on an a-priori knowledge of the regularity properties of the solution such as its degree
of smoothness. Their main use is to establish convergence properties along with optimal global rate of
convergence. A-posteriori error estimates, on the other hand, express estimates of the error akin to the
current approximation and therefore are much more appealing to finite element practitioners.
As result of the Cea’s lemma, the error analysis of the Galerkin methods for second order elliptic problems
is most easily described in the H 1 -norm. The inequality (4.1) is the starting point in establishing a-priori
4
March 15, 2013 7:22
error estimates and in studying asymptotic rates of convergence of the finite element method. In these
analysis, a-priori error estimates are evaluated as bounds for the interpolation/approximation error of
polynomials in Sobolev spaces.
If the bilinear form a = a(u, v) is symmetric and elliptic,
a(u, v) may be used to define p
an inner product in
p
V with the corresponding norm given by k|u|k = a(u, u) called energy norm. Since a(u, u) is a norm,
one can investigate the convergence in V with the energy norm, that is, looking at the error measured
with the energy norm. If the bilinear form a(u, v) is also continuous, the energy norm is then equivalent
to the H 1 -norm that is, there holds
αkuk2H 1 ≤ a(u, u) ≤ M kuk2H 1
(6.1)
where α and M are the coercivity and continuity constants of the bilinear form, respectively. Hence, any
considerations in H 1 -norm applies to energy norm and viceversa.
If the bilinear form
(6.1) still
p a(u, v) is not symmetric, while it is coercive and continuous, the inequality
3
a(u,
u)
is
no
longer
a
norm
as
the
triangular
inequality
is
not
satisfied
.
As
a result,
holds.
However,
p
a(u, u) should be considered only simply as an estimate of the error in H1 -norm.
Finally, note that since from (6.1) it is
kukH 1 ≤
1p
a(u, u)
α
p
it follows that a(u, u) cannot be considered as a robust error estimator because it fails to be meaningful
for those problems having very small coercivity costant α, i.e. when kmin in (2.1) is very small. The case
kmin very small should deserve special attention as it leads in general to singularly pertubed boundary
value problems where a boundary or interior layer phenomenon emerges. An error estimate is said to be
robust if it performs well in relatively general circumstances. More precise and quantitative definitions
can be found in literature.
6.1
Explicit residual type error estimates in energy norm
Explicit a posteriori residual type error estimates are obtained by providing an explicit estimate of the
norm of the residual associated with the Galerkin approximation uh in the dual space. This residual
functional is constitued by the pointwise residual defined over the interior of each element and the jump
of the finite element solution across the element boundaries. Let
k|e|k2 = a(e, e)
(6.2)
with the convention that (6.2) is a norm if a(u, v) is symmetric. Using (3.2) and noting that the discretization error e ∈ V (for u ∈ V and uh ∈ Vh ⊂ V ), we have then
k|e|k2 = a(e, e) = ruh (e)
(6.3)
where
ruh (v) = (f, v) − B(uh , v) =
Z L
f (x)v(x) dx+
=
−
Z
0
L
0
[k(x)u′h (x)v ′ (x) + β(x)u′h (x)v(x) + c(x)uh (x)v(x)] dx
(6.4)
Since uh is smooth over each subinterval Ωi , the integral which gives a(uh , v) must be then evaluated as
follows
n Z xi
X
[k(x)u′h (x)v ′ (x) + β(x)u′h (x)v(x) + c(x)uh (x)v(x)] dx
a(uh , v) =
i=1
3 The
xi−1
triangular inequality of the norm, that is,
kx + ykV ≤ kxkV + kykV for all x, y ∈ V
is invoked to prove that given the set S of all open sheres, there is a unique topology having S as one of its bases. This
topology is referred to as topology generated by the metric.
5
March 15, 2013 7:22
In order to have the residual corresponding to uh with respect to the strong form of the differential
equation, a formal application of the integration by parts yields
a(uh , v) =
n Z
X
xi
xi−1
i=1
− (k(x)u′h (x))′ v(x) + β(x)u′h (x)v(x) + c(x)uh (x)v(x)] dx+
h
i xi + k(x)u′h (x)v(x)
(6.5)
xi−1
Replacing (6.5) into (6.4) gives
ruh (v) =
n Z
X
xi
xi−1
i=1
{f (x) − [(k(x)u′h (x))′ + β(x)u′h (x)+
h
i xi =
+ c(x)uh (x)]}v(x) dx + k(x)u′h (x)v(x)
xi−1
=
n Z
X
i=1
xi
Ruh (x)v(x)dx +
xi−1
n h
X
i=1
where we have set
Ruh : x ∈
[
i=1,...,n
i xi
k(x)u′h (x)v(x)
xi−1
(6.6)
Ωi → Ruh,i (x)
with
Ruh,i (x) = f (x) − [ − (k(x)u′h (x))′ + β(x)u′h (x) + c(x)uh (x)]
i = 1, . . . , n
(6.7)
the pointwise residual corresponding to uh . Notice that Ruh,i is well defined since uh is smooth over
Ωi =]xi−1 , xi [.
Let uh,j = uh,j (x) denote the restriction of the finite element solution uh over the element Ωj =]xj−1 , xj [,
rearranging (6.6) as follows:
ruh =
n Z
X
i=1
xi
Ruh,i (x)v(x)dx +
xi−1
n−1
X
j=1
d(xj )[u′h,j (x) − u′h,j−1 (x)]v(xj )
(6.8)
shows that ruh is made up of two terms: (i) the pointwise residual Ruh , which is referred to as the regular
part of the residual and is defined over each subdomain Ωi ; and (ii) the term k(xj )[u′h,j (x) − u′h,j−1 (x)]
which is referred to as the singular part of the residual and is defined by the jump of the finite element
solution uh across the element boundaries.
Let Πh e denote the Vh -nodal linear interpolant of e, that is Πh e ∈ C 0 (Ω), Πh e|Ωi ∈ P1 (Ωi ) and Πh e(xi ) =
e(xi ). Since Πh e ∈ Vh ,
ruh (Πh e) = 0 .
Due to the linearity of ruh , it follows
ruh (e) = ruh (e − Πh e)
which combined with (6.3) yields
k|e|k2 = ruh (e − Πh e)
(6.9)
where
ruh (e − Πh e) =
n Z
X
i=1
xi
Ruh (x)(e(x) − Πh e(x)) dx+
xi−1
h
i xi ′
+ k(x)u (x)[e(x) − Πh e(x)]
xi−1
(6.10)
Since in 1-D problems the boundary of each subdomain is formed by only nodes, the term involving the
boundary in ruh vanishes as Πh e(xi ) − e(xi ) = 0. Then, it follows
k|e|k2 = ruh (e − Πh e) =
6
n Z
X
i=1
xi
xi−1
Ruh (x)(e(x) − Πh e(x)) dx
March 15, 2013 7:22
(6.11)
Remark 6.1. Notice that this is a typical 1D result. For Ω ⊂ Rn for n > 1, the singular part of the
residual does not vanish and in some cases this is the dominant term.
Let kmin,i = min {k(x), x ∈ Ωi =]xi−1 , xi [ } and consider the following piecewise constant function
p(x) = pi if x ∈ Ωi with p2i =
h2i
kmin,i
.
We give now the main result of this section.
Proposition 6.1. Denote by uh = uh (x) the continuous piecewise linear finite element solution of
(GWBVP) and by u the exact solution of the (WBVP). Then the following bound on the energy norm of
the error e holds
1/2
n
1 X 2
,
(6.12)
η
k|e|k ≤
π i=1 i
where the error indicators ηi are defined for each element as follows:
Z xi
o2
h2i n
ηi2 =
f (x) − − (k(x)u′h (x))′ + β(x)u′h (x) + c(x)uh (x)
dx .
xi−1 kmin,i
(6.13)
Proof. Rewrite (6.11) as
ruh (e − Πh e) =
=
n Z
X
xi
pi Ruh (x)
i=1 xi−1
L
Z
p(x)Ruh (x)
0
1
[e(x) − Πh e(x)] dx =
pi
1
[e(x) − Πh e(x)] dx .
p(x)
The use of Schwartz’s inequality gives
1
|ruh | ≤ kpRuh kL2 (Ω) k (e − Πh e)kL2 (Ω)
p
where
kpRuh kL2 (Ω) =
=
X
n Z
i=1
xi
xi−1
X
n Z
i=1
X
n Z
xi
xi−1
1/2
p2i Ru2 h (x) dx
=
h2i
kmin,i
1/2
Ru2 h (x) dx
1/2
1
2
[e(x)
−
Π
e(x)]
dx
h
p2
i=1 xi−1 i
1/2
X
Z
n
kmin,i xi
2
[e(x) − Πh e(x)] dx
.
=
h2i
xi−1
i=1
1
k (e − Πh e)kL2 (Ω) =
p
(6.14)
xi
(6.15)
Now in 1D, it is possible to show that for any v ∈ H 1 (Ωi )
kv − Πh vk2L2 (Ωi ) ≤
where
ai (v, v) =
Z
xi
2
2
1
h
h
kv ′ k2L2 (Ωi ) ≤
ai (v, v)
π
π
kmin,i
[k(x)v ′2 (x) + β(x)v ′ (x)v(x) + c(x)v 2 (x)] dx .
xi−1
As a result, using (6.16) with v = e one obtains
Z
1
kmin,i xi
[e(x) − Πh e(x)]2 dx ≤ 2 ai (e, e)
2
hi
π
xi−1
7
March 15, 2013 7:22
(6.16)
which combined with (6.15) yields
1
k (e − Πh e)kL2 (Ω) =
p
≤
X
n
i=1
X
n
i=1
kmin,i
h2i
Z
xi
xi−1
1/2
≤
[e(x) − Πh e(x)] dx
2
1/2
1
a
(e,
e)
.
i
π2
In addition
n
X
ai (e, e) =
n Z
X
i=1
L
i=1
=
Z
xi
[k(x)e′2 (x) + β(x)e′ (x)e(x) + c(x)e2 (x)] dx =
xi−1
[k(x)e′2 (x) + β(x)e′ (x)e(x) + c(x)e2 (x)] dx = a(e, e)
0
hence,
1
1
k (e − Πh e)kL2 (Ω) ≤ [a(e, e)]1/2 .
p
π
In summary, we obtain the following error estimate
1
k|e|k2 = a(e, e) = ruh (e − Πh e) ≤ kpRuh kL2 (Ω) k (e − Πh e)kL2 (Ω)
p
1
≤ kpRuh kL2 (Ω) [a(e, e)]1/2
π
that is,
1
k|e|k ≤
π
X
n Z
xi
h2i n
f (x) − − (k(x)u′h (x))′ + β(x)u′h (x)+
kmin,i
1/2
o2
+ c(x)uh (x)
dx
,
i=1
xi−1
(6.17)
which is (6.12) by accounting for the definition of ηi given by (6.13).
The term on the right hand side of (6.12) is referred to as error estimate and gives an estimate to the global
discretization error e = u − uh . It is composed of locally computable quantities, ηi , named error indicators
of the approximate solution uh , which reflect the contribution of each element to the global error estimate.
Due to the structure of the error estimator obtained, it is suitable to adaptive mesh refinement strategies.
It must, however, be said that in the current literature the term of error indicator is adopted with a wider
meaning to denote simply a mesh quality indicator, i.e. it is meant to give rough information about the
distribution of the error and is sometimes not suitable to evaluate an error estimator. It is also worth
noting the dependence of the estimate on the first power of h which is consistent with the a-priori error
analysis in the enrgy norm. However, one must observe that in deducing this a-posteriori error estimate
we have not required u ∈ H 2 which was invoked for the a-priori error analysis.
We conclude this section by reporting the proof of (6.16). First we need to recall the following result
which can be easily checked.
Lemma 6.1. Let ϕ ∈ H01 (Ωi ), the smallest value for λ such that the following homogeneous two-point
value problem
− ϕ′′ + λϕ = 0
ϕ(xi−1 ) = ϕ(xi ) = 0
has no trivial solution is
λ1 =
and there holds
λ1 ≤
8
(6.18)
π2
h2i
(ϕ′ , ϕ′ )
for all ϕ ∈ H01 (Ωi ) .
(ϕ, ϕ)
March 15, 2013 7:22
(6.19)
Lemma 6.2. Let v ∈ H 1 (Ωi ) and Πh v be the nodal linear interpolant of v with Πh v(xi−1 ) = v(xi−1 ) and
Πh v(xi ) = v(xi ). Then there holds
k(v − Πh v)′ kL2 (Ωi ) ≤ kv ′ kL2 (Ωi ) .
(6.20)
Proof. For any v ∈ H 1 (Ωi ), v − Πh v ∈ H01 (Ωi ). Then we note that
k(v − Πh v)′ k2L2 (Ωi ) = ((v − Πh v)′ , (v − Πh v)′ )L2 = ((v − Πh v)′ , v ′ )L2 − ((v − Πh v)′ , (Πh v)′ )L2
and since
((v−Πh v)′ , (Πh v)′ )L2 =
Z
xi
xi−1
(v−Πh v)′ (Πh v)′ dx = −
Z
xi
xi−1
x i
(v−Πh v)(Πh v)′′ dx+ (v−Πh v)(Πh v)′
=0
xi−1
then we obtain after applying Schwartz inequality
k(v − Πh v)′ k2L2 (Ωi ) = ((v − Πh v)′ , v ′ )L2 ≤ k(v − Πh v)′ kL2 (Ω) kv ′ kL2 (Ω) ,
which yields (6.20).
If we combine (6.19) with (6.20) then we conclude that
hi
1
kv − Πh vkL2 (Ωi ) ≤ √ kv ′ kL2 (Ωi ) = kv ′ kL2 (Ωi ) for any v ∈ H 1 (Ωi ) .
π
λ1
Lemma 6.3. Given the conditions (2.1), there holds
2
2
h
1
h
′ 2
kv kL2 (Ωi ) ≤
ai (v, v)
π
π
kmin,i
where
ai (v, v) =
Z
xi
k(x)v ′2 (x) + β(x)v ′ (x)v(x) + c(x)v 2 (x) dx .
xi−1
Proof. Just observe that
2
2
h
kmin,i ′ ′
h
′ 2
kv kL2 (Ωi ) =
(v , v )L2 (Ωi )
π
π
kmin,i
2
2
h
h
1
1
′
′
(kmin,i v , v )L2 (Ωi ) ≤
ai (v, v) .
=
π
kmin,i
π
kmin,i
6.2
Dual variational formulation for the error
In the previous section we have seen that the error e = u − uh is solution of the weak formulation (ERWP)
which we rewrite for the reader convenience
Find: e ∈ V such that
a(e, v) = ruh (v) = ℓ(v) − a(uh , v) for all v ∈ V .
(6.21)
This formulation is referred to as the primal formulation for the error. If a(u, v) is symmetric, which will
be assumed hereafter, then (6.21) is equivalent to the minimization problem
Find: e ∈ V such that
J (e) ≤ J (v) for all v ∈ V .
(6.22)
with
1
a(v, v) − ℓ(v) + a(uh , v) for all v ∈ V .
2
It then follows that the energy norm of the error e given by (6.3) meets the following bound
J (v) =
k|e|k2 ≥ −2J (v) for all v ∈ V .
9
March 15, 2013 7:22
(6.23)
Proof. From the definition of k|e|k and of J (v) we have that
1
1
k|e|k2 = a(e, e) = J (e) + ℓ(e) − a(uh , e)
2
2
(6.24)
and using (6.21) with v = e,
a(e, e) = ℓ(e) − a(uh , e),
we have
1
1
k|e|k2 = a(e, e) = J (e) + a(e, e)
2
2
which gives
1
− a(e, e) = J (e)
2
and therefore
1
1
k|e|k2 = a(e, e) = −J (e) .
2
2
(6.25)
Recalling then (6.22), (6.23) follows.
Equation (6.23) states that in order to obtain a lower bound to k|e|k2 one just needs to evaluate J (v) at
any element of V . Notice that this bound can be, however, very poor unless v is chosen suitably.
Adaptive processes rely, on the other hand, on upper bounds to the discretization error. Upper bounds
can be obtained by means of the so-called dual variational formulation of (6.22).
We next detail how to obtain such dual formulation for the error e, using the following two-point value
problem as model problem. At variance of (BVP) here we consider the case that a natural boundary
condition is given at x = L).
− (k(x)u′ (x))′ + c(x)u(x) = f (x) for all x ∈ [0, L]
u(0) = u0
u′ (L) = u′L .
(6.26)
To put things in perspective, in order to get the dual formulation, we have to find a functional G such that
the ‘stress variable’
qe = (k(x)e′ , c(x)e)
represents a critical point for G. It is the objective of this note to show that such functional exists and
there holds
G(q − qh ) ≤ G(qe ) for all q ∈ Q
with Q the set of equilibrated stress fields, qh = (k(x)uh , c(x)uh ) where uh denotes any compatible function, that is, we mean a function that is continuous and meets the essential boundary conditions. For
instance, conforming finite element method will yield such uh .
By introducing the following variables
σ(x) = k(x)u′ (x)
T (x) = c(x)u(x) ,
(6.27)
(6.26) is transformed into the following system of first order differential equations in the unknowns σ, T
10
March 15, 2013 7:22
and u
Find σ, T, u such that:
a)
−σ ′ (x) + T (x) = f (x) for all x ∈ [0, L]
b) σ(x) − k(x)u′ (x) = 0 for all x ∈ [0, L]
c)
T (x) − c(x)u(x) = 0 for all x ∈ [0, L]
(6.28)
d) σ(L) = k(L)u′L
e)
T (0) = c(0)u0
f)
u(0) = u0
g)
u′ (L) = u′L .
Equation (6.28)a is called the equilibrium equation whereas (6.28)d represents a natural boundary condition. We introduce then the following notation: σex and Tex denote the exact solution of (6.28) whereas
for any given compatible function uh = uh (x), we set
σh (x) = k(x)u′h (x)
(6.29)
Th (x) = c(x)uh (x) .
Moreover, we denote by Q the set of elements q = (σ, T ) solutions of the following problem stated over
the whole domain Ω =]0, L[
− σ ′ (x) + T (x) = f (x) for all x ∈]0, L[
(6.30)
σ(L) = σex (L) = k(L)u′L ,
obtained from (6.28) considering only the equilibrium equation along with the natural boundary condition.
Given ri = (σi , Ti ) ∈ L2 (Ω) × L2 (Ω) i = 1, 2, we consider the linear space L2 (Ω) × L2 (Ω) equipped with
the following inner product
(r1 , r2 ) := hσ1 , σ2 i + hT1 , T2 i :=
Z
and denote by k|q|k the associated norm, that is,
s
Z
p
k|q|k = (q, q) =
0
L
0
L
σ1 (x)
σ2 (x) dx +
k(x)
σ 2 (x)
dx +
k(x)
Z
0
L
Z
0
L
T1 (x)
T2 (x) dx ,
c(x)
(6.31)
T 2 (x)
dx .
c(x)
We have then the following fundamental result.
Proposition 6.2. Let qex = (σex , Tex ), qh = (σh , Th ) then for any q ∈ Q there holds
(q − qex , qex − qh ) = hσ − σex , σex − σh i + hT − Tex , Tex − Th i = 0 .
(6.32)
Remark 6.2. Equation (6.32) expresses the orthogonality between q − qex and qe = qex − qh . Such
condition is known as Prager-Synge relationship.
Proof. Next the proof of (6.32) is presneted for the case that uh is a conforming finite elment solution. By
11
March 15, 2013 7:22
accounting for (6.31) and applying integration by parts over each subdomain where uh is smooth yields
Z
L
0
=
=
−
(a)
(b)
=
−
(σ − σex )
Z
Z
L
u′h )dx
+
σex − σh
dx +
k(x)
L
(σ −
0
n h
X
i=1
Z L
σex )(u′ex
−
0
0
Tex − Th
dx =
c(x)
L
Z
0
(T − Tex )(uex − uh )dx =
i xi
(σi (x) − σex,i (x))(uex (x) − uh (x)
xi−1
(σ − σex )′ (uex − uh )dx +
0
n h
X
i=1
Z L
(T − Tex )
Z
0
(6.33)
L
(T − Tex )(uex − uh )dx =
i xi
(σi (x) − σex,i (x))(uex (x) − uh (x))
+
xi−1
+
(σ − σex )′ − (T − Tex ) (uex − uh )dx .
By rewriting the term (6.33)a as follows
n h
i xi
X
(σi (x) − σex,i (x))(uex (x) − uh (x))
xi−1
i=1
(a′ )
(b′ )
(6.34)
=
= σ1 (x0 ) − σex,1 (x0 ) uex (x0 ) − uh (x0 ) + σn (xn ) − σex,n (xn ) uex (xn ) − uh (xn ) +
n−1
X n
o
σi (xi ) − σi+1 (xi ) + σex,i+1 (xi ) − σex,i (xi )
uex (xi ) − uh (xi )
+
i=1
we observe that
• The term (6.34)a′ vanishes either for the compatibility of the finite element solution uh = uh (x) or
because σ meets the natural boundary conditions;
• The term (6.34)b′ vanishes for the continuity of the functions σ and σex .
Moreover, also the term (6.33)b is equal to zero as q = (σ, T ) does satisfy the equilibrium equation in
Ω =]0, L[. As result, we have
Z
0
since
L
(σ − σex )′ − (T − Tex ) (uex − uh ) dx =
′
−σex
(x) + Tex (x) = f (x)
−σ ′ (x) + T (x) = f (x)
Z
0
L
′
(−σex
+ Tex ) − (−σ ′ + T ) (uex − uh ) dx = 0
for x ∈ Ωi =]xi−1 , xi [ i = 1, 2, ..., n
(6.35)
for x ∈ Ωi =]xi−1 , xi [ i = 1, 2, ..., n .
(6.36)
Combining these results all together yields finally (6.32).
Once we have established the orthogonality condition (6.32), we are now in the position to formulate the
dual variational principle for (6.22). For any q ∈ L2 (Ω) × L2 (Ω) define the following functional
1
1
G(q) = − (q, q) = − k|q|k2
2
2
(6.37)
with (·, ·) defined in (6.31). We can then state the following result.
Proposition 6.3. Let qex = (σex , Tex ) and qh = (σh , Th ) corresponding to a compatible function uh .
Then there holds
G(qex − qh ) = sup G(q − qh ) .
(6.38)
q∈Q
12
March 15, 2013 7:22
Proof. For any q ∈ Q we can write
(q − qh , q − qh ) = (q − qex + qex − qh , q − qex + qex − qh )
(6.39)
= (q − qex , q − qex ) + 2(qex − qh , q − qex ) + (qex − qh , qex − qh )
and accounting of (6.32) yields
(q − qh , q − qh ) = (q − qex , q − qex ) + (qex − qh , qex − qh ) ,
(6.40)
k|q − qh |k2 = k|q − qex |k2 + k|qex − qh |k2 ≥ k|qex − qh |k2 .
(6.41)
that is,
Recalling then the definition (6.37) of G it follows
(6.42)
−2G(q − qh ) ≥ −2G(qex − qh )
which finally yields (6.38) after multiplying by −1/2.
Exercise 6.1. Show that (6.40) can also be written as
k|q − qh |k = 2k|qex −
qh + q
|k for all q ∈ Q
2
The stationariety condition expressed by (6.38) represents the dual variational formulation of (6.22) once
we note after (6.31) that
− 2G(qex − qh ) = (qex − qh , qex − qh ) = hσex − σh , σex − σh i + hTex − Th , Tex − Th i
=
Z
(6.43)
L
0
k(x)e,2 (x) + c(x)e2 (x) dx = a(e, e) = −2J (uex − uh ) .
In summary, we have therefore
For any v ∈ V
− 2J (v) ≤ −2inf J (v) = −2J (e) = k|e|k2 = a(e, e)
v∈V
= −2G(qex − qh ) = −2sup G(q − qh ) ≤ −2G(q − qh )
q∈Q
for any q ∈ Q.
(6.44)
Examining (6.44) reveals that upper bounds on the discretisation error can be obtained by constructing
elements of the set Q and evaluating therein the value of the functional G.
6.3
Implicit residual type error estimates in energy norm
From the considerations of the previous section, it follows that we can compute upper bounds to the
energy norm of the error as long as we can build an element of the set Q. We recall that Q is the set of
q = (σ, T ) solutions of the equilibrium equations (6.30) stated all over Ω. In this section we are concerned
with finding ways to avoid this global construction and still retain the bound (6.38). The key observation
is that (6.38) holds even if q(x) = (σ(x), T (x)) is in equilibrium over each element Ωi ∈ T with σ ∈ q
continuous on Ω, meeting the natural boundary conditions of (6.30) and the following condition
n h
i xi
X
σi (x)v(x)
xi−1
i=1
h
i
= σn (xn )v(xn ) − σ1 (x0 )v(x0 )
(6.45)
for any compatible function v, that is, for any v ∈ H1 (Ω) that meets the homogeneous essential boundary
conditions. This follows because with such σ, the orthogonality condition (6.32) still holds. In proving
(6.32) in 1D we have invoked the continuity of σ across the nodes whereas for higher dimensions, one must
ensure the continuity of the normal traction σν across the interelement boundaries, with ν a fixed normal
to the element boundary. Now, a way to ensure that the functions σi solutions of the elemental equilibrium
equations can be patched together can be for instance the following. Denote by ∂T the set of all the edges
of the elements Ωi ∈ T the continuity conditions across γ ∈ ∂T can be enforced by associating a smooth
13
March 15, 2013 7:22
scalar field
gγ : γ ∈ ∂T → R
with each γ ∈ ∂T . In 1D, assigning the function gγ means to assign a family of values {gγ (xi ) i = 1, 2, ..., n}
for the value of σ at the nodes. The choice of gγ is arbitrary subject to the constraints to meet the natural boundary conditions on the part of ∂Ω where they have been prescribed and to ensure existence of
solutions for the following problem. Given a scalar field gγ , consider on each element Ωi ∈ T the following
problem
′
(x) + Ti,gγ (x) = f (x) for all x ∈ Ωi =]xi−1 , xi [
− σi,g
γ
σi,gγ (xi−1 ) = −gi (xi−1 ) = −ξi (xi−1 )gγ (xi−1 )
(6.46)
σi,gγ (xi ) = gi (xi ) = ξi (xi )gγ (xi ),
with gi (xi ) = ξgγ (xi ) where


+1 at the node xi−1
ξi = −1 at the node xi


+1 at the nodes x0 and xn , i.e. on the boundary of the domain
which ensures continuity across the nodes of the function σ built element by element by means
Q of (6.46).
QΩi ,gγ the
For the given gγ , let QΩi ,gγ denote the set of solutions qi,gγ = (σi,gγ , Ti,gγ ) of (6.46) and
Ωi ∈T
set of functions q = (σ, T ) such that the restriction over each Ωi is qi,gγ , that is, for x ∈ Ωi q(x)|Ωi =
qi,gγ (x) = (σi,gγ (x), Ti,gγ (x)). We have then the following result.
Q
QΩi ,gγ . If σ ∈ q then σ satisfies condition (6.45).
Proposition 6.4. Let q ∈
Ωi ∈T
Proof. It follows at once by observing that for any compatible v it is
n h
i xi
X
σi (x)vi (x)
i=1
xi−1
n h
i xi
X
gi (x)vi (x)
=
xi−1
i=1
= gγ (x0 )v0 (x0 ) +
n−1
X
i=1
gγ (xi )[vi+1 (xi ) − vi (xi )] + gγ (xn )vn (xn )
= σn (xn )v(xn ) − σ1 (x0 )v(x0 ) .
As result of Proposition 6.4, we can then conclude that (6.38) will hold also with elements of
Q
Ωi ∈T
that is,
G(q − qh ) ≤ G(qex − qh )
∀q ∈
Y
Ωi ∈P
QΩi ,gγ ,
(6.47)
QΩi ,gγ .
For the additivity property of the integral, from (6.44) and recalling the definition (6.37), we obtain
2
k|e|k ≤ −2
+
n Z
X
i=1
xi
xi−1
n
X
i=1
Gi (qi − qh,i ) =
n Z
X
i=1
xi
xi−1
2
1 Ti (x) − Th,i (x) dx
c(x)
2
1 σi (x) − σh,i (x) dx
k(x)
for any q ∈
n
Y
i=1
(6.48)
QΩi ,gγ .
where Gi is defined as (6.37) with the integration limits equal to xi−1 and xi .
Qn
By choosing the function gγ and solving (6.46) in each Ωi we can build in this way elements q ∈ i=1 QΩi ,gγ
with q|Ωi = (σi , Ti ). By evaluating G at this q, according to (6.48) we obtain an upper bound to the energy
norm of error. It is the different
choice of the function gγ and of the criterion to pick the ‘best’ element q
Q
Qi,gγ to deliver different error estimates and to determine its sharpness.
within the resulting set
Ωi ∈P
Remark 6.3. In (6.48) the localization of the upper bound has been obtained by simply using the additivity
of the integral. Consequently, no link is established between global error estimate and solution of element
by element complementary problems. However, the primal hybrid variational formulation of the primal
14
March 15, 2013 7:22
problem (6.22), where one relaxes the interelement continuity of the flux, leads to the following inequality
(for details see [1]).
X
k|e|k2 = −2J (e) ≤ −2
(6.49)
inf J Ωi ,gγ (v) for all Ωi ∈ T
1
Ωi ∈T v∈H (Ωi )
with
1
J Ωi ,gγ (v) = ai (v, v) − ℓi (v) + ai (uh , v) +
2
I
gi vds.
∂Ωi
Applying the complementary principle to the minimization of the functional J Ωi ,gγ (v) we, therefore, find
inf
v∈H 1 (Ωi )
J Ωi ,gγ (v) =
sup
qi,gγ ∈QΩi ,gγ
Gi (qi,gγ − qh ),
thus,
k|e|k2 = −2G(qex − qh ) ≤ −2
≤ −2
X
Ωi ∈T
X
sup
Ωi ∈T qi,gγ ∈QΩi ,gγ
Gi (qi,gγ − qh ) for any q ∈
Y
Ωi ∈T
Gi (qi,gγ − qh )
QΩi ,gγ , for any Ωi ∈ T .
(6.50)
Equation (6.49) links the estimate of the energy norm of the error to the solution of local primal problems
wheras (6.50) links the estimate of the energy norm of the error to the solution of local complementary
problems.
Remark 6.4. Recall that a differential problem stated over the whole domain Ω can be split into local
problems stated over each subdomain belonging to a decomposition T of Ω. Each of these local problems
defines the restriction of the exact solution to the subdomain, provided that suitable boundary conditions
are given on the interelement boundaries. It is the definition of the boundary conditions the most delicate
point as it involves the knowledge of the value of the exact solution (which is the unknown of the problem
we want to solve) or the value of its normal flux across the interior boundaries.
This observation may also be applied to the differential problem which defines the discretization error.
Thus, local problems can be stated over each subdomain such that their solution gives the restriction to
the subdomain of the error. For the generic element Ωi =]xi−1 , xi [ the local problem would read indeed as
follows
−(k(x)ψi′ (x))′ + c(x)ψi (x) = Ruh,i (x) for x ∈ Ωi =]xi−1 , xi [
ψi′ (xi−1 ) = e′i (xi−1 )
ψi′ (xi )
=
(6.51)
e′i (xi ),
where Ruh,i is the pointwise residual and ei (x) = uex (x) − uh,i (x) is the discretization error.
Due to the objective difficulty in stating the boundary conditions, the previous aim is difficult to pursue,
hence we can enquiry if it is possible to modify the local problems so that their solution can give still useful
information at least on the estimate of the error.
By looking at the estimate (6.49) we see that upper bounds on the discretization error have been linked to
the solution of element by element primal problems given by the solution of the infinimum of the functionals
J Ωi ,gγ (v) over H 1 (Ωi ). Hence, the strong form corresponding to each of the local minimizations gives how
the local problems have to be posed, which result in local Neumann boundary value problems. Thus, (6.49)
provides the tool to evaluate and state local problems so that their solution can give useful results for the
estimation of the error in energy norm.
Remark 6.5. From (6.50) follows that exact solution of local complementary problems would detect the
element of QΩi ,gγ which would give the sharpest estimate that one could get with any other element of
QΩi ,gγ . It is also clear that the estimate given by the solution of the local complementary problems could
not be sharp by itself unless the set QΩi ,gγ , and hence the function gγ has been appropriately chosen. In
fact, in (6.50) the sign equal holds if the function gγ is chosen equal to the normal flux of the exact solution
u across the interelement boundaries. With this choice
Q of the function gγ , qex is one of the solutions of
QΩi ,gγ . Furthermore, because of (6.50), nothing
the problem (6.46), hence it belongs to the resulting
Ωi ∈T
15
March 15, 2013 7:22
can be said in general on the relationship between solution of local problems and the restriction of the
error to the subdomain, that is, the solution of local problems does not give any type of information on the
restriction of the error to the subdomain. However, if gγ is chosen equal to the normal flux of the exact
solution across the boundaries, then the solution of the local problem is equal to the restriction of the error
to the subdomain Ωi .
Remark 6.6. The complementary principle appears as a tool by which we can obtain upper bounds on the
energy norm of the error without actually solving any complementary
Q problem. We just need to evaluate
QΩi ,gγ . Hence, the error estimates
the value of the functionals Gi (qi,gγ − qh ) at a suitable element q ∈
Ωi ∈T
presented in literature based on complementary
principle are obtained using the (6.48) and providing a
Q
QΩi ,gγ , given by the solution of (6.46). The price to pay for using
procedure to get an element q ∈
Ωi ∈T
(6.48) consists in the solution of the problem (6.46) and only estimate of the error in energy norm are
provided.
6.3.1
An application of an implicit residual error estimate
In this section we consider the error estimate developed in [2] with a = 0. Hence, the following equation
is analysed
− [k(x)u′ (x)]′ = f (x) for x ∈]0, L[
u(0) = u0 ,
u(L) = uL .
(6.52)
The inequality (6.48), which gives the upper bound on the energy norm of the error, becomes
k|e|k2 ≤ −2
n
X
i=1
Gi (σi − σh,i ) =
n Z
X
i=1
xi
xi−1
n
n
Y
X
2
1 QΩi ,gγ , (6.53)
ηi2 for all σ ∈
σi (x) − σh,i (x) dx =
k(x)
i=1
i=1
where we have let
ηi2 = −2Gi (σi − σh,i ) =
which are referred to as local error indicators.
Z
xi
xi−1
2
1 σi (x) − σh,i (x) dx,
k(x)
To use (6.53), we need, therefore,
n
Y
QΩi ,gγ and
(i) to define the set
i=1
(ii) to give the criterion to pick an element σ of the set
integrals −2Gi (σi − σh,i ).
Qn
i=1
(6.54)
QΩi ,gγ where then we will evaluate the
In the remaining part of this section, we show how for the problem (6.52) the above two questions are
tackeld with.
The set QΩi ,gγ is defined by (6.46) which for (6.52) reads as
′
(x) = f (x)
− σi,g
γ
x ∈ Ωi =]xi−1 , xi [
σi,gγ (xi−1 ) = −gi (xi−1 ) = −ξi (xi−1 )gγ (xi−1 )
(6.55)
σi,gγ (xi ) = gi (xi ) = ξi (xi )gγ (xi ) .
This problem is defined once a function gγ is assigned at the interelement boundaries upon the condition
that gγ meet the natural boundary conditions and (6.55) have solution. Since T = 0, the latter condition
does make sense. It expresses the data equilibrationfor apure Neumann problem and is obvious from the
structure of the differential problem (6.55). For being of the first order, only one initial value is necessary.
Thus, in order to problem (6.55) have solutions an additional condition involving the boundary data is
expected. It is immediate to find this condition, in fact, integrating (6.55)1 over Ωi yields
Z x
f (ξ) dξ for x ∈ Ωi ,
σi (x) = σi (xi−1 ) −
xi−1
16
March 15, 2013 7:22
hence, the following condition must hold
σi (xi ) = σi (xi−1 ) −
Z
xi
f (x)dx ,
(6.56)
xi−1
that is,
gγ (xi ) + gγ (xi−1 ) = −
Z
xi
f (x)dx .
(6.57)
xi−1
Thus, by giving the function gγ , which in 1D consists of the family of values {g(xi ); i = 1, . . . , n}, and
provided that one meets condition (6.57) and possible natural boundary conditions, problem (6.55) is
defined and it can be solved.
In 1D assigning gγ at the nodes is meaningful if essential boundary conditions are given at both the ends
of Ω =]0, L[. If at one of the ends a natural boundary condition is given, then the respect of this condition
along with the equilibration condition (6.57), defines unambiguously the values of gγ at all the nodes
recursively. In fact, in this case problem (6.52) can be split into a system of two indipendent differential
problems of the first order which can be solved separately. This circumstance describes a so-called statically
determined system,. It then follows that there exists only one equilibrated stress field which is the exact
one. For instance, if at x = L the natural boundary condition is given, say σL = k(L)u′L , it is gγ (L) = σL .
By accounting then for (6.57) consequent consistent values of gγ are obtained for the other nodes. The
equilibrated stress field, corresponding to the natural boundary condition σ(L) = σL prescribed at x = L
is then given as follows
Z x
f (ξ) dξ, for x ∈ Ωi =]xi−1 , xi [
σi (x) = σi (xi−1 ) −
xi−1
where
σi (xi−1 ) = σi (xi ) +
Z
xi
xi−1
with
f (x)dx i = {n, n − 1, . . . , 1}
(6.58)
σn (xn ) = σL
The indetermination in defining gγ arises if essential boundary conditions are given at both the ends of
Ω. In this case, which corresponds to a statically indetermined system, more equilibrated stress fields can
be defined and a choice must be done for defining a family of values of gγ . The function gγ is required to
meet only condition (6.57). In 1D, once gγ is given, problem (6.55) can be solved and it admits a unique
solution, hence the set QΩi ,gγ reduces to only one element. This will not be the case in higher dimensions
when the analogous of problems (6.55) will result indetermined.
In summary, if a natural boundary condition is given at one of the ends of Ω, say for instance at x = L,
then the unique equilibrated stress field is given by (6.58) which is also the exact stress field, that is
σ = σex . In this case the error estimate coincides with the energy norm of the error.
In order to provide more generality, we refer next to the case where equilibrated stress fields are indetermined, that is, when essential boundary conditions are given at both the ends of the domain Ω. In this
case, a function gγ must be chosen. Let Vh (Ωi ) denote the finite element space generated by the shape
functions defined over Ωi , in [2] gγ is introduced so that it does meet the following condition
Z xi
Z xi
h
i xi
′
f (x)v(x)dx for any v ∈ Vh (Ωi ).
(6.59)
σh,i (x)v (x)dx −
σi (x)v(x)
=
xi−1
xi−1
xi−1
which is obtained by imposing the orthogonality of σ − σh to the local space Vh (Ωi ), that is,
Z xi
σi (x) − σh,i (x) v ′ (x)dx = 0 for all v ∈ Vh (Ωi ) .
(σi − σh,i , v ′ )L2 (Ωi ) =
(6.60)
xi−1
Equation (6.60) says that σi is the local L2 projection onto Vh (Ωi ).
Condition (6.59) defines the values of the function σ at the nodes xi as result of the finite dimensionality of
the space Vh and of the linearity of condition (6.59) with respect to v. In fact, equation (6.59) is equivalent
17
March 15, 2013 7:22
to the following two conditions
σi (xi−1 ) = −
σi (xi ) =
xi
Z
xi
Z
σh,i (x)ϕ′h,xi−1 (x)dx
xi−1
σh,i (x)ϕ′h,xi (x)dx −
xi−1
Z
+
xi
Z
f (x)ϕh,xi−1 (x)dx
(6.61)
xi−1
xi
f (x)ϕh,xi (x)dx
(6.62)
xi−1
with ϕh,xi−1 and ϕh,xi the element lagrangian shape functions.
Remark 6.7. It is interesting to compare condition (6.60) to the orthogonality condition of the discretization error σex − σh to Vh (Ω) expressed as
(σex − σh , v ′ )L2 (Ω) =
L
Z
0
σex (x) − σh (x) v ′ (x) dx = 0 for all v ∈ Vh (Ω).
(6.63)
This means that the equilibrated stress field σ satisfying condition (6.60) is such that σ − σh is solution
of a problem similar to problem (6.21) characterising the discretization error, in the sense that at r.h.s.
of (6.21) there will still appear a functional vanishing over Vh (Ω) which is, however, different from the
residual functional.
Condition (6.59), albeit local, is consistent with the continuity of σ across the nodes, that is, it is
σi−1 (xi−1 ) = σi (xi−1 ).
Proof. Since it is
σi−1 (xi−1 ) =
σi (xi−1 ) = −
Z
xi−1
xi−2
Z xi
xi−1
σh,i−1 (x)ϕ′h,xi−1 (x)dx −
σh,i (x)ϕ′h,xi−1 (x)dx +
it follows
σi−1 (xi−1 ) − σi (xi−1 ) =
Z
Z
Z
xi
f (x)ϕh,xi−1 (x)dx
xi−1
f (x)ϕh,xi−1 (x)dx
xi−1
xi
xi−2
(6.64)
xi
σh (x)ϕ′h,xi−1 (x)dx −
Z
xi
f (x)ϕh,xi−1 (x)dx
(6.65)
xi−2
which is equal to zero, because the residual functional vanishes over Vh (Ω).
Replacing (6.61) and (6.62) in (6.56) shows that the choice suggested in [2] satisfies the equilibration
condition
Z xi
Z xi
′
f (x)ϕh,xi (x)dx +
σh,i (x)ϕh,xi (x)dx −
σi (xi ) − σi (xi−1 ) =
xi−1
xi
+
=
Z
xi−1
Z xi
xi−1
xi
−
Z
xi−1
and since,
xi−1
Z xi
σh,i (x)ϕ′h,xi−1 (x)dx −
f (x)ϕh,xi−1 (x)dx =
xi−1
′
σh,i (x) ϕh,xi−1 (x) + ϕh,xi (x) dx +
f (x) ϕh,xi−1 (x) + ϕh,xi (x) dx
ϕh,xi−1 (x) + ϕh,xi (x) = 1 for x ∈ Ωi =]xi−1 , xi [
(6.66)
(6.67)
the equilibration condition (6.56) is therefore identically satisfied.
In summary, the functions σi (x) to use in (6.53) are obtained as follows. If a natural boundary condition
is given at one of the ends of Ω, say at x = L, then the functions σi are given by equation (6.58); on the
other hand, if essential boundary conditions are given at both the ends of Ω, the functions σi are solution
18
March 15, 2013 7:22
of the following local problems,
− σi′ (x) = f (x) for x ∈ Ωi =]xi−1 , xi [
Z
Z xi
σh,i (x)ϕ′h,xi−1 (x)dx +
σi (xi−1 ) = −
σi (x)
Z x
σi′ (ξ)dξ =
σi (xi−1 ) +
xi−1
Z
Z xi
′
σh,i (x)ϕh,xi−1 (x)dx +
−
=
=
f (x)ϕh,xi−1 (x)dx.
xi−1
xi−1
that is,
(6.68)
xi
xi
xi−1
xi−1
f (x)ϕh,xi−1 (x)dx −
Z
(6.69)
x
f (ξ)dξ.
xi−1
Remark 6.8. The sharpness of the estimate depends on the choice of the equilibrated stress field σ. Now,
a condition which we would like to have, though no essential, can be that if it happens u′h = u′ex , so that the
error is zero, then also the estimate vanishes. This means to require that if σh happens to be equilibrated,
then the procedure that defines the equilibrated stress field σ should be such that it delivers σ = σh . This
condition is satisfied in 1D by the equilibrated stress field as defined in (6.69).
′
Proof. For f (x) = −σh,i
(x) over Ωi , equation (6.69) becomes
σi (x)
=
=
−
Z
xi
xi−1
σh,i (x)ϕ′h,xi−1 (x)dx −
xi
− σh,i (x)ϕh,xi−1 (x)
xi
xi−1
′
σh,i
(x)ϕh,xi−1 (x)dx +
x
+ σh,i (x)
xi−1
=
Z
Z
x
xi−1
′
σh,i
(ξ)dξ =
=
xi−1
σh,i (xi−1 ) + σh,i (x) − σh,i (xi−1 ) = σh,i (x)
that is the equilibrated stress coincides with the finite element solution if the latter is equilibrated.
If we let
pi = σi − σh,i for every Ωi ∈ T
(6.70)
with σi solution of (6.68), it might be more appropriate to transform (6.68) into another one which has
pi = pi (x) as solution. The differential problem which characterises the functions pi useful to deliver error
estimates is therefore given as follows.
′
′
−σi′ (x) + σh,i
(x) = f (x) + σh,i
= Ruh,i (x) for x ∈ Ωi = (xi−1 , xi ), for Ωi ∈ T
with boundary condition given by
pi (xi−1 ) = σi (xi−1 ) − σh,i (xi−1 ).
By accounting of (6.68)2 we obtain
Z
Z xi
σh,i (x)ϕ′h,xi−1 (x)dx +
pi (xi−1 ) = −
xi
xi−1
xi−1
f (x)ϕh,xi−1 (x)dx − σh,i (xi−1 ) ,
and applying integration by parts to the first integral yields
Z xi
i xi
h
σh,i (x)ϕ′h,xi−1 (x)dx = σh,i (x)ϕh,xi−1 (x)
xi−1
xi−1
that is,
pi (xi−1 ) =
Z
−
Z
xi
xi−1
′
σh,i
(x)ϕh,xi−1 (x)dx
xi
Ruh,i (x)ϕh,xi−1 (x)dx.
xi−1
Hence, in order to obtain an error estimate p must be solution of the following differential problem
− p′i (x) = Ruh,i (x) ∀x ∈ Ωi = (xi−1 , xi )
Z xi
Ruh,i (x)ϕh,xi−1 (x)dx.
pi (xi−1 ) =
xi−1
19
March 15, 2013 7:22
which gives,
pi (x) = pi (xi−1 ) −
that is,
pi (x) =
Z
ηi2 =
Z
xi
xi−1
1 h
k(x)
x
Ruh,i (ξ)dξ.
xi−1
xi
xi−1
so that,
Z
Z
Ruh,i (ξ)ϕh,xi−1 (ξ)dξ −
x
Z
Ruh,i (ξ)dξ.
xi−1
xi
xi−1
Ruh,i (ξ)ϕh,xi−1 (ξ)dξ −
Z
x
Ruh,i (ξ)dξ
xi−1
i2
dx .
Remark 6.9. The differential problem (6.71) applies also for i = 1, though the shape function ϕh,x0 =
ϕh,x0 (x) does not belong to the finite element space.
In fact, for the element Ω1 =]x0 , x1 [ the problem to solve should be the following
− p′1 (x) = Ruh,1 (x) ∀x ∈ Ω1 = (x0 , x1 )
Z x1
Ruh,1 (x)ϕh,x1 (x)dx .
p1 (x1 ) = −
(6.71)
x0
However, it results
Z x1
Ruh,1 (x)ϕh,x0 (x)dx +
Z
x1
Ruh,1 (x)ϕh,x1 (x)dx =
x1
Ruh,1 (x)dx =
x0
x0
x0
Z
h
i x1
= p1 (x0 ) − p1 (x1 )
= − p1 (x)
(6.72)
x0
that is,
p1 (x0 ) = p1 (x1 ) +
Z
x1
Ruh,1 (x)dx =
x0
Z
x1
Ruh,1 (x)ϕh,x0 (x)dx
x0
thus, even for i = 1 the differential problem (6.71) may be expressed as
− p′1 (x) = Ruh,1 (x) ∀x ∈ Ω1 = (x0 , x1 )
Z x1
Ruh,1 (x)ϕh,x0 (x)dx.
p1 (x0 ) =
x0
which is obtained by (6.71) for i = 1.
Remark 6.10. In [2] no link is given with the solution of local complementary problems. The gists of
the procedure is to establish the complementary problem characterising the
Q global discretization error and
indicate a strategy, introduced by assigning gγ , to detect the element q ∈ ni=1 QΩi ,gγ which would deliver
the best estimate.
References
[1] Ainsworth M., Oden J.T., A posteriori error estimation in finite element analysis, Comput. Methods
Appl. Mech. Engrg. 142 (1997) 1-88
[2] Ladeveze P., Leguillon D., Error estimate procedure in the finite element method and applications,
SIAM J. Numer. Anal. 30 (1983) 485-509
20
March 15, 2013 7:22