Appendix A
Lagrange multipliers

A.1  A general introduction to Lagrange multipliers

The idea for the Lagrange multipliers was first put forward in the ninth of Lagrange's "Leçons sur le calcul des fonctions" in 1806. A function M = f0(xi) for i = 1,2,...N is to be maximised, subject to constraints gj for which is known:

gj(xi) = 0 for j = 1,K
(85)

This is equivalent to reducing the dimension of the original system.

A new function L is constructed, adding the dimension lost through the constraints by incorporating Lagrange multipliers. Therefore, the constrained maximisation of function M is equivalent to the unconstrained maximisation of function L where

L = f0(xi) +
å
j 
lj gj(xi) = f0(xi) + l1 g1(xi) + l2 g2(xi) + ...
(86)

This function has N+K unknowns.

The maximisation is performed by differentiating L with respect to both parameters. In the maximum the differentials with respect to the appropriate lk and xk will be 0:

L
lk
= gk(xi) = 0
(87)

and

L
xk
= f0
xk
+
å
j 
lj gj
xk
= 0
(88)

We now have N+K equations (equations 88 and 87, respectively) to solve the N+K unknowns of function L.

A.2  Lagrange multipliers in entropy maximisation

In the general case of maximising entropy, the following entropy formula may be used (this is equation 57):

S(p,q) = - B
å
i = 1 
piln(pi/qi)
(89)

The constraints may be denoted as follows:

B
å
i = 1 
piCij = cj
(90)

for B possible system states. j = 1,M where M is the number of constraints. Since the constraints must always be satisfied, it follows from equation 90 that

B
å
i = 1 
DpiCij = 0
(91)

where Dpi is a small variation in pi. In the maximum of S we have:

B
å
i = 1 
S
pi
Dpi = 0
(92)

Now a multiplier lj is introduced for each constraint. Multiplying equation 90 by lj for each j and summation of the results leads to

B
å
i = 1 
M
å
j = 1 
lj Dpi Cij = 0
(93)

When M £ B, only B-M parameters Dpi are independent, whereas the parameters for which i = 1,M are dependent. This means the multipliers lj may be obtained from

S
pi
= M
å
j = 1 
lj Cij
(94)

A.3  Lagrange multipliers in crystallography

In crystallography the entropy may be maximised in the case of general linear constraints of the form

ó
õ


V 
p( ®
r
 
)Cj( ®
r
 
)d ®
r
 
= cj
(95)

for j = 1,M where M is the number of constraints. Cj = e2pi[h]j·[r] for every reciprocal lattice vector [h]j, and cj = U([h]j) is the unitary structure factor for the reflection corresponding to [h]j. p([r]) is the unknown optimal distribution of atoms and V the volume of the unit cell. The normalisation condition, or the origin term with j = 0, may also be regarded as a constraint:

ó
õ


V 
p( ®
r
 
)d ®
r
 
= ó
õ


V 
p( ®
r
 
)C0( ®
r
 
)d ®
r
 
= c0 = 1
(96)

The constrained maximisation of the entropy S(p,q) (see equation 89) is equivalent to the unconstrained maximisation of

S(p,q) + M
å
j = 0 
(lj ó
õ


V 
p( ®
r
 
)Cj( ®
r
 
)d ®
r
 
-cj)
(97)

where lj are the Lagrange multipliers. q = q([r]) is the prior distribution.

Differentiation with respect to lj returns the constraint equations 95, while differentiation with respect to p([r]) yields:

-1-ln(pME( ®
r
 
)/q( ®
r
 
))+ M
å
j = 0 
lj Cj( ®
r
 
) = 0
(98)

or

pME( ®
r
 
) = q( ®
r
 
)exp(l0-1)exp( M
å
j = 0 
lj Cj( ®
r
 
))
(99)

where pME is the Maximum Entropy posterior distribution. Solving directly for l0 by integrating this equation over [r] and using the normalisation constraint òVpME([r])d[r] = 1 gives l0-1 = -lnZ where

Z(l1,...,lB) = ó
õ


V 
q( ®
r
 
)exp( M
å
j = 1 
lj Cj( ®
r
 
))d ®
r
 
(100)

Z is the partition function which describes the way in which the unit cell has been divided up in fractions B. Then

pME( ®
r
 
) =
q( ®
r
 
)

Z(l1,...,lB)
exp( M
å
j = 1 
lj Cj( ®
r
 
))
(101)

and

ó
õ


V 
pME( ®
r
 
)Cj( ®
r
 
)d ®
r
 
= cj
(102)

File translated from TEX by TTH, version 1.54.
and edited manually afterwards.

Back to the Table of Contents