T-61.263 Advanced course in neural computing
Solutions for exercise 9
- We start with the notion that a neuron
flips from state
to
at temperature
with probability
 |
(1) |
where
is the energy difference resulting from such a flip.
The energy function of the Boltzmann machine is defined by
Hence the energy change produced by neuron
flipping from state
to
is
where
is the induced local field of neuron
.
- In light of the result in Eq.(2), we may rewrite Eq.(1) as
This means that for an initial state
, the probability that neuron
is flipped into state
is
 |
(2) |
- For an initial state of
, the probability that neuron
is
flipped into state
is
 |
(3) |
The flipping probability in Eq.(4) and the one in Eq.(3) are in perfect
agreement with the following probabilistic rule:
where
is itself defined by
- The Boltzmann machine and sigmoid belief network share a common
feature: they are both stochastic machines with their theory rooted
in statistical mechanics.
They differ from each other in the following respects:
- The Boltzmann machine is a recurrent network whereas the sigmoid
belief network is an acyclic feedforward network.
- The learning process in a Boltzmann machine involves two phases:
one clamped (positive) and the other free running (negative). The
negative phase is eliminated from the sigmoid belief network.
- Writing the system of
simultaneous equations (Haykin, Eq. 12.22) in
matrix form:
 |
(4) |
where
Rearranging terms in Eq.(5):
where
is the
-by-
identity matrix. For the solution
to be unique we require that the
-by-
matrix
have an inverse matrix for all possible values of the discount factor
.
- An important property of dynamic programming is the monotonicity property
described by
This property follows from the fact that if the terminal cost
for
stages
is changed to a uniformly larger cost
, that is,

for all
then the last stage cost-to-go function
will be uniformly
increased. In more general terms, we may state the following. Given two cost-to-go
functions
and
with

for all
we find that for all
and
the following relation holds:
This relation merely restates the monotonicity property of the dynamic programming algorithm.
Jaakko Peltonen
2003-11-21