|
The curly braces
are used in this figure to enclose the elements of a set. The
5 horizontal levels along which the sets are organized reflects
the partial order structure of this set of sets. The top set
includes all four of the possible concept instances. The next
level consists of four sets each of which contain a different
combination of three of the four possible concept instances.
Similarly, the next level consists of six sets each of which
contain a different combination of two of the four possible concept
instances. The fourth level consists of 4 sets each containing
one of the four possible concept instances. And, finally, the
bottom level consists of the empty set which contains none of
the possible concept instances.
The lines in
the figure connect a set at level n - 1 with a set
at level n if the set at level n - 1 is
a subset of the set at level n. Now in order to see
how the lattice property proves useful to hypothesis revision,
let us choose one of the sets to represent our current hypothesis
about the concept. Choose the set at the middle level at the
far left. According to this hypothesis the concept is any instance
that is black. And, we assume that this hypothesis is consistent
with the training instances that have been seen to this point.
Now assume
that we encounter an instance that consists of a large white
square. Further, suppose that we are told that this instance
is a positive instance of the concept. Consequently, our hypothesis
must be generalized. Note that there are 5 hypotheses in this
space that are more general than the current hypothesis. How
should we choose among this candidate set of revisions?
Note first
that the current hypothesis is a subset of only 3 of the 5. Thus,
2 of 5, the two at the far right of the figure, can be eliminated
since adopting either of these would result in an hypothesis
that was inconsistent with the training sequence that has been
observed to this point.
Consider next
the generalizations of the current instance. There are seven
generalizations that include this instance. However, only two
of these seven generalizations include both the current instance
and the current hypotheses. And one of these two is the most
general hypothesis at the top level of the lattice. One other
generalization of the hypothesis remains. That, is the set at
the far left of the lattice at level 2. This hypothesis is preferred
since it is the minimal generalization required to bring the
hypothesis in lines with the training sequence.
A similar line
of argument could be illustrated if a negative instance were
presented, such as a small black square. In this case the specialization
of the hypothesis would result in the identification of the set
at the far left of the lattice at level 4.
|