Gene clustering: 2001:
2
Finding Rules
§
Words
of cluster
characterise
its
instances
§
Construct rules with
•
“necessary” words
•
“sufficient” words
•
“supplementary” words
§
Construct “ordinary” discriminant rules.
if
E.coli
&
(intergenic
or
(hypothetical
&
region))
then
UP
§
Not all words necessary to
discriminate
the
cluster
•
eg
, if all instances in dataset have word
“
protein"
then
“
protein"
is in every cluster
but
“
protein"
is not useful
to discriminate
UP
if
•
E.coli
•
intergenic, hypothetical, region
•
protein, in, to