Professional Documents
Culture Documents
Hungry judges rule Columbia University and Hunger and fatigue leave
negatively Ben Gurion University decision makers feeling
(Israel) less forgiving
Example:
Goal: classify a record as will buy computer or
will not buy
Rule might be IF (Income > 92.5) AND
(Education = poor) AND (FamilySize = small)
THEN buy = no (class = 0)
Rules are represented by tree diagrams
no yes no yes
Entropy(D) =
= c - (pc ) log2(pc )
= c - (nbc / nb ) log2(nbc / nb )
Entropy(D) Entropy(D,A)
= Gain(D,A)
= Info(D) Info(D,A)
= - (ntc / nt ) log2(ntc / nt )
- Average entropy on branching on A
= - c(ntc / nt ) log2(ntc / nt )
- ( b ((nb / nt ) x (-c (nbc / nb ) log2(nbc / nb ))))
Gain(D,A)/SplitInfo(D,A)
Accuracy of a classifier
Overfitting
Classifier performance
1400
1200
1000
Revenue
800
600
400
200
0
0 100 200 300 400 500 600 700 800 900 1000
Expenditure
Causes:
Too many predictors
Sample
Explore
Modify
Model
Assess