A look at Boosting

Taken from the paper A decision-theoretic generalization of on-line learning and an application to boosting. Freund and Schapire 1995.

We are interested from section 4 on.

Language:

X is the domain.

A concept is a Boolean function .

A concept class is a collection of concepts.

The learner has access to an oracle providing labeled examples of the form (x, c(x)), where x is chosen randomly according to some fixed but unknown and arbitrary distribution D on X, and is the target concept.

After some time , the learner outputs a hypothesis: .

the error of the hypothesis is where x follows distribution D.

A strong PAC-leraning algorithm is an algorithm, that given , access to random examples, outputs with probability a hypothesis with error at most .

A weak learner, is one that has , .

It does a little bit better than guessing.

Boosting is a method for turning weak learners into strong learners.

The Boosting algorithm proceeds as follows:

Input: the examples, the distribution D over the examples, the weak learning algorithm and the number of iterations, T.

Initialize the weight vector: , .

Do for

Set
Call the weak learner , providing it with ; and get back the hypothesis .
Calculate the error of
Set .
Reweight:

Output the hypothesis:

Up: Class 8 Stat 540 Previous: Where to find additional

Richard Waterman
Fri Mar 5 08:09:37 EST 1999