Perceptron
- Definition
- For a training example input
and output
.
- Hypothesis:
.
- For a training example input
- Conceptual Explanation
- A linear / surface / hyperplane (a hypothesis) that separate the dots (training examples) examples inside a 2D / 3D / ND space.
- (If the features are N-dimensional, then the problem space is N-dimensional, and the hyperplane is N-1 dimensional)
- Learning Algorithm
- Perceptron Learning Algorithm
- Overall: start from some
and correct its mistakes on
.
- Implementation: From
- Find a mistake of
_t called
,
.
- Follow the naive cycle or precomputed random cycle.
- Try to correct the mistake by
.
- Until no more mistakes (a full cycle of not encountering mistakes).
- Return the last
as
.
- Find a mistake of
- Conceptual Explanation:
- Assume there is a hyperplane that splits the two different kinds of the output, the perceptron learning algorithm is trying to find the normal vector
of the hyperplane though doing dot products of the current
and the training examples
.
- The target is making all the positive example vectors has an acute angle with the positive examples and an obtuse angle with the negative examples.
- Assume there is a hyperplane that splits the two different kinds of the output, the perceptron learning algorithm is trying to find the normal vector
- Therom:
.
- Overall: start from some
- Perceptron Learning Algorithm
- Linear Separability
- Definition: If PLA halts
allows some
to make no mistake, call such
linear seperable.
- PLA Fact:
gets more aligned with
.
- The dot product of
and
is getting larger.
- PLA Fact:
does not grow too fast
.
- Therom: (For simplicity, no longer mark w and x as vector)
- Definition: If PLA halts
- Non-Linear Seperatably Data
- Gurantee to work as long as linear speratble, but doesn’t know if it is linear speratble at front, event not sure how long halting takes (
depends on
).
- Line with Noise Tolerance
- Pocket Algorithm: Keeping best weights in pocket. Will be slower than PLA (storing alternative and checking all the examples).
- Gurantee to work as long as linear speratble, but doesn’t know if it is linear speratble at front, event not sure how long halting takes (
Leave a comment