- Hoeffding’s Inequality
.
- Unlike central limit therom, it holds for all N. The statement
is probably approximately correct (PAC). The professor simplified the formula based on the factor the output lies in the interval
. Consequently
.
- Connection to Learning
- Concept Mapping
- The probability
: fixed hypothesis h(x) = target f(x).
- The sample space: the input space
- The sample equals to 1: h(x) is true
- The sample equals to 0: h(x) is wrong
- Sampling: check h(x) on the training data set (with i.i.d inputs)
- The probability
- Conclusion
For any fixed h, can probably infer unknown
by
.
- Problem
The previous discussion assumes a fixed hypothesis. Actually, if there are multiple hypotheses, there can always a probability that the hypothesis fits the training data set well but not the actual problem (overfitting). To overcome this problem, we need a large data setto reduce the probability.
- Concept Mapping
Leave a comment