Perceptron Learning Summary, so far

Perceptron training rule is guaranteed to succeed if the training examples are linearly separable and the learning rate is sufficiently small.
The linear unit training rule (gradient descent) will converge to the hypothesis with minimum squared error given a sufficiently small learning rate, even with noise and when training data not separable by $H$ .
Gradient descent has two problems
1. Convergence to a local minimum can sometimes be slow.
2. There is no guarantee that we will find the global minimum.