Perceptron Learning

Question

Learning Perceptorn can be easily accomplished using the update rule w_i=w_i + n(y-\hat{y})x.

All resources I read so far say that the learning rate n can be set to 1 w.l.g.

My question is the following, is there any proof that the Speed of convergence will always be the same, given that the data is linearly separable? Should not this also depend of the initial w vector?

possible duplicate of [Parameter Tuning for Perceptron Learning Algorithm](http://stackoverflow.com/questions/2762215/parameter-tuning-for-perceptron-learning-algorithm) — Neil, May 15 '14 at 10:22

score 0 · Accepted Answer · answered May 16 '14 at 21:54

Citing Wikipedia:

The decision boundary of a perceptron is invariant with respect to scaling of the weight vector; that is, a perceptron trained with initial weight vector \mathbf{w} and learning rate \alpha \, behaves identically to a perceptron trained with initial weight vector \mathbf{w}/\alpha \, and learning rate 1. Thus, since the initial weights become irrelevant with increasing number of iterations, the learning rate does not matter in the case of the perceptron and is usually just set to 1.

Perceptron Learning

1 Answers1