Perceptron

From Normalized Augmented Feature Vector, we get the value $α^{T} y_{i}^{'}$ that we want to maximize. Since cost functions are usually things you wanna minimize, we get

J_{P} (α) = y_{j} \in Y^{k} \sum (- α^{T} y_{j})

In which we want to find the $α$ that minimizes the above function.

Obviously, we do differentiation wrt $α$ .

\triangledown J = \frac{ \partial J_{p}(\boldsymbol{\alpha})}{ \partial \boldsymbol{\alpha} } = -\sum_{\mathbf{y}_{j}\in\mathcal{Y}^k} (-\mathbf{y}_{j}) $$As such, we get $\boldsymbol{\alpha}(k+1) = \boldsymbol{\alpha}(k) + \rho_{k}\sum_{\mathbf{y}_{j}\in\mathcal{Y}^k}\mathbf{y}_{j}$, where $\rho_{k}$ is the learning rate. This is the backprop formula. In traditional perceptron, we update the learning rate with the variable increment rule:

\rho_{k} = \frac{|\boldsymbol{\alpha}(k)^T\mathbf{y}{j}|}{\begin{Vmatrix} \mathbf{y}{j} \end{Vmatrix}^2}

> [! in f o] P erce pt ro n C o n v er g e n ce T h eore m >> I f t r ainin g s am pl es a re l in e a r l yse p a r ab l e, t h e P erce pt ro n A l g or i t hm w i ll co n v er g e t o a so l u t i o n v ec t or ina f ini t e n u mb ero f u p d a t es .

Messy Notes

Explorer

Perceptron

Graph View

Backlinks