Messy Notes

❯

❯

TU Machine Learning

❯

ADALINE

Nov 24, 20251 min read

Also called the Windrow-Hoff algorithm, it is as follows:

Normalize the augmented feature vectors $z_{j}$ of all the training samples (refer to Normalized Augmented Feature Vector):

\mathbf{z}{i}’= \begin{cases} \mathbf{z}{i} & \text{if $z_{i} \in ω_{i}$ } \ -\mathbf{z}_{i} & \text{if $z_{i} \in ω_{2}$ } \end{cases}

2. Initialization: Set $k=0$ and all initial weights to zero $\boldsymbol{\alpha}(0) = \mathbf{0}$. Set proper target values $b_{i}$ for all samples. 3. Pick up sample $\mathbf{z}_{j}$ from the training set, compute the gradient and update the weight $$ \boldsymbol{\alpha}(k+1)=\boldsymbol{\alpha}(k) + \rho_{k}(b_{j}-\alpha(k)^T\mathbf{z}_{j})\mathbf{z}_{j}

Let $k = k + 1$ , and repeat step 3 for all samples until the stopping criterion is met.

There are a few options on how to set the value of $b$ .

If we follow Linear Discriminant Analysis, we can
$b_{i} = {\frac{N}{N _{1}} \frac{N}{N _{2}} if y_{i} \in ω_{1} if y_{i} \in ω_{2}$
And set $w_{0} = \overset{m}{^}$ .
Otherwise, we can approximate Bayesian Discriminant instead.$$ b_{i}= 1, \ i = 1, \dots, N

This is kinda beyond the course, so I am not gonna try to understand what’s going on.

Graph View

Backlinks

Minimum Squared Error

Created with Quartz v4.5.2 © 2025

GitHub
Discord Community