Mason Wang

Linear Classifiers

From CS 231N A linear classifier works like this:

SVM

Regularization

We can set $\Delta$ to $1.0$ and only tune $\lambda$, due to the weight magnitude thing.

Softmax

\(L_i = - \log \frac{e^{s_{y_i}}}{\sum_k e^{s_k}}\) \(= - s_{y_i} + \log \sum_k e^{s_k}\)