support vector classifiers can be written as where

  • is the bias
  • is the parameter
  • is the observation

The kernel trick is to replace the dot product with a functional :

It allows for non-linear decision boundaries.