where

  • is the observed value
  • is the predicted value

Note: Squaring the error will more heavily penalize a few large errors than many small errors. Even if the sum of errors is the same.