Is a common metric in classification.

Fails when classes are high imbalanced. For these cases f1-score is more appropriate

where

  • is the number of observations;
  • is the indicator function
  • is the predicted value
  • is the observed value