Bayesian Information Criterion
Assumptions
- The approximation is only valid for sample size much larger than the number of parameters in the model.
- The BIC cannot handle complex collections of models as in the variable selection problem in high-dimension.
Definition
The BIC is formally defined as
where
- is the maximized value of the likelihood function of the model , i.e. with is the parameter value that maximizes the Likelihood Function.
- is the observed data.
- is the number of data points in , the number of observations.
- is the number of parameters estimated by the model.