ML.MD

Search

❯

❯

Bayesian Information Criterion

Bayesian Information Criterion

Apr 23, 20241 min read

Bayesian Information Criterion

Assumptions

The approximation is only valid for sample size $n$ much larger than the number $k$ of parameters in the model.
The BIC cannot handle complex collections of models as in the variable selection problem in high-dimension.

Definition

The BIC is formally defined as

BIC = k ln (n) - 2 ln (\hat{L}),

where

$\hat{L}$ is the maximized value of the likelihood function of the model $M$ , i.e. $\hat{L} = p (x ∣ \hat{θ}, M)$ with $\hat{θ}$ is the parameter value that maximizes the Likelihood Function.
$x$ is the observed data.
$n$ is the number of data points in $x$ , the number of observations.
$k$ is the number of parameters estimated by the model.

Remarks

BIC and AIC penalties: BIC is similar to AIC, but with a different penalty for the number of parameters. With AIC this penalty is $2 k$ , whereas with BIC the penalty is $ln (n) k$ .

Graph View

Bayesian Information Criterion
Assumptions
Definition
Remarks

Backlinks

No backlinks found

Created with Quartz v4.2.3 © 2024

GitHub
Discord Community