statsmodels.stats.outliers_influence.variance_inflation_factor

statsmodels.stats.outliers_influence.variance_inflation_factor(exog, exog_idx)[source]

variance inflation factor, VIF, for one exogenous variable

The variance inflation factor is a measure for the increase of the variance of the parameter estimates if an additional variable, given by exog_idx is added to the linear regression. It is a measure for multicollinearity of the design matrix, exog.

One recommendation is that if VIF is greater than 5, then the explanatory variable given by exog_idx is highly collinear with the other explanatory variables, and the parameter estimates will have large standard errors because of this.

Parameters:
  • exog (ndarray) – design matrix with all explanatory variables, as for example used in regression
  • exog_idx (int) – index of the exogenous variable in the columns of exog
Returns:

vif – variance inflation factor

Return type:

float

Notes

This function does not save the auxiliary regression.

See also

xxx
class for regression diagnostics TODO: doesn’t exist yet

References

http://en.wikipedia.org/wiki/Variance_inflation_factor