statsmodels.genmod.generalized_linear_model.GLM.fit_regularized¶

GLM.fit_regularized(method='elastic_net', alpha=0.0, start_params=None, refit=False, opt_method='bfgs', **kwargs)[source]¶

Return a regularized fit to a linear regression model.

Parameters:

method{‘elastic_net’}: Only the elastic_net approach is currently implemented.
alphascalar or array_like: The penalty weight. If a scalar, the same penalty weight applies to all variables in the model. If a vector, it must have the same length as params, and contains a penalty weight for each coefficient.
start_paramsarray_like: Starting values for params.
refitbool: If True, the model is refit using only the variables that have non-zero coefficients in the regularized fit. The refitted model is not regularized.
opt_methodstr: The method used for numerical optimization.
**kwargs: Additional keyword arguments used when fitting the model.

Returns:

Notes

The penalty is the elastic net penalty, which is a combination of L1 and L2 penalties.

The function that is minimized is:

- l o g l i k e / n + a l p h a * ((1 - L 1_w t) * | p a r a m s |_{2}^{2} / 2 + L 1_w t * | p a r a m s |_{1})

where $| * |_{1}$ and $| * |_{2}$ are the L1 and L2 norms.

Post-estimation results are based on the same data used to select variables, hence may be subject to overfitting biases.

The elastic_net method uses the following keyword arguments:

maxiterint: Maximum number of iterations
L1_wtfloat: Must be in [0, 1]. The L1 penalty has weight L1_wt and the L2 penalty has weight 1 - L1_wt.
cnvrg_tolfloat: Convergence threshold for maximum parameter change after one sweep through all coefficients.
zero_tolfloat: Coefficients below this threshold are treated as zero.