Pre 0.5.0 Release History¶
0.5.0¶
Main Changes and Additions * Add patsy dependency
Compatibility and Deprecation
- cleanup of import paths (lowess)
Bug Fixes
- input shapes of tools.isestimable
Enhancements and Additions
- formula integration based on patsy (new dependency)
- Time series analysis - ARIMA modeling - enhanced forecasting based on pandas datetime handling
- expanded margins for discrete models
- OLS outlier test
- empirical likelihood - Google Summer of Code 2012 project - inference for descriptive statistics - inference for regression models - accelerated failure time models
- expanded probability plots
- improved graphics - plotcorr - acf and pacf
- new datasets
- new and improved tools - numdiff numerical differentiation
0.4.3¶
The only change compared to 0.4.2 is for compatibility with python 3.2.3 (changed behavior of 2to3)
0.4.2¶
This is a bug-fix release, that affects mainly Big-Endian machines.
Bug Fixes
- discrete_model.MNLogit fix summary method
- tsa.filters.hp_filter don’t use umfpack on Big-Endian machine (scipy bug)
- the remaining fixes are in the test suite, either precision problems on some machines or incorrect testing on Big-Endian machines.
0.4.1¶
This is a backwards compatible (according to our test suite) release with bug fixes and code cleanup.
Bug Fixes
- build and distribution fixes
- lowess correct distance calculation
- genmod correction CDFlink derivative
- adfuller _autolag correct calculation of optimal lag
- het_arch, het_lm : fix autolag and store options
- GLSAR: incorrect whitening for lag>1
Other Changes
- add lowess and other functions to api and documentation
- rename lowess module (old import path will be removed at next release)
- new robust sandwich covariance estimators, moved out of sandbox
- compatibility with pandas 0.8
- new plots in statsmodels.graphics - ABLine plot - interaction plot
0.4.0¶
Main Changes and Additions
- Added pandas dependency.
- Cython source is built automatically if cython and compiler are present
- Support use of dates in timeseries models
- Improved plots - Violin plots - Bean Plots - QQ Plots
- Added lowess function
- Support for pandas Series and DataFrame objects. Results instances return pandas objects if the models are fit using pandas objects.
- Full Python 3 compatibility
- Fix bugs in genfromdta. Convert Stata .dta format to structured array preserving all types. Conversion is much faster now.
- Improved documentation
- Models and results are pickleable via save/load, optionally saving the model data.
- Kernel Density Estimation now uses Cython and is considerably faster.
- Diagnostics for outlier and influence statistics in OLS
- Added El Nino Sea Surface Temperatures dataset
- Numerous bug fixes
- Internal code refactoring
- Improved documentation including examples as part of HTML
Changes that break backwards compatibility
Deprecated scikits namespace. The recommended import is now:
import statsmodels.api as sm
model.predict methods signature is now (params, exog, …) where before it assumed that the model had been fit and omitted the params argument.
For consistency with other multi-equation models, the parameters of MNLogit are now transposed.
tools.tools.ECDF -> distributions.ECDF
tools.tools.monotone_fn_inverter -> distributions.monotone_fn_inverter
tools.tools.StepFunction -> distributions.StepFunction
0.3.1¶
- Removed academic-only WFS dataset.
- Fix easy_install issue on Windows.
0.3.0¶
Changes that break backwards compatibility
Added api.py for importing. So the new convention for importing is:
import statsmodels.api as sm
Importing from modules directly now avoids unnecessary imports and increases the import speed if a library or user only needs specific functions.
- sandbox/output.py -> iolib/table.py
- lib/io.py -> iolib/foreign.py (Now contains Stata .dta format reader)
- family -> families
- families.links.inverse -> families.links.inverse_power
- Datasets’ Load class is now load function.
- regression.py -> regression/linear_model.py
- discretemod.py -> discrete/discrete_model.py
- rlm.py -> robust/robust_linear_model.py
- glm.py -> genmod/generalized_linear_model.py
- model.py -> base/model.py
- t() method -> tvalues attribute (t() still exists but raises a warning)
Main changes and additions
- Numerous bugfixes.
- Time Series Analysis model (tsa)
- Vector Autoregression Models VAR (tsa.VAR)
- Autogressive Models AR (tsa.AR)
- Autoregressive Moving Average Models ARMA (tsa.ARMA) optionally uses Cython for Kalman Filtering use setup.py install with option –with-cython
- Baxter-King band-pass filter (tsa.filters.bkfilter)
- Hodrick-Prescott filter (tsa.filters.hpfilter)
- Christiano-Fitzgerald filter (tsa.filters.cffilter)
- Improved maximum likelihood framework uses all available scipy.optimize solvers
- Refactor of the datasets sub-package.
- Added more datasets for examples.
- Removed RPy dependency for running the test suite.
- Refactored the test suite.
- Refactored codebase/directory structure.
- Support for offset and exposure in GLM.
- Removed data_weights argument to GLM.fit for Binomial models.
- New statistical tests, especially diagnostic and specification tests
- Multiple test correction
- General Method of Moment framework in sandbox
- Improved documentation
- and other additions
0.2.0¶
Main changes
- renames for more consistency RLM.fitted_values -> RLM.fittedvalues GLMResults.resid_dev -> GLMResults.resid_deviance
- GLMResults, RegressionResults: lazy calculations, convert attributes to properties with _cache
- fix tests to run without rpy
- expanded examples in examples directory
- add PyDTA to lib.io – functions for reading Stata .dta binary files and converting them to numpy arrays
- made tools.categorical much more robust
- add_constant now takes a prepend argument
- fix GLS to work with only a one column design
New
- add four new datasets
- A dataset from the American National Election Studies (1996)
- Grunfeld (1950) investment data
- Spector and Mazzeo (1980) program effectiveness data
- A US macroeconomic dataset
- add four new Maximum Likelihood Estimators for models with a discrete dependent variables with examples
- Logit
- Probit
- MNLogit (multinomial logit)
- Poisson
Sandbox
- add qqplot in sandbox.graphics
- add sandbox.tsa (time series analysis) and sandbox.regression (anova)
- add principal component analysis in sandbox.tools
- add Seemingly Unrelated Regression (SUR) and Two-Stage Least Squares for systems of equations in sandbox.sysreg.Sem2SLS
- add restricted least squares (RLS)
0.1.0b1¶
- initial release