statsmodels.tsa.statespace.representation.Representation

class statsmodels.tsa.statespace.representation.Representation(k_endog, k_states, k_posdef=None, initial_variance=1000000.0, nobs=0, dtype=<class 'numpy.float64'>, design=None, obs_intercept=None, obs_cov=None, transition=None, state_intercept=None, selection=None, state_cov=None, statespace_classes=None, **kwargs)[source]

State space representation of a time series process

Parameters:
  • k_endog (array_like or integer) – The observed time-series process \(y\) if array like or the number of variables in the process if an integer.
  • k_states (int) – The dimension of the unobserved state process.
  • k_posdef (int, optional) – The dimension of a guaranteed positive definite covariance matrix describing the shocks in the measurement equation. Must be less than or equal to k_states. Default is k_states.
  • initial_variance (float, optional) – Initial variance used when approximate diffuse initialization is specified. Default is 1e6.
  • initialization ({'approximate_diffuse','stationary','known'}, optional) – Initialization method for the initial state.
  • initial_state (array_like, optional) – If known initialization is used, the mean of the initial state’s distribution.
  • initial_state_cov (array_like, optional) – If known initialization is used, the covariance matrix of the initial state’s distribution.
  • nobs (integer, optional) – If an endogenous vector is not given (i.e. k_endog is an integer), the number of observations can optionally be specified. If not specified, they will be set to zero until data is bound to the model.
  • dtype (np.dtype, optional) – If an endogenous vector is not given (i.e. k_endog is an integer), the default datatype of the state space matrices can optionally be specified. Default is np.float64.
  • design (array_like, optional) – The design matrix, \(Z\). Default is set to zeros.
  • obs_intercept (array_like, optional) – The intercept for the observation equation, \(d\). Default is set to zeros.
  • obs_cov (array_like, optional) – The covariance matrix for the observation equation \(H\). Default is set to zeros.
  • transition (array_like, optional) – The transition matrix, \(T\). Default is set to zeros.
  • state_intercept (array_like, optional) – The intercept for the transition equation, \(c\). Default is set to zeros.
  • selection (array_like, optional) – The selection matrix, \(R\). Default is set to zeros.
  • state_cov (array_like, optional) – The covariance matrix for the state equation \(Q\). Default is set to zeros.
  • **kwargs – Additional keyword arguments. Not used directly. It is present to improve compatibility with subclasses, so that they can use **kwargs to specify any default state space matrices (e.g. design) without having to clean out any other keyword arguments they might have been passed.
nobs

int – The number of observations.

k_endog

int – The dimension of the observation series.

k_states

int – The dimension of the unobserved state process.

k_posdef

int – The dimension of a guaranteed positive definite covariance matrix describing the shocks in the measurement equation.

shapes

dictionary of name:tuple – A dictionary recording the initial shapes of each of the representation matrices as tuples.

initialization

str – Kalman filter initialization method. Default is unset.

initial_variance

float – Initial variance for approximate diffuse initialization. Default is 1e6.

Notes

A general state space model is of the form

\[\begin{split}y_t & = Z_t \alpha_t + d_t + \varepsilon_t \\ \alpha_t & = T_t \alpha_{t-1} + c_t + R_t \eta_t \\\end{split}\]

where \(y_t\) refers to the observation vector at time \(t\), \(\alpha_t\) refers to the (unobserved) state vector at time \(t\), and where the irregular components are defined as

\[\begin{split}\varepsilon_t \sim N(0, H_t) \\ \eta_t \sim N(0, Q_t) \\\end{split}\]

The remaining variables (\(Z_t, d_t, H_t, T_t, c_t, R_t, Q_t\)) in the equations are matrices describing the process. Their variable names and dimensions are as follows

Z : design \((k\_endog \times k\_states \times nobs)\)

d : obs_intercept \((k\_endog \times nobs)\)

H : obs_cov \((k\_endog \times k\_endog \times nobs)\)

T : transition \((k\_states \times k\_states \times nobs)\)

c : state_intercept \((k\_states \times nobs)\)

R : selection \((k\_states \times k\_posdef \times nobs)\)

Q : state_cov \((k\_posdef \times k\_posdef \times nobs)\)

In the case that one of the matrices is time-invariant (so that, for example, \(Z_t = Z_{t+1} ~ \forall ~ t\)), its last dimension may be of size \(1\) rather than size nobs.

References

[*]Durbin, James, and Siem Jan Koopman. 2012. Time Series Analysis by State Space Methods: Second Edition. Oxford University Press.

Methods

bind(endog) Bind data to the statespace representation
initialize_approximate_diffuse([variance]) Initialize the statespace model with approximate diffuse values.
initialize_known(initial_state, …) Initialize the statespace model with known distribution for initial state.
initialize_stationary() Initialize the statespace model as stationary.

Attributes

design (array) Design matrix\(Z~(k\_endog \times k\_states \times nobs)\)
dtype (dtype) Datatype of currently active representation matrices
endog (array) The observation vector, alias for obs.
obs (array) Observation vector\(y~(k\_endog \times nobs)\)
obs_cov (array) Observation covariance matrix\(H~(k\_endog \times k\_endog \times nobs)\)
obs_intercept (array) Observation intercept\(d~(k\_endog \times nobs)\)
prefix (str) BLAS prefix of currently active representation matrices
selection (array) Selection matrix\(R~(k\_states \times k\_posdef \times nobs)\)
state_cov (array) State covariance matrix\(Q~(k\_posdef \times k\_posdef \times nobs)\)
state_intercept (array) State intercept\(c~(k\_states \times nobs)\)
time_invariant (bool) Whether or not currently active representation matrices are time-invariant
transition (array) Transition matrix\(T~(k\_states \times k\_states \times nobs)\)