Definition

In a switching-kalman-filter with discrete regimes and Markov regime dynamics, the exact filtering posterior is a mixture of Gaussians — one component per possible regime history . This is exponential belief-state growth: the representation cost of the exact posterior doubles (and more) at every time step, making exact inference intractable beyond a handful of steps even though the model is conditionally linear-Gaussian.

Intuition

Suppose at time the prior is a mixture of Gaussians, one per value of . To advance to , each of those Gaussians must be propagated through each of the regime- Kalman dynamics (because the regime can switch), producing Gaussians. Each of those must in turn be advanced through dynamics to time 3, giving . By time , the posterior has components. The components are not redundant in general: they correspond to genuinely different regime histories with different posterior means and covariances.

This is the central computational obstruction in switching state-space models and is the reason every practical SKF inference algorithm — GPB1, GPB2, IMM, MHT, RBPF, variational SKF — is an approximation that controls growth either by collapsing (gpb-imm-collapsing-filters), by selecting a subset of paths (Multiple-Hypothesis Tracking, RBPF), by sampling ( via MCMC), or by breaking the vertical links variationally.

Formal notation

Let be the regime, the continuous state, and the observations. The exact filtering posterior factors as

where each is a Gaussian (the result of running the Kalman filter conditional on the regime path ). The number of components is , and the per-component bookkeeping (Kalman mean + covariance) is , so the total exact-inference cost at horizon is .

Variants

  • History buffer of length — keep only the last regime values, so the mixture has components (this is GPB(); see gpb-imm-collapsing-filters)
  • Path selection — keep only the highest-probability regime paths (Multiple-Hypothesis Tracking, particle filters)
  • Compound regime explosion — if the regime has factored structure with states each and the chains are independent, the effective regime count is and the explosion is . This is the relevant scaling for the CRE asset-pricing model in this workspace, which uses two independent binary chains (monetary policy × wage rigidity) for compound regimes.

Comparison

  • vs HMM forward algorithm cost — an HMM with states has exact inference cost because the latent variable is discrete and can be marginalised at every step. The SKF cannot do this marginalisation in closed form because the continuous state entangles histories.
  • vs Linear Dynamical System (LDS) — an LDS has exact Kalman cost. The SKF inherits this per regime path, so the SKF cost is — exponentially worse in .
  • vs Dynamic Bayesian Network (DBN) — a DBN with all-discrete state of cardinality has junction-tree inference cost. The SKF avoids the discretisation cost of large but pays the history cost instead.

When to use

This is a negative result — it tells you when not to attempt exact inference. Specifically:

  • If and , exact SKF inference is already infeasible (mixture has components)
  • For real macro / financial time series ( in the hundreds), exact inference is utterly impossible at any
  • For the CRE asset-pricing model ( compound regimes, quarterly observations), the exact mixture would have components

The implication is that every production SKF must use one of the four approximation families catalogued in murphy-1998-switching-kalman-filters.

Known limitations

  • The bound is tight in the worst case: you cannot avoid it by exploiting symmetry unless the model has explicit parameter tying that reduces the effective regime count.
  • The exponential growth applies even when most regime histories have vanishing posterior mass — the difficulty is the bookkeeping, not the numerical mass.
  • “Spreading-out” arguments (Boyen-Koller [BK98a, BK98b]) bound the error of the collapsing approximation but do not reduce the exact-inference cost.

Open problems

  • Sharper conditions under which a low-dimensional -step history buffer suffices (i.e. when do far-past regime histories truly stop mattering?)
  • Adaptive history-buffer sizing based on regime entropy / mixing time
  • Exact inference via factor-graph contraction tricks for SKFs with sparse matrices

Key papers

My understanding

The explosion is the central reason every modern switching-state-space algorithm — GPB, IMM, MHT, particle filters, RBPF, variational SKF — exists at all. For this workspace’s CRE asset-pricing model, the relevant cost is exact mixture components, which is the entire reason the model uses a Rao-Blackwellised particle filter that selects a small bag of regime histories rather than enumerating them. Knowing that exact inference is provably infeasible (rather than merely “hard”) is also the right argument to give when defending an approximate-inference design choice in a paper or rebuttal.