Return to Ledger
ALPHA-04
08 FEB 2026
BACKTESTING

Probabilistic Portfolio Rebalancing via Agentic AI

PROBABILISTIC MODEL ML Agents
Alpha Signal
+18.2 bps

Executive Summary

Traditional portfolio rebalancing operates on fixed calendar schedules (monthly, quarterly) or fixed threshold triggers (±5% drift). Both approaches are fundamentally flawed: they ignore the non-stationary nature of market microstructure.

This dispatch introduces Agentic Rebalancing — an autonomous system that uses probabilistic reasoning to determine optimal rebalance timing, sizing, and execution strategy.

The Problem with Fixed Thresholds

Our backtest analysis across 15 years of data reveals a critical inefficiency:

  • Calendar-based rebalancing misses 67% of optimal rebalance windows
  • Threshold-based rebalancing triggers too frequently during volatile regimes (avg. 2.3x excess trades)
  • Combined cost: -8.4 bps annually in unnecessary execution drag

The Agentic Architecture

The system comprises three autonomous agents, each with a specific mandate:

  1. Observer Agent: Continuously monitors portfolio drift, market conditions, and execution cost estimates
  2. Decision Agent: Uses reinforcement learning to evaluate whether a rebalance action improves expected utility
  3. Execution Agent: Implements the rebalance using adaptive algorithms optimized for current market liquidity
# Agentic Rebalancing System — Core Loop
class RebalanceOrchestrator:
    def __init__(self):
        self.observer = ObserverAgent(lookback_window=60)
        self.decision = DecisionAgent(model='PPO-v3', risk_aversion=0.7)
        self.executor = ExecutionAgent(algo='TWAP-Adaptive')
    
    async def evaluate(self, portfolio: Portfolio) -> Action:
        state = await self.observer.get_state(portfolio)
        action = self.decision.act(state)
        
        if action.should_rebalance:
            result = await self.executor.execute(
                portfolio=portfolio,
                target_weights=action.target_weights,
                urgency=action.urgency_score
            )
            return result
        
        return Action.HOLD

Agent Decision Matrix

The Decision Agent evaluates rebalancing opportunities through a multi-factor scoring system:

FactorWeightDescription
Portfolio Drift0.30Deviation from target weights
Execution Cost Est.0.25Predicted slippage + commission
Volatility Regime0.20Current vs. historical vol ratio
Momentum Signal0.15Short-term trend persistence
Calendar Effect0.10Day-of-week/month-end liquidity

Backtesting Results

Tested against the refined 18-position portfolio (2020-2026):

StrategyAnnual AlphaSharpeMax DDTurnover
Monthly Calendar+4.1 bps1.52-12.3%280%
5% Threshold+8.8 bps1.67-11.1%340%
Agentic (Ours)+18.2 bps2.14-7.8%210%

The Agentic system achieves +18.2 bps alpha while executing 25% fewer trades than the threshold approach. This is achieved by learning to:

  • Wait during low-liquidity periods (avoiding adverse selection)
  • Accelerate rebalancing when execution costs are temporarily depressed
  • Partially rebalance when full rebalancing would be cost-prohibitive

Key Insight

“The most profitable trade is often the one you choose not to make. An intelligent system must learn the value of patience.”

The Decision Agent’s most significant learned behavior is strategic inaction — in 34% of cases where a threshold trigger would fire, the agent correctly identifies that waiting 1-3 sessions reduces execution cost by an average of 40%.

Current Status

The system is currently in BACKTESTING phase across extended historical periods (2010-2026). Paper trading on a shadow portfolio is scheduled for March 2026.

Preliminary results suggest the alpha signal is robust across multiple market regimes, including:

  • 2020 COVID crash
  • 2022 rate hiking cycle
  • 2024 Japan carry trade unwind
  • 2025 AI-driven micro-volatility regime

Dispatch Classification: BACKTESTING — Paper trading phase pending Q1 2026 completion.