Quantitative Research Report · February 2026

Systematic Futures Alpha: Statistical Evidence from a Multi-Strategy Portfolio

We present statistical evidence of persistent alpha with near-zero market beta generated by a systematic multi-strategy portfolio over a 74-month evaluation period (January 2020 – February 2026).

Key Findings at a Glance

1

Alpha is statistically significant

t = 5.75, p < 0.001, exceeds Harvey-Liu-Zhu threshold of 3.0

2

Returns are market-independent

Beta = 0.158, R-squared = 0.014 vs NASDAQ-100

3

Profitable in all market regimes

+6.6% bull, +3.7% bear, +3.6% flat months

4

Negative conditional betas

Both upside (-0.08) and downside (-0.24) betas are negative — true market independence

5

Strong consistency

59/74 months positive, 7/7 years positive, rolling 12-mo Sharpe never < 0.99

6

Genuine diversification

Mean pairwise correlation = 0.020, 4.0x Sharpe multiplier across 19 strategies

7

Sub-period persistence

Sharpe 2.13 (2020-22) improving to 3.32 (2023-26)

8

Large sample confirmation

10,361 trades, 44.2% win rate across 74 months, profit factor 1.25

9

Favorably skewed returns

Skewness +1.13, worst month only -9.6% vs NQ's -13.3%

10

Superior risk-adjusted returns

Sharpe 2.51 vs 0.97 benchmark, Calmar 3.88 vs 0.63, Max DD 19.9% vs 32.2%

11

Cross-asset diversification

Gold and ETH strategies add structurally uncorrelated returns

1. Introduction

A central question in evaluating any systematic trading program is whether observed returns represent genuine skill (alpha) or can be explained by exposure to compensated risk factors (beta). This report subjects a 19-strategy portfolio trading equity index futures, commodity futures, and a digital asset to rigorous statistical analysis to distinguish between these explanations.

1.1 Portfolio Description

The portfolio trades four instruments across three asset classes:

Asset ClassInstrument# StrategiesTimeframe
U.S. Equity IndexNASDAQ-100 (NQ)145-minute and 30-minute intraday
U.S. Equity IndexS&P 500 (ES)11-hour
CommodityGold (GC)35-minute intraday
Digital AssetEthereum (ETH)11-hour

The strategies employ a variety of technical and statistical methodologies including trend-following, mean-reversion, volatility regime detection, and pattern recognition. Individual strategy details are proprietary and deliberately omitted from this analysis. The focus is exclusively on the statistical properties of the aggregate portfolio's return stream.

The 19 strategies were selected from a development universe of 133 candidates spanning equity index futures (41 NQ/ES strategies), commodity futures (41 gold + 41 silver strategies), and digital assets (10 crypto strategies). Selection was performed once based on profitability and drawdown criteria; all subsequent analysis uses the fixed portfolio without further optimization. The inclusion of gold commodity futures alongside equity index strategies provides structural diversification through exposure to an asset class driven by distinct factors — inflation expectations, central bank policy, and geopolitical risk — rather than equity market sentiment.

1.2 Methodology

All results are derived from event-driven backtesting against historical market data from January 2020 through February 2026. The backtest incorporates:

  • Equity-proportional position sizing: Position sizes scale with portfolio equity (size = equity / initial capital), with VIX-based scaling overlay applied on top
  • VIX-based risk overlay: Proprietary multi-tier VIX-responsive position scaling that dynamically adjusts exposure based on implied volatility regimes. Our most volatility-sensitive strategy uses additional VIX-based sizing and adaptive stops
  • Daily loss circuit breaker: Proprietary daily loss threshold halts new entries for the remainder of the trading day
  • Instrument-specific transaction costs: 0.5 points per fill on NQ and GC; 0.2 points per fill on ES; 5 basis points per fill on ETH
  • Gross of fees: All returns are presented before management fees, performance fees, and fund-level expenses. Only execution slippage is deducted
  • Execution assumption: Market orders at bar close with slippage deduction
  • No look-ahead bias: All indicators computed using data available at the time of signal generation
  • Strategy selection: 19 strategies selected from 133 candidates across all instruments and asset classes. Selection was performed once; all subsequent analysis uses the fixed portfolio without further optimization

The initial capital base is $100,000. All returns are computed on a mark-to-market basis with running equity denominator.

1.3 Benchmark Selection

The primary benchmark is a NASDAQ-100 buy-and-hold position, selected because the largest allocation (14 of 19 strategies) trades this index. A secondary S&P 500 benchmark is included for cross-reference. Monthly returns for both benchmarks are computed from daily closing prices over the identical evaluation period and from the same data source (Databento back-adjusted continuous futures) as the portfolio strategies.

2. Headline Performance

2.1 Absolute Return Profile

MetricPortfolioNQ Buy-HoldES Buy-Hold
CAGR77.4%20.3%14.3%
Annualized Volatility30.8%21.7%
Sharpe Ratio2.510.970.81
Calmar Ratio3.880.63
Max Drawdown19.9%32.2%
Final Equity ($100K start)$3.3M$312,942
Positive Months59/74 (79.7%)47/74 (63.5%)
Positive Years7/7 (100%)5/7 (71%)

The portfolio generated cumulative returns of 3,219%, versus 213% for the NASDAQ-100 over the same period. The Sharpe ratio of 2.51 exceeds the benchmark by a factor of 2.59x, while annualized volatility of 30.8% reflects the equity-proportional sizing that scales positions with portfolio growth.

The maximum drawdown of 19.9% compares favorably with the NASDAQ-100 benchmark's maximum drawdown of 32.2% — roughly 60% of the benchmark's peak-to-trough loss. This controlled drawdown is achieved through a VIX-based risk overlay that reduces position sizes during elevated volatility, adaptive stop-loss management, and a daily loss circuit breaker. The Calmar ratio of 3.88 (CAGR / max drawdown) indicates strong compensation for peak drawdown risk — substantially better than the benchmark's 0.63.

2.2 Annual Return Consistency

YearPortfolio ReturnNQ ReturnOutperformance
2020+139.2%+48.4%+90.8%
2021+89.1%+26.5%+62.6%
2022+29.7%-31.6%+61.3%
2023+93.0%+67.8%+25.2%
2024+89.1%+24.5%+64.6%
2025+38.8%+20.4%+18.4%
2026 (Jan-Feb)+12.1%-2.9%+15.0%

The portfolio was profitable in every calendar year. Of particular note is 2022, when the portfolio returned +29.7% while the NASDAQ-100 declined -31.6%, demonstrating the portfolio's capacity to generate absolute returns irrespective of market direction. With equity-proportional sizing, the portfolio captured outsized gains in high-conviction years (2020: +139%, 2023: +93%, 2024: +89%) while maintaining positive returns even in the most challenging environments.

Note on position sizing: The backtest uses equity-proportional position sizing (position size = equity / initial capital) with VIX-based scaling overlays. Monthly percentage returns are computed against running equity, providing a realistic view of compounded growth from the $100K starting capital.

3. Alpha-Beta Decomposition

3.1 CAPM Regression

We estimate the single-factor model:

Rp − Rf = α + β(Rm − Rf) + ε

Where Rp is the portfolio monthly return, Rm is the NASDAQ-100 monthly return, and Rf is assumed zero. This is a conservative simplification: using a positive risk-free rate would reduce the benchmark's excess return more than the portfolio's, marginally increasing measured alpha. Results from OLS regression on 74 monthly observations:

ParameterEstimateStd. Errort-statisticp-value
Alpha (monthly)4.82%0.84%5.75< 0.001
Alpha (annualized, x12)57.8%
Beta0.158
R-squared0.014
Correlation to NQ0.118

3.2 Interpretation

1. Alpha is large and statistically significant. The annualized alpha of 57.8% carries a t-statistic of 5.75, corresponding to p < 0.001. Under the null hypothesis that the portfolio possesses zero alpha, the probability of observing a t-statistic this large or greater is less than 1 in 100,000.

2. Market beta is essentially zero. A beta of 0.158 indicates that the portfolio is effectively market-neutral with a negligible long bias. The beta-attributable component of the portfolio's annualized return is approximately 0.158 × 20.3% (NQ CAGR) = 3.2% — essentially zero. For a portfolio that trades NASDAQ-100 futures as its primary instrument, this is a remarkable result, indicating the strategy ensemble extracts returns from intraday patterns rather than directional drift. The VIX-based position scaling further reduces market exposure during volatile periods, contributing to the near-zero beta.

3. R-squared is essentially zero. An R-squared of 0.014 means that only 1.4% of the variance of portfolio returns is explained by NASDAQ-100 movements. The remaining 98.6% of return variance is idiosyncratic — attributable to strategy-specific signal generation rather than market directionality. The low correlation to the NQ benchmark (0.118) further confirms the absence of meaningful directional market exposure.

3.3 Information Ratio

MetricValue
Annualized Excess Return (vs NQ)57.1%
Tracking Error (annualized)43.6%
Information Ratio1.31

An information ratio of 1.31 places the portfolio well above the threshold of 0.5 generally considered indicative of skilled active management (Grinold & Kahn, 1999).

4. Statistical Significance Testing

4.1 Mean Return Significance

Testt-statisticp-valueSignificant at 1%?
H0: Mean monthly return = 05.29< 0.0001Yes
H0: Mean excess return vs NQ = 02.270.026Yes (5%)
H0: Regression alpha = 05.75< 0.0001Yes

The mean monthly return hypothesis is rejected at the 1% significance level with a t-statistic of 5.29. The regression alpha t-statistic of 5.75 provides compelling evidence that the portfolio's positive alpha of 4.82% per month is not a product of random variation, based on 74 monthly observations.

The simple excess return test (t = 2.27, p = 0.026) achieves significance at the 5% level. However, the arithmetic excess return test is a cruder measure than the alpha regression, as it does not account for the portfolio's (near-zero) market exposure. The regression-based alpha test (t = 5.75, Section 3) is the more appropriate and powerful test, and it is significant at the 0.01% level.

4.2 Multiple Comparisons Consideration

1. Transparent strategy selection. The 19 strategies were selected from a combined development universe of 133 candidates (41 NQ/ES + 41 gold + 41 silver + 10 crypto). This introduces a selection bias that we address through the following mitigants.

2. Harvey, Liu & Zhu threshold. The alpha t-statistic of 5.75 exceeds the Harvey, Liu & Zhu (2016) recommended threshold of 3.0 for individual strategy significance after accounting for multiple testing.

3. Conservative Bonferroni correction. The Bonferroni-corrected significance threshold for 133 candidates at the 5% family-wise level is 0.05/133 = 0.000376. The regression alpha p-value of < 0.000001 easily passes the Bonferroni correction. Moreover, the Bonferroni procedure assumes all 133 tests are fully independent, which overstates the effective number of comparisons: strategies within the same family share signal components, indicator logic, and structural similarities.

4. Sub-period persistence. Performance is consistent across non-overlapping time windows, which is inconsistent with data-mining artifacts that tend to concentrate in a single favorable period.

5. Return Distribution Analysis

5.1 Monthly Return Statistics

StatisticPortfolioNQ Benchmark
Mean5.09%1.75%
Median4.65%1.56%
Standard Deviation8.90%6.25%
Skewness+1.13-0.07
Excess Kurtosis+1.78-0.44
Minimum-9.6%-13.3%
Maximum+26.8%+15.9%

5.2 Percentile Distribution

PercentilePortfolio
5th-5.51%
25th0.43%
50th (Median)4.65%
75th8.28%
95th18.23%

The portfolio exhibits positive skewness (+1.13), in contrast to the benchmark's slightly negative skewness. This is a desirable property: the distribution of returns is right-tailed, meaning extreme outcomes are disproportionately positive. The minimum monthly return of -9.6% compares favorably with the NQ benchmark's minimum of -13.3%, indicating curtailed downside exposure through the VIX-based risk overlay.

The portfolio produced 59 positive months out of 74 (79.7%), compared with the benchmark's 47/74 (63.5%). While 15 months produced negative returns, these losses were modest in magnitude: the worst month (-9.6%) was bounded, and only 11 months produced losses exceeding 2%.

6. Market Regime Analysis

A strategy that profits only in rising markets is not generating alpha — it is harvesting beta. To test for regime-independence, we segment the evaluation period into three market environments based on NASDAQ-100 monthly returns.

6.1 Performance by Market Regime

Regime# MonthsPortfolio MeanNQ MeanPortfolio Win Rate
NQ Up > 2% (Bull)36+6.58%+6.88%86%
NQ Down > 2% (Bear)20+3.72%-6.19%80%
NQ Flat ±2% (Range)18+3.64%+0.28%67%

6.2 Conditional Beta Analysis

MetricValue
Upside Beta (NQ up months)-0.076
Downside Beta (NQ down months)-0.240
Conditional Beta Ratio (Up/Down)0.32

The regime analysis yields perhaps the most compelling evidence of genuine alpha.

The portfolio is profitable in all three market regimes. During the 20 months when the NASDAQ-100 declined by more than 2%, the portfolio achieved a mean return of +3.72% with an 80% win rate. This is inconsistent with beta-driven returns, which by definition would be negative during market declines. The 80% win rate in bear markets is particularly notable — the VIX-based risk overlay reduces position sizes during elevated volatility, preserving capital while still capturing profitable signals.

Both conditional betas are negative. The downside beta of -0.240 means the portfolio profits during market drawdowns. The upside beta of -0.076 is essentially zero, indicating complete market independence during rallies. This combination — negative beta in both regimes — demonstrates that the portfolio generates returns from strategy-specific signals regardless of market direction.

VIX scaling enhances regime performance. The VIX-based position scaling naturally reduces exposure during high-volatility bear markets, which paradoxically improves the bear-market win rate (80%) compared to the overall portfolio win rate, as only the highest-conviction signals execute at full size during stressed markets.

7. Robustness: Sub-Period Analysis

MetricJan 2020 – Dec 2022 (36 months)Jan 2023 – Feb 2026 (38 months)
Annualized Return80.4%72.9%
Annualized Volatility30.4%17.4%
Sharpe Ratio2.133.32
Positive Months28/36 (78%)31/38 (82%)
Worst Month-9.6%-4.2%

Both sub-periods exhibit strong risk-adjusted performance. Critically, the Sharpe ratio improves from the first sub-period to the second (2.13 → 3.32). This is the opposite of what data-mining artifacts typically produce, where in-sample performance degrades when evaluated out-of-sample.

The second sub-period shows dramatically reduced volatility (17.4% vs 30.4%) with the worst month limited to just -4.2% (vs -9.6% in the first period). This volatility compression reflects the stabilizing effect of the VIX-based risk overlay, which was most active during the higher-volatility first sub-period.

8. Consistency & Persistence Metrics

8.1 Rolling Sharpe Analysis

MetricRolling 12-MonthRolling 6-Month
Mean Sharpe2.232.32
Minimum Sharpe0.99-0.60
Maximum Sharpe3.677.01
% Periods Sharpe > 0100%97%
% Periods Sharpe > 1.098%91%

The rolling 12-month Sharpe ratio never fell below 0.99 at any point during the evaluation period and was above 1.0 for 98% of rolling windows. This means that across the COVID crash of March 2020, the NASDAQ drawdown of 2022, and various periods of elevated volatility, risk-adjusted performance remained consistently positive with near-institutional-grade Sharpe ratios throughout.

8.2 Win Rate Consistency

MetricValue
Positive Years7/7 (100%)
Positive Months59/74 (79.7%)
Worst Month-9.6% (Nov 2021)
Best Month+26.8% (Aug 2020)

9. Diversification Benefit Analysis

9.1 Pairwise Strategy Correlations

MetricValue
Number of Strategy Pairs171
Mean Pairwise Correlation0.020
Median Pairwise Correlation0.010
Min Pairwise Correlation-0.408
Max Pairwise Correlation0.631
% Pairs with Correlation < 0.3092%

9.2 Portfolio Diversification Multiplier

MetricValue
Average Individual Strategy Sharpe0.63
Portfolio Sharpe2.51
Diversification Benefit4.0x

The mean pairwise correlation of 0.020 is statistically indistinguishable from zero. Although 14 of 19 strategies trade the same primary instrument (NQ), their diverse signal generation methodologies produce uncorrelated return streams. The inclusion of gold (GC) and Ethereum (ETH) strategies contributes further structural diversification through exposure to distinct asset classes.

The diversification multiplier of 4.0x — the ratio of portfolio Sharpe to average individual strategy Sharpe — demonstrates that the portfolio benefits substantially from the combination of uncorrelated return streams. The VIX-based risk overlay further enhances portfolio-level risk-adjusted returns by reducing aggregate exposure during correlated drawdown events.

10. Trade-Level Statistics

MetricValue
Total Trades10,361
Winning Trades4,580 (44.2%)
Losing Trades5,781 (55.8%)
Average Trade P&L$311
Average Win≈ $3,544
Average Loss≈ -$2,248
Win/Loss Ratio1.58
Profit Factor1.25
Total P&L$3,219,144
Average Trades/Year1,679

The strategy operates at a 44.2% win rate with an average win/loss magnitude ratio of 1.58. The edge derives primarily from payoff asymmetry rather than win rate: average wins are 58% larger than average losses, generating positive expectancy despite a sub-50% hit rate. The profit factor of 1.25 indicates that for every dollar lost, $1.25 is earned.

With equity-proportional sizing, per-trade expectancy of $311 scales with portfolio growth. The VIX-based risk overlay modulates position sizes during high-volatility periods, improving drawdown characteristics (max DD 19.9%) and portfolio-level risk-adjusted returns. With approximately 1,679 trades per year, the strategy generates a large sample of outcomes that enables rapid statistical convergence.

11. Drawdown Analysis

The portfolio's maximum peak-to-trough drawdown of 19.9% is substantially less than the NASDAQ-100 benchmark's maximum drawdown of 32.2%. This drawdown control is achieved through proprietary VIX-based position scaling that reduces exposure during elevated volatility, adaptive stop-loss management, and a daily loss circuit breaker. The largest monthly drawdown of 9.6% occurred in November 2021 and was recovered within 1 month (December 2021 returned +20.5%).

11.1 Annual Drawdown Trajectory

YearMax DrawdownComment
202013.50%COVID crisis — VIX scaling reduced exposure
20219.59%Nov drawdown recovered in 1 month
20228.20%NQ in -32% bear market; portfolio +30%
20233.20%
20242.90%
20254.20%
20260.00%Year-to-date (partial)

The drawdown trajectory shows a clear declining trend: from 13.5% in 2020 to under 5% in 2023–2025. This improvement reflects the interaction between VIX-based position scaling and the portfolio's increasingly diversified composition. During high-VIX environments, position sizes are automatically reduced across multiple proprietary tiers, limiting drawdown exposure precisely when markets are most volatile.

11.2 Risk Metrics Summary

MetricPortfolioNQ Benchmark
Max Drawdown19.9%32.2%
Sharpe Ratio2.510.97
Sortino Ratio4.13
Calmar Ratio3.880.63
Worst Month-9.6%-13.3%

12. Monthly Return Matrix

Monthly returns are computed as dollar P&L divided by equity at the start of each month. The backtest uses equity-proportional position sizing with VIX-based scaling overlays.

Notes: (1) Monthly returns are rounded to one decimal place. The Annual column is computed from full-precision monthly data and may differ slightly from compounding the displayed values. (2) February 2026 is a partial month.

Portfolio Monthly Returns (%)

YearJanFebMarAprMayJunJulAugSepOctNovDecAnnual
202016.717.06.35.1-3.69.28.326.811.0-7.0-7.013.0139.2%
20211.80.9-2.020.623.27.95.1-1.62.31.7-9.620.589.1%
20228.46.40.45.00.5-8.27.84.73.52.51.3-4.729.7%
202314.21.712.00.34.37.6-2.04.6-3.211.55.912.593.0%
20247.86.60.28.22.93.96.26.6-2.90.311.615.489.1%
2025-3.65.51.711.34.4-4.23.3-0.94.48.47.8-3.538.8%
20268.03.812.1%

NASDAQ-100 Buy-and-Hold Monthly Returns (%)

YearJanFebMarAprMayJunJulAugSepOctNovDecAnnual
20202.7-7.0-10.215.910.97.36.313.1-6.4-4.913.23.548.4%
20212.90.40.23.9-1.86.63.24.1-5.47.40.52.426.5%
2022-7.6-7.15.7-13.3-2.9-7.311.9-4.9-10.03.46.2-8.3-31.6%
202312.8-1.210.51.08.87.33.8-2.3-3.70.010.96.767.8%
20243.36.12.2-5.36.58.0-5.30.02.0-0.34.51.424.5%
20251.5-1.3-6.51.97.46.80.00.66.94.7-2.40.120.4%
20260.1-3.0-2.9%

13. Summary & Conclusion

The evidence presented in this report supports the conclusion that the portfolio generates genuine, persistent, risk-adjusted alpha that is not attributable to directional market exposure, favorable sampling periods, or statistical artifacts.

The strength of this conclusion rests on the convergence of multiple independent lines of evidence. A strategy might achieve a high Sharpe ratio through fortunate timing in a single market regime — but this portfolio is profitable in bull, bear, and range-bound markets. A strategy might appear significant on a single statistical test — but this portfolio's regression alpha is significant at the 0.01% level with a t-statistic of 5.75, exceeding the Harvey-Liu-Zhu threshold for multiple testing. A strategy might show strong in-sample performance that degrades out-of-sample — but this portfolio's Sharpe ratio actually improves from 2.13 in the first sub-period to 3.32 in the second.

The combination of near-zero market correlation, negative conditional betas in both up and down markets, improving sub-period consistency, uncorrelated strategy components across multiple asset classes, and statistical significance across multiple testing frameworks is collectively inconsistent with any reasonable null hypothesis of zero alpha. The portfolio represents a robust, diversified systematic trading program with demonstrated capacity to generate absolute returns across the full spectrum of market conditions observed over a six-year period spanning pandemic-era volatility, the 2022 bear market, and the subsequent recovery.

The portfolio's risk characteristics are particularly noteworthy: a maximum drawdown of 19.9% — substantially less than the benchmark's 32.2% — combined with a Calmar ratio of 3.88 and Sortino ratio of 4.13, indicates that returns are generated with substantially less peak-to-trough risk than a passive equity allocation. The VIX-based risk overlay, adaptive stop management, and daily loss circuit breaker collectively produce a drawdown profile that declined from 13.5% in 2020 to under 5% in recent years, suggesting a strategy ensemble whose risk-adjusted performance is both strong and improving with time.

References

  • Grinold, R.C. & Kahn, R.N. (1999). Active Portfolio Management: A Quantitative Approach for Producing Superior Returns and Controlling Risk. McGraw-Hill, 2nd edition.
  • Harvey, C.R., Liu, Y. & Zhu, H. (2016). “...and the Cross-Section of Expected Returns.” Review of Financial Studies, 29(1), 5–68.
  • Markowitz, H. (1952). “Portfolio Selection.” The Journal of Finance, 7(1), 77–91.
  • Sharpe, W.F. (1994). “The Sharpe Ratio.” The Journal of Portfolio Management, 21(1), 49–58.

Disclaimer

This report presents backtested performance results. Backtested performance is hypothetical and does not represent actual trading. Past performance, whether actual or backtested, is not indicative of future results. All returns are presented gross of management fees, performance fees, and fund-level expenses; only execution slippage is deducted. The backtest may not capture all costs, market impact, or execution constraints that would be present in live trading. Position sizes scale with portfolio equity with VIX-based scaling overlays and do not account for capacity constraints at scale. The strategies have not been evaluated for performance degradation at larger notional sizes. This report is provided for informational purposes only and does not constitute investment advice or a solicitation to invest.