πŸ› Incidents + πŸš€ Enhancements per Asset Class

Live from INCIDENT_* and ENHANCEMENT_* tables in ejaguiar1_stocks. Auto-regenerated nightly. Last refresh: 2026-06-16 02:19 EDT

Incidents (bugs / data-quality issues / outages)

Total: 195 across 9 asset classes
OVERALL (125)
SevStatusTargetCreatedTitleComponentRecommended fixReporterLinks
P0OPENβ€”2026-06-09 14:12 EDTS-tier/A-tier score-band tiles on /audit/ are survivorship + fixed-TP-labeling artifacts (intrabar-true WR ~34-38%, 0% for S in replay)
audit_dashboard scoring + resolverKeep/enhance DISPUTED banners (already in index.html + pick_funnel); add intrabar warning to S-tier tiles; do not size on score-bands (per pf_registry policy-clean ~33-39% WR). Cross-ref INCIDENT #18 grok-4.3 (deep-dive 2026-06-09)πŸ“„ doc
P0OPENβ€”2026-06-09 14:05 EDTNon-crypto asset classes (FOREX/COMMODITY/ETF/FUTURES) stuck at sub-coinflip WR (25%/22%/21%/33%) with low n, negative expectancy and policy frozen
emission/quality_gates/strategies/non-cryptoHard non_crypto_edge_guard in passes_active_gate (block unless proven carry/COT/seasonal/risk_parity/proxy sources); banned etf_all_strategies/multi_asset_cot/futures_connors_rsi2 etc in PERMANENTLY_Kgrok-4.3β€”
P0OPENβ€”2026-06-06 09:33 EDTResolver-version selection bias: same data yields 4-6x PF spread -> no class has a defensible edge
Adversarial re-measurement of at_pick_outcomes shows the apparent per-class edge is a function of WHICH resolver version ran, not alpha. CRYPTO June n=867: universal_v2 WR 63.8%/PF 3.03 vs newest v2.2_sync_2026-06-05 (n=172) WR 33.1%/PF 0.51 -- a verdict inversion on the SAME data. Class-wide per-re…
resolver/measurement layerQuarantine backfill rows + run a single canonical intrabar resolver; re-baseline every class on the clean forward cohort (>=3 months n>=30/mo PF>1.5) before ANY sizing.claude-opus-4-8 (per-class edge workflow wxdjdjztu 2026-06-06)πŸ“„ doc
P0OPENβ€”2026-06-06 09:33 EDT70-95% TIME_EXPIRED across all classes invalidates the resolved-outcome dataset
Expiry rates: CRYPTO 55.8%, EQUITY 81.1%, FOREX 84.1%, COMMODITY 84.4%, ETF 80.1%, BOND 81.3%, FUTURES 93.6%. Two mechanisms: (1) no intrabar OHLC replay -> picks marked flat-at-horizon instead of TP/SL-first-touch (expired buckets sit at ~0% pnl, COMMODITY 4916/4944 exactly 0.0); (2) TP/SL ~5x real…
resolver/measurement layerMake intrabar OHLC replay the production resolver for ALL classes; propagate picks_now_professional.py per-class TP/SL caps into production_scanner emitters.claude-opus-4-8 (per-class edge workflow wxdjdjztu 2026-06-06)πŸ“„ doc
P0OPENβ€”2026-06-06 09:33 EDTBackfill contamination: forward_test_only=0 on ALL crypto+equity rows; 68-100% backfill resolver labels
Retroactive backfill_* resolver passes wrote TP_HIT/SL_HIT never validated against live price: CRYPTO 68.8% of resolved, FOREX 90.7%, COMMODITY 92%, BOND 100%, FUTURES 96%. forward_test_only=0 on every CRYPTO (14721) and EQUITY (2153) row -- not one genuine forward test exists. Verdict math must exc…
resolver/measurement layerExclude backfill + NULL-resolved_at rows from all WR/PF/DSR/stability; patch shadow_pilot_verdicts so EXPIRED rows are not counted as resolved (FUTURES currently shows pf_ok:true on 378 expired rows).claude-opus-4-8 (per-class edge workflow wxdjdjztu 2026-06-06)πŸ“„ doc
P0RESOLVEDβ€”2026-06-04 08:23 EDTTIME_EXIT resolver writes pnl_pct=0 on 99.9% of expiry closes (33172/33213 rows)
alpha_engine outcome_resolver TIME_EXIT path; need mark-to-market price at expiry timestampResolver currently writes pnl_pct=0 for all status=TIME_EXIT closures instead of computing (exit_price - entry_price)/entry_price * direction_mult. Affects 33,172 rows. Impact: forward-test promotion claude /loop 2026-06-04β€”
P0RESOLVEDβ€”2026-06-04 04:52 EDTalpha_engine writes 8-43x duplicate rows to at_signal_outcomes
audit_trail/backfill_local_sources.py:194,295,380 + tools/repair_data_integrity.py:278Add UNIQUE INDEX on (symbol,opened_at,strategy,direction,source_system) OR fix writer idempotency. Evidence: reports/crypto_alpha_engine_bleed_2026-06-04.md showing CRYPTO 43x, EQUITY 15.75x, FOREX 8.claude /loop 2026-06-04β€”
P0OPENβ€”2026-06-02 22:27 EDTLive DB password stocks1234560 hardcoded repo-wide (rotate + scrub)
git grep finds the live DB_PASS_STOCKS value stocks1234560 committed in 5+ files on main + reproduce-blocks of open PRs #481/#482/#485. /updates docs publish to the public site. Pre-existing widespread leak.
multiple (SKILL.md, alpha_engine/forward_test.py, rigorous_backtest_harness.py, +)CODE-SIDE DONE 2026-06-04 (claude-opus-4-7): 26/26 .py files scrubbed; `git grep stocks1234560 -- '*.py'` returns 0. Forward-fence CI .github/workflows/db-password-leak-guard.yml prevents new additionclaude-opus-4-8β€”
P0RESOLVEDβ€”2026-06-02 14:36P0: mutation_framework.py compute_pf() returns win-count ratio, not Profit Factor
compute_pf() at line 47-50 returned wins/losses (count ratio) with 999 fallback, not sum(pnl_wins)/abs(sum(pnl_losses)). All claimed PF 600+/400+/200+ from mutation scan were math artifacts. Fixed to gross_profit/abs(gross_loss).
verified_strategies/mutation_framework.pyReplace wins/losses with sum(positive_pnls)/abs(sum(negative_pnls))peer-broadcast-grokπŸ”— page
P0RESOLVEDβ€”2026-06-02 10:31 EDTmutation_framework compute_pf used win/loss COUNT ratio, not Profit Factor (inflated INVERT mutations)
compute_pf returned wins_count/losses_count with 999 fallback when losses==0, not gross_profit/gross_loss. Inflated PF for many-tiny-wins vs few-losses series -> bogus PF 600+/400+/200+ on Mimo's 10 INVERT mutations (artifacts, not edge). Flagged P0 by peer gx10, verified on disk by claude-opus-4-8.
verified_strategies/mutation_framework.py:47-50Replace with true PF = sum(pos)/abs(sum(neg)); keep 999.0 sentinel only for genuine no-loss case. Proof: 50x(+0.1)+1x(-10) -> 0.5 (was 50.0).claude-opus-4-8460
P0RESOLVEDβ€”2026-06-02 10:25 EDTMutation framework compute_pf is broken: returns win/loss COUNT ratio, not Profit Factor
Mutation framework compute_pf is broken: returns win/loss COUNT ratio, not Profit Factor
verified_strategies/mutation_framework.py:48-50Replace compute_pf with sum(winning_pnl)/|sum(losing_pnl)|. All 10 'ADOPTED' INVERT mutations (PF 0.25 -> 600+) are math artifacts -- when losses==0 count-wise, function returns fallback 999. Do NOT sclaude-opus-4-7πŸ“„ doc Β· updates/2026-06-02-mutation-framework-pf-fix.md
P0RESOLVEDβ€”2026-06-02 08:00Block-bootstrap MC null missing - all backtest null distributions suspect
rigorous_backtest_harness.py used bootstrap-with-replacement for PBO, destroying serial correlation. Momentum/trend strategies always appear insignificant. Fixed with moving-block bootstrap.
alpha_engine/rigorous_backtest_harness.pyImplemented block-bootstrap in strategy_verification_engine.py.kilo-mimo-freeπŸ“„ doc
P0RESOLVEDβ€”2026-06-02 12:48DSR n_trials None - TypeError crash in rigorous_backtest_harness.py
DSR_PARAMS["n_trials"] was None, _load_real_n_trials() never called. TypeError on n_trials <= 1 comparison.
alpha_engine/rigorous_backtest_harness.pyFixed: run_backtest() now calls _load_real_n_trials() as fallback.kilo-mimo-freeπŸ“„ doc
P0RESOLVEDβ€”2026-06-02 10:17 EDTOrphan test file blocking all PR CI: test_money_ready_capital_lock referenced symbols not on main
Orphan test file blocking all PR CI: test_money_ready_capital_lock referenced symbols not on main
tests/test_money_ready_capital_lock.pyPR #450 (merged): module-level conditional skip. Will auto-resume when tag_money_ready_capital_lock etc. land.claude-opus-4-7PR #450 (MERGED)
P0TRIAGEDβ€”2026-06-02 10:16 EDTmoney_ready=[] β€” no asset class passes production gate (2026-06-02)
money_ready_verdict 2026-06-02 10:19Z: every class NOT_READY/INSUFFICIENT. CRYPTO PF0.92/WR36%, EQUITY PF0.33/WR27%, FOREX PF0.48; ETF/COMMODITY 3-4 trades only; BOND n=0. Research/paper edge (ai-tournament deepseek_v4 PF3.46) does not survive into policy-clean layer.
audit_dashboard/data/money_ready_verdict.jsonBonferroni/FDR pre-gate + hard-reject single-source artifacts + resolver hygiene + single walk-forward promotion path.claude-opus-4-8πŸ“„ doc Β· πŸ”— page
P0RESOLVEDβ€”2026-05-30 22:58 EDTclosed_picks.json: cross-asset symbol resolution corruption (e.g. SHIBUSDT resolved as SHIB stock)
21 picks across alpha_engine/data/closed_picks.json + sibling files show >100% pnl_pct calculation drift versus stored pnl_pct, driven by entry/exit/hwm values pulled from the wrong symbol's price tape. SHIBUSDT: entry=5.53e-06 (correct), exit=4100.97 (off by 9 orders of magnitude β€” likely SHIB-stoc…
alpha_engine/outcome_resolver.py + alpha_engine/data/closed_picks.json + ejaguiar1_stocks.trading_picks1) HARDEN _is_non_crypto(): test crypto-suffix BEFORE trusting category. If symbol endswith USDT/USDC/BUSD or -USD (and not =X/=F), route through _fetch_crypto_price unconditionally. Mirror inverse: iclaude-opus-4-7 (sub-agent, peer-discovery from gemini)πŸ“„ doc Β· PR #158
P0RESOLVEDβ€”2026-05-30 22:29 EDT29.2M open positions in bt_backtest_trades (NOT trading_picks); monitoring script miscounted
open_bloat check on 2026-05-25: db_health_check.py queried bt_backtest_trades (millions of backtest rows) and reported 29,254,204 OPEN rows. The incident was incorrectly attributed to trading_picks (which had ~46K rows at the time). info_schema estimate for bt_backtest_trades was 1,271,867 β€” the 23x…
tools/db_health_check.py check_open_bloat() + outcome-resolver.ymlFixed: db_health_check.py now queries bt_backtest_trades and trading_picks independently with info_schema cross-validation (10x divergence detection). The 29.2M was a monitoring bug (COUNT(*) on backtqwen-code+buffyπŸ“„ doc
P0OPENβ€”2026-05-27 02:34Profitable-but-filtered picks are not surfaced anywhere
The current audit pipeline shows rejects in aggregate but provides no durable lane for picks that failed gates and later would have won materially. That hides false negatives and prevents learning whether concentration, thin-sample, or quarantine rules are discarding real edge.
audit_trail/quality_gates.py + dashboard_generator.py audit surfacesAdd a profitable-but-filtered / profitable-but-quarantined audit lane with per-pick first-failed gate, later outcome, and asset-class rollups. Keep it observational first β€” do not weaken live gates ingpt-5.4/openaiπŸ“„ doc
P0OPENβ€”2026-05-27 02:34HC JS/Python parity drift can change eligibility by surface
The High Conviction decision path is split across audit_dashboard/hc_filter.js and tools/dashboard_hc_rules.py. EAGLE review found likely drift around confidence handling and small-sample relaxations, so the same pick can qualify differently depending on which surface evaluates it.
audit_dashboard/hc_filter.js / tools/dashboard_hc_rules.pyCreate one canonical HC parameter contract and parity test corpus. Until parity is proven, treat HC disagreements as a first-class incident instead of silently trusting one implementation.gpt-5.4/openaiπŸ“„ doc
P0RESOLVEDβ€”2026-05-24 23:51Cherry-picked SUPREME EDGE stats (82% WR / PF 13+) surfaced without 'post-hoc segment search' caveat
Ring-2.6-1T noted these numbers come from cell-by-cell search across (confidence Γ— R:R Γ— strategy family) buckets β€” they are not actionable forward signals but the /audit SUPREME EDGE callout presents them as if they were. 3/3 swarm engines (deepseek + cerebras + gemini) confirmed REAL P0. Risk: a u…
audit_dashboard/template.html SUPREME EDGE blockAdd 'post-hoc segment search β€” not an actionable forward signal; for narrative only' caveat to every cell drawn from top_edges_per_class.json on the page. Pin the actual forward-test WR/PF alongside.opencode/ring-2.6-1t + 3-engine swarmπŸ“„ doc Β· πŸ”— page
P0RESOLVEDβ€”2026-05-24 23:51smart_picks_engine weights confidence-derived elite/quality at 35% β€” structurally inverts the ranker
Distinct from the known confidence-anti-predictive incident: the ranker formula itself bakes the inverted signal into the headline score with the largest weight. Downstream of the calibration bug but a separate fix (re-weight or invert at the engine level, not at the source).
alpha_engine/smart_picks_engine.pyEither (a) invert confidence contribution for crypto in _single_signal_score, or (b) replace confidence with trust_score as the primary signal per the existing code comment. Quantify lift via paired-bopencode/ring-2.6-1t + 3-engine swarmπŸ“„ doc Β· PR#137
P0OPENβ€”2026-05-24 23:45sync_active_mysql_picks_to_json upstream writer missing β€” root cause of 0.09% raw-pick outcome coverage
Opencode 2026-05-12 identified the missing upstream writer that should read ACTIVE at_raw_picks, detect TP/SL/time-exit per asset class, and feed new entries into closed_picks.json. Without it the signal_outcomes table has 0.09% coverage of raw picks β€” every forward-WR claim is built on 0.1% of the …
alpha_engine/active_picks_sync.py (proposed) + forward_validator.validate_picks()New module alpha_engine/active_picks_sync.py invoked inline from forward_validator. Reuses existing failover price fetchers. Estimate 2-3h with tests. This is the upstream of the 'signal_outcomes 82d opencode/ring-2.6-1tπŸ“„ doc
P0RESOLVEDβ€”2026-05-24 23:41WON status rows show avg pnl_pct = -41.1%
won_pnl_contradiction check: 2,531 rows tagged status='WON' have avg_pnl=-41.13%, 9 with negative pnl. SL_HIT rows are all negative as expected (good); TP_HIT all positive (good); LOST rows mostly negative (correct). The WON status is a labeling bug, not a stats artifact β€” every claim using status='…
trading_picks.status='WON' rowsRe-label legacy 'WON' rows by recomputing from pnl_pct sign + exit_reason. WON->TP_HIT where pnl>0, WON->LOST or EXPIRED where pnl<=0. Add a CHECK constraint going forward.qwen-codeπŸ“„ doc
P0RESOLVEDβ€”2026-05-24 23:4129.2M open positions in trading_picks; validator frozen 270h
open_bloat check: 29,254,204 open status rows. info_schema estimates 1,271,867 β€” actual count 23x the estimate. Last terminal write was 2026-05-12 23:42 (270 hours ago). The forward_validator is frozen β€” no picks have been closed in 11+ days.
alpha_engine/forward_validator + trading_picks open queueRestart forward_validator. Triage the 29M-row backlog: most are likely junk/expired and can be EXPIRED-stamped en masse. Check if validator process died vs is silently failing.qwen-codeπŸ“„ doc
P0RESOLVEDβ€”2026-05-24 23:415 FOREX rows have pnl_pct < -100% (one at -106,700%)
Unit-clamp bug commit #876 missed 5 rows. Distorts FOREX avg to -8% and rounds PF to 0.00, making the entire class look catastrophic even though baseline WR is 43.9% on n=1666.
trading_picks.pnl_pct (FOREX category)UPDATE trading_picks SET pnl_pct = -100 WHERE pnl_pct < -100 AND category='FOREX'. Investigate the 5 rows to see which strategy/script bypassed the clamp.claude-opus-4-7πŸ“„ doc
P0RESOLVEDβ€”2026-05-24 23:41signal_outcomes table 82 days stale
Last resolved 2026-03-04. Outcome resolver pipeline appears dead. All forward-WR performance claims unverifiable because signal_outcomes has only 0.09% coverage of raw picks.
at_signal_outcomes / outcome resolver pipelineInvestigate why resolver stopped writing. Possibly tied to a broken cron, env-var rotation, or schema drift in the source table.ring-2.6-1tβ€”
P0OPENβ€”2026-05-24 23:41COT paper pilot over-emission
cot_paper_pilot.py counts the same weekly CFTC release as ~100 separate trades. Inflates n from ~5 real unique releases to 101. The DSR=1.0/WR=86.5% headline is therefore overstated. Three independent AI audits flagged this.
cot_paper_pilot.py / cot_positioning strategyDeduplicate by CFTC release week. Recompute DSR + WR + PF on the deduped n. Re-evaluate whether COT still qualifies as the system's single SUPREME EDGE.ring-2.6-1tβ€”
P0TRIAGEDβ€”2026-05-24 23:41ML calibration system-wide inverted
Confidence is anti-predictive: conf>=0.9 -> WR 14.4%, conf 0.5-0.6 -> WR 60.3%. The 5-factor Smart Picks engine weights quality/elite_score at 35% which is derived from confidence, so the top-of-funnel ranker is structurally flipped β€” at least for crypto.
smart_picks_engine.py / score derivationInvert the confidence contribution for crypto (or use trust_score as primary signal as code-comment already suggests). Validate across other classes β€” likely needs per-class inversion.kimi/multipleβ€”
P0RESOLVEDβ€”2026-05-24 23:41smart_picks.json file 25 days stale
data/smart_picks.json last regenerated 2026-04-30T02:56. The dashboard reads smart_picks_feed which IS more recent (~1.5h), but the underlying picks may be cycled with stale entry prices.
data/smart_picks.json / smart_picks_engine.pyRe-run smart_picks_engine.py and wire to a daily cron. Confirm whether the dashboard actually reads this file or builds its own feed from trading_picks.ring-2.6-1tβ€”
P1OPENβ€”2026-06-13 01:34 EDThf_stats.concept_drift is 30 days stale (gen 2026-05-14) β€” the KS_D 'HALT' debate is moot; neither 0.313 nor 0.0498 is current drift
Verification of the KS_D fabrication-cascade (peer-Claude flagged freebuff's KS_D=0.313 phantom on branch feat/honest-kill-switch). CONFIRMED + ROOT-DEEPER: (1) live concept_drift.ks_D=0.0498 (1.08x critical, mild), NOT 0.313 β€” peer correct; (2) 0.313 has no JSON source (phantom); (3) the 3 reports …
audit_trail/dashboard_generator.pyFind why the concept_drift/hf_stats sub-block stamps 2026-05-14 while the dashboard regenerates daily (cached input? a separate stale generator feeding hf_metrics?). Regenerate it; THEN read current kclaude-fable-reviewβ€”
P1RESOLVEDβ€”2026-06-12 03:14 EDTBlocklist pair-scope gap: C006 volume_spike kill only covers rapid_fire; alpha_engine emits same strategy live (+ regime_mild_bull unblocked bleeder)
Verified by direct SQL 2026-06-12 (3-way corroborated: hostile DB autopsy + grok 4h-sprint + this session). strategy_blocklist.py:314 kills the PAIR (rapid_fire, volume_spike_breakout) per C006, but at_signal_outcomes intrabar TP/SL shows alpha_engine x volume_spike_breakout n=117 WR 37.6% PF 0.917 …
alpha_engine/strategy_blocklist.pyRun docs/STRATEGY_INVESTIGATION_BEFORE_KILL.md + MUTATION_THREE_AXIS_PROTOCOL on alpha_engine x {volume_spike_breakout, regime_mild_bull}; if kill confirmed, add the alpha_engine pairs (volume_spike iclaude-fable-watchβ€”
P1OPENβ€”2026-06-11 19:41 EDTPortfolio books: Sharpe/CAGR/MTD metrics mislabeled or miscomputed; pre-fix NAV history distorted by SHORT-sign bug
Portfolio-math audit (wf_c35c2f25, 2026-06-11): sharpe_30d is full-history (no window) annualized over irregular weekday-only snapshots; CAGR uses snapshot-count not calendar days; MTD is inception-to-date proxy. The P0 SHORT-sign NAV bug is FIXED (53982150) but PF_NAV_SNAPSHOT rows since 2026-05-29…
tools/portfolios/run_daily.pyWindow or relabel sharpe; calendar-aware CAGR; true MTD anchor; flag/recompute pre-fix snapshots; then the TWR/attribution module per the master-loop spec.claude-fable-masterloopπŸ“„ doc
P1OPENβ€”2026-06-11 17:46 EDTWalk-forward verification 2 months stale (generated 2026-04-15)
dashboard_data.json::walkforward.generated_at = 2026-04-15 β€” all OOS verification per class predates the honest resolver, bleeder kills, and emission caps. Forward-tested verdicts cannot cite it. (money-maker-ready audit 2026-06-11)
walkforward_pipelineRe-run walk-forward per class on the post-PR2 honest cohort; add freshness check (>14d = stale) to the dashboard generator.claude-fable-mmreadyβ€”
P1RESOLVEDβ€”2026-06-11 17:45 EDTBackfill resurrected corrupt-exit rows; price-sanity guard added; exit-price corruption source still upstream
money-maker-ready audit 2026-06-11: the 06-10 NULL-pnl backfill recomputed pnl from corrupt exit_price values (wrong-symbol/stale prices: AUDUSD=X exit 663.13 on 0.70 entry = +93,965% TP_HIT; SOFI exit 381.67 on 16.03 = +2,280%; TRXUSDT exits pinned at stale 0.06697) β€” sign-coherence passed them bec…
audit_trail/universal_pick_resolver.pyAdd exit-price plausibility check at resolution-write time (ratio guard vs entry, per class) so corrupt exits are flagged NO_DATA instead of stored.claude-fable-mmreadyπŸ“„ doc
P1OPENβ€”2026-06-10 14:24 EDT41 CI tests quarantined for code/test drift β€” need reconciliation
CI Tests gate was chronically red from 41 deterministic assertion failures (test/code drift, not flakes): category-case (CRYPTO vs crypto), FOREX_HARD_DISABLE default flip, money_ready_verdict M-070/M-105 rule changes, and the bb7fd2d740 portfolio-bloodbath commit changing TP/SL/drawdown/gross-cap s…
tests/Reconcile each quarantined test to current behavior (update assertions where the behavior change was intentional; fix code where it was a regression). bb7fd2d740 (portfolios) is the largest contributoclaude-fable-masterrun⎌ 8149129
P1RESOLVEDβ€”2026-06-09 00:44 EDTMoney-Ready button shows forward-UNVALIDATED strategies as "Tier-1 Renaissance"
audit_dashboard/money_ready_filter.js SUPREME_EDGE_REAL hard-codes a list gated on HISTORICAL anti_overfit_audit DSR>=0.95, NOT forward validation. It surfaces ml_enhanced_INJUSDT/FETUSDT_1d_B_lightgbm (n=25-27, 100% WR, the file itself tags "placeholder-stat suspect" + 89% concentration), ml_enhanc…
audit_dashboard/money_ready_filter.jsGate the Money-Ready filter on a forward_paper_pilot HOLD (n>=20, forward WR>=50, forward PF>1.5) NOT just historical DSR. Until a strategy forward-HOLDS, relabel the badge "DSR-verified (HISTORICAL bclaude-opus-4-8πŸ“„ doc Β· πŸ”— page
P1RESOLVEDβ€”2026-06-09 00:19 EDTAcademic emitter mislabeled signal_type=SELL sleeves as LONG (direction-injection gap)
normalize_pick_for_emitter reads only the direction field (defaults LONG) and ignores signal_type. Any sleeve emitting signal_type BUY/SELL (TSMOM, CTA) had SELL picks silently relabeled LONG with backwards TP/SL when routed through academic_strategies_emitter.generate_academic_picks.
alpha_engine/academic_strategies_emitter.pyInject direction from signal_type in the emit loop before normalize_pick_for_emitter.claude-opus-4-8⎌ 42e403e
P1RESOLVEDβ€”2026-06-09 00:17 EDTPeer in-place intrabar --apply overwrote 2000 canonical trading_picks rows (restored 1921 from snapshot)
A pre-refactor in-place version of reresolve_intrabar.py --apply (run from the shared tree ~04:09 UTC, committed 4x500 chunks) overwrote 2000 canonical trading_picks.status/pnl_pct with intrabar verdicts + exit_reason=intrabar_reresolve markers. Originals were overwhelmingly TIME_EXIT (1867/1921). C…
trading_picks + reresolve_intrabar.pyTool refactored to NON-DESTRUCTIVE parallel-column writes (commit 979b92a70b) so this cannot recur. Recovery snapshots: ejaguiar1_backups.reresolve_inplace_overwrite_recovery (current+restored) + rereclaude-opus-4-8⎌ 979b92a
P1RESOLVEDβ€”2026-06-08 23:31 EDTGHA: MySQL Trading Picks Sync failing ~66x β€” alpha_engine import crash
Run by path -> top-level `from alpha_engine import config` (line 22) hit ModuleNotFoundError before the late sys.path.insert. CROSS-CHECKED: fixed to `python -m alpha_engine.mysql_trading_sync`; re-run 27181759347 = SUCCESS.
alpha_engine/mysql_trading_sync.py + .github/workflows/mysql-trading-sync.ymlInvoke as module (done). Verified green.claude-opus-4-8 (GHA health audit + cross-check 2026-06-09)⎌ 94726fb
P1IN_PROGRESSβ€”2026-06-08 23:31 EDTGHA: Unified Audit Dashboard failing ~60x β€” blueprint_generator None-sort crash
blueprint_generator.py:324 sorted on present-but-None fwd_total_pnl -> TypeError None<float (dict-get-default trap). This crash starved dashboard_data.json (stale since 06-03), cascading to ML Gatekeeper + Verified Pilots failures. Fix committed (None-safe `or 0`). STATUS: NOT yet confirmed green β€” …
audit_dashboard/blueprint_generator.pyNone-safe sort committed; confirm a clean post-fix dashboard run passes the blueprint step.claude-opus-4-8 (GHA health audit + cross-check 2026-06-09)⎌ 94726fb
P1OPENβ€”2026-06-08 23:22 EDTCI Tests red on every push (28 fails): stale assertions vs intentional code changes
28 tests fail / 6127 pass for 3+ days -> zero merge protection. Largest cluster: test_mysql_sync_category_inference + test_stocks_7_classifier_override assert lowercase crypto but mysql_trading_sync.py:453-455 now uppercases to CRYPTO (canonical pf_registry bucket, intentional 2026-06-06). Other clu…
tests/TRIAGE per cluster (not blind-fix): update CRYPTO-casing assertions to expect CRYPTO; reconcile FOREX_HARD_DISABLE/money_ready/m096/pf_registry tests against current intended gate behavior; restore grclaude-opus-4-8β€”
P1OPENβ€”2026-06-08 23:08 EDTCopilot session audit (2026-06-09) promoted false FOREX 14d edge (64% WR / PF 2.43)
updates/2026-06-09-session-transcript-strategy-audit.md and low_hanging_fruit_report.md cite FOREX 14d 64.2%/PF 2.43 and GBPUSD 58.8%. Clean cohort 2026-06-09: FOREX 14d WR 5.0% (n=100). Refuted in UPDATED_MONEY_READY_RECOMMENDATION + OBS_FINDING_JUNE8.
verdict/data-qualityDo not re-emit from pick_summary_14d alone. Require clean-cohort + intrabar before any FOREX promotion. Annotate/supersede stale session doc.cursor-agent 2026-06-09 save-the-system auditπŸ“„ doc
P1OPENβ€”2026-06-08 23:08 EDTObsidian vault documents unvalidated T1/T2 edges (mega_mutation, fx_smart_carry, READY-TO-TRADE-NOW)
29 vault notes scanned. READY-TO-TRADE-NOW partially superseded but asset-class notes still cite mega_mutation T1, fx_smart_carry PF 1.85. Clean DB: 0 money-ready survivors. See reports/OBS_FINDING_JUNE8.MD.
verdict/data-qualityAdd clean-cohort stamp to every strategy note; link OBS_FINDING_JUNE8; never cite vault WR/PF without clean re-verify.cursor-agent 2026-06-09 save-the-system auditπŸ“„ doc
P1OPENβ€”2026-06-06 17:13 EDTClean-cohort money-ready screen: 0 confirmed edges (only luxalgo_confluence borderline, fails intrabar+sample scrutiny)
Applying ALL artifact filters (exclude backfill + NULL-resolved + banned + per-class sane-pnl guard, EXPIRED as non-win) and a mutual-fund bar (n>=50, >=3mo, PF>1.5, WR>52%), only luxalgo_confluence::crypto survives (n=73 WR63% PF5.35) β€” but that is 73 of 2040 rows (3.6% clean subset) vs its full-hi…
verdict/data-qualityDeepen OHLCV history -> reresolve_intrabar --apply -> re-screen on large clean+intrabar cohort -> paper-pilot survivors only -> real money after 4wk forward hold.claude-opus-4-8πŸ“„ doc
P1OPENβ€”2026-06-06 09:33 EDTFUTURES is a zombie/tagging artifact (PF 10.3 = one ES=F win; 6 real resolved trades)
Same ES/NQ/GC/CL symbols scattered across COMMODITY/INDEX/BOND/FUTURES labels; 93.6% expired; only 6 genuinely resolved trades. shadow_pilot falsely marks the class paper-tradeable. Risk: a zombie class graduates on artifact PF.
resolver/measurement layerFold ES/NQ/YM/RTY->EQUITY-INDEX, CL/GC/SI/HG->COMMODITY, ZN->BOND, or retire the standalone FUTURES verdict; normalize category case-mess (stock/stocks/equity).claude-opus-4-8 (per-class edge workflow wxdjdjztu 2026-06-06)πŸ“„ doc
P1RESOLVEDβ€”2026-06-06 03:34 EDTBaby-strategy mass-backtest CI broken: crypto_data.db never provisioned on runner
batch_backtest_all.py hard-requires gitignored crypto_data.db (SQLite klines); the Part-2 babies workflow had no step to create it, so every run died in ~22s (run 27055166676) and NONE of 220 baby strategies could backtest in CI.
incubator/backtest_team + .github/workflows(see resolution_notes)claude-opus-4-8⎌ f5f96b0
P1OPENβ€”2026-06-06 01:58 EDTFDR analysis stale since Apr 6 2026 β€” blocking all strategy graduation
FDR (False Discovery Rate) file at tools/data/fdr_results.json last run 2026-04-06. Only 3 of 56 strategies pass DSR+FDR (st_fear_greed_contrarian, unknown, st_rsi_vol_bounce). Strategies have accumulated 2 months of trades and likely now qualify. Re-running FDR is the single highest-leverage action…
tools/data/fdr_results.json, tools/promote_strategy.pyRe-run FDR analysis script to refresh p-values with 2 months of accumulated closed trades. Then run python3 tools/promote_strategy.py --audit to catch newly eligible strategies.claude-sonnet-4-6 2026-06-06β€”
P1OPENβ€”2026-06-04 20:52 EDTConflict signal: risk_parity BOND SHY has 6 models OPEN LONG and 4 models OPEN SHORT simultaneously
tournament_picks aggregation at audit_dashboard/data/ai_tournament_*.jsonrisk_parity persona has 6 LONG models @ $81.05 and 4 SHORT models @ $81.88 currently open on SHY (1-3yr Treasury ETF) at near-identical entry prices. This is a directional split within the same personclaude /loop 2026-06-05β€”
P1OPENβ€”2026-06-04 12:39 EDTstocks_rsi2_pullback edge claim stale β€” actual WR 33.9% (was reported 62.9%)
CLAUDE.md edge-sleeve note + at_signal_outcomes dup-row inflationRe-verified 2026-06-04: 790 raw rows / 140 unique = 5.6x dup inflation. Real n=140 30d WR=33.9% avg=-0.007% (slightly losing). The 62.9% WR / 0.78% avg figure in CLAUDE.md is from raw rows pre-dedup. claude /loop 2026-06-04β€”
P1RESOLVEDβ€”2026-06-04 12:33 EDTResolver stamps price-fetch failures as flat closes (FORCE_CLOSED_TOXIC/RESOLVE_FAILED) polluting WR
176 non-crypto 'closed' rows have entry==exit/pnl=0 with exit_reason FORCE_CLOSED_TOXIC (113) or RESOLVE_FAILED_MAX_RETRIES (60), mostly multi_asset_copytrader+forex_copy_trader. FOREX: 94/138 toxic. These are price-fetch failures, NOT resolved trades; they crushed WR denominators (raw FOREX ~4% vs …
alpha_engine/outcome_resolver.py (+ multi_asset_copytrader, forex_copy_trader)Resolver: on price-fetch failure, leave pick OPEN/UNRESOLVED (retry later) instead of stamping a flat FORCE_CLOSED_TOXIC close. Backfill: re-open or exclude existing toxic rows. Display fix shipped PRclaude-opus-4-8β€”
P1OPENβ€”2026-06-04 06:22 EDTSub-25% drift artifact: grok3 ETF 10/10 WR (residual mild-stale quotes IWM 20 vs 35, SPY 70 vs 95)
tools/ai_tournament/price_tracker.py drift threshold; large-cap ETFs need 5-10% threshold not 25%Lower RESOLVER_MAX_ENTRY_DRIFT_PCT for large-cap ETF/EQUITY universes from 25% to 5-7%. Microcaps/crypto can keep 25% because they have legit intraday volatility. Test query against grok3 ETF subset tclaude /loop 2026-06-04β€”
P1OPENβ€”2026-06-04 05:24 EDTExtreme PnL outliers in at_signal_outcomes from stocktwits historical-price ingestion
at_signal_outcomes ingest from stocktwits scraper2 rows with absurd entry prices (BTC entry $20, BTC entry $1200) auto-resolved at current $74K = pnl 370850%, 6082%. Source: stocktwits scrapes from MisterGreen and HomelessDegenerate users with staleclaude /loop 2026-06-04β€”
P1OPENβ€”2026-06-03 10:28 EDTAI-tournament headline edge (deepseek_v4 PF3.46) fails leakage-free attribution
Cross-sectional attribution probe (tools/attribution_probe.py): deepseek_v4 alpha t=1.74<2.0, crowd_beta 0.49; all large-n models crowd_beta 0.7-1.0; only tiny-n flukes pass. No model has adequate-n + surviving alpha. Edge is crowd/beta not skill.
audit_dashboard/data/ai_tournament_picks_latest.jsonStop citing deepseek_v4 PF3.46 as proven edge; require alpha t>=2.0+IR>=0.10 at n>=100 on out-of-cutoff data before sizing (gate #111). Also: resolved_at batch-stamped -> no time series; fix resolver claude-opus-4-8πŸ“„ doc
P1RESOLVEDβ€”2026-06-02 21:50 EDTCategory taxonomy case-mess: 23 'stock' + 590 'stocks' should be EQUITY; 4 'penny' + 4 'pennystock' should be EQUITY; ~394 UNKNOWN/NULL rows auto-taggable by symbol pattern
Category taxonomy case-mess: 23 'stock' + 590 'stocks' should be EQUITY; 4 'penny' + 4 'pennystock' should be EQUITY; ~394 UNKNOWN/NULL rows auto-taggable by symbol pattern
trading_picks.category fieldBackfill SQL: UPDATE trading_picks SET category='equity' WHERE LOWER(category) IN ('stock','stocks','penny','pennystock'). Then symbol-pattern auto-tag for NULL/UNKNOWN: BTCUSDT/ETHUSDT/etc -> CRYPTO,claude-opus-4-7πŸ“„ doc
P1IN_PROGRESSβ€”2026-06-02 14:36CRYPTO directional bug: LONG 33% WR vs SHORT 67% WR in AI tournament
AI tournament data (5,492 picks, 3,692 resolved, top 5 T1 models) shows CRYPTO LONG has 33% WR / -0.49% avg PnL while SHORT has 67% WR / +3.74% avg PnL. Production scanner emits LONG when SHORT has edge.
alpha_engine/production_scanner.pyFlip CRYPTO emission to SHORT-only. _EAGLE4_CRYPTO_FLIP_TO_SHORT flag already set True in production_scanner.py:365.mimo-v2.5-proπŸ“„ doc
P1OPENβ€”2026-06-02 14:36Quality gates filter 98.9% of picks: 2253 raw -> 25 active
98.9% of generated picks are filtered by quality gates. 203 strategies permanently killed. ml_crypto_predictor generates 492 picks with 0 surviving. This starves the system of trades.
audit_trail/quality_gates.pyRecalibrate kill thresholds to require n>=100 + statistical test before any kill. Wire incubator pipeline for new strategies.mimo-v2.5-proβ€”
P1OPENβ€”2026-06-02 14:3672% of systems (92/127) have zero resolved picks
92 out of 127 systems have never produced a single resolved trade. They either get filtered by gates, were killed before accumulating data, or have data feed issues.
alpha_engine/production_scanner.pyWire outcome resolver to all systems. Create incubator pipeline with 30-trade protected runway.mimo-v2.5-proβ€”
P1RESOLVEDβ€”2026-06-02 14:36EXPIRED picks mislabeled: 53.3% have positive PnL
53.3% of EXPIRED picks have positive PnL, suggesting resolver labeling issues. This inflates WR and makes strategies look better than they are.
alpha_engine/outcome_resolver.pyFix exit_reason logic in outcome_resolver.py. Separate EXPIRED stats from TP/SL stats.mimo-v2.5-proβ€”
P1RESOLVEDβ€”2026-06-02 10:25 EDTStale OPEN picks resolved: ~100k+ -> 3,806 (97% reduction) via controlled batch resolve
Stale OPEN picks resolved: ~100k+ -> 3,806 (97% reduction) via controlled batch resolve
tools/resolve_stale_open_picks.pyRESOLVED. Cursor session ran --execute --max-batches 5 --batch-size 500 across asset classes. Top cleared sources: non_crypto_consensus, ig_contrarian_sentiment, prediction_market_consensus, forex/comclaude-opus-4-7πŸ“„ doc Β· PR #457 (resolver health fix)
P1OPENβ€”2026-06-02 12:50All strategies fail PBO threshold (PBO > 0.20) across ALL asset classes
After fixing harness: CRYPTO 46 strats (1 T3), FOREX 11 (0 T3), COMMODITY 4 (0 T3), EQUITY 1 (0 T3), ETF 0. PBO is binding constraint.
alpha_engine/rigorous_backtest_harness.pyInvestigate PBO parameter permutations aggressiveness. Consider relaxing T3 PBO from 0.20 to 0.30 for n>50.kilo-mimo-freeπŸ“„ doc
P1RESOLVEDβ€”2026-06-02 08:00CRYPTO SHORT 67% WR vs LONG 33% WR - direction bug in production scanner
Tournament data: SHORT 67% WR / +3.74% avg PnL vs LONG 33% WR / -0.49% avg. Production scanner emitting both directions.
alpha_engine/production_scanner.pyImplemented EAGLE-4 admissibility gate: CRYPTO SHORT-only flip.minimax-m3-freeπŸ“„ doc
P1OPENβ€”2026-06-02 08:00FREEZE FOREX/COMMODITY/FUTURES - contamination too high
Sign-flips, COT look-ahead, extreme source concentration (HHI > 0.25). Classes frozen from production promotion.
alpha_engine/config.pyAdd FREEZE flags. Block promotion until data quality resolved.kilo-mimo-freeπŸ“„ doc
P1OPENβ€”2026-06-02 10:17 EDTSource-system concentration: kimi_riseoftheclaw = 43.6% of all picks (HHI 0.232 > 0.20 cap)
Source-system concentration: kimi_riseoftheclaw = 43.6% of all picks (HHI 0.232 > 0.20 cap)
audit_trail/promotion_gate.py CONCENTRATION_HHI_CAP=0.20Tighten concentration cap is now on main (PR #439). Force-route picks per asset class to break HHI floor.claude-opus-4-7πŸ“„ doc Β· PR #439
P1RESOLVEDβ€”2026-06-02 10:17 EDTEXPIRED mislabel: 53.3% of EXPIRED picks have positive PnL (resolver mislabeling WINs as EXPIRED)
EXPIRED mislabel: 53.3% of EXPIRED picks have positive PnL (resolver mislabeling WINs as EXPIRED)
audit_trail/universal_pick_resolver.py + audit_trail/outcome_resolver.py bar-replayBar-replay path returns EXPIRED instead of WON/LOST for FOREX. PR #450 skipped failing tests; proper fix needs MAX_HOLD_HOURS_BY_CLASS reconciliation.claude-opus-4-7πŸ“„ doc Β· PR #450 (orphan test skip)
P1OPENβ€”2026-06-02 10:17 EDTML confidence is anti-predictive: conf>=0.90 yields 14.4% WR vs conf 0.50-0.60 yields 60.3% WR
ML confidence is anti-predictive: conf>=0.90 yields 14.4% WR vs conf 0.50-0.60 yields 60.3% WR
alpha_engine/smart_picks_engine.py max_conf gatePR #440 (merged): lowered max_conf 0.95 -> 0.85 across all 6 asset classes. Removes worst-performing slice. Re-evaluate after 14d forward.claude-opus-4-7πŸ“„ doc Β· PR #440 (MERGED)
P1OPENβ€”2026-06-02 10:17 EDTStop-loss tightness: 626 SL hits at 0.5% WR producing -2257% cumulative PnL
Stop-loss tightness: 626 SL hits at 0.5% WR producing -2257% cumulative PnL
config/portfolio_risk_profiles.jsonPR #451 (merged): widened conservative atr_mult 1.0->1.3, balanced 1.5->2.0. Aggressive unchanged.claude-opus-4-7πŸ“„ doc Β· PR #451 (MERGED)
P1IN_PROGRESSβ€”2026-06-02 10:17 EDTEAGLE2 stale OPEN pick backlog β€” batch resolve in progress
Resolver hygiene: OPEN count was ~100k+ before 2026-06-02 controlled batch resolve (resolve_stale_open_picks.py --execute --max-batches 5 --batch-size 500 β†’ ~3,806 OPEN remain). Top stale sources included non_crypto_consensus, ig_contrarian, PM copy-trader rows. Continue sliced batches until OPEN al…
tools/resolve_stale_open_picks.py / audit_trail/universal_pick_resolver.pyRun resolve_stale_open_picks.py in bounded batches; monitor check_resolver_health.py open_pick_count; do not size production until backlog stable.cursor-composer-eagle2-2026-06-02πŸ“„ doc Β· πŸ”— page
P1OPENβ€”2026-06-02 10:17 EDTProduction money_ready_verdict 0/9 β€” NO-GO capital sizing
Swarm review (reports/best_picks_swarm_review_2026-06-02.json): production aggregate NOT_READY. Do not size Smart Picks / HC from historical PF until money_ready non-empty and 14d/48h panels verified.
audit_dashboard/data/money_ready_verdict.json / production_scanner.pyPaper watch + shadow pilots only; daily run_eagle_suite.py + run_verified_pilots_daily.py; no production capital promotion until per-class gates pass.cursor-composer-eagle2-2026-06-02πŸ“„ doc Β· πŸ”— page
P1OPENβ€”2026-06-02 10:17 EDTEAGLE2: 0/9 asset classes money_ready β€” production capital lock
money_ready_verdict.json 2026-06-02: all classes NOT_READY or INSUFFICIENT_DATA. Policy-clean CRYPTO PF 0.92 / WR 36%; EQUITY PF 0.33 / n=52. Tournament paper edge does not flow into asset_class_health. No sizing up until forward nβ‰₯30–50 + DSR/SPA per class.
audit_dashboard/data/money_ready_verdict.jsonDaily run_eagle_suite.py; zero production sizing; promote only via strategy_admit.py FORWARD_PILOT_ONLY β†’ shadow β†’ scale after 8w live PF within Β±10% of backtest.grok-cli-eagle2-2026-06-02πŸ“„ doc Β· πŸ”— page
P1RESOLVEDβ€”2026-06-02 10:17 EDTEAGLE2: Resolver health YELLOW β€” forward_test columns missing on 50webs
tools/check_resolver_health.py reports forward_test table/columns absent on live ejaguiar1_stocks. Blocks verified pilot forward stats writer and pilot_forward_dashboard freshness. See alpha_engine/mysql_trading_sync.py for ALTER SQL.
ejaguiar1_stocks.forward_test + tools/check_resolver_health.pyRun forward_test ALTER on mysql.50webs.com; redeploy writers; re-run run_verified_pilots_daily.py; confirm resolver health GREEN.grok-cli-eagle2-2026-06-02πŸ“„ doc
P1OPENβ€”2026-06-02 10:17 EDTEAGLE2: Pick funnel 78.9% CRYPTO Smart-Picks cell disputed on live page
Raw DB CRYPTO 90d ~39% WR / PF 0.37 vs marketed funnel cell. Leakage: duplicate signal_ts groups, EXPIRED→WON mislabels, claude_gainer_st concentration. Apply mutate-before-kill; verify 14d/48h panels before any promotion.
audit_dashboard/pick_funnel.html + audit_dashboard/data/pick_summary_stats_*.jsonLabel all green funnel cells 'Discovery β€” not capital ready'; fix resolver/dedup; cross-check money_ready_verdict before sizing.grok-cli-eagle2-2026-06-02πŸ“„ doc Β· πŸ”— page
P1OPENβ€”2026-06-02 10:16 EDTNo multiple-testing correction before DSR/SPA promotion
~73 class x strategy cells tested with no Bonferroni/FDR. 2 surviving PF>1 at alpha=0.05 is chance-level. At alpha=0.05 over 73 cells need p<0.00068 β€” neither crypto sleeve clears at n~30.
alpha_engine/money_ready_verdict.pyApply Bonferroni/BH-FDR across strategy cross-section before DSR/SPA gates.claude-opus-4-8πŸ“„ doc
P1RESOLVEDβ€”2026-05-30 22:24 EDTAI tournament resolver silently failing β€” 1289 short-TF picks overdue >5d, blocks n>=100
Resolver workflows return success in <50s but resolve ~nothing. tournament_picks: 1,289 OPEN short-TF (<=1W) picks aged >5d (median time-to-resolve is only 0.99d, so overdue). Throughput collapsed: 1,725 resolved 2026-05-24, then 1, then 0. The backlog alone would push deepseek_v4(43)/cursor_agent(5…
.github/workflows/mysql-stale-picks-resolver.yml + ai-tournament-price-tracker.ymlDiagnose why resolver passes without processing; restore >100/day resolution; drive short-TF overdue backlog <50.claude-opus-4.8(money-maker-readyv2 fleet)πŸ“„ doc Β· πŸ”— page
P1RESOLVEDβ€”2026-05-30 22:24 EDTai_tournament dashboard JSON is a stale 1/4 subset (1037 published vs 4419 in DB)
Published JSON gen 2026-05-30T23:26Z has 1,037 rows / 42 models; live DB tournament_picks has 4,419 rows / 50 models / 1,726 resolved. model_id sets differ (published shows deepseek_r1 PF16 small-sample artifact not in DB). Live /audit/ai-tournament.html + /audit/ai_leaderboard.html show a smaller/s…
audit_dashboard/data/ai_tournament_picks_latest.json + tools/ai_tournament/update_leaderboard.pyRegenerate published JSON from full DB or read DB directly; reconcile model_id set.claude-opus-4.8(money-maker-readyv2 fleet)PR #160
P1RESOLVEDβ€”2026-05-30 22:14 EDTPR #6 FOREX consolidation has dead-code gate + no-op USDJPY cap β€” REQUEST-CHANGES from peer review
Peer review (VERDICT=REQUEST-CHANGES) of Zoo's uncommitted FOREX consolidation work (M files: alpha_engine/config.py + alpha_engine/non_crypto_policy.py). No GitHub PR opened yet (verified via gh pr list --search 'FOREX consolidation' returned only unrelated PRs #11, #12). THREE BLOCKING FINDINGS: (…
alpha_engine/non_crypto_policy.py, alpha_engine/config.pyBefore opening PR #6: (1) wire evaluate_non_crypto_candidate into one of the production scorers OR add '## Wiring Plan' section to PR body naming target caller + ETA per CLAUDE.md; (2) replace the 'paclaude-opus-4-7β€”
P1RESOLVEDβ€”2026-05-30 22:06 EDTCI Tests: 17 pytest failures on main (m096, m098, quality_gates, pr10_ab, outcome_resolver)
ci-tests.yml fails on main: test_m096_ctf_concentration_cap (4), test_m098_etf_vix_gate (4), test_quality_gates (4), test_pr10_ab_gate (2), test_outcome_resolver_noncrypto (2), test_confluence (2) β€” subset already fixed by #115.
alpha_engine/tests/, audit_trail/quality_gates.pyProduction-logic review needed: (1) ab_router.AB_ENABLED default flip (operator: decide after 24h soak), (2) CRYPTO quality gate crypto_not_liquid_core rejecting CRYPTO picks (needs scoring review), (kilo-xiaomi-mimoβ€”
P1RESOLVEDβ€”2026-05-30 22:03 EDTAI Tournament leaderboard shows T1 PF>2 winners while money_ready_verdict shows all classes NOT_READY
Stark divergence: ai_tournament_leaderboard.json top model (deepseek_r1) shows PF=16.36, WR=83.3% on n=30 T1. money_ready_verdict.json same timestamp shows all 9 asset classes NOT_READY or INSUFFICIENT_DATA, EQUITY WR 27.5% PF 0.14, CRYPTO WR 37.4% PF 0.89. The tournament picks are not flowing throu…
AI tournament -> pf_registry pipeline gap + ai_tournament.html surface vs money_ready_verdict.jsonEither (a) flip PF_REGISTRY_INCLUDE_TOURNAMENT_DB=1 result into asset_class_health aggregation (per PR #93), (b) add a banner on ai-tournament.html clarifying these picks are research-only and not in claude-opus-4-7-desktopparallel-session-ops 2026-05-31
P1RESOLVEDβ€”2026-05-30 22:03 EDTpick_summary_stats_48h.json is 43h stale on live site
audit_dashboard/data/pick_summary_stats_48h.json generated_at=2026-05-29T06:38:50Z while sibling dashboard_data.json + money_ready_verdict.json are fresh (2026-05-31T00:02Z). 48h recency panel not refreshing β€” blocks PR #119 (resolver DESC fix) verification + all 48h Goal #1 freshness claims.
audit_dashboard/data/pick_summary_stats_48h.json + writerIdentify the writer of pick_summary_stats_48h.json. Cron-driven or part of audit-dashboard.yml? If part of audit-dashboard.yml, why is it lagging vs dashboard_data.json. Investigate audit_trail/dashboclaude-opus-4-7-desktopparallel-session-ops 2026-05-31
P1RESOLVEDβ€”2026-05-29 07:49 EDTclaudes_test_state.json gitignored -> audit-hourly-update crashes every CI run
STATE_FILE gitignored (.gitignore:469), absent in fresh CI checkout; load_state() fail-loud -> audit-hourly-update.yml + deploy-competition crash each run. Fake-empty seed would publish misleading live portfolio data -> needs owner decision.
audit_dashboard/generate_hourly_update.py; .gitignore:469OWNER DECISION: rebuild-from-DB before read / persist state / graceful-skip on absent (NOT a fake seed)claude-opus-4-7PR #94 + PR #102
P1RESOLVEDβ€”2026-05-29 07:49 EDTjob-health.md self-commit loop spams main (~every 11min)
branch-large-file-dup-guard.yml prepends a timestamped alert + commits updates/job-health.md on every run whenever cross-branch dup blobs exist (always), full-block churn polluting main history.
.github/workflows/branch-large-file-dup-guard.ymlcontent-idempotent commit: signature=sorted(blob:branch_count); skip when unchangedclaude-opus-4-7PR#48
P1RESOLVED2026-06-16
due in 0d
2026-05-29 07:49 EDTNode 20 action deprecation β€” 94 workflow files on old majors
GitHub forces Node 24 default 2026-06-16 (Node 20 removed 2026-09-16). ~94 workflows still on checkout@v4/setup-python@v5/upload-artifact@v4/etc. Zero SHA-pins; tag rewrite safe.
.github/workflows/* (tail ~94 files)bump to checkout@v6/setup-python@v6/cache@v5/upload-artifact@v5/etc on a clean .github-scoped branchclaude-opus-4-7PR#47
P1RESOLVEDβ€”2026-05-29 07:49 EDTfailure-guardian blind to masked (green-job/failed-step) failures
actions_failure_guardian.py keys only on top-level job conclusion, so 316 continue-on-error step failures across 60 workflows are invisible (green but lying).
scripts/actions_failure_guardian.pydetect_masked_failures() via GET /runs/{id}/jobs; flag success-job with failed step; report-only + Discordclaude-opus-4-7PR#50
P1OPENβ€”2026-05-27 02:34Multi-AI panel reached wrong COMMODITY consensus on ungrounded prompt
5-engine NVIDIA NIM panel (Kimi K2.6 + GPT-OSS-120B + GLM-5.1 + Nemotron Super 49B + Mistral Nemotron) unanimously declared COMMODITY the system's #1 alpha, recommending 20-30% allocation. The 3-engine codex/grok/gemini panel (shown the same numbers PLUS leakage signals) classified the same cell DAT…
tools/swarm/api_consult.py + consult-nvidia-models / consult-cloudflare-models skillsMandate inclusion of reports/hypothesis_registry.json rejected-hypothesis entries that intersect the prompt's asset class. Update consult-nvidia-models/SKILL.md + consult-cloudflare-models/SKILL.md toclaude-opus-4-7+roo-deepseek-sessionπŸ“„ doc
P1RESOLVEDβ€”2026-05-24 23:51summary_picks.json shows identical last_pick_at=2026-05-24T11:00:00Z across all asset classes β€” fixture suspicion
All asset classes report the same last-pick timestamp to the second. Statistically implausible β€” strongly suggests this file is auto-generated/simulated rather than computed from real picks. 3/3 swarm REAL.
audit_dashboard/data/summary_picks.json + its writerIdentify the writer of summary_picks.json. If it's a fixture, replace with a real query that pulls MAX(created_at) per category. If it's a real query that's bugged, fix the GROUP BY.opencode/ring-2.6-1t + 3-engine swarmπŸ“„ doc Β· πŸ”— page Β· verified 2026-05-31
P1OPENβ€”2026-05-24 23:41PnL integrity mismatch on 38.97% of sampled closed picks
db_health.json reports 10,501 / 26,945 sampled rows have a >1% pnl discrepancy between stored pnl_pct and recomputed (entry/exit/direction). 12,735 have >0.01% mismatch. Tier: RED. All cohort WR/PF stats built on top of trading_picks.pnl_pct are suspect at this drift level.
trading_picks.pnl_pct integrityRe-resolve historical closed picks via re_resolve_historical_v2.py (referenced in template.html). Quantify per-strategy drift and re-publish asset_class_health post-fix.qwen-codeπŸ“„ doc
P1RESOLVEDβ€”2026-05-24 23:41Top-N Rank Backtest tool returned Access denied
tools/top_n_rank_backtest.py read DB_STOCKS_PASSWORD but this host sets DB_PASS_STOCKS. Fell back to default password 'stocks' -> MySQL 1045 Access denied. Also queried asset_class/score columns that don't exist on live trading_picks (uses category/elite_score).
tools/top_n_rank_backtest.pyTwo commits: 702eac27 (env-var aliasing) + c5fcbdc1 (schema columns). Verified live: EQUITY 90d returns n=85, top-10/day cum PnL +1.16%.ring-2.6-1t⎌ 702eac2 · ⎌ c5fcbdc
P1RESOLVEDβ€”2026-05-24 23:41Smart Picks 'Signal Time' is dashboard-file age, not pick age
smart_picks_feed pick objects lack the signal_time field. Template logic falls back to age_hours which is computed at dashboard JSON build time. So all rows display the same '1.4h ago' regardless of when the pick actually fired.
audit_trail/dashboard_generator.py (smart_picks_feed builder)Populate signal_time = trading_picks.created_at on every entry in the smart_picks_feed payload. One-line addition.claude-opus-4-7πŸ“„ doc Β· πŸ”— page
P1OPENβ€”2026-05-24 23:41Swarm Picks tab effectively abandoned
data/swarm_picks.json has 38 picks; newest is dated 2026-05-12 (13 days old). Workflow swarm-pick-review.yml runs daily but no longer adds picks β€” only resolves the existing 38.
audit/ Swarm Picks tab / .github/workflows/swarm-pick-review.ymlEither revive multi_model_pick_gen.py so fresh consensus picks flow in, OR deprecate the Swarm Picks tab and redirect to /audit/ai-tournament.html.claude-opus-4-7πŸ“„ doc Β· πŸ”— page
P2RESOLVEDβ€”2026-06-12 21:48 EDTBATTLE_REPORT.md publishes simulated funding rates + Feb-17 static results as hourly 'real-time battle test'
Agent-verified 2026-06-13: battle_test.yml regenerates BATTLE_REPORT.md hourly; battle_test_real_time.py:103 SIMULATES funding rates ('in real system, fetch from Binance/Bybit') yet renders them as 100%-confidence Live Signals; Survivors/Eliminated load from forward_test_results.json whose period EN…
battle_test_real_time.pyEither retire the workflow or truth-label the output (SIMULATED funding banner + real data-vintage stamp Feb-17) + add price failover. Do not let peers cite its survivors list.claude-fable-mdreviewβ€”
P2OPENβ€”2026-06-12 21:48 EDTDashboard reads KIMI_RISEOFTHECLAW files directly at ~10 sites β€” AI surfaces bypass the honest ledger
PROGRESS.MD claim agent-CONFIRMED 2026-06-13: dashboard_generator.py reads KIMI_RISEOFTHECLAW/data/*.json|sqlite directly (lines 3779, 4223-4224, 8785, 9039, 9173, 10087, 10846, 11181, 12436, 16430) β€” kimi-attributed numbers on dashboard surfaces never pass at_signal_outcomes resolution/guards. Same…
audit_trail/dashboard_generator.pyMigrate the ~10 direct reads to ledger-backed queries (or label those panels NON-LEDGER/UNAUDITED); PROGRESS.MD's reconciliation work is the companion.claude-fable-mdreviewβ€”
P2OPENβ€”2026-06-12 14:41 EDTTrack-record stats are DIRECTION-BLIND: sym_track_wr keys on strategy/source+symbol only (dashboard_generator.py:5495)
Validated 2026-06-12 (agent finding, code-verified): _build_source_symbol_track_stats + _build_strategy_symbol_track_stats key on (system|strategy)+symbol WITHOUT direction. The active-picks Track column therefore shows the same WR for a LONG and a SHORT on the same symbol. Given the now-proven CRYP…
audit_trail/dashboard_generator.pyAdd UPPER(direction) to the track keys (+the _pit at-issue variant), bump n-floor note in template Track tooltip; audit consumers of sym_track_* before changing key shape.claude-fable-edgehuntβ€”
P2RESOLVEDβ€”2026-06-12 02:40 EDTml_gatekeeper A/B router last-mile broken: tagged picks never reach the analysis reader (n_ab_tagged=0)
ML audit: _ab_sleeve picks written to ml_gatekeeper/data/active_picks_ab_new.json (gatekeeper.py:1001) which is never committed/merged into alpha_engine/data/closed_picks.json that ab_analysis.py:95 reads; plus _ab_arm vs _ab_sleeve field mismatch. Cheap fix, high information (the A/B answers whethe…
ml_gatekeeperMerge ab_new picks into the closed-picks path (or point ab_analysis at the ab file) + unify the field name.claude-fable-mlauditβ€”
P2IN_PROGRESSβ€”2026-06-11 17:45 EDTTerminal NULL-pnl regrowing: active writers close picks without pnl
737 terminal rows with NULL pnl (was 131 after the 06-10 recovery). Ongoing writers: multi_asset_copytrader (latest 2026-06-11 19:13), alpha_engine, cta_replicator. The backfill recovers but the writers should set pnl at close; each NULL row is invisible to PF/WR until backfilled. (money-maker-ready…
multi_asset_copytrader + cta_replicator close pathsSet pnl_pct at close-time in the three writer paths; schedule backfill_resolved_pnl.py weekly as a safety net (now price-sanity-guarded).claude-fable-mmreadyβ€”
P2WONTFIXβ€”2026-06-09 13:53 EDTAlgorithm Competition workflows fail: STOCKS/competition/run_competition.py lost in 2026-05-23 history strip
GHA "Algorithm Competition Refresh" (algorithm-competition-refresh.yml) + "[torontoevent] Algorithm Competition Refresh" (torontoevent-algorithm-refresh.yml) both fail every scheduled run: "python3 STOCKS/competition/run_competition.py -> No such file or directory" (exit 2). The script is absent fro…
.github/workflows/algorithm-competition-refresh.yml + torontoevent-algorithm-refresh.yml; STOCKS/competition/run_competition.pyOPERATOR: restore run_competition.py from a pre-2026-05-23 clone / runner cache, git add + commit. STOP-GAP (claude): guard both workflow steps to skip-with-warning if the file is missing, so the scheclaude-opus-4-8 (GHA audit)β€”
P2OPENβ€”2026-06-09 00:50 EDTkimi_direct 66.1% WR is stale + losing-tail (correctly excludes MISPRICED but misleading)
model.html?id=kimi_direct WR=66.1% (37W/19L). VERIFIED it correctly EXCLUDES 46 MISPRICED_ENTRY picks (model.html resolved=WIN/LOSS/EXPIRED only; live DB WR incl/excl mispriced both 66.1%). BUT misleading: (1) data ends 2026-05-22 β€” model dark 18 days; (2) the 3 most-recent resolved picks were ALL L…
audit_dashboard/model.html / tournament_picks(model_id=kimi_direct)Add a recency/staleness badge + "last resolved N days ago" to model.html; flag models dark >14d as STALE; dedup tournament_picks NZDUSD row; surface most-recent-3 outcome strip so a losing tail is visclaude-opus-4-8πŸ”— page
P2RESOLVEDβ€”2026-06-08 23:31 EDTGHA: Daily Per-Class Scrutiny Engine failing β€” scipy not installed
Engine imports scipy but workflow only `pip install pymysql`. CROSS-CHECKED: added scipy+numpy; re-run 27181760292 = SUCCESS.
.github/workflows/daily-scrutiny-engine.ymlAdd scipy to deps (done). Verified green.claude-opus-4-8 (GHA health audit + cross-check 2026-06-09)⎌ 94726fb
P2IN_PROGRESSβ€”2026-06-08 23:31 EDTGHA: AI Leaderboard Freshness failing β€” (a) missing DB creds FIXED, (b) residual regime_report.json missing on runner
Two causes. (a) DB-reading steps lacked DB password -> 'no DB password found': FIXED via job-level AUDIT_DB_*/DB_PASSWORDS_JSON env (committed). (b) RESIDUAL: workflow still red because the FTP step's `deploy_audit_files.py --only audit_data` reports 'local missing' for alpha_engine/data/regime_repo…
.github/workflows/ai-leaderboard-freshness.yml + tools/deploy_audit_files.pyDB-creds fixed. For residual: either generate regime_report.json in-workflow before deploy, OR make deploy_audit_files treat a missing-LOCAL source as skip+warn (not a hard exit-1), since these data fclaude-opus-4-8 (GHA health audit + cross-check 2026-06-09)⎌ 94726fb
P2RESOLVEDβ€”2026-06-06 03:34 EDTBabies sweep aborts entirely when one strategy file raises SystemExit on import
baby_strategies/backtest_batch_round3.py does raise SystemExit("pip install yfinance") at import; SystemExit derives from BaseException so the loader except Exception missed it and the whole sweep died at file 10/15 (run 27056111987 exit 1).
incubator/backtest_team/real_data_sweep_runner.py(see resolution_notes)claude-opus-4-8⎌ daf08d2
P2OPENβ€”2026-06-06 03:34 EDTThree models get 'T1' badges with n_resolved 41-98 (< charter n>=100) and PF lower-CI straddling 1.0
Tier is assigned by absolute PF/WR thresholds (T1 = PF>=2 & WR>=55) gated only at n>=30, not the project charter's T1 requirement of n>=100 clean trades. kimi_direct (n_resolved=54, pf 2.573, pf_ci_lo 1.313), grok3 (n=98, pf 2.216, pf_ci_lo 1.338), and llm7_qwen (n=41, pf 2.088, pf_ci_lo 0.929) all …
audit-surfaceGate the T1 badge itself on n_resolved>=100 AND pf_ci_lo>=1.5 (charter Tier-1 = PF>2); render anything below as 'T1-provisional' or downgrade to T2/T3 until n>=100. Suppress the T1 color until the CI claude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
P2OPENβ€”2026-06-06 03:34 EDTMISPRICED_ENTRY exclusion drops 58.5% of all picks; high-rank models have the heaviest exclusion
4,154 of 7,099 picks (58.5%) are excluded as MISPRICED_ENTRY (entry_price drifted >25% from market at submission). The exclusion is not random across the leaderboard: grok3 has 262 MISPRICED vs 52 resolved (83% of resolvable picks excluded), and deepseek_v4 shows n_picks=352 but only 39 resolved (28…
audit-surfaceAdd a per-model 'exclusion rate' column (n_excluded / (n_excluded+n_resolved)) and flag models where it exceeds e.g. 60% as 'survivorship-suspect'. Investigate why entry_price drifts >25% (data-feed lclaude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
P2OPENβ€”2026-06-06 03:34 EDTFunnel WR% excludes the majority of economically-resolved trades by bucketing TIME_EXIT as non-decisive
_classify_status() in extract_funnel.py maps TIME_EXIT/EXPIRED/STALE/FLAT to CLOSED_FLAT (non-decisive), counting only TP_HIT as WIN and SL_HIT/LOST as LOSS. For CRYPTO this drops 11,727 TIME_EXIT closes that DO carry real pnl_pct (5,499 positive / 6,226 negative). The displayed decisive denominator…
audit-surfaceIn _classify_status / the funnel WR computation, score TIME_EXIT (and EXPIRED with non-null pnl) by pnl_pct sign rather than dumping to CLOSED_FLAT, OR surface a separate 'time-exit win-rate' column aclaude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
P2OPENβ€”2026-06-06 03:34 EDTAdvertised multi-asset coverage is a facade β€” CRYPTO/BOND/FUTURES never produced; TOP-20 is 80% equity
The page is positioned as 'best picks across stocks/crypto/forex/bonds/ETFs/commodities/futures' and references BOND (TLT/IEF/SHY) and CRYPTO in methodology/tier tables. But the live data never contains them as actual picks: picks_now.json 'all' (50) = EQUITY 31 / ETF 10 / FOREX 5 / COMMODITY 4, zer…
audit-surfaceEither wire real CRYPTO/BOND/FUTURES candidate generation (yfinance/CoinGecko for crypto, bond-ETF momentum for BOND) so the page delivers on its multi-asset promise, OR relabel the page honestly as 'claude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
P2OPENβ€”2026-06-06 03:34 EDTNo staleness gate: header stuck at 'Loading...' and JSON age never shown/guarded
The page header (#header-updated line 112) and market strip (line 133) ship literal 'Loading...' and are only replaced client-side from data.generated_at; there is no max-age check. If picks_now.json fails to refresh (the GHA picks-now-refresh workflow or FTP deploy stalls), the page will silently d…
audit-surfaceCompute age = now - generated_at in the render JS; if > ~30min show a red 'STALE β€” prices may be outdated, do not trade' banner and gray the cards. Have the refresh workflow write generated_at and verclaude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
P2RESOLVEDβ€”2026-06-06 01:58 EDTcta_golden_cross listed in `elite` tier but PF=0.59 WR=54.55% β€” should not be promoted
strategy_tiers.json lists cta_golden_cross in `elite` array alongside prediction_market_consensus (PF 11.84). Actual stats: PF=0.592, avg_pnl=-0.037, sharpe_proxy=-0.11. This is a tier data quality error β€” the strategy has negative expected value and should be in `mutation_candidates` or demoted, no…
alpha_engine/data/strategy_tiers.jsonRemove cta_golden_cross from `elite` array in strategy_tiers.json. Move to `mutation_candidates` for Axes 2/4 investigation (direction gate or vol-normalized threshold).claude-sonnet-4-6 2026-06-06β€”
P2OPENβ€”2026-06-04 22:34 EDTPer-strategy perf trackers stale (fx_algo Feb / strategy_symbol_perf Mar / rolling Apr)
fx_algo_performance last 2026-02-09 (all 8 rows 0 picks); at_strategy_symbol_performance (410, only crypto true-PF) snapshot 2026-03-06; algorithm_rolling_perf max calc_date 2026-04-27; algorithm_performance Feb-Mar all-negative. Per-strategy perf not refreshing -> stale per-class edge reads.
fx_algo_performance, at_strategy_symbol_performance, algorithm_rolling_perf, algorithm_performanceRe-point the nightly strategy-stats rebuild at these tables; or deprecate dead ones (fx_algo_performance) and rebuild FX/equity perf from at_pick_outcomes.claude-opus-4-8πŸ“„ doc
P2RESOLVED2026-06-02 17:30 UTC
OVERDUE 14d
2026-06-02 14:20 EDTPR branch fast-forward corruption: docs/eagle4-eagle5-gates-2026-06-02 lost EAGLE-4/5 commits
Branch docs/eagle4-eagle5-gates-2026-06-02 (carrying PR #447) was fast-forwarded past my 2 EAGLE-4/5 commits (c39d266bfa, 47e3ec4f7c) by concurrent agent activity (PR #455 walkforward-lazy-imports merge). Branch tip moved from c39d266bfa to bda830d7b9 with no EAGLE-4/5 work in the new tip. Rebase te…
git-branch-stateWhen a doc-only branch must coexist with concurrent auto-syncs, build on a clean new branch from current origin/main and add only the doc files. Cherry-picking old commits onto a fast-forwarded tip wiminimax-m3-freeπŸ“„ doc Β· #447(closed) Β· #461 Β· #455
P2OPENβ€”2026-06-02 14:36Source concentration: kimi_riseoftheclaw = 43.6% of all picks
Single source system dominates the pick book at 43.6% concentration (HHI=0.232). Should be < 40%.
alpha_engine/smart_picks_engine.pyCap any single source at 40% of the aggregate book.mimo-v2.5-proβ€”
P2RESOLVEDβ€”2026-06-02 10:17 EDTcheck_resolver_health forward_test probe false YELLOW (conn closed early)
tools/check_resolver_health.py main() queried forward_test_only counts after conn.close(), surfacing false YELLOW. Fix on branch fix/resolver-health-forward-test-2026-06-02 (PR #457). forward_test columns on at_pick_outcomes were ALTER-applied 2026-06-02 (forward_test_only, forward_validated, _gated…
tools/check_resolver_health.pyMerge PR #457; re-run check_resolver_health.py on main; confirm forward_test section green when columns exist.cursor-composer-eagle2-2026-06-02πŸ“„ doc Β· 457
P2RESOLVEDβ€”2026-05-30 22:24 EDTSilent: anti_overfit_audit 20d stale + walkforward by_class 6wk empty (|| non-fatal swallow)
anti_overfit_audit.json generated_at=2026-05-11 (~20d stale); dashboard_data.json::walkforward.by_class empty, sub-block generated_at=2026-04-15 (6wk). Both run hourly behind '|| echo non-fatal' so they fail silently -> per-strategy DSR table + OOS-divergence detector run on stale data.
tools/anti_overfit_audit_sidecar.py + alpha_engine/walkforward_validator.py (audit-dashboard.yml)Remove the non-fatal swallow or alert on staleness; fix the underlying sidecar/walkforward failures.claude-opus-4.8(money-maker-readyv2 fleet)PR #156
P2RESOLVEDβ€”2026-05-30 22:06 EDTtorontoevent-forward-test.yml still on main referencing deleted forward_test.py
PR #113 was created to rename to .disabled but merge was blocked by branch conflict. The workflow still references STOCKS/competition/forward_test.py which does not exist on origin/main. Every run fails at python invocation step.
.github/workflows/torontoevent-forward-test.ymlRe-attempt: git mv .github/workflows/torontoevent-forward-test.yml .github/workflows/torontoevent-forward-test.yml.disabled. Same disposition as PR #97.kilo-xiaomi-mimoPR #155
P2RESOLVEDβ€”2026-05-30 22:06 EDTAI Tournament snapshot race β€” recurring regression to 18/478 picks
During this session agents observed tournament data regressing from 40 models/3,873 picks back to 18 models/478 picks multiple times. Root cause: a peer merge_submissions_to_latest pipeline overwrites the agent-rebuilt JSON with stale data. Auto-restore works but root cause not fixed.
tools/ai_tournament/rebuild_latest_from_db.py, alpha_engine/merge_submissions_to_latest.pyFind and disable the peer pipeline that overwrites ai_tournament_picks_latest.json with stale snapshot. Or protect the agent-rebuilt JSON via exclusive lock / timestamp check.kilo-xiaomi-mimoPR #161
P2RESOLVEDβ€”2026-05-30 22:06 EDTat_strategy_stats table has 0 rows β€” never populated
refresh_strategy_stats_mysql() landed in PR #91 (audit_trail/mysql_client.py) but is never called from any cron pipeline or workflow. The table remains empty. No feedback loop exists for strategy performance tracking.
audit_trail/mysql_client.py, .github/workflows/Wire refresh_strategy_stats_mysql() into a cron workflow (audit-dashboard.yml or new hourly job). Also wire into at_strategy_stats consumption β€” verify which dashboard/audit components read this tablekilo-xiaomi-mimowave-3 backfill
P2RESOLVEDβ€”2026-05-30 22:06 EDTBond Emitter workflow fails (FRED_API_KEY secret missing)
alpha-engine-bond.yml fails: pandas_datareader/fredapi missing + FRED_API_KEY not set. All FRED backends timed out at 30s.
.github/workflows/alpha-engine-bond.ymlOperator: (1) Add FRED_API_KEY to GH secrets, (2) pip install pandas_datareader fredapi in the workflow, (3) Test with FRED test endpoint. BOND data is currently unavailable β€” Goal #6 BOND class analykilo-xiaomi-mimoβ€”
P2RESOLVEDβ€”2026-05-30 22:06 EDTDB Backups SMTP relay failure (550 relay not permitted)
db-backup-email.yml fails: SMTP rejected mail relay (550). Backup itself succeeds but notification fails.
.github/workflows/db-backup-email.ymlOperator: provision SMTP creds on 50webs or switch to a working SMTP provider. Backup logic is correct; only the notification email fails.kilo-xiaomi-mimoPR#168
P2RESOLVEDβ€”2026-05-30 22:06 EDT15 residual NULL trust_score closed picks (strategies not in at_signal_outcomes)
After trust_score backfill (32,683 rows + 15,166 floored at trust=2), 15 closed trading_picks rows still have NULL trust_score for strategies that do not appear in any at_signal_outcomes source system. These are orphaned strategies with zero outcome history.
trading_picks.trust_scoreFloor at trust=2 UNPROVEN (already done for majority). These 15 have no outcome history anywhere β€” cannot derive stats. Consider whether they should be excluded from HC overlay entirely or treated as kilo-xiaomi-mimowave-1 backfill
P2RESOLVEDβ€”2026-05-30 22:03 EDTdashboard_data.json 52h stale (fails 2h freshness gate)
As of 2026-05-31 02:30 UTC, dashboard_data.json generated_at=2026-05-28T21:29Z (age 52.5h). The money-maker-readyv2 freshness gate fails fast >2h. money_ready_verdict.json and pf_registry.json ARE fresh (2026-05-30T23:05Z) so the main verdict pipeline runs, but the dashboard_data.json regenerator (w…
audit_dashboard/data/dashboard_data.jsonInvestigate the dashboard_data.json regeneration job; ensure it runs on the same cadence as money_ready_verdict.json.claude-opus-4.8(money-maker-readyv2)β€”
P2RESOLVEDβ€”2026-05-30 22:03 EDTma_strategy_leaderboard.json is 44h stale AND empty (total_variants=0)
audit_dashboard/data/ma_strategy_leaderboard.json generated_at=2026-05-29T05:27:24Z (~44h stale) with leaderboard=[]. This is the MA Strategy Forward-Tracker v2 output (PR session 2026-05-29). Either the periodic cron is not wired, or tools/ma_strategy_forward_tracker.py is crashing silently.
tools/ma_strategy_forward_tracker.py + ai_leaderboard.html consumerWire ma_strategy_forward_tracker.py into a periodic cron (weekly is fine β€” strategy survey, not per-pick). If the empty-leaderboard is intentional (no golden cells passed gate), document that explicitclaude-opus-4-7-desktopPR #163
P2RESOLVEDβ€”2026-05-24 23:51IPO asset class advertised as 'tracked' on /audit but has zero coverage
/audit lists IPO as one of the tracked asset classes but the codebase has zero IPO-specific strategy or pick writer. 1/3 swarm REAL + 2/3 NOISE β€” added at P2 because the UI advertises the class and the user-facing claim is misleading even if low impact.
audit_dashboard tab listing / IPO scanner (missing)Either (a) remove the IPO claim from the UI until a writer exists, or (b) build a minimal IPO scanner using the PEAD framework adapted for lockup expiry + insider selling + revenue trajectory.opencode/ring-2.6-1t + 1/3 swarmπŸ“„ doc Β· πŸ”— page Β· PR #150
P2RESOLVEDβ€”2026-05-24 23:4156,559 ghost rows in trading_picks (top cohort: 20,474 identical MATICUSDT entries)
ghost_rows audit: 12 cohorts with thousands of identical (asset_class, strategy, symbol, direction, pnl_pct) rows. Top: CRYPTO/quan_engine/MATICUSDT/LONG/pnl=-15.0 with n=20,474 from 1 distinct entry. MEMECOIN/meta_strategy variants make up the next 10. This single cohort alone is dragging quan_engi…
trading_picks ghost-row write pathDEDUP via (asset_class, strategy, symbol, direction, pnl_pct, created_at) where distinct_entries=1 and n>50. Investigate the writer that's emitting the duplicates. quan_engine + meta_strategy are the qwen-codeπŸ“„ doc
P2OPENβ€”2026-05-24 23:41UNKNOWN asset_class on 951 active + 54 closed picks
Category is NULL/UNKNOWN for 951 active picks (~10% of active set) and 54 closed (35.2% WR). UI can't apply per-class gates to UNKNOWN rows. Cross-class stats undercount these.
trading_picks.category writer / classifierBackfill UNKNOWN rows using symbol pattern matching (USDT/BTC suffix -> CRYPTO; =X suffix -> FOREX; etc.). Add a classifier guard at write time.claude-opus-4-7β€”
P2OPENβ€”2026-05-24 23:41trust_score NULL on 99.99% of closed picks
trading_picks.trust_score is NULL on 38,884 of 38,889 closed picks. HC overlay requires trust_score>=4 (CRYPTO) / >=5 (EQUITY). Cited CRYPTO 60.3% N=562 and EQUITY 68.1% N=72 stats unreproducible β€” only 5 closed picks have a non-NULL trust_score.
trading_picks.trust_score / audit_dashboard/hc_filter.jsBackfill trust_score from strategy registry OR move HC gate to a field that IS populated (elite_score / derived TRUST tier). Or mark HC overlay UNVERIFIABLE on UI until backfill lands.claude-opus-4-7πŸ“„ doc
P3OPENβ€”2026-06-06 03:34 EDTWire-Up Rule: failover modules feed only the price-fill path, not score/gate functions
Per the repo Wire-Up Rule grep: `grep -rln forex_price_failover|commodity_price_failover alpha_engine/ tools/ audit_trail/` returns only the two modules + universal_price_enricher.py. The enricher IS in the production path (imported by production_scanner.py and run by .github/workflows/alpha-engine-…
audit-surfaceNo code change required for the rule. Document in the module docstring that failover prices feed PnL + TP/SL closure (audit numbers), and that FRED/ECB proxy sources must be display-only per the P1/P2claude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
P3OPENβ€”2026-06-06 03:34 EDTSummary table renders concrete PF up to 4.67 / WR up to 75% on n=2-4 samples
The Model-Summary table shows full PF/WR cell values for statistically meaningless samples: gpt4o_mini PF 4.67 on n=4, claude_opus PF 3.88 on n=2, fireworks_qwen 75.0% WR / PF 2.25 on n=4, nvidia_minimax_m2 PF 1.71 on n=2, alpha_engine PF 1.82 on n=2. The only mitigation is an amber color + '⚠' glyp…
audit-surfaceFor rows with resolved<MIN_N_TO_RANK, gray out (force pnl-neu) the PF and WR cells, append '~' or show them as 'PF β€” (n=4)', and never apply the green pnl-pos class below n=30. Optionally render PF asclaude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
P3RESOLVEDβ€”2026-05-30 22:50 EDTat_local_picks 304k duplicates claim NOT_REPRODUCED
Peer claim alleged ~304,000 duplicate rows in ejaguiar1_stocks.at_local_picks. Re-ran tools/at_local_picks_dedup.py (dedup key: symbol+direction+source_system+signal_timestamp) on 2026-05-31 02:49 UTC: table holds 37,763 rows total with 0 duplicate groups. Claim does not reproduce β€” likely confused …
at_local_picksNone required β€” claim not reproduced. If the claim resurfaces, ask for the exact dedup key and table size at the time of measurement before re-running.claude-opus-4-7πŸ“„ doc
P3RESOLVEDβ€”2026-05-30 22:06 EDTat_signal_outcomes SL_HIT rows have 24% with positive pnl_pct (labeling inconsistency)
24,363 SL_HIT rows in at_signal_outcomes: 18,511 have pnl<0 (correct) but 5,852 (24%) have pnl>0 β€” a stop-loss hit that made money is contradictory. Likely entry_price/exit_price stored with different sign conventions across sources.
at_signal_outcomesInvestigate source_system breakdown: which engines produce SL_HIT+pnl>0? Decide whether to reclassify these as TP_HIT or normalize the sign convention.kilo-xiaomi-mimoπŸ“„ doc Β· PR#164 eltonaguiar/findtorontoevents_antigravity.ca
STOCKS (12)
SevStatusTargetCreatedTitleComponentRecommended fixReporterLinks
P0RESOLVEDβ€”2026-05-30 22:24 EDT~90% of EQUITY picks mistagged as crypto β€” corrupts all EQUITY verdicts
FIRING13 report: 198/218 'EQUITY' symbols are mistagged crypto -> real EQUITY n~=20 not 39. Every EQUITY number (n=39 verdict, db_raw 90d PF 5.555 DISPUTED, 48h panel) is unreliable until tagging fixed. Same EXPIRED->WON leakage family as the disputed CRYPTO 78.9%.
asset-class tagging in pick pipeline (see reports/continual_research/6gate_validation/FIRING13_VT_PATTERN_SWEEP_EQUITY_SUBREPORT_2026-05-21.md)Fix asset_class tagging so real-EQUITY symbol share >=95% in resolved cohort; re-run money_ready_verdict.claude-opus-4.8(money-maker-readyv2 fleet)β€”
P0OPENβ€”2026-05-30 22:06 EDTEQUITY emission unlocked (1,424 outcomes) but all strategies PROBATION-tier (trust_score=3)
at_signal_outcomes EQUITY surged 169β†’1,424 after PR #121 + live DB backfill. But 0 EQUITY picks meet HC overlay gate (trust>=5). WR=42.1%, PF=0.39 β€” class is still deeply losing. Goal #1 EQUITY acceptance (>=1000 outcomes) MET but no statistically-valid edge exists yet.
at_signal_outcomes, alpha_engine/equity strategiesBuild or rewire EQUITY strategies that produce profitable picks. Current picks come from PROBATION-tier strategies (trust=3). Need WR>=50% + PF>=1.5 on n>=100 to reach T2. Investigate: are existing EQkilo-xiaomi-mimoβ€”
P0OPEN2026-06-01 17:00 EST
OVERDUE 15d
2026-05-24 23:41PEAD equity strategy stuck in shadow mode
The only WF-VERIFIED equity strategy (62.2% OOS WR on 2-day window) is the new pead_equity, but it never made it past shadow. Meanwhile the broken earnings_drift (0% WR on 92 picks) was active in prod.
alpha_engine/pead_equity (shadow mode)Promote pead_equity from shadow -> probation. Document wire-up in updates/ per the Wire-Up Rule.ring-2.6-1tβ€”
P1OPENβ€”2026-06-12 23:41 EDTpicks-now regime guard reads market_regime (intraday-rebounding) not risk_off_explanation (authoritative) β€” under-fires on falling knives
MiniMax finding CONFIRMED live 2026-06-13: picks_now.json shows market_regime='NEUTRAL' AND risk_off_explanation='Market appears RISK-OFF (recent 5d declines across SPY/QQQ/BTC)' simultaneously β€” they disagree. The regime variable (picks_now_professional.py:1410) is a 5d-return 2-state classifier th…
tools/picks_now_professional.pyWhen market_regime disagrees with risk_off_explanation, trust the conservative signal (demote). Simplest: derive the guard's regime from the same condition that builds the explanation, OR make the guaclaude-fable-reviewβ€”
P1RESOLVEDβ€”2026-06-09 00:19 EDTpicks-now dividend yield double-multiplied (yfinance *100) β€” impossible yields + corrupted ranking
tools/picks_now_professional.py yfinance path did div_yield *= 100, but yfinance already returns dividendYield as a percent-like value (GOOGL 0.24=0.24%). Live picks-now.html showed GOOGL 24%, SBUX 262%, PLD 300%, and the >3 score gate handed a spurious +5 bonus to 14 of 20 picks, corrupting rank or…
tools/picks_now_professional.pyRemoved the *100 scale on the yfinance primary path (FMP fallback fixed separately by peer); div_yield used as-is.claude-opus-4-8πŸ“„ doc Β· πŸ”— page Β· ⎌ 6053b3e
P1OPENβ€”2026-06-06 09:21 EDTstocks_rsi2_pullback emission stopped 2026-05-29 β€” 8+ days dark, edge lost
alpha_engine/production_scanner.pyRe-add stocks_rsi2_pullback to production_scanner.py dispatch β€” removed 2026-05-28 (line 488), config.py un-kill 2026-05-31 never restored caller. Validated 30d: WR=58.8% PF=2.68 n=894 dedup-clean.claude-codeβ€”
P1OPENβ€”2026-06-06 03:34 EDTTop pick AMZN (Score 133, 6/7 AI APPROVE) hides a true DB record of 3W/34L (8.1%)
AMZN is the page's #1 headline pick (highest score, 6/7 models approve, 'classic TREND+DIP'). But its own at_pick_outcomes resolved record is WON=3, LOST=34 (8.1% WR excl EXPIRED). The card surfaces analyst consensus, Piotroski, Altman-Z but NOT the catastrophic internal hit-rate, because the dbVal …
audit-surfaceSurface the resolved DB WR on every card (not just when >=55%) and add a red 'DB hit-rate poor (n)' badge when db_wr<40% with db_n>=10. Down-weight the multi-factor score by the DB-edge overlay when tclaude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
P1OPENβ€”2026-06-05 01:39 EDTHermes/Nemotron-3-Super backupless trading_picks mutation 2026-06-05T05:30Z
tools/mysql_stale_picks_resolver.py + raw SQL UPDATEDocument that ~250 status renames + ~9 LOSS->WIN outcome flips were applied without pre-snapshot to ejaguiar1_backups. Audit policy: any future status mutation requires archive_table_slice to ejaguiarclaude-opus-4-7 MD-review-loopβ€”
P1OPEN2026-06-05 17:00 EST
OVERDUE 11d
2026-05-27 02:34Penny/meme names still pollute the main EQUITY sleeve
Research and backtest evidence is concentrated in cleaner large-cap equity universes, but live EQUITY still carries penny/meme contamination. This distorts both edge claims and gate calibration for the parent class.
alpha_engine/config.py EQUITY universe / live EQUITY routingSplit LARGE_CAP_EQUITY from PENNY research-only names and report them separately. Do not let speculative names share the same production quality story as the large-cap sleeve.gpt-5.4/openaiπŸ“„ doc
P1OPEN2026-06-08 17:00 EST
OVERDUE 8d
2026-05-24 23:41US Equity screener emits zero picks
The /audit/ueps tab is rendered (n=0/100 disclaimer shown) but no picks have ever been emitted. Composite (Magic Formula x Piotroski x Acquirer's Multiple x SafetyGate) is documented but has no live writer.
alpha_engine equity scanner / US Equity Picks tabWire the UEPS composite to a weekly scanner. First emit can be sample/seed to validate plumbing end-to-end.claude-opus-4-7+ringπŸ“„ doc Β· πŸ”— page
P1OPEN2026-06-10 17:00 EST
OVERDUE 6d
2026-05-24 23:41EQUITY production scanner may not be routed
code grep found no _run_equity_scanner or similar routing function in production_scanner.py main loop. Strategies (connors_rsi2, quality_compounders, equity_momentum_regime, pead_equity) exist in code but may never be called.
alpha_engine/production_scanner.py main loopAdd explicit per-class routing functions; verify each documented strategy is reachable from main(). Add a smoke test.ring-2.6-1tβ€”
P2OPENβ€”2026-06-02 10:17 EDTEQUITY production FAIL β€” tournament symbol picks WEAK
Swarm: deepseek_v4/gpt4o tournament SUPPORT for paper; BAC/JPM/MSFT/NVDA equity picks WEAK. EQUITY class FAIL+INSUFF-N in money_ready (PF ~0.90, WR ~33%, n=33). Production scanner routing unproven.
alpha_engine/production_scanner.py / audit/ai-tournament.htmlKeep EQUITY on paper/tournament track; fix routing + n before probation; no production size-up.cursor-composer-eagle2-2026-06-02πŸ“„ doc Β· πŸ”— page
ETFS (3)
SevStatusTargetCreatedTitleComponentRecommended fixReporterLinks
P2OPENβ€”2026-06-06 03:34 EDTETF 90d WR 8.0% and INDEX 100% (n=2) shown without small-sample / quality caveat
The 90d table shows ETF WR 8.0% (win 2 / loss 23) and INDEX WR 100.0% (win 2 / loss 0, n=2). INDEX 100% on n=2 is a small-sample artifact and ETF 8% is catastrophic; neither carries an on-page INSUFF-N or quality flag, and the WebFetch summary even mis-narrated INDEX as a 'top performer' (perfect 10…
audit-surfaceApply the same small-n / INSUFF-N badge logic used elsewhere on /audit (n<20 or n<100 thresholds) to the 90d funnel table cells, and grey-out / footnote WR for classes with decisive n below the per-clclaude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
P2OPENβ€”2026-06-02 10:17 EDTEAGLE2: Live ETF policy-clean n=3 β€” sleeve stats not promotable from symbols
money_ready ETF INSUFFICIENT_DATA (n=3). Tournament symbols EEM/IWM/GLD show paper WR but cherry-pick β‰  sleeve. Lab etf_verified_dual_momentum is FORWARD_PILOT_ONLY (WF PASS).
verified_strategies/strategies/etf_dual_momentum.pyPromote sleeve-level dual momentum only; ignore single-symbol tournament cells until forward nβ‰₯30 then 100 on pilot book.grok-cli-eagle2-2026-06-02πŸ“„ doc Β· PR #458
P2OPENβ€”2026-05-24 23:41All 5 ETF strategies on probation with ZERO verified forward trades
etf_dual_momentum, etf_sector_momentum, etf_risk_parity_rotation, etf_faber_tactical, etf_trend_following all allow_without_forward=True. No track record.
alpha_engine ETF strategies / configPick one (etf_faber_tactical has strongest academic backing per Ring) and graduate to probation with a real forward floor. Document promotion path.ring-2.6-1tβ€”
CRYPTO (21)
SevStatusTargetCreatedTitleComponentRecommended fixReporterLinks
P0OPENβ€”2026-06-02 10:17 EDTCRYPTO directional bug: production emits LONG (33% WR) when SHORT edge is 67% WR
CRYPTO directional bug: production emits LONG (33% WR) when SHORT edge is 67% WR
alpha_engine/production_scanner.pyFlip CRYPTO emission default to SHORT or invert at scanner. EAGLE-4 minimax-m3-free claims fix landed; needs verification.claude-opus-4-7πŸ“„ doc Β· PR #439 (Pillar 1 freeze landed)
P1OPENβ€”2026-06-12 14:23 EDTVALIDATED: CRYPTO LONG is the book's biggest PnL leak (WR 30.1%, sum -508.6%, PF 0.684, n=1051 intrabar) β€” P0C direction block execution-ready
Direct-SQL validation 2026-06-12 of BUFFY_STRATEGY_AUDIT_JUNE122026.MD headline (numbers match to the decimal). at_signal_outcomes intrabar TP/SL CRYPTO: LONG n=1051 WR 30.1% sum=-508.6% PF 0.684; SHORT n=104 WR 55.8% PF 1.359 β€” SHORT survives time-split (H1 1.03 / H2 1.74) AND symbol-day dedup (n_e…
alpha_engine/config.pyExecute docs/plans/2026-06-12-P0C-m036-long-gate-plan.md: add LONG/STRONG_BUY to CRYPTO_BLOCKED_DIRECTIONS, SIZED-lane-only with forward_test_only/shadow exemption (REQUIRED β€” else starves rsi5070 + hclaude-fable-mdreviewβ€”
P1OPENβ€”2026-06-12 02:40 EDTml_strategy_reviver + inverse both lose on honest intrabar (PF 0.83/0.69) β€” geometry-structural; M-105 quarantine defaults OFF
ML audit 2026-06-12: reviver emits 76 picks/7d; both the strategy AND its inversion lose on intrabar (PF 0.83 n=470 / 0.69 n=221) β€” structural TP/SL-geometry+cost failure, not direction. ML_ENHANCED_CRYPTO_QUARANTINE (money_ready_verdict.py:84) still defaults OFF.
ml_strategy_reviverMutate-before-kill: geometry repair trial in replay; default-ON the M-105 quarantine meanwhile.claude-fable-mlauditβ€”
P1IN_PROGRESSβ€”2026-06-10 14:24 EDTCRYPTO ADV liquidity gate (is_liquid_crypto) built but unwired
alpha_engine/asset_class.py:315 is_liquid_crypto() exists + tested but the only caller is tools/coingecko_adv_fetcher.py (a fetcher) β€” it is NOT in the CRYPTO emission/score gate path. Illiquid-symbol picks (a known concentration/no-edge driver) are not filtered at emit. Verified 2026-06-10 master-r…
alpha_engine/asset_class.pyCall is_liquid_crypto() in the CRYPTO emission gate (passes_active_gate / scanner path); block or shadow-tag illiquid symbols. Wire-Up Rule.claude-fable-masterrunπŸ“„ doc Β· ⎌ 68dab0a
P1OPENβ€”2026-06-09 01:00 EDTA/S-tier crypto scoring tiles are resolver artifacts (100%/72.6% WR vs ~38% intrabar-true)
Swarm cross-review (4 dimensions, HIGH confidence, forward intrabar replay): the /audit crypto S-Tier (score>=70, 100% WR 16/0/0) and A-Tier (55-70, 72.6% WR PF4.0) tiles are a survivorship+labeling ARTIFACT, NOT a tradable edge. (1) SELECTION: tiles bucket the curated recent_closed slice (template.…
audit_dashboard/template.html:6597-6617 + dashboard_data.json + snapshot resolverAdd a prominent tile DISCLAIMER (like the DISPUTED 78.9% cell): score-band WR is survivorship-biased; intrabar-true band WR ~34-38%, full-pop CRYPTO 47.7%/PF0.945. Switch tile WR to intrabar_status TPclaude-opus-4-8 (swarm w976lbctb)πŸ“„ doc Β· πŸ”— page
P1OPENβ€”2026-06-03 10:07 EDTclaude_ml_moderate_mut bootstrap PASS is single-row JUPUSDT outlier (945x)
PR #481/#482 bootstrap CI: IS_PF=310.77 on n=67 but pf_lo_95=1.31. One row (id=214622) JUPUSDT pnl_pct=76573 drives gross-profit sum; without it PF collapses. Do not promote to live or forward-pilot until sustained_pf / pf_lo_95>=1.5 on clean sample.
verified_strategies/claude_ml_moderate_mut / bootstrap CI gateBlock promotion; add sustained_pf resample metric (see updates/2026-06-02-suspicious-pass-investigation.md). B_flip and inverse_ml_enhanced_BTCUSDT_15m_D are legit forward-test candidates instead.grok-2026-06-03πŸ“„ doc
P1OPENβ€”2026-06-02 10:17 EDTCRYPTO top_source_share 0.55 concentration β€” EAGLE2 Phase 0 gate
Pick pulse / resolver health flagged top_source_share=0.55 on CRYPTO (live policy-clean cohort sub-T2: PF ~1.14, WR ~43%, n=728 per money_ready_verdict). Concentration gate must hold before any size-up.
alpha_engine/production_scanner.py / CRYPTO emittersEnforce concentration caps; mutate-before-kill per MUTATION_THREE_AXIS_PROTOCOL; paper watch SHORT bias only (swarm SUPPORT), not production capital.cursor-composer-eagle2-2026-06-02πŸ“„ doc Β· πŸ”— page
P1IN_PROGRESSβ€”2026-06-02 10:17 EDTEAGLE2: Production LONG-heavy vs tournament SHORT edge (67% vs 33% WR)
Swarm + EAGLE3: tournament SHORT cohort ~67% WR vs LONG ~33% on nβ‰ˆ216. Production scanner historically LONG-heavy. EAGLE-4 flip wired in production_scanner.py but shadow only β€” policy-clean CRYPTO still NOT_READY PF 0.92.
alpha_engine/production_scanner.pyForward-proof SHORT flip per regime (β‰₯3/4 cells); pair with verification_engine once on main; do not size until policy-clean PFβ‰₯1.0 or sleeve replaces bulk emitters.grok-cli-eagle2-2026-06-02πŸ“„ doc Β· πŸ”— page Β· PR #458
P1RESOLVEDβ€”2026-06-02 10:16 EDTCRYPTO CVaR95 -85% / MDD 1.0 = resolver label pollution
money_ready_verdict CRYPTO mdd=1.0, cvar_95=-85.1%: fingerprint of mislabeled/never-closed tails inflating loss tail.
alpha_engine/outcome_resolver.pyResolver hygiene pass on CRYPTO (+FOREX): purge disputed/never-closed rows, source-provenance tagging.claude-opus-4-8πŸ“„ doc Β· 478
P1OPENβ€”2026-06-02 10:16 EDTCRYPTO 55% source concentration + 2 PF>1 sleeves 100% single-source artifact
policy-clean-net: top_source 55% of CRYPTO book. The only 2 sleeves with PF>1 n>=20 (crypto_liquidity_wick_reversal PF1.55, atr_percentile_gate PF1.10) are both 100% single-source (is_single_source_artifact).
audit_dashboard/data/pf_registry.jsonEnforce is_single_source_artifact as hard reject before DSR/SPA promotion.claude-opus-4-8πŸ“„ doc
P1RESOLVEDβ€”2026-05-30 22:06 EDTCRYPTO 48h closures at 0 β€” resolver DESC fix landed but first test pending
PR #119 changed ORDER BY signal_timestamp ASC to DESC at alpha_engine/active_picks_sync.py:125. Fix landed 2026-05-31T01:20Z. First audit-dashboard.yml hourly run post-fix at 02:10Z will test whether CRYPTO 48h closures emerge from the backlog of 39,706 OPEN CRYPTO picks.
alpha_engine/active_picks_sync.py, .github/workflows/audit-dashboard.ymlMonitor 02:10Z audit-dashboard.yml run + inspect pick_summary_stats_48h.json CRYPTO closed_n. If still 0 after 3-4 hours, investigate deeper (yfinance rate-limit, DB connection, etc.).kilo-xiaomi-mimoPR#142 Β· PR#119
P1RESOLVEDβ€”2026-05-24 23:51CRYPTO ML strategies DSR>=0.9995 on n=25-34 displayed without 'insufficient n' badge
ml_enhanced_INJUSDT_1d_B_lightgbm (n=25 WR 100%), ml_enhanced_DYDXUSDT_15m_D (n=31 WR 96.8%), ml_enhanced_FETUSDT_1d_B (n=25 WR 100%), ml_enhanced_RENDERUSDT_1h_D (n=34 WR 85.3%) on /audit/anti_overfit.html show DSR>=0.9995 as 'publishable confidence' but n is too small for the DSR formula to be rel…
audit_dashboard/anti_overfit.html / DSR sidecar renderingAdd 'insufficient n β€” awaiting n>=100' badge to any row with n<100 on the anti-overfit page. Reorder so n>=100 rows come first.opencode/ring-2.6-1t + 3-engine swarmπŸ“„ doc Β· πŸ”— page Β· PR #100
P1TRIAGEDβ€”2026-05-24 23:45meta_strategy template explosion β€” 1.6M template rows across ~140 symbol/dir pairs in bt_backtest_trades
Opencode flagged 1.6M template rows from meta_strategy across MEMECOIN/CRYPTO symbol+direction pairs in backtest_trades. Same root cause as the ghost-rows finding from Qwen's db_health (top 11 ghost cohorts are meta_strategy MEMECOIN). Defer blanket-block until db_health ghost_rows.top_cohorts repop…
meta_strategy emitter / bt_backtest_trades writerWait 1-2 cron cycles for db_health refresh post-commit d317560ac9c. Then decide: blanket-block meta_strategy on CRYPTO/MEMECOIN OR symbol-triple enumeration.opencode/ring-2.6-1tπŸ“„ doc Β· ⎌ d317560
P1TRIAGEDβ€”2026-05-24 23:41ML 'edges' with PF 99-1094 are likely look-ahead leakage
Pick-funnel top_edges_per_class found cells like 'copy_trader_intel & LONG' (n=21, PF 1094) and 'conf=0.80-0.85 & ml' (n=42, PF 674) β€” values that high on tiny samples almost always indicate look-ahead bias in the feature pipeline, not real edge.
alpha_engine ml_enhanced_* family / copy_trader_intel feature pipelineAudit the feature pipeline for look-ahead bias. Add walk-forward gate before any ML strategy claims edge. Mark current 'DSR=0.9995' claims as 'small-sample, awaiting n>=100 confirmation' on the dashboclaude-opus-4-7+deepseek+cerebrasπŸ“„ doc
P1OPENβ€”2026-05-24 23:41quan_engine_scalp degraded to PF 0.42 / WR 37%
edge_decay_heatmap shows quan_engine_scalp at n=4236, WR 37.4%, PF 0.42 β€” verdict 'dead'. Yet it remains a substantial share of open CRYPTO volume per CLAUDE.md ('18% volume @ PF 0.70 drag elite strategies down').
alpha_engine quan_engine_scalp emitterPer the mutation-three-axis protocol: cut volume share, mutate, or kill. Required to lift the CRYPTO class PF above the T2 threshold.claude/edge_stabilityπŸ“„ doc
P2OPENβ€”2026-06-12 02:40 EDTIntrabar scale outlier: +1,706,212% pnl row inflates ml_crypto_predictor family PF to 385
ML audit: one corrupt intrabar pnl row (+1.7M%) in the ml_crypto_predictor family; trimmed truth = WR 40.5%/PF 1.07 gross n=1805 (net-negative after fees). Needs the exit-ratio guard applied to the intrabar writer path too, + quarantine of the row.
tools/reresolve_intrabar_signal_outcomes.pyQuarantine the outlier row (backup first); add ratio sanity to the intrabar pnl computation.claude-fable-mlauditβ€”
P2RESOLVEDβ€”2026-06-09 01:10 EDTStale April-06 edge claims refuted on current data (quan_engine_swing/st_rsi/TRXUSDT)
Peer (Zoo) Quant review cited EDGE_FINDINGS_2026-04-06.md (2 months old) as "the edge exists". Refuted vs live trading_picks 2026-06-09: quan_engine_swing PF 0.81 / WR 41.3% / avg -0.11% / n=46 (LOSING; in PERMANENTLY_KILLED) β€” NOT the claimed PF 1.57. st_rsi_momentum_confluence n=0 resolved (dead) …
docs/EDGE_FINDINGS_2026-04-06.md (stale) vs live trading_picksQuant reviews must re-query live DB with artifact filters (backfill/banned/sane-pnl/EXPIRED-honest), not cite April docs. quan_engine_swing/scalp + st_rsi stay killed.claude-opus-4-8β€”
P2OPENβ€”2026-06-06 03:34 EDT3 DSR/FDR-passing baby strategies TRAPPED below n>=30 promotion gate (no path to accumulate trades)
Three strategies are status=pending_backtest, explicitly need more trades to hit the n>=30 promotion gate, and have backtest_metrics with total_trades=None β€” meaning the sweep has never produced trade counts for them because CI is broken. They cannot graduate: (1) rsi_vol_bounce_v2_expanded β€” 'Paren…
audit-surfaceUnblocking the CI DB-provision (finding #1) is the prerequisite. After that, these strategies need broader symbol/timeframe coverage to accumulate n: fetch_and_populate_db.py already fetches 14 pairs claude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
P2OPENβ€”2026-06-04 22:34 EDTcr_algo_performance corrupt: avg_return 95,692% (units/aggregation bug)
CR Mean Reversion row shows avg_return_pct=95692.81 on 288 trades. Clear units/aggregation bug; all 8 rows untrustworthy. Do not cite cr_algo_performance.
cr_algo_performanceRecompute cr_algo_performance with correct per-trade pnl_pct aggregation (likely summing instead of mean, or fraction-vs-percent).claude-opus-4-8πŸ“„ doc
P2OPENβ€”2026-06-02 20:33 EDTPnL outlier: at_signal_outcomes id=214622 (JUPUSDT) pnl_pct=76573% β€” single-row 945x meme-coin return distorts claude_ml_moderate_mut PF=310
Investigated 3 suspicious bootstrap CI PASSes (IS_PF>30) flagged in PR #481. Found: - claude_ml_moderate_mut (n=67, IS_PF=310): single row id=214622 JUPUSDT LONG pnl_pct=76573 (entry=0.0002, exit=0.1892, real 945x return). One meme-coin moon dominates the sum. - B_flip_PriceRocMeanReversion (n=1…
ejaguiar1_stocks.at_signal_outcomesNo code change β€” bootstrap CI correctly down-weights via pf_lo_95. Add per-strategy sustained_pf (median PF over 10 random sub-resamples) as a future enhancement to detect strategies that only work beminimax-m3-free (2026-06-02)#481
P2RESOLVEDβ€”2026-05-30 22:03 EDTpick_funnel DISPUTED CRYPTO banner is 6 days stale (raw DB numbers moved)
DISPUTED banner (Last verified 2026-05-25) cites raw-DB CRYPTO 90d WR 39.4% (n=2001). Live at_raw_picks query 2026-05-31 shows CRYPTO 90d=41.9% (n=7198), 30d=42.6%, 14d=50.9% (recovering), 2d=41.0%. Dispute direction still valid (raw ~42% vs Smart-Picks-claimed 78.9%) but the specific banner figures…
audit_dashboard/pick_funnel.html / reports/2026-05-25_crypto_78pct_wr_verification.mdRe-run the verification against live at_raw_picks and refresh the banner numbers + Last-verified date; or auto-generate the banner from a nightly query.claude-opus-4.8(money-maker-readyv2)πŸ”— page Β· parallel-session-ops 2026-05-31
FOREX (18)
SevStatusTargetCreatedTitleComponentRecommended fixReporterLinks
P0OPENβ€”2026-06-09 14:12 EDTFOREX 25% WR n=24 PF 0.078 (policy-clean); primary multi_asset_scanner + low n (sub-coinflip, no real money)
forex emission + gatesKeep hard disable + bans (multi_asset_scanner etc.); promote carry/COT/cta_tsmom SHORT only on probation; paper until n>=50 clean PF>1.0 per subagent plan.grok-4.3πŸ“„ doc
P0OPENβ€”2026-06-02 10:17 EDTkimi_signal_tracking sign-flip: 142 of 367 total sign-flipped rows = 38.7% (worst single source)
kimi_signal_tracking sign-flip: 142 of 367 total sign-flipped rows = 38.7% (worst single source)
alpha_engine/strategy_blocklist.py + audit_trail/sign_coherence_check.pyRun staged purge PR #433: luxalgo 6 -> ml 15 -> battleground 63 -> mega 141 -> kimi 142. Set SIGN_FLIP_BASELINE=0 after.claude-opus-4-7πŸ“„ doc Β· PR #433 (operator-staged purge)
P0OPENβ€”2026-05-24 23:41All FOREX strategies losers except cta_cross_asset_tsmom SHORT (93% USDJPY concentration)
Per benchmark report: forex_carry_momentum, forex_rsi2_mean_reversion, myfxbook_retail_contrarian all losing. Only cta_cross_asset_tsmom SHORT has WR 57.6% but is 93% concentrated in USDJPY β€” not a diversified edge, just one carry trade.
alpha_engine FOREX strategies (concentration risk)Block all FOREX strategies except cta_cross_asset_tsmom SHORT. Force symbol diversification on that one (cap USDJPY at <50%). Add forex_carry (Ring's recommendation) as the second leg.qwen-code+ring-2.6-1tπŸ“„ doc
P1OPENβ€”2026-06-10 14:39 EDTFOREX emission re-enabled while FREEZE incident open and policy-clean FOREX FAILs
Commit a3f11a99dd (2026-06-05) flipped FOREX_HARD_DISABLE default 1->0 citing the forex_carry_g10 backtest (PF 1.59/WR 60.4%/n=197), and 16f7e66623 wired it through quality_gates + dashboard_generator β€” so FOREX emission is LIVE today. This contradicts open incident #77 (FREEZE FOREX/COMMODITY/FUTUR…
alpha_engine/config.pyDecide posture: (a) keep enabled but force forward_test_only on carry_g10 until intrabar n>=100 passes gates, or (b) flip FOREX_HARD_DISABLE back to 1. Do not let sized FOREX emission ride a backtest claude-fable-ultracode⎌ a3f11a9
P1OPENβ€”2026-06-10 14:24 EDTFOREX carry signal is a hardcoded snapshot, not live FRED rates
The FOREX carry input is a static snapshot rather than live FRED policy-rate differentials; carry decays as central-bank rates move, so the signal is stale. Flagged in the 90day FOREX plan + 2-month sweep (O5/O6). Related: COT proxy is not real CFTC data.
alpha_engine/forexWire live FRED API (FRED_API_KEY in env) for policy-rate differentials; replace the COT proxy with real CFTC API (CFTC_API_KEY). Both keys flagged missing in operator_status_check.claude-fable-masterrunπŸ“„ doc
P1OPENβ€”2026-06-09 14:05 EDTFOREX class 25% WR n=24 PF=0.078 (sub-coinflip, neg exp, policy_frozen); primary emitter multi_asset_scanner + low n
quality_gates/emission/forex_strategiesEnforce non_crypto_edge_guard; ban multi_asset_scanner et al; focus on forex_carry_g10 / COT / cta_cross_asset_tsmom only. Re-eval after n>=50 clean WR>45% PF>1.0grok-4.3πŸ“„ doc
P1RESOLVEDβ€”2026-06-08 23:34 EDTFOREX bleeder family emitting ~3600 garbage picks β€” not in BANNED_SOURCES (forex_carry_momentum 1% WR, forex_rsi2_mean_reversion 12.7%)
emitter_discipline.HARD_KILL_STRATEGIES listed these FOREX bleeders but is orphaned (zero importers, never runs). Clean at_pick_outcomes: forex_carry_momentum n=1183 WR 1.0% (PF 9.0 = single +428% CADJPY feed-bug win); forex_rsi2_mean_reversion n=2441 WR 12.7% PF 0.39 (incl -100% NZDUSD feed bug); c…
alpha_engine/production_scanner.py BANNED_SOURCESMigrated the confirmed bleeders into the wired apply_source_ban_gate BANNED_SOURCES (commit d9c35b9277). quan_engine_* deliberately NOT banned (gold_standard).claude-opus-4-8⎌ d9c35b9
P1RESOLVEDβ€”2026-06-06 03:34 EDTDB edge WR excludes EXPIRED outcomes β€” GBPUSD shown 58.8% vs true 6.1%
The page shows GBPUSD=X as a FOREX pick with 'DB_n=114 WR=58.8% (edge-corroborated)'. The generator's WR query (tools/picks_now_professional.py load_db_edge, line ~308-314) filters WHERE status IN ('WON','LOST'), silently dropping all EXPIRED rows. For GBPUSD=X the raw at_pick_outcomes split is WON=…
audit-surfaceChange the WR denominator to count EXPIRED as non-wins (WR = WON / (WON+LOST+EXPIRED+FLAT)), OR display BOTH 'resolved-only WR' and 'EXPIRED-inclusive WR' with the expiry rate. Add an explicit 'expireclaude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)⎌ e94bc85
P1RESOLVEDβ€”2026-06-06 03:34 EDTFOREX/COMMODITY picks use 8% TP / 4% SL β€” 5x the enforced production FX cap (1.5%/1.0%)
All 5 FOREX and all 4 COMMODITY picks in picks_now.json carry suggested_tp_pct=8.0, suggested_sl_pct=4.0. alpha_engine/production_scanner.py hard-caps TP_CAP_FOREX=0.015 (1.5%) and SL_CAP_FOREX=0.010 (1.0%) precisely because median daily FX ATR is 0.3-0.8% (see updates/2026-04-25-forex-tpsl-review.m…
audit-surfaceApply the production TP_CAP/SL_CAP per class in the generator before writing picks_now.json (import the caps or replicate the table). For FX use ATR-based TP/SL (e.g. 1.5x/1.0x daily ATR) rather than claude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)⎌ e94bc85
P1RESOLVEDβ€”2026-05-30 22:27 EDTPR#6 FOREX whitelist promotes a CONFIRMED LOSER (cta_cross_asset_tsmom PF 0.254 DSR 0.0)
_FOREX_ALLOWED={cta_cross_asset_tsmom(SHORT),forex_carry}. cta_cross_asset_tsmom has ZERO policy-clean FOREX rows AND anti_overfit rates it n=288 PF 0.254 DSR 0.0 OVERFIT_LIKELY β€” a proven loser, not 'the sole proven FOREX sleeve' as the code comment claims. The only wired FOREX strategy with a real…
alpha_engine/non_crypto_policy.py:585 _FOREX_ALLOWEDReplace cta_cross_asset_tsmom in _FOREX_ALLOWED with dxy_trend_filter; hard-block regime_accumulation/regime_terminal for FOREX; require live n>=50 PF>=1.3 before sizing.claude-opus-4.8(money-maker-readyv2 fleet)PR#6
P1RESOLVEDβ€”2026-05-30 22:03 EDTPR#6 FOREX consolidation gate is dead code + no-op USDJPY cap
Code review of Zoo's PR#6: the FOREX consolidation gate lives in evaluate_non_crypto_candidate() which has ZERO production callers (live path uses passes_non_crypto_policy() which does not delegate to it) -> the block-all-except-cta_cross_asset_tsmom-SHORT logic never executes. USDJPY concentration …
alpha_engine/non_crypto_policy.py:573-590 (gate), :702-710 (USDJPY pass)Wire FOREX block into passes_non_crypto_policy() (the called gate) or copy_trader_bridge; implement or delete the USDJPY pass; justify the cta_cross_asset_tsmom survivor with real n/WR/PF or switch toclaude-opus-4.8(money-maker-readyv2 review)PR#6
P1OPENβ€”2026-05-27 02:34FOREX class still aggregates losers around a small winner subset
EAGLE review found the class story is dominated by a few stronger sleeves while the aggregate is dragged down by broad losers. The dashboard does not expose that isolate-the-winner vs kill-the-drag distinction cleanly enough.
FOREX class aggregation / per-sleeve visibilityAdd per-sleeve isolation reporting and treat FOREX as a basket of sleeves, not one monolith. Promote only the proven sleeve(s) in audit visibility and keep the rest explicitly quarantined or paper-onlgpt-5.4/openaiπŸ“„ doc
P1OPENβ€”2026-05-24 23:41forex_carry.py exists in repo but is NOT in allowlist
alpha_engine/new_strategies/forex_carry.py implements G10 interest-rate differential carry with claimed 55-60% WR / PF 1.2-1.5 but is not registered in non_crypto_policy.NON_CRYPTO_STRATEGY_POLICY so it never emits picks.
alpha_engine/non_crypto_policy.py allowlistAdd forex_carry to NON_CRYPTO_STRATEGY_POLICY with probation thresholds. Document wire-up in updates/.ring-2.6-1tβ€”
P1TRIAGEDβ€”2026-05-24 23:41FOREX SL at 0.5% sits at median daily FX ATR
Causes 44% SL hit rate vs 12% TP hit (3.7x more stops than targets). After April 2026 widening (TP 0.75%->1.5%, SL 0.5%->0.8%) the situation improved but still asymmetric.
alpha_engine FOREX TP/SL configWiden FOREX SL to >=1.0% (or use 1.5x daily ATR). Backtest before deploying.ring-2.6-1tβ€”
P2IN_PROGRESSβ€”2026-06-06 03:34 EDTMixed-freshness sources (ECB daily / OER hourly) feed intraday PnL & TP/SL without a staleness guard
The forex chain falls Yahoo(real-time) -> Frankfurter(ECB, updates once/day) -> OER(hourly). When Yahoo fails on a GHA runner (the stated reason these modules exist), the system silently uses an ECB daily reference rate as the 'current' price and stamps asof=data['date'] (Frankfurter) or today (OER)…
audit-surfacePropagate asof/source into the pick (e.g. pick['price_source']=result['source'], pick['price_asof']=result['asof']) and gate TP/SL auto-closure on freshness: only allow closure when source in {yahoo_vclaude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)⎌ fb27cfa
P2RESOLVEDβ€”2026-06-06 03:34 EDTWR 'edge' overlay is fed by BANNED/REFUTED sources (myfxbook, ig_contrarian)
The DB overlay backing the FX picks aggregates at_pick_outcomes across ALL strategies with no source filter. For GBPUSD/EURGBP the resolved rows come from myfxbook_retail_contrarian (46) and ig_contrarian_sentiment (76) among others β€” both flagged/refuted in project memory (myfxbook PF 3.79 fat-tail…
audit-surfaceIn load_db_edge, exclude rows whose strategy is in BANNED_SOURCES / the per-class banned frozensets (import from production_scanner / hedge_fund_quality_gate). Recompute db_n/db_wr on the cleaned set claude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)⎌ e94bc85
P2RESOLVEDβ€”2026-06-06 03:34 EDTNegative-expectancy FX pairs labeled STRONG_BUY
EURUSD=X (db_avg_pnl -0.0011), AUDUSD=X (db_avg_pnl -0.2521) are emitted as direction=STRONG_BUY with scores 90/82 and 8% size. A symbol whose own resolved average PnL is negative should not carry a STRONG_BUY label or full position size, even pre-EXPIRED-fix.
audit-surfaceGate STRONG_BUY / full sizing on db_avg_pnl>0 (after the EXPIRED-inclusive recompute) and on the cleaned, non-banned WR. Demote negative-expectancy symbols to WATCH or drop them.claude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)⎌ e94bc85
P3RESOLVEDβ€”2026-06-06 03:34 EDTOER/Finnhub cross-rate raises KeyError-into-broad-except, silently dropping valid exotic pairs
In _adapter_open_er_api and _adapter_finnhub_forex the cross-rate uses rates[quote]/rates[base] with direct subscript. If either currency is missing from the USD-based table (exotic/EM pair), this raises KeyError caught by the broad except, returning None and falling through. Functionally safe (no w…
audit-surfaceUse rates.get(quote)/rates.get(base) with explicit None checks and a logger.info on missing currency so coverage gaps are visible in CI logs rather than swallowed as a generic adapter failure.claude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)⎌ fb27cfa
COMMODITIES (9)
SevStatusTargetCreatedTitleComponentRecommended fixReporterLinks
P0OPENβ€”2026-06-02 10:17 EDTCOT publication lag look-ahead bias: cot_positioning PF dropped 4.6 -> 0.51 post-dedup
COT publication lag look-ahead bias: cot_positioning PF dropped 4.6 -> 0.51 post-dedup
alpha_engine/strategy_blocklist.py + COT signal pipelineEnforce 3-day COT publication lag at signal receipt. Remove CT=F (57% concentration) from universe. 60d post-fix live test.claude-opus-4-7πŸ“„ doc Β· PR #439 (COMMODITY emission frozen)
P0OPENβ€”2026-05-27 02:34COMMODITY headline PF/WR still contaminated by pre-clean COT aggregation
EAGLE review converged with the existing COT forensic concern: the class story remains unsafe while pre-clean or over-emitted COT history can still dominate class-level PF/WR claims. The page should not treat COMMODITY as trustable until independent-cycle-only stats are canonical.
COMMODITY class-health aggregation / COT-derived historyRecompute class health from deduped independent COT cycles only, then re-derive the honest class verdict. Block promotional Tier claims until the cleaned aggregation is the live source of truth.gpt-5.4/openaiπŸ“„ doc
P0OPENβ€”2026-05-24 23:41Class-level COMMODITY 11.9% WR / PF 0.29 / Sharpe -0.534
Benchmark says CRITICAL β€” cot_positioning at the STRATEGY level is strong (DSR=1.0 per Ring) but at the CLASS level (n=140 closed) numbers are catastrophic because cot_positioning is now BLOCKED per audit benchmark, and remaining cta_cross_asset_tsmom + cta_commodity_momentum_term are losers.
alpha_engine commodity strategies (post cot_positioning block)Retire all remaining COMMODITY strategies. Rebuild from non-COT signals (term structure, EIA inventory, weather overlay). Reconcile the Ring 'cot DSR=1.0' claim vs the audit-benchmark 'cot BLOCKED' clqwen-code+freebuffπŸ“„ doc
P0OPENβ€”2026-05-24 23:41Reconcile: cot_positioning DSR=1.0 (Ring) vs BLOCKED (audit benchmark) β€” contradiction
Ring's 2026-05-25 audit says cot_positioning is the SUPREME EDGE (DSR=1.0, WR=86.5%, n=104). audit_benchmark_analysis_2026-05-24 says cot_positioning is BLOCKED and the COT-dedup audit downgraded WR to 5% / PF 0.12 on n=20 post-dedup. Both can't be true.
cot_positioning evaluation (pipeline vs paper-pilot vs class aggregate)Run the COT-dedup audit live, compute n + WR + PF under (a) raw, (b) deduped-by-release-week, (c) cot_paper_pilot-only sleeve. Publish the truth-table; update the page's SUPREME EDGE callout to match.claude-opus-4-7πŸ“„ doc
P1RESOLVEDβ€”2026-05-30 22:27 EDTCOMMODITY PF 1.81 is a 2-trade outlier + SHIBUSDT mislabeled as commodity
COMMODITY policy-clean n=9 PF 1.812 collapses to PF 1.15 (drop 1 winner), 0.47 (drop 2 winners). Biggest winner vwap_rsi_confluence +7.3% is symbol SHIBUSDT (a CRYPTO) mislabeled COMMODITY. True clean edge ~break-even, 78% GC=F(gold) concentration. Best wired backtest gold_safe_haven (PF 1.98/n=61/S…
asset_class tagging + pf_registry by_asset_class_strategy_symbolFix SHIBUSDT->COMMODITY mislabel; require single-symbol conc <60%; accumulate n>=50 clean post-COT-dedup before trusting PF.claude-opus-4.8(money-maker-readyv2 fleet)PR#147 Β· #166 Β· #149
P1RESOLVEDβ€”2026-05-30 22:27 EDTcot_positioning 'DSR 1.0 / TIER_1_RENAISSANCE' is a falsified 7.33x over-emission
cot_positioning headline (n=104, DSR 1.0, TIER_1) is over-emission: 44 raw emissions of only 6 unique CFTC releases (7.33x). Deduped one-trade-per-release: n=6, WR 33%, cum PnL -,547.57 (NEGATIVE). friction-adjusted MC (cot_step7_friction_adjusted_mc.json n=126) gives DSR 0.0 FAIL / NOT LIVE_ELIGIBL…
cot_paper_pilot_status.json + anti_overfit_audit.json (stale 2026-05-11)Dedup COT emissions to one-trade-per-release; keep cot_positioning SHADOW until >=20 unique releases; refresh anti_overfit_audit.claude-opus-4.8(money-maker-readyv2 fleet)πŸ”— page Β· PR#149 Β· PR#157
P2OPENβ€”2026-05-17 12:00 ESTCOMMODITY filter-survival gap: dashboard 85.5% WR vs raw 60.2% (-126 picks excluded by _is_valid_resolved_pick)
Dashboard shows COMMODITY WR=85.5% (n=228) but the raw cohort is WR=60.2% (n=354). The -126 difference is picks dropped by _is_valid_resolved_pick() for missing/corrupt pnl_pct, concentrated in LOSING cta_replicator and combined_confidence trades β€” so the filter survivorship-biases the displayed WR …
_is_valid_resolved_pick() / dashboard COMMODITY WR cellSurface the conservative raw WR (60.2%) alongside or instead of the filtered 85.5% on the COMMODITY cell, OR annotate that 85.5% is filtered-cohort-only. Investigate why cta_replicator/combined_confidclaude-opus-4-8 (banner-to-incident conversion 2026-06-09)β€”
P2RESOLVEDβ€”2026-06-06 03:34 EDTFRED fallback returns macro SPOT price labeled as the =F futures contract -> phantom basis PnL + false closures
_FRED_MAP maps GC=F->GOLDAMGBD228NLBM (London gold fixing, spot), CL=F->DCOILWTICO (WTI spot), NG=F->MHHNGSP (Henry Hub spot), HG=F->PCOPPUSDM (monthly copper). These are macro spot/cash prices, NOT the front-month futures the pick was entered on. The result dict still sets symbol=GC=F, currency=USD…
audit-surfaceEither (a) drop FRED from the futures price chain entirely (it answers a different question than the contract), or (b) keep it but mark the result is_proxy=True / price_basis='spot' and have the enricclaude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)⎌ fb27cfa
P2OPENβ€”2026-05-24 23:41cftc_cot_commercial_signal BLOCKED (19% WR on n=16)
Strategy is in code but blocked from production. Either rehab via mutation protocol or formally retire.
alpha_engine cftc_cot_commercial_signalRun mutation analysis (docs/MUTATION_THREE_AXIS_PROTOCOL.md). If no axis recovers, formally retire and remove from allowlist.ring-2.6-1tπŸ“„ doc
BONDS (3)
SevStatusTargetCreatedTitleComponentRecommended fixReporterLinks
P0OPENβ€”2026-05-24 23:41Antigravity_bond: 0% WR on n=9 β€” kill emission
audit_benchmark_analysis_2026-05-24.md: BOND class is 0% WR / PF 0.00 / Sharpe -2.465. Only strategy is antigravity_bond with 1 historical pick. Already flagged P0 in Freebuff 2026-05-17.
alpha_engine antigravity_bondKill BOND emission entirely. Re-enable only after a viable yield-curve or duration strategy is built (see ENHANCEMENT_BONDS).qwen-code+freebuffπŸ“„ doc
P1RESOLVEDβ€”2026-05-30 22:24 EDTalpha-engine-bond.yml emitter failing 5 consecutive days -> 0 resolved bonds
Bond Emitter workflow has failed daily 2026-05-26..05-30 (~17min each). 5 open bond picks exist in active_picks_bond.json but n_resolved=0 because the emitter/resolution leg is red. bond_connors_rsi2 (PF 1.34/WR 50/Sharpe 2.22/n=201 over 21y) is wired and ready; the blocker is the failing workflow, …
.github/workflows/alpha-engine-bond.ymlFix the failing bond emitter workflow so emitted picks reach the resolver/DB; accumulate n>=30 resolved.claude-opus-4.8(money-maker-readyv2 fleet)β€”
P3OPENβ€”2026-05-24 23:41bond_connors_rsi2 new, probation, no forward trades
Claims 73% WR but is brand new β€” needs forward-test data before promotion.
alpha_engine/new_strategies/bond_connors_rsi2.pyRun for 60 days in shadow; gate to probation when n>=20 with WR>=55%.ring-2.6-1tβ€”
FUTURES (2)
SevStatusTargetCreatedTitleComponentRecommended fixReporterLinks
P1OPENβ€”2026-05-27 02:34FUTURES is a zombie tile with real futures hidden under COMMODITY
The standalone FUTURES class has near-zero useful activity while real futures exposure is represented under COMMODITY. This makes the /audit taxonomy misleading and blocks honest per-class review.
asset-class taxonomy / FUTURES vs COMMODITY reportingReplace the empty FUTURES story with a unified futures taxonomy or clearly scope FUTURES as research-only financial futures. The page should stop presenting a zombie tile as if it were a live class.gpt-5.4/openaiπŸ“„ doc
P3OPENβ€”2026-05-24 23:41futures_mean_reversion and ema_stack_momentum BANNED at 0% WR
Both strategies sit in code with BANNED status. Remove from registry to declutter.
alpha_engine futures_mean_reversion / ema_stack_momentumFormal retirement entry. Move source files to deprecated/ subfolder.ring-2.6-1tβ€”
PENNY (2)
SevStatusTargetCreatedTitleComponentRecommended fixReporterLinks
P2OPENβ€”2026-05-24 23:41skyrocket_detector NOT wired to production
alpha_engine/skyrocket_detector.py has the SIDU pattern framework ($0.63->$3.79 example) but is not called from production_scanner.py.
alpha_engine/skyrocket_detector.pyWire to production scanner per Wire-Up Rule. Add tests + integration doc.ring-2.6-1tβ€”
P3OPENβ€”2026-05-24 23:41penny_deep_oversold BLOCKED by Gate 0
Strategy emits but every pick is rejected at Gate 0 (initial filter). Either fix Gate 0 to allow penny-class scores or move to a class-specific scoring path.
audit_trail/quality_gates.py Gate 0Investigate Gate 0 logic. Likely needs per-class score floor.ring-2.6-1tβ€”

Enhancements (scoring / gate / data-feed / UI proposals)

Total: 208 across 9 asset classes
OVERALL (161)
ImpactEffortStatusTargetCreatedCatTitleSuccess metricPlanProposed byLinks
HIGHSBACKLOGβ€”2026-06-09 14:12 EDTUIUpdate data integrity warning on /audit/ (quoted 2026-06-09 message) - confirm remediation (ghosts 0 per banner) and close/ update if outdated while noting persistent P0s (resolver #110, backfill #112)
Banner reflects current health (ghost 0, mismatch 0); new incident or resolution note for the reviewβ€”grok-4.3 (deep dive 2026-06-09)πŸ“„ doc
HIGHSIMPLEMENTEDβ€”2026-06-09 00:20 EDTGATEEnforce per-class TP/SL caps at the raw-insert chokepoints
The cap function only ran inside production_scanner; emitters writing direct to at_raw_picks (copy_trader 8%/4%, multi_asset 3% FX) bypassed it, producing unreachable targets and 56-94% TIME_EXPIRED. Added clamp_non_crypto_tp_sl at BOTH raw-insert chokepoints (mysql_record_raw_pick + sync_all_picks.…
non-crypto TIME_EXPIRED rate drops; FOREX emitted TP <= 1.5%β€”claude-opus-4-8β€”
HIGHSBACKLOGβ€”2026-06-05 01:32 EDTDATA_FEEDBackfill 1034 zero-PnL rows in at_pick_outcomes after _compute_pnl fix
PR landed in audit_trail/universal_pick_resolver.py (_compute_pnl helper @ line 260, wired into 10 call sites). Live DB still shows 1034 WON/LOST rows with pnl_pct=0 from pre-fix round(...,2) collapse. Need targeted backfill: re-run resolution on the 1034 affected pick_ids using new helper. Verified…
at_pick_outcomes WHERE status IN ('WON','LOST') AND pnl_pct=0 count drops from 1034 to <50β€”claude-opus-4-7 MD-review-loopβ€”
HIGHSBACKLOGβ€”2026-06-05 00:59 EDTOTHERAdd explicit kill switch on luxalgo_confluence pilot β€” circuit-break if WR<20% for 7 consecutive days
Currently the pilot has shadow-block (verified working in alpha_engine/verified_promotion_gate.py:81 etf_scanner_shadow_active gate) but no automatic kill switch. Swarm review 2026-06-05 (DeepSeek + xAI both flagged) identifies this gap. Add a daily check in tools/run_verified_pilots_daily.py: if luxalgo_confluence rolling-7d WR drops below 20% OR cumulative drawdown >15%, auto-disable the pilot and file P0 incident. Currently Day 1 state is n=14 WR 29% β€” shadow block holds but no automatic stopβ€”β€”
HIGHSBACKLOGβ€”2026-06-04 20:52 EDTOTHERPromote top 5 bias-survivor picks to paper-pilot tracking (USDCAD LONG / BND LONG / SPY LONG / SHY signals / ZS=F LONG)
After 4 weeks of paper-pilot tracking: at least 3 of 5 maintain bias-survivor status (WR>=60%, sym HHI<0.5, fam HHI<0.5, replay<25%) on out-of-sample forward window. Top candidates by mlflow_high_quality_picks tool: (1) carry_trade x FOREX x USDCAD LONG n=16 WR 87.5% avg +0.87%; (2) risk_parity x BOND x BND LONG n=26 WR 69.2% +0.52% with 3 models open @ $71.69; (3) deep_value x ETF x SPY LONG n=6 WR 66.7% +0.93% with 2 models open @ $747.93; (4) risk_parity x BOND x SHY LONG n=21 WR 66.7% +0.5% β€”β€”
HIGHSBACKLOGβ€”2026-06-04 10:33 EDTOTHERKill inverse_ml_enhanced_RENDERUSDT_4h_D β€” PF 0.12, avg -1.73%, n=52 (genuine no-edge)
Strategy removed from production scanner / paper-pilot loop. Post-INCIDENT-94 backfill confirmed: 52 closed trades, WR 50% but win:loss skew destroys edge. Reference: reports/verified_strategies_unlock_2026-06-04.mdβ€”β€”
HIGHSBACKLOGβ€”2026-06-04 08:08 EDTOTHERCalibrate TP/SL distances on verified strategies β€” TIME_EXIT bias blocking promotion
b_flip + inverse_ml decisive-close rate >= 50% (currently <10%). Either tighten TP/SL by 30-40% OR widen TIME_EXIT window beyond strategy's typical excursion period. Current: b_flip 0/40 decisive, inverse_ml_RENDERUSDT 0/57 decisive. Affects forward-promotion gate which counts only decisive closes.β€”β€”
HIGHSIMPLEMENTED2026-06-02 17:30 UTC
OVERDUE 14d
2026-06-02 14:19 EDTGATEEAGLE-4 + EAGLE-5 gates user-guide document (minimax-m3-free, 2026-06-02, PR #461)
Full user-facing guide to the EAGLE-4 admissibility gate (kills 4 noise personas, kills 8 negative-edge classΓ—direction combos, flips CRYPTO LONGβ†’SHORT) and the EAGLE-5 promotion gate (+20% boost for 33 tournament-validated symbols, +15% for 16 promoted personas). 7-section HTML report with ELI5 per…
Page live at findtorontoevents.ca/updates/eagle4-eagle5-gates-2026-06-02.html, ELI5 readability, links to all source commits and the EAGLE3/4 plan docs
Plan: 7-section HTML page (TL;DR / 8 tasks w/ ELI5 / 5 best picks w/ rationale / 2-week short-term / 12-week long-term /
minimax-m3-freeπŸ“„ doc Β· πŸ”— page Β· #461 Β· #447(c)
HIGHSVALIDATEDβ€”2026-06-02 10:36 EDTMETHODOLOGYWalk-forward validated macd_rsi_m048 strategy (PF 3.33, 75.4% WR)
macd_rsi_m048 passed purged walk-forward validation: 65 trades, 75.4% WR, test PF 3.33, 5/5 folds profitable, negative decay (improves OOS). Best validated strategy in the system.
Forward PF >= 2.0 on 30 live picksβ€”mimo-v2.5-proπŸ“„ doc
HIGHSVALIDATEDβ€”2026-06-02 10:36 EDTMETHODOLOGYEquity Momentum 12-1 strategy (B-Tier on SPY/QQQ/GLD)
Jegadeesh-Titman momentum applied to single equities. B-Tier on SPY (Sharpe 1.62, +104%), QQQ (Sharpe 1.63, +165%), GLD (Sharpe 1.18, +128%). Walk-forward validated.
Forward PF >= 1.3 on 30 live picksβ€”mimo-v2.5-proπŸ“„ doc
HIGHSVALIDATEDβ€”2026-06-02 10:36 EDTMETHODOLOGYETF Sector Rotation strategy (B-Tier on XLK/SPY)
Rotates among SPDR sector ETFs based on 3-month momentum. B-Tier on XLK (Sharpe 1.73, +217%), SPY (Sharpe 1.86, +122%).
Forward PF >= 1.3 on 30 live picksβ€”mimo-v2.5-proπŸ“„ doc
HIGHSIMPLEMENTEDβ€”2026-06-02 10:36 EDTMETHODOLOGYQuant Ops Monitor (real-time health checks)
Monitors concentration HHI, resolver disputes, per-class health, strategy culling status. Run via python3 verified_strategies/quant_monitor.py.
Daily monitor runs, alerts on degradationβ€”mimo-v2.5-proβ€”
HIGHSVALIDATEDβ€”2026-06-02 10:36 EDTMETHODOLOGYDonchian Breakout strategy (B-Tier on BTC/ETH/BNB)
Turtle Trading-inspired breakout for crypto. B-Tier on BTC (Sharpe 1.91, +371%), ETH (Sharpe 1.69, +574%), BNB (Sharpe 1.57, +358%).
Forward PF >= 1.5 on 30 live picksβ€”mimo-v2.5-proπŸ“„ doc
HIGHSVALIDATEDβ€”2026-06-02 10:36 EDTMETHODOLOGYMulti-Timeframe Momentum strategy (B-Tier on ETH/XRP)
Combines 20-day and 50-day momentum with 200 SMA filter. B-Tier on ETH (Sharpe 1.60, +272%), XRP (Sharpe 1.72, +1169%).
Forward PF >= 1.5 on 30 live picksβ€”mimo-v2.5-proπŸ“„ doc
HIGHSVALIDATEDβ€”2026-06-02 10:36 EDTMETHODOLOGYBlue-Chip Compounders strategy (NVDA/MSFT/AAPL/GOOGL)
Buy-and-hold mega-cap compounders with SMA trend filter. NVDA +2285% backtest, Sharpe 2.08. GOOGL +733%, AMZN +441%, JPM +336%.
Forward PF >= 1.2 on 20 live picksβ€”mimo-v2.5-proπŸ“„ doc
HIGHSIMPLEMENTEDβ€”2026-06-02 10:36 EDTMETHODOLOGYEAGLE Swarm Synthesis β€” multi-model review of 8 EAGLE documents
Consolidated findings from Claude Opus 4.7, minimax-m3-free, Mimo v2.5 Pro, Grok. Identified AI tournament edge matrix, directional biases, best symbols, best personas.
β€”mimo-v2.5-proπŸ“„ doc
HIGHSBACKLOGEAGLE-3 P0 follow-up2026-06-02 10:25 EDTMETHODOLOGYReplace mutation_framework.compute_pf with proper PF formula + reject 999 fallback inflation
Replace mutation_framework.compute_pf with proper PF formula + reject 999 fallback inflation
All mutation PF values are computable from gross win/loss sums; no fallback artifact above 5.0β€”claude-opus-4-7β€”
HIGHSIMPLEMENTEDβ€”2026-06-02 10:18 EDTGATECRYPTO SHORT-only direction flip in production scanner
Tournament data: SHORT 67% WR / +3.74% avg PnL vs LONG 33% WR / -0.49% avg. Flip production scanner to SHORT-only for CRYPTO.
CRYPTO WR improves from ~45% to >55%
Implemented in production_scanner.py EAGLE-4 admissibility gate.
minimax-m3-freeπŸ“„ doc
HIGHSBACKLOGEAGLE-3 P12026-06-02 10:17 EDTMETHODOLOGYFix MC null hypothesis: bootstrap-with-replacement -> block bootstrap
Fix MC null hypothesis: bootstrap-with-replacement -> block bootstrap
MC p-value distribution shifts; previously rejected strategies (VWAPReversion OOS Sharpe 3.10) can promoteβ€”claude-opus-4-7β€”
HIGHSBACKLOGEAGLE-3 P12026-06-02 10:17 EDTMETHODOLOGYDSR n_trials: read real cumulative hypothesis count from reports/hypothesis_registry.json
DSR n_trials: read real cumulative hypothesis count from reports/hypothesis_registry.json
DSR values shift to correct multi-testing-deflated magnitudesβ€”claude-opus-4-7β€”
HIGHSBACKLOGEAGLE-3 P1 follow-up2026-06-02 10:17 EDTMETHODOLOGYLand block-bootstrap helper in verified_strategies/strategy_verification_engine.py (file now on main via PR #455)
Land block-bootstrap helper in verified_strategies/strategy_verification_engine.py (file now on main via PR #455)
MC null hypothesis preserves serial structure for trend strategiesβ€”claude-opus-4-7β€”
HIGHSACCEPTEDβ€”2026-06-02 10:17 EDTMETHODOLOGYMerge PR #457 β€” resolver health forward_test fix + ALTER helper
Ship check_resolver_health.py fix (forward_test before close), tools/ensure_forward_test_outcome_columns.py idempotent ALTER, and stale-open batch-resolve documentation.
PR #457 merged; check_resolver_health exits 0 green on forward_test when columns present.β€”cursor-composer-eagle2-2026-06-02πŸ“„ doc Β· 457
HIGHSIMPLEMENTEDβ€”2026-06-02 10:17 EDTGATEHard-reject single-source-artifact sleeves at promotion
is_single_source_artifact currently advisory. Make it a hard reject (or require >=3 sources) before any sizing. Kills the 2 false-positive crypto sleeves until de-concentrated.
No promoted sleeve with single_source_pct>0.6.
Shipped shadow-first in PR #470: _single_source_gate() flags classes whose profitable edge is entirely single-source. St
claude-opus-4-8πŸ“„ doc Β· 470
HIGHSACCEPTED2026-06-09
OVERDUE 7d
2026-06-02 10:17 EDTMETHODOLOGYEAGLE2: Daily operator bundle β€” run_eagle_suite.py + deploy audit JSON
Morning checklist: run_eagle_suite.py (--skip-swarm on laptop), verify money_ready_verdict, FTP strategy_admissibility.json + pilot_forward_dashboard.json, zero sizing on NOT_READY.
Suite runs green 7 consecutive days; verdict JSON fresh <24h on live /audit.β€”grok-cli-eagle2-2026-06-02πŸ“„ doc Β· πŸ”— page
HIGHSIMPLEMENTED2026-06-09
OVERDUE 7d
2026-06-02 10:17 EDTUIEAGLE2: Pick-funnel β€” label discovery cells 'not capital ready'
All green pick_funnel cells carry discovery disclaimer; cross-link strategy_admissibility.json and money_ready_verdict. Prevents tournament/lab numbers sizing production.
Live pick_funnel.html shows disclaimer on every green cell; zero user reports of funnel=live edge.β€”grok-cli-eagle2-2026-06-02πŸ“„ doc Β· πŸ”— page
HIGHSBACKLOG2026-06-01
OVERDUE 15d
2026-05-29 02:08 EDTDATA_FEEDUpdate 'forward validator frozen 270h' incident title/description to accurate root cause (outcome resolver git add pathspec, not forward_validator)
From reports/2026-05-26_forward_validator_freeze_diagnosis.md + money maker v2 unblock plan. Current incidents.html / DB entry misattributes the 270h freeze. Accurate: outcome-resolver.yml failing on missing closed_picks.json in git add since 2026-05-25 19:54. This is the real blocker for signal_out…
incidents.html and DB accurately reflect the outcome resolver root cause; triage focuses on the real one-line workflow fix
--- title: "Phase 1.1 β€” 'Forward validator frozen 270h' root-cause diagnosis" date: 2026-05-26 status: diagnosed, fix pr
grok-4.3-autosweep-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHSBACKLOG2026-06-02 17:00 EST
OVERDUE 14d
2026-05-29 02:08 EDTDATA_FEEDFix outcome-resolver.yml git add step (pathspec closed_picks.json error blocking signal_outcomes)
From reports/2026-05-26_forward_validator_freeze_diagnosis.md (cross-ref money maker v2 unblock plan). The 'forward validator frozen 270h' claim is misdirected. Real freeze is in outcome-resolver workflow: fatal pathspec 'alpha_engine/data/closed_picks.json'. Last success 2026-05-25 18:51, failures …
outcome-resolver.yml runs green hourly; signal_outcomes written <2h fresh; money maker v2 unblocked on data trust
--- title: "Phase 1.1 β€” 'Forward validator frozen 270h' root-cause diagnosis" date: 2026-05-26 status: diagnosed, fix pr
grok-4.3-autosweep-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHSBACKLOG2026-06-10
OVERDUE 6d
2026-05-29 01:46 EDTDATA_FEEDSchema drift watchdog nightly workflow (information_schema snapshot + automated diff vs version-controlled baseline)
From daily_ideas_edge_sweep_2026_05_17.md + synthesis + this Daily Ideas subagent. Nightly snapshot of information_schema to JSON + automated diff against version-controlled baseline in schemas/. Currently no such workflow despite high consensus. Integrates with existing db-freshness guardian and cr…
Zero unexplained schema drift in CI; faster MTTR on generator/schema bugs; reproducible dashboard numbers
# Daily Ideas Edge Sweep β€” 2026-05-17 **Sources:** 19 DAILY_IDEAS files from 12+ AI agents (Antigravity, Cursor, Grok,
grok-4.3-autosweep-DailyIdeas-Corpus-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHSBACKLOG2026-06-08
OVERDUE 8d
2026-05-29 01:46 EDTDATA_FEEDMySQL sync workflow silent-fail removal + explicit failure + alert (remove || echo non-fatal swallow)
From daily_ideas_edge_sweep_2026_05_17.md + synthesis + this Daily Ideas subagent. Remove '|| echo "non-fatal"' in mysql-trading-sync.yml line 43 (and similar) that swallows DB failures. 4+ agent consensus; 5-min fix. Aligns with existing db-freshness guardian.
DB outages surfaced in <15min (vs hours); higher fidelity live_picks/resolver data for audit
# Daily Ideas Edge Sweep β€” 2026-05-17 **Sources:** 19 DAILY_IDEAS files from 12+ AI agents (Antigravity, Cursor, Grok,
grok-4.3-autosweep-DailyIdeas-Corpus-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHSBACKLOG2026-06-10
OVERDUE 6d
2026-05-29 01:45 EDTDATA_FEEDSchema drift watchdog nightly workflow (information_schema snapshot + diff vs baseline in schemas/)
From daily_ideas_edge_sweep_2026_05_17.md + synthesis + this Daily Ideas subagent. Nightly snapshot of information_schema to JSON + automated diff against version-controlled baseline. Currently no such workflow despite high consensus. Integrates with existing db-freshness and cross-DB audit.
Zero unexplained schema drift in CI; faster detection of generator bugs; reproducible dashboard numbers
# Daily Ideas Edge Sweep β€” 2026-05-17 **Sources:** 19 DAILY_IDEAS files from 12+ AI agents (Antigravity, Cursor, Grok,
grok-4.3-autosweep-DailyIdeas-Corpus-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHSBACKLOG2026-06-08
OVERDUE 8d
2026-05-29 01:42 EDTUIFix duplicate leaderboard entries (source="" inflation) + reconcile alpha_engine aggregate loss vs sub-strat buried winners
From AUDIT_DASHBOARD_BURIED_WINNERS_2026-05-29.md + this subagent. Every leaderboard entry appears exactly twice (source="" + real); inflates counts. alpha_engine aggregate losing (βˆ’5.56, 37.1% WR) while sub-strats (mega_mutation etc.) win big. Dedup by strategy+source + surface buried + per-strateg…
No double-counting on /audit; buried winners (mega_mutation PF4.39 etc.) appear in Smart Picks/aggregates; accurate counts
# Audit Dashboard Buried Winners Analysis **Date:** 2026-05-29 **Analyst:** Grok 4.3 (autonomous) **Data source:**
grok-4.3-autosweep-05-27-synth-05-29-hunt-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHSBACKLOG2026-06-10
OVERDUE 6d
2026-05-29 01:42 EDTDATA_FEEDFix incidents.html EST timestamps + add enhancement_plan + target_date columns to ENHANCEMENT tables
From 2026-05-29-COMPREHENSIVE-QUANT-ANALYSIS.md + this subagent (Tier 1 recs #3/#4). incidents.html timestamps broken; no audit trail for enhancement plans/dates. Add schema columns + population + generator fixes for credible /audit/incidents.html as Goal #1 task board.
incidents.html shows correct EST + every enhancement has plan + target_date; full audit trail
# Comprehensive Quant Data Dive β€” May 29, 2026 > **Purpose:** Full-spectrum quantitative audit of pick performance, edg
grok-4.3-autosweep-05-27-synth-05-29-hunt-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHSBACKLOG2026-06-12
OVERDUE 4d
2026-05-29 01:40 EDTUItriage_dashboard_P0_regression_diff_and_escalation
Add post-render job + TRIAGE DASHBOARD section on incidents.html showing P0 counts, age>7d escalation, and OPEN->RESOLVED->OPEN regressions. From 2026-05-25_incidents_triage_consult_codex + grok. Makes nightly incidents page credible for Goal #1 work.
All P0s >7d or regressed flagged daily; triage becomes actionable
# Incidents Triage Consult β€” codex **Date:** 2026-05-25 **Engine:** codex **Prompt:** see /tmp/consult_prompt.md (also
grok-4.3-autosweep-EAGLE-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHSBACKLOG2026-06-08
OVERDUE 8d
2026-05-29 01:40 EDTMETHODOLOGYmulti_AI_panel_grounding_failure_mandate
Ungrounded 5-engine panel called COMMODITY #1 alpha (PF 4.64 from 85% CT=F leakage); grounded panel correctly flagged DATA_QUALITY_LEAKAGE. From 2026-05-25_multi_ai_panel_meta_review. Mandate leakage signals + hypothesis_registry rejected entries in every consult prompt.
0 ungrounded COMMODITY-style false positives in future panels
# Multi-AI Panel Meta-Review β€” 2026-05-25 **TL;DR:** Two parallel multi-AI panels reached **opposite verdicts on the sa
grok-4.3-autosweep-EAGLE-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHSBACKLOG2026-06-10
OVERDUE 6d
2026-05-29 01:40 EDTDATA_FEEDappend_only_incidents_seed_with_finding_key
Nightly seed_incidents_enhancements.py re-inserts destroy triage state and create false 'new' P0 churn. From 2026-05-25_incidents_triage_consult_codex.md + gemini. Add deterministic finding_key (asset_class + issue_type + normalized_title) + INSERT ... ON DUPLICATE KEY that only touches evidence/upd…
Stable status/assigned_to across nightly runs; zero duplicate 'new' P0s on identical findings
# Incidents Triage Consult β€” codex **Date:** 2026-05-25 **Engine:** codex **Prompt:** see /tmp/consult_prompt.md (also
grok-4.3-autosweep-EAGLE-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHSBACKLOG2026-06-28
in 12d
2026-05-27 02:34 EDTVerify the 648-for-0 un-gated-picks claim (DeepSeek session)
Roo's NIM panel session (2026-05-25) reports moderate_confidence (n=455) and low_confidence (n=193) buckets went 0-for-648 over the 6-day window 2026-05-16..21, destroying -825% PnL, while 300 gated picks generated +994%. If real this is the single highest-leverage filter in the system. 0-for-455 is…
Verified n / WR per quality_tier bucket from raw DB; circular-definition determination documented in reports/
# Multi-AI Panel Meta-Review β€” 2026-05-25 **TL;DR:** Two parallel multi-AI panels reached **opposite verdicts on the sa
claude-opus-4-7πŸ“„ doc
HIGHSBACKLOG2026-06-28
in 12d
2026-05-25 00:31 EDTMETHODOLOGYRuntime _assert_no_lookahead leakage guard in walk-forward harness
Qwen step-2: assert no test-period features were populated during training. Check columns NaN pre-cutoff AND populated post-cutoff as red-flag for forward-fill leakage. Raise DataLeakageError on detection inside every fold iteration.
zero leakage flags across all candidate strategies in CI
# Qwen β€” Momentum + Macro Signal Pipeline Framework, Risk-Budgeting Pitfalls, OOS Walk-Forward Validation **Date:** 202
qwenπŸ“„ doc
HIGHSBACKLOG2026-06-28
in 12d
2026-05-24 23:45 EDTSCORINGWON-vs-PnL backfill SQL β€” re-label legacy contradicted rows
Opencode P0 DRAFT. UPDATE pass that re-computes status from pnl_pct for any (status='WON', pnl_pct<0) or (status='LOST', pnl_pct>0) row. Sign-coherence guard already stops NEW contradictions; this backfills the historical 2,531+ WON rows with negative PnL flagged in db_health.json::won_pnl_contradic…
All WON rows have pnl_pct >= 0; all LOST/SL_HIT rows have pnl_pct <= 0; aggregates re-published
# Audit picks methodology and UI edge validation **Session ID:** ses_1a2d77a8dffe4tBYbdlLp6CGV2 **Created:** 5/25/2026,
opencode/ring-2.6-1tπŸ“„ doc
HIGHSBACKLOG2026-06-28
in 12d
2026-05-24 23:41 EDTGATEAdd VIX/realised-vol regime tag at pick submission
Cheapest single fix per persona-survey β€” addresses 7 personas / ~470 picks. ~30% of picks fire in the wrong regime today.
Picks tagged with regime; backtest shows >=3pp WR improvement when filtering by regime-aligned subset
# Persona Improvement Survey β€” AI Tournament **Date:** 2026-05-25 **Author:** investigation subagent **Goal alignment:*
claude+persona_surveyπŸ“„ doc
HIGHMBACKLOGβ€”2026-06-10 14:24 EDTGATELook-ahead leakage guard as a pipeline-failing CI gate
A look-ahead/leakage check exists only in research tools, not as a CI gate that fails the pipeline. Given the honest-measurement layer now shipped (entry-anchored resolver, intrabar_truth), a CI guard that asserts no future-bar references in the resolver/feature path would prevent regression of the …
CI fails on any new look-ahead in resolver/feature code; protects the honest measurement chainβ€”claude-fable-masterrunπŸ“„ doc
HIGHMBACKLOGβ€”2026-06-09 14:12 EDTGATEWire TSMOM/residual/carry as opt-in sidecars post-P0 (intrabar/OHLCV/TP-SL) for low-n classes (ETF/FUTURES proxies, FOREX carry, EQUITY residual)
n>=50 clean post-filters for pilots; >=1 class reaches n>=100 WR>=50 PF>=1.5 on policy-clean + forward 4wkβ€”grok-4.3 (subagents 2026-06-09)πŸ“„ doc
HIGHMBACKLOGβ€”2026-06-09 14:05 EDTGATENon-crypto edge guard + class-specific source curation and score floors for FOREX/COMMODITIES/ETFS/FUTURES
Implemented non_crypto_edge_guard in quality_gates.passes_active_gate to hard-block emission for these classes unless from proven carry/COT/seasonal/risk_parity/proxy sources. Banned additional universal losers (etf_all_strategies, multi_asset_cot, futures_connors_rsi2) in config.PERMANENTLY_KILLED_…
n_resolved >=50 clean per class with WR>=48% PF>=1.1 on resolved; reduced concentration in bad universal scanners; progress toward money_ready for 1+ non-crypto classβ€”grok-4.3 (subagents 2026-06-09)πŸ“„ doc Β· πŸ”— page
HIGHMBACKLOGβ€”2026-06-08 23:08 EDTMETHODOLOGYDeepen crypto_ohlcv 1h history to >=180 days (gating dependency for intrabar re-resolution)
crypto_ohlcv holds only ~30d contiguous 1h bars (720/symbol). reresolve_intrabar.py dry-run: 1422/3000 picks no_data. Without deep history, full-book intrabar re-baseline is impossible.
>=180d 1h bars for top symbols; reresolve no_data rate <10%β€”cursor-agent 2026-06-09 save-the-system auditπŸ“„ doc
HIGHMBACKLOGβ€”2026-06-08 23:08 EDTMETHODOLOGYApply intrabar re-resolution to production pick book (backup-first)
Dry-run: CRYPTO orig WR 52.3% -> intrabar-true 42.9%, PF 1.22. 26.4% TP->SL reclassification. tools/reresolve_intrabar.py --apply gated.
Clean+intrabar cohort n>=2000; at least one strategy clears n>=100/3mo/PF>1.5/WR>52% OR honest no-edge verdictβ€”cursor-agent 2026-06-09 save-the-system auditπŸ“„ doc
HIGHMBACKLOGβ€”2026-06-08 23:06 EDTMETHODOLOGYSAVE-3: Re-baseline every class on clean+intrabar cohort, then paper-pilot survivors only
After SAVE-1/2 + the shipped artifact filters (backfill quarantine, sane-pnl guard, EXPIRED-honest, banned-source exclusion), re-run the clean-cohort money-ready screen. Only strategies clearing n>=100, >=3 months, PF>1.5, WR>52%, intrabar-validated, multi-source advance to a >=4-week forward paper-…
>=1 strategy x class clears the full bar on a forward (post-fix) cohort before any sizing.β€”claude-opus-4-8 (save-the-system roadmap 2026-06-09)πŸ”— page
HIGHMBACKLOGβ€”2026-06-08 23:06 EDTMETHODOLOGYSAVE-4: Wire dormant academic sleeves (TSMOM / residual-momentum / carry) AFTER resolver is trustworthy
Strategy supply is fine; the academically-correct sleeves are already coded but dormant: tsmom_strategy.py+cta_bridge.py (universal, trailing-stop exits avoid the expiry trap), residual_momentum.py (EQUITY), bond_strategy_harness.py + commodity_basis_carry.py (carry). Promote into production_scanner…
TSMOM + residual-momentum + carry each have >=1 production caller; forward-tracked.β€”claude-opus-4-8 (save-the-system roadmap 2026-06-09)πŸ”— page
HIGHMBACKLOGβ€”2026-06-06 09:33 EDTMETHODOLOGYWire Time-Series Momentum (vol-scaled) as the universal cross-class sleeve
TSMOM (Moskowitz-Ooi-Pedersen 2012) is validated across every class in this audit; exits on trailing stop / sign-flip rather than far fixed TP, structurally curing the TIME_EXPIRED disease. tsmom_strategy.py + cta_bridge.py already exist but are dormant. Pair with Moreira-Muir (2017) vol-managed siz…
After resolver fix: TSMOM sleeve live in production_scanner with vol-targeting; >=3 months clean forward n>=30/mo PF>1.3.β€”claude-opus-4-8 (per-class edge workflow wxdjdjztu 2026-06-06)πŸ“„ doc
HIGHMBACKLOGβ€”2026-06-06 09:33 EDTMETHODOLOGYPromote residual (idiosyncratic) momentum into production_scanner
Blitz-Huij-Martens (2011) residual momentum ~doubles vanilla-momentum Sharpe (~0.8-1.0) and is crash-resistant. residual_momentum.py / equity_qmom_residual.py already coded but stranded in the academic sidecar (no production caller). v1 needs only stock_ohlcv + SPY-residual.
residual_momentum wired into production_scanner equity path; forward PF>1.3 on clean cohort.β€”claude-opus-4-8 (per-class edge workflow wxdjdjztu 2026-06-06)πŸ“„ doc
HIGHMBACKLOGβ€”2026-06-06 01:58 EDTPERSONAWire st_fear_greed_contrarian into AI tournament provider pool
st_fear_greed_contrarian was just promoted to anti-overfit registry (DSR=19.70, FDR p=0.0, WR=53%, n=430). Currently emits only via paper_trading pipeline. Should be wired into the AI tournament ingestion so its FNG-driven crypto picks appear alongside OpenAI/DeepSeek/Google. Strategy: BUY when Fear…
fear_greed_contrarian picks appear in ai_tournament_picks_latest.json within 1 week. Target: β‰₯50 tournament picks from this strategy in the first week.β€”claude-sonnet-4-6 2026-06-06β€”
HIGHMBACKLOGβ€”2026-06-05 01:48 EDTDATA_FEEDTournament leaderboard JSON missing per-row DISPUTED/artifact caveat field
ai_tournament_leaderboard.json tags 67-91% WR models as tier=T1/T2 with NO per-row disputed/artifact/caveat field. Per memory ai-tournament-wr-artifact-2026-06-03, these WRs are suspected single-snapshot resolver artifacts (fix = intrabar OHLC replay). The DISPUTED banner is HTML-only β€” non-browser …
ai_tournament_leaderboard.json includes 'caveat' or 'wr_artifact_risk' per row when WR>=60% AND resolution method = daily-OHLC. UI banner pulls from this field instead of hardcoding.β€”claude-opus-4-7 /loop-1hβ€”
HIGHMBACKLOGβ€”2026-06-05 00:50 EDTOTHERLive-or-die kill switch monitoring auto-files P0 incidents on breach (MASTERPLAN action 5)
See MASTERPLAN_JUNE52026_CLAUDE.MD for acceptance criteria, dependencies, and 30-day timeline. Reviewed by 2-engine swarm (DeepSeek + xAI) β€” both unanimous on action item priority.β€”β€”
HIGHMBACKLOGβ€”2026-06-05 00:50 EDTOTHERFreeze LLM tournament for 30-day live-or-die test of 7 confirmed micro-edges (MASTERPLAN action 1)
See MASTERPLAN_JUNE52026_CLAUDE.MD for acceptance criteria, dependencies, and 30-day timeline. Reviewed by 2-engine swarm (DeepSeek + xAI) β€” both unanimous on action item priority.β€”β€”
HIGHMBACKLOGβ€”2026-06-05 00:50 EDTOTHERHarden promotion_gate.py with walk-forward + DSR>=0.80 + two-regime test (MASTERPLAN action 4)
See MASTERPLAN_JUNE52026_CLAUDE.MD for acceptance criteria, dependencies, and 30-day timeline. Reviewed by 2-engine swarm (DeepSeek + xAI) β€” both unanimous on action item priority.β€”β€”
HIGHMIMPLEMENTEDβ€”2026-06-04 22:34 EDTDATA_FEEDStrategy-level tracking gap: FUTURES/ETF/BOND/COMMODITY/PENNY emit picks, 0 perf rows
at_strategy_stats covers only CRYPTO/MEMECOIN (+thin EQUITY). FUTURES (4094 raw picks, 246/14d), ETF (421), BOND (90), COMMODITY (5723 outcomes), PENNY (494) have ZERO strategy-level perf rows β€” only raw/outcome ledgers. Can't evaluate per-class edge without tracking. Build per-class strategy stats …
Shipped: tools/rebuild_strategy_stats_all_classes.py + daily cron build strategy_perf_by_class for all 11 classes (542 c
claude-opus-4-8πŸ“„ doc Β· 532
HIGHMBACKLOG—2026-06-04 11:53 EDTOTHERCRYPTO LONG→SHORT flip bypass: 10 emitters skip production_scanner gate (991 LONG/BUY in 7d)
Add the _EAGLE4_CRYPTO_FLIP_TO_SHORT flip at alpha_engine/mysql_trading_sync.py upsert layer so it catches all 10 bypass paths (ml_crypto_predictor 230, copy_trader_polymarket 172, luxalgo_filters 116, alpha_engine_fast 116, prediction_market_agents 106, alpha_engine 87, ml_strategy_reviver 84, quan_engine 29, genome_mutations 26, battleground_luxalgo 25). Risk: changes direction on ~900 picks/week β€” needs operator approval + canary mode. Alternative: keep current per-scanner approach and add flβ€”β€”
HIGHMBACKLOGβ€”2026-06-04 10:33 EDTOTHERPromote inverse_ml_enhanced_ADAUSDT_15m_D β€” PF 1.73 / WR 55.6% / avg +0.11% (n=36)
Strongest verified strategy post-INCIDENT-94 backfill. Reach n=100 forward closed and reconfirm PF>1.5 before any live capital. Reference: reports/verified_strategies_unlock_2026-06-04.mdβ€”β€”
HIGHMBACKLOGβ€”2026-06-04 07:53 EDTOTHERPaper pilot signal rarity blocks promotion β€” b_flip n=2, inverse_ml_btc n=3 (need n>=100)
Both strategies reach n=100 forward closed within 60 days. b_flip lab PF=35.91 from 157 trades but forward gen too sparse. Options: (1) loosen entry filters, (2) add more symbols to universe, (3) extend timeframe coverage. Reference: reports/bootstrap_forward_stats_latest.json (2026-06-04T11:24)β€”β€”
HIGHMBACKLOGnext sprint2026-06-02 21:50 EDTMETHODOLOGYSL/TP audit using intrabar OHLC replay (EAGLE_JUNE2 Β§7.1 #3)
SL/TP audit using intrabar OHLC replay (EAGLE_JUNE2 Β§7.1 #3)
Per-class SL/TP audit report with concrete tightening/widening recommendationsβ€”claude-opus-4-7β€”
HIGHMIMPLEMENTEDβ€”2026-06-02 10:36 EDTMETHODOLOGYAdmissibility Pipeline (10-step promotion gate)
Pre-register, data validation, purged walk-forward, cost model, DSR/PBO, block bootstrap, regime robustness, concentration check, forward validation, promotion decision.
All new strategies pass pipeline before capital deploymentβ€”mimo-v2.5-proπŸ“„ doc
HIGHMIMPLEMENTED30-90 days forward data accumulation2026-06-02 10:25 EDTMETHODOLOGYDaily verified pilot loop (PR #452 MERGED): ETF dual_momentum + crypto WF forward stats + faber forward stats
Daily verified pilot loop (PR #452 MERGED): ETF dual_momentum + crypto WF forward stats + faber forward stats
Forward n >= 30 (checkpoint) / n >= 100 (promotion) on ETF dual_momentum pilotβ€”claude-opus-4-7β€”
HIGHMIMPLEMENTEDβ€”2026-06-02 10:18 EDTMETHODOLOGYBlock-bootstrap MC null for strategy verification
Replace bootstrap-with-replacement with moving-block bootstrap (Kunsch 1989) for Monte Carlo null hypothesis testing. Preserves serial correlation, trend/momentum structure, and volatility clustering.
Sharpe p-values for top strategies drop to 0.000 (strong edge signal)
Implemented in alpha_engine/strategy_verification_engine.py. Block size auto-computed via n^(1/3).
kilo-mimo-freeπŸ“„ doc
HIGHMIMPLEMENTEDβ€”2026-06-02 10:18 EDTMETHODOLOGYPBO via parameter grid permutations (not sign flips)
Implement Probability of Backtest Overfitting per Bailey & Lopez de Prado (2015) using parameter grid permutations. 5 perturbation types: window shift, threshold flip, holding period, trailing stop, volatility scaling.
PBO distinguishes robust (PBO<0.10) from overfit (PBO>0.50)
Implemented in strategy_verification_engine.py compute_pbo_parameter_perms().
kilo-mimo-freeπŸ“„ doc
HIGHMACCEPTEDβ€”2026-06-02 10:18 EDTMETHODOLOGYETF Dual Momentum as first promotion candidate
ETF Dual Momentum (Antonacci 2013) is only lab strategy passing Tier-2 gates: PF 1.60, n=104, Sharpe 1.91. First candidate for paper trading pilot.
Forward test PF>1.2, Sharpe>0.8, MDD<15% over 60-day pilot
Created alpha_engine/backtest_etf_dual_momentum.py (1056 lines).
kilo-mimo-freeπŸ“„ doc
HIGHMIMPLEMENTED2026-06-02 12:00 UTC
OVERDUE 14d
2026-06-02 10:18 EDTGATEEAGLE-5 promotion gate (minimax-m3-free, 2026-06-02)
Second in-pipeline gate (6f2.6 in production_scanner.py) that BOOSTS confidence for 33 tournament-validated symbols and 16 personas that have demonstrated >=60% win rates. Multiplicative boost: +20% on whitelisted symbol (_EAGLE5_SYMBOL_BOOST=1.20), +15% on whitelisted persona (_EAGLE5_PERSONA_BOOST…
Distribution of _eagle5_boosted picks on whitelisted symbols matches the scanner universe; paper-trade forward PnL on whitelisted symbols is > paper-trade PnL on non-whitelisted within 4 weeks; no picked confidence >1.0 (cap holds).
Shipped 2026-06-02. Implementation: alpha_engine/eagle_gates.py (standalone module) + alpha_engine/production_scanner.py
minimax-m3-freeπŸ“„ doc Β· ⎌ 7510035 Β· 447)guide-doc:PR #461
HIGHMIMPLEMENTED2026-06-02 12:00 UTC
OVERDUE 14d
2026-06-02 10:17 EDTGATEEAGLE-4 admissibility gate (minimax-m3-free, 2026-06-02)
First in-pipeline gate (6f2.5 in production_scanner.py) that operates on the 4 noise personas + 8 negative-edge class x direction combos identified in the AI tournament analysis (5,492 picks, 3,692 resolved). Three actions: (1) KILL 4 NOISE personas β€” momentum_scalp (28% WR), breakout_scanner (28% W…
All 3 action categories execute on every scanner run; live CRYPTO SHORT ratio jumps from pre-gate to >=60% within 1 week of forward data; NOISE personas emit 0 picks post-gate; PENNY/COMMODITY/ETF/EQUITY SHORT/SELL count drops to 0 post-gate.
Shipped 2026-06-02. Implementation: alpha_engine/eagle_gates.py (standalone module) + alpha_engine/production_scanner.py
minimax-m3-freeπŸ“„ doc Β· ⎌ e9b2d73 Β· ⎌ 7510035 Β· 447)guide-doc:PR #461
HIGHMBACKLOGafter first 30d shadow on ETF Verified DM2026-06-02 10:17 EDTGATEWire promotion_gate.is_admissible_for_production() into production_scanner emission path
Wire promotion_gate.is_admissible_for_production() into production_scanner emission path
Strategies emit only when on PROMOTED_STRATEGIES allowlist + class not frozenβ€”claude-opus-4-7β€”
HIGHMBACKLOGnext sprint2026-06-02 10:17 EDTDATA_FEEDAdd current_price + unrealized_pnl_pct columns to PF_POSITION + daily mark step
Add current_price + unrealized_pnl_pct columns to PF_POSITION + daily mark step
CURRENT $ and UNREALIZED % cells on portfolio drill pages show live numbersβ€”claude-opus-4-7β€”
HIGHMBACKLOGEAGLE-3 P12026-06-02 10:17 EDTMETHODOLOGYReal k-fold PBO using 8-fold purged WF (already coded at rigorous_backtest_harness.py:249-320 but never called)
Real k-fold PBO using 8-fold purged WF (already coded at rigorous_backtest_harness.py:249-320 but never called)
PBO numbers become meaningful; can detect real overfittingβ€”claude-opus-4-7β€”
HIGHMBACKLOGafter data hygiene fixes land2026-06-02 10:17 EDTGATESymbol whitelist: restrict production emission to AI-tournament-proven symbols (BAC/JPM/MSFT 64-100% WR; KULR/RGTI/ASTS/RKLB 83-100% WR for penny)
Symbol whitelist: restrict production emission to AI-tournament-proven symbols (BAC/JPM/MSFT 64-100% WR; KULR/RGTI/ASTS/RKLB 83-100% WR for penny)
Live aggregate WR per asset class lifts to within 80% of tournament WRβ€”claude-opus-4-7β€”
HIGHMIMPLEMENTEDβ€”2026-06-02 10:17 EDTMETHODOLOGYDaily verified pilots operator loop (EAGLE suite + forward stats)
Cron/local: run_eagle_suite.py then run_verified_pilots_daily.py (ETF dual momentum pilot, crypto_wf/faber/etf forward stats, pilot_forward_dashboard.json). GitHub workflow verified-pilot-daily.yml 06:15 UTC.
pilot_forward_dashboard.json refreshed daily; verified_edge_status honest empty states when no logs.β€”cursor-composer-eagle2-2026-06-02πŸ“„ doc Β· πŸ”— page Β· 452
HIGHMACCEPTEDβ€”2026-06-02 10:17 EDTGATEContinue stale OPEN resolution β€” bounded batch operator runbook
After first 5×500 batch (~100k→~3.8k OPEN), schedule additional resolve_stale_open_picks.py runs until OPEN count matches hold-window expectations per asset class.
OPEN count <5k and check_resolver_health overall GREEN on open_pick_staleness.β€”cursor-composer-eagle2-2026-06-02πŸ“„ doc
HIGHMIMPLEMENTEDβ€”2026-06-02 10:17 EDTGATEBonferroni/FDR multiple-testing pre-gate before DSR/SPA
Add cross-section multiple-testing correction (Bonferroni or BH-FDR q=0.10) as a hard pre-gate before DSR/SPA in money_ready_verdict. Closes the false-Tier1-PASS hole.
0 promotions that fail BH-FDR; documented adjusted-p per promoted sleeve.
Shipped shadow-first in PR #466: _fdr_gate() per-strategy BH-FDR q=0.10 + Bonferroni, stamps fdr_ok/n_fdr_pass/n_bonferr
claude-opus-4-8πŸ“„ doc Β· 466
HIGHMIMPLEMENTED2026-06-02 14:00 UTC
OVERDUE 14d
2026-06-02 10:17 EDTGATEEAGLE-6 statistical admissibility gate v1 (minimax-m3-free, 2026-06-02)
Final production pipeline gate (6f2.7 in production_scanner.py) sitting on top of EAGLE-4 (6f2.5) and EAGLE-5 (6f2.6). Kills picks whose strategies fail any of: (a) DSR noise filter β€” strategy NOT in the 27 dead strategies from tools/deflated_sharpe_results.json (sr_haircut=2.5034 for 164 strategies…
Live scanner emits _eagle6_verdict on 100% of picks; live ratio of INADMISSIBLE picks reflects DSR-noise set membership (target: ~25-35% INADMISSIBLE for the strategies currently in the live scanner universe); no false-positive kills on strategies with n>=30 and HHI<=0.20
v1 (shipped): DSR noise + n>=30 + HHI<=0.20. v2 backlog: (a) PBO<0.5 gate (blocked on tools/cpcv_pbo_results.json not ye
minimax-m3-freeπŸ“„ doc Β· ⎌ 2b4d7ce Β· ⎌ 0365999 Β· 456
HIGHMIMPLEMENTED2026-06-09
OVERDUE 7d
2026-06-02 10:17 EDTMETHODOLOGYEAGLE2: Merge PR stack #458 #447 #444 β€” walkforward pilots + admissibility + HTML
PR #458 and #457 merged on main 2026-06-02. walkforward_suite pilot PASS etf_dual_momentum OOS PF 2.75. #447/#444/#461 still open.
walkforward_suite.py --only pilot completes on CI; PRs merged; WALKFORWARD_REPORT.json refreshed.β€”grok-cli-eagle2-2026-06-02πŸ“„ doc Β· PR #458 Β· PR #457
HIGHMACCEPTED2026-06-16
due in 0d
2026-06-02 10:17 EDTDATA_FEEDEAGLE2: Weeks 1–2 resolver audit β€” EXPIRED+PnL, duplicate signal_ts, emitter census
Weekly SQL from EAGLE doc Part E: CRYPTO/FOREX EXPIRED wins, duplicate signal_ts groups, depromote battleground/regime_terminal concentration emitters.
Resolver dispute rate <1%; duplicate-ts groups trending down; census report in reports/.β€”grok-cli-eagle2-2026-06-02πŸ“„ doc
HIGHMBACKLOG2026-06-23
in 7d
2026-06-02 10:17 EDTMETHODOLOGYEAGLE2: strategy_admit.py for every promotion candidate + Bonferroni from registry
Re-run walkforward_suite on 50webs when source available; strategy_admit per sleeve; Bonferroni Ξ±_adj from hypothesis_registry.json (M-107).
Every promotion PR cites admit verdict JSON; no sleeve promoted without harness row.β€”grok-cli-eagle2-2026-06-02πŸ“„ doc
HIGHMBACKLOGβ€”2026-05-30 22:24 EDTGATEWire proven-but-dormant backtest edges to live emission (highest-ROI money-ready bridge)
Fleet investigation: the bottleneck is NOT strategy discovery or Monte-Carlo (validation is over-served, runs hourly) β€” it is live signal VOLUME. Multiple PROVEN backtest edges exist but emit ~0 live picks: ETF etf_cross_sectional_momentum (PF 2.05/WR 70/Sharpe 0.97, wired+ENABLED, just young); EQUI…
Each wired edge accumulates n>=30 live resolved at PF>=1.3 within 60-90d; >=2 classes reach Tier-2β€”claude-opus-4.8(money-maker-readyv2 fleet)β€”
HIGHMACCEPTEDβ€”2026-05-29 07:47 EDTGATEMasking-policy linter + baselined manifest (PR-only gate)
Tiered continue-on-error policy: manifest grandfathers 38 existing silent maskers; lint_workflow_masking.py --fail-on-new blocks only NEW silent maskers at PR time (zero new red X). PR-only, never on push-to-main.
0 new silent maskers merged; known_silent baseline drawn down over timeβ€”claude-opus-4-7PR #51
HIGHMACCEPTEDβ€”2026-05-29 07:47 EDTMETHODOLOGYGuardian step-level masked-failure detection
Extend failure-guardian to surface green-job/failed-step masking (the 316-coe blind spot) via the /runs/{id}/jobs API, bounded for quota, report + Discord.
masked failures appear in actions_failure_guardian.json + amber Discord alertβ€”claude-opus-4-7PR #50
HIGHMBACKLOG2026-06-09 18:00 EST
OVERDUE 7d
2026-05-29 06:40 EDTMETHODOLOGYWire ADX_Trend (and donchian_vol_breakout companion) as first universal cross-asset momentum strategy with n>=30 + 5-fold OOS/MC p<0.05 gate
From reports/CYCLE_12_STRATEGY_HUNT_2026-05-29.md: 'Key Discovery: **ADX_Trend is a universal winner across ALL asset classes** β€” the first strategy to show PF>1.5 on EQUITY, ETF, COMMODITY, FOREX, and CRYPTO simultaneously.' Table with specific Tier 1: 'ADX_Trend | NVDA | EQUITY | 5.17 | 80.0% | 10…
EQUITY (currently FAIL+INSUFF-N) + other 5 classes reach pf_registry Tier 2 (PF>1.5/WR>50/MDD<20, n_resolved>=100 policy-clean); incidents.html row closure; measurable lift in 14d/48h WR/PF and pick_funnel.html.β€”grok-4.3-autosweep-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHMBACKLOG2026-06-09 18:00 EST
OVERDUE 7d
2026-05-29 06:10 EDTMETHODOLOGYExtend rigorous_backtest_harness (purged WF + DSR + PBO + costs) to all 81 strategies + implement 7 new <=2-param economic designs (crypto_funding_carry_reversion etc.) to drive PBO<0.20 + T3+
From reports/STRATEGY_ROADMAP_COMPREHENSIVE_2026-05-29.md: Strategies with DSR/PBO computed | 2. Average PBO (all sized) | 0.613. Root Cause: High PBO (0.3-0.7) indicates most strategies were data-mined. The solution: fewer parameters + economic rationale + purged walk-forward. The 7 New Strategy De…
DSR/PBO computed + reported for >=50/81 strategies (pf_registry); >=1 design per weak class reaches T3 (PBO<0.20 + DSR>0.80 + n>=20 policy-clean); 14d/48h OOS stability; incidents.html tracks harness coverage + new designs.β€”grok-4.3-autosweep-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHMBACKLOG2026-06-12
OVERDUE 4d
2026-05-29 01:46 EDTSCORINGSingle-persona swarm-pick backfill + tier-gate (backfill 60d for tier=single; promote only if PF>=1.30 & WR>=50% at n>=100)
From daily_ideas_synthesis_2026-05-16.md + edge_sweep + this Daily Ideas subagent. 22/38 swarm picks are tier=single (1/1 vote) with no backing. Operational risk; backfill + enforce tier-gate so only verifiable high-quality singles get promoted to TV-eligible/Smart Picks.
All swarm picks have verifiable PF/WR backing; no unbacked singles in Smart Picks or funnel
# DAILY_IDEAS Cross-Agent Synthesis β€” 2026-05-16 **Sources:** 15 files across 6 agents (Antigravity, Cursor, Claude, Gr
grok-4.3-autosweep-DailyIdeas-Corpus-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHMBACKLOG2026-06-12
OVERDUE 4d
2026-05-29 01:43 EDTSCORINGConfidence calibration table + MySQL anomaly detector (at_confidence_calibration + mysql_prediction_anomaly_scanner for inverted confidence/direction conflicts)
From daily_ideas_synthesis_2026-05-16.md + edge_sweep + KimiCode + this Daily Ideas subagent (HIGH, fixes anti-predictive confidence inversion blocking all WR/PF claims). Create at_confidence_calibration (bucket-level drift tracking + auto-quarantine) + tools/mysql_prediction_anomaly_scanner.py (inv…
+8-12pp class WR via prevented inverted picks; reproducible HC stats (trust_score gaps closed); anomaly flags reduce ghost/PnL drift
# DAILY_IDEAS Cross-Agent Synthesis β€” 2026-05-16 **Sources:** 15 files across 6 agents (Antigravity, Cursor, Claude, Gr
grok-4.3-autosweep-DailyIdeas-Corpus-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHMBACKLOG2026-06-15
OVERDUE 1d
2026-05-29 01:43 EDTGATEPCG-5 portfolio gate stack shadow-mode wire (5-gate exec-time reject for concentration/correlation/regime/thin-sample)
From daily_ideas_synthesis_2026-05-16.md + edge_sweep + this Daily Ideas subagent review (multi-agent consensus). 5-gate exec-time reject layer (portfolio concentration + correlation + regime + thin-sample + other) proposed for shadow to block live sizing on correlated/thin/quarantined signals. No p…
Measurable DD reduction in sims/paper (target <20% MDD on sized books); zero concentration violations; gates surface in pick_funnel/audit
# DAILY_IDEAS Cross-Agent Synthesis β€” 2026-05-16 **Sources:** 15 files across 6 agents (Antigravity, Cursor, Claude, Gr
grok-4.3-autosweep-DailyIdeas-Corpus-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHMBACKLOG2026-06-15
OVERDUE 1d
2026-05-29 01:41 EDTSCORINGPer-strategy Smart Picks filtering + buried winners promotion + health panels (mega_mutation, kimi scouts etc.)
From AUDIT_DASHBOARD_BURIED_WINNERS_2026-05-29.md + this subagent review. Smart Picks only has 4 picks despite 16+ strategies with forward PF>2.0. mega_mutation PF4.39 n=110; kimi_riseoftheclaw scouts PF>2.5 with 0 active. Add per-strategy filtering, promote buried high-PF to active sizing, add heal…
Smart Picks reflects buried winners (mega_mutation etc. active); >10 strategies with fwd PF>2.0 surfaced; measurable lift in class PF/WR
# Audit Dashboard Buried Winners Analysis **Date:** 2026-05-29 **Analyst:** Grok 4.3 (autonomous) **Data source:**
grok-4.3-autosweep-05-25-29-MDs-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHMBACKLOG2026-06-10
OVERDUE 6d
2026-05-29 01:41 EDTMETHODOLOGYUniversal Monte Carlo permutation test (1000x, p<0.05) as mandatory promotion gate + scaffold wire
From 2026-05-27_pr_set_plan_and_blockers.md + AUTONOMOUS_STRATEGY_HUNT_CYCLES + this subagent. No current gate; small-sample luck survives. Add tools/permutation_test.py + wire to quality_gates/promotion: require real PF in 99th percentile of noise. Proven in hunt (91.5% profitable, ensemble 92% sig…
0 promotions without 99th %ile MC pass; all wired strats p<0.05
--- title: "PR Set Plan + Items I Cannot Solve Without [X]" date: 2026-05-27 04:00 UTC status: PR #8 opened; PR #9, #10
grok-4.3-autosweep-05-25-29-MDs-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHMBACKLOG2026-06-20
in 4d
2026-05-29 01:39 EDTDATA_FEEDAdd MySQL anomaly detector + predictor scorecard tables (at_predictor_scorecard, at_confidence_calibration, mysql_prediction_anomaly_scanner) from daily ideas corpus
From daily_ideas_synthesis_2026-05-16.md (KimiCode + synthesis). Inverted confidence, direction conflicts, silent-dead strategies, drift tracking. Directly improves Goal #1 data integrity and incidents visibility.
Tables populated nightly; at least 3 new P0/P1 incidents auto-detected per week from anomaly scanner
# DAILY_IDEAS Cross-Agent Synthesis β€” 2026-05-16 **Sources:** 15 files across 6 agents (Antigravity, Cursor, Claude, Gr
grok-4.3-autosweep-daily-ideas-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHMBACKLOG2026-07-28
in 42d
2026-05-27 02:34 EDTGATEAdd profitable-but-filtered / profitable-but-quarantined audit lane
Create a non-admission-changing observability lane that records picks rejected by gates or quarantine rules but later resolved positively. This turns hidden false negatives into measurable backlog without weakening live safety gates on day one.
Dashboard exposes per-asset-class counts and PF/WR for profitable filtered picks; every row includes first-failed gate + later outcome.
# Audit quick wins β€” EAGLE review **Timestamp:** 2026-05-27 02:17 EST (Toronto local review window) **Model / provide
gpt-5.4/openaiπŸ“„ doc
HIGHMBACKLOG2026-07-28
in 42d
2026-05-27 02:34 EDTGATEAdd bounded hot-streak exemption with explicit audit trail
Current streak logic influences scoring but does not create a controlled exemption path. Add a time-boxed, per-sleeve exemption contract so repeated clean winners can earn temporary gate relief without silently changing the system.
Every hot-streak exemption has a minimum clean sample, expiry timestamp, explicit reason, and automatic rollback on deterioration.
# Audit quick wins β€” EAGLE review **Timestamp:** 2026-05-27 02:17 EST (Toronto local review window) **Model / provide
gpt-5.4/openaiπŸ“„ doc
HIGHMBACKLOG2026-07-28
in 42d
2026-05-25 01:15 EDTSCORINGAdopt multi-model-consensus 0-100 algorithm quality score (DeepSeek+Kimi+8 CF models)
10-model panel (NVIDIA DeepSeek-R1-32B + kimi-k2.6 + CF qwen3-30b/qwq-32b/llama-3.3-70b/llama-4-scout/nemotron-3-120b/gpt-oss-120b/glm-4.7-flash/mistral-small-24b) converged on 8 core scoring dimensions: Sharpe(20%)+Sortino(20%)+MaxDD(20%)+n(10%)+Bonf-sig(10%)+OOS-ratio(10%)+cost-burden(5%)+bias-aud…
quality_score column live on /audit/pick_funnel.html nav-surface matrix; every promoted strategy carries a documented 0-100 score; deploy band (>=80) blocks promotion until all 10 Kimi gates pass
# Multi-Model Stock-Algorithm Scoring Synthesis β€” v1 (partial, 2 of 11 models landed) **Date:** 2026-05-25 **Status:**
multi-model-consensusπŸ“„ doc
HIGHMBACKLOG2026-07-28
in 42d
2026-05-25 00:31 EDTMETHODOLOGYStress-test 2020-March COVID scenario as mandatory pre-deploy gate
Qwen gate-10: replay 2020-03-09 through 2020-03-23 (flash crash) on candidate strategy. Fail deploy if drawdown >30%. Catches strategies that work in normal regime but blow up in correlation-to-1 events. Add stress_test_2020_march() helper, run once per candidate in CI.
every candidate strategy carries documented 2020-Mar drawdown and passes 30% floor
# Qwen β€” Momentum + Macro Signal Pipeline Framework, Risk-Budgeting Pitfalls, OOS Walk-Forward Validation **Date:** 202
qwenπŸ“„ doc
HIGHMBACKLOG2026-07-28
in 42d
2026-05-25 00:31 EDTMETHODOLOGYWard hierarchical clustering for sector/correlation concentration caps
Qwen pitfall #4: 4 'different' US-tech holdings = 1 sector. Use scipy.cluster.hierarchy.linkage(method=ward) on correlation distance matrix, cap risk per cluster <=20%. Replaces naive sector-string caps in portfolio_construction.
max per-cluster risk contribution <=20% on every rebalance
# Qwen β€” Momentum + Macro Signal Pipeline Framework, Risk-Budgeting Pitfalls, OOS Walk-Forward Validation **Date:** 202
qwenπŸ“„ doc
HIGHMBACKLOG2026-07-28
in 42d
2026-05-25 00:31 EDTMETHODOLOGYParameter sensitivity sweep +-20% as mandatory deploy gate
Qwen gate-9: shift each key parameter +-20% and re-run OOS. Fail deploy if any single shift causes >15% PnL degradation. Detects fragile parameter overfits. Add _run_parameter_sensitivity_analysis() to StabilityReport.
all promoted strategies pass <=15% degradation on +-20% sweep
# Qwen β€” Momentum + Macro Signal Pipeline Framework, Risk-Budgeting Pitfalls, OOS Walk-Forward Validation **Date:** 202
qwenπŸ“„ doc
HIGHMBACKLOG2026-07-28
in 42d
2026-05-25 00:31 EDTMETHODOLOGYExtend Bonferroni multi-test correction from top_edges to live promotion gate
Agent B (c9ba1589) added Bonferroni to top_edges.py. KEY FINDING: CRYPTO's 5 apparent edges DO NOT survive correction across 673-cell space; only COMMODITY produces robust edges. Currently live score_pick / passes_smart_gate uses raw thresholds. Wire wr_z + bonferroni_pass fields from top_edges outp…
verified-alpha promotion rate drops to align with bonferroni_pass=True cells only
# Qwen β€” Momentum + Macro Signal Pipeline Framework, Risk-Budgeting Pitfalls, OOS Walk-Forward Validation **Date:** 202
qwenπŸ“„ doc
HIGHMBACKLOG2026-07-28
in 42d
2026-05-25 00:31 EDTMETHODOLOGYMonte Carlo permutation test on OOS Sharpe before strategy admission
Qwen step-4: shuffle outcome labels N=100, re-run walk-forward, derive empirical permutation p-value on mean OOS Sharpe. Required gate: p<0.10 before shadow->probation promotion. Add _shuffle_and_revalidate() helper, return permutation_p_value field in WalkForwardResult.
p<0.10 enforced on every promotion event
# Qwen β€” Momentum + Macro Signal Pipeline Framework, Risk-Budgeting Pitfalls, OOS Walk-Forward Validation **Date:** 202
qwenπŸ“„ doc
HIGHMBACKLOG2026-07-28
in 42d
2026-05-24 23:54 EDTSCORINGReplace stationary covariance with EWMA + Ledoit-Wolf shrinkage (framework pitfall #1)
Current portfolio constructor likely uses sample covariance which fails on regime breaks (Mar 2020 correlations spike to ~1 and equal-risk allocation blows up). Switch to exponentially-weighted covariance with Ledoit-Wolf shrinkage; recompute on a rolling basis. Required input for the planned CVaR o…
Portfolio constructor uses EWMA + LW shrunk covariance; backtest on Mar 2020 + 2022 vol-shock shows DD reduction vs sample cov baseline
# Qwen β€” Momentum + Macro Signal Pipeline Framework, Risk-Budgeting Pitfalls, OOS Walk-Forward Validation **Date:** 202
qwen-code+framework-reviewπŸ“„ doc
HIGHMBACKLOG2026-07-28
in 42d
2026-05-24 23:51 EDTMETHODOLOGYShip tools/wf_validator.py β€” purged walk-forward (30d purge / 5d embargo / >=8 folds)
Ring's quant-workflow spec: a systematic walk-forward runner with purged k-fold + embargo to prevent leakage. Currently no such runner exists in repo; DSR sidecar is the only overfit guard. Complements the kilocode/nemotron purged-k-fold proposal but is more specific (30d purge, 5d embargo, >=8 fold…
tools/wf_validator.py shipped, integrated into strategy-promotion CI; >=3 strategies validated through it within 14 days
# Opencode Ring-2.6-1T Session Deep-Scan β€” Net-New Items **Source:** `session-ses_1a2d.md` (22,048 lines, Ring-2.6-1T v
opencode/ring-2.6-1t + 3-engine swarmπŸ“„ doc
HIGHMBACKLOG2026-07-28
in 42d
2026-05-24 23:51 EDTMETHODOLOGYAdd 10-gate deployment checklist as CI gate on shadow->probation promotion
Per Qwen framework: OOS Sharpe>=0.5, max DD<20%, WR binomial p<0.05, Sharpe CoV<0.5, edge-decay slope>-0.05, permutation p<0.10, net-Sharpe-after-costs>0.30, zero leakage flags, parameter sweep stability <15% PnL loss, stress-test (Mar 2020) DD<=30%. Failures block promotion or force reduced sizing.
CI workflow exists that runs all 10 gates on a candidate strategy + writes pass/fail JSON; no strategy promotes without all 10 green
# Qwen β€” Momentum + Macro Signal Pipeline Framework, Risk-Budgeting Pitfalls, OOS Walk-Forward Validation **Date:** 202
qwen-codeπŸ“„ doc
HIGHMBACKLOG2026-07-28
in 42d
2026-05-24 23:47 EDTSCORINGAdd momentum + macro composite signal template to alpha_engine (kilocode pseudo-code)
Kilocode shipped a reference pseudo-code: 50%/50% blend of (a) rolling-z 20d log-return momentum, (b) rolling-z macro-surprise series (CPI/PMI/yield-curve from FRED). Outputs a rank-able composite score. Closes the long-standing 'FRED data wired but not used in pick gen' gap from HEDGE_FUND_GAP_ANAL…
New alpha_engine/new_strategies/momentum_macro_composite.py wired into production_scanner; emits >=20 picks/30d with WR>=50%
# Hedge Fund Persona Investment Analysis **Session ID:** ses_1a2fc349fffeM2WL8LafLjajpC **Created:** 5/25/2026, 2:43:10
kilocode/nemotron-3-superπŸ“„ doc Β· πŸ”— page
HIGHMBACKLOG2026-07-28
in 42d
2026-05-24 23:45 EDTMETHODOLOGYBatch-DSR backtest the 206 baby_strategies/ files (zero currently wired)
Opencode found 206 files in alpha_engine/baby_strategies/, ZERO wired to production. Massive untapped pipeline. Surface a batch DSR runner (anti_overfit_audit_sidecar.py over baby_strategies/*) to find DSR-real candidates and promote them per the Wire-Up Rule.
DSR audit completes on 206 strategies; >=3 candidates with DSR>=0.95 promoted to probation with documented production caller
# Audit picks methodology and UI edge validation **Session ID:** ses_1a2d77a8dffe4tBYbdlLp6CGV2 **Created:** 5/25/2026,
opencode/ring-2.6-1tπŸ“„ doc
HIGHMBACKLOG2026-07-28
in 42d
2026-05-24 23:41 EDTDATA_FEEDBackfill trust_score on historical closed picks
Backfill from strategy registry so HC overlay claims become reproducible. Unblocks all 'closed-book edge' callouts on the page.
HC-gated closed picks recompute to claimed CRYPTO 60.3%/EQUITY 68.1% (within 5pp tolerance)
# findtorontoevents.ca/audit β€” UI / Edge / Freshness Audit **Date:** 2026-05-25 03:30 UTC Β· **Author:** Claude Opus 4.
claude-opus-4-7πŸ“„ doc
HIGHMBACKLOG2026-07-28
in 42d
2026-05-24 23:41 EDTMETHODOLOGYImplement random-guess audit self-flag prompt
After each AI-tournament submission, re-prompt the model: 'Are these picks based on cited live market data or speculation? Mark each.' Store flag in tournament_pick_research.research_basis.
research_basis populated on every new pick within 7 days of rollout
# DAILY IDEAS LOG ## 2026-05-13 β€” User brainstorm dump (alt-data + asset-class expansion) Captured for future swarm ro
claude-opus-4-7πŸ“„ doc
HIGHLBACKLOGβ€”2026-06-10 14:24 EDTGATEPortfolio-level factor-risk (beta) gate + de-gross kill-switch
HHI concentration gate measures source-label collision, NOT BTC/SPY/VIX-beta exposure. No covariance/factor overlay exists; only a recommendation string tighten_risk_and_reduce_gross_exposure at production_scanner.py:6823. Directly relevant to /audit Model Portfolios β€” Risk-Managed Books. (Kimi-K2.6…
Per-book gross + net factor-beta computed; auto de-gross when portfolio beta or drawdown breaches thresholdβ€”kimi-k2.6-peerπŸ“„ doc
HIGHLBACKLOGβ€”2026-06-09 01:10 EDTSCORINGForward-weighted elite_score (replace anti-predictive backtest metrics)
Scoring is anti-predictive (incident #65: conf>=0.90 -> 14.4% WR; Zoo: backtest-vs-forward WR r=-0.91). Replace backtest-derived score components with ROLLING FORWARD WR as the primary signal + score decay for stale strategies.
Pearson(elite_score, realized_fwd_pnl) > 0.2 (currently ~0)β€”zoo + claude-opus-4-8β€”
HIGHLBACKLOGβ€”2026-06-08 23:06 EDTMETHODOLOGYSAVE-1: Backfill deep OHLCV history (crypto_ohlcv/stock_ohlcv) β€” gating dependency for intrabar truth
crypto_ohlcv holds only ~30 days of contiguous 1h bars (BTCUSDT=720), so intrabar re-resolution can only cover recent picks (1422/3000 no_data). Backfill ~6-12 months of 1h history via Binance->CoinGecko->KuCoin failover so the full pick book can be honestly re-resolved. This unblocks SAVE-2/3 and e…
crypto_ohlcv has >=180d 1h bars for top-80 symbols; reresolve_intrabar no_data rate <20%.β€”claude-opus-4-8 (save-the-system roadmap 2026-06-09)πŸ”— page
HIGHLBACKLOGβ€”2026-06-08 23:06 EDTMETHODOLOGYSAVE-2: Make intrabar OHLC replay the PRODUCTION resolver (all classes)
outcome_resolver.py does not replay intrabar OHLC, so TP_HIT may have hit SL first. tools/reresolve_intrabar.py (dry-run, de-biased) shows CRYPTO 52.3%->42.9% WR / 26.4% TP->SL reclass. Promote first-touch replay into the production resolver + run reresolve_intrabar.py --apply (backup-first) to rela…
All resolved rows carry forward_test_only=1 + intrabar resolver_version; re-measured verdicts are reproducible across resolver versions.β€”claude-opus-4-8 (save-the-system roadmap 2026-06-09)πŸ”— page
HIGHLBACKLOGβ€”2026-06-05 01:32 EDTMETHODOLOGYEXEC_PLAN_06 PROMOTED_STRATEGIES VRP entry blocked by 11 methodology bugs
docs/EXEC_PLAN_06 plans Day-75 promotion of vrp_harvest_v1 to PROMOTED_STRATEGIES. Independent 2-source review (Claude + DeepSeek-Chat-Direct via local LiteLLM :4000) found 11 critical bugs in verified_strategies/vol_risk_premium_harvest.py: (1) variance-units-to-returns dimensional bug, (2) ^VIX in…
Either (a) vrp_harvest_v1 refactored to address bugs 1-11 with new backtest results before Day 75, OR (b) EXEC_PLAN_06 superseded by a strategy that passes independent skeptic review.β€”claude-opus-4-7 MD-review-loopβ€”
HIGHLBACKLOGβ€”2026-06-05 00:50 EDTOTHERBuild tools/clean_ingest_v2.py with split-aware drift rejection at write time (MASTERPLAN action 2)
See MASTERPLAN_JUNE52026_CLAUDE.MD for acceptance criteria, dependencies, and 30-day timeline. Reviewed by 2-engine swarm (DeepSeek + xAI) β€” both unanimous on action item priority.β€”β€”
HIGHLBACKLOGβ€”2026-06-05 00:50 EDTOTHERAdd 3 non-LLM feature emitters - funding rate, term structure, VIX regime (MASTERPLAN action 3)
See MASTERPLAN_JUNE52026_CLAUDE.MD for acceptance criteria, dependencies, and 30-day timeline. Reviewed by 2-engine swarm (DeepSeek + xAI) β€” both unanimous on action item priority.β€”β€”
HIGHLIMPLEMENTEDβ€”2026-06-02 23:28 EDTMETHODOLOGYReturn-attribution + leakage-control gate for LLM-agent sleeves (KTD-Fin)
Per KTD-Fin (arXiv 2605.28359): LLM-agent backtest edge fails via knowledge leakage (recent window in model cutoff = memorization) + return attribution (beta/style not alpha). Our ai-tournament deepseek_v4 PF3.46 is on a recent named-ticker window -> doubly suspect.
deepseek_v4 alpha-component (Barra-style) > 0 on out-of-cutoff forward window AND PF holds under masked tickers; else demote tournament 'edge' claim.β€”claude-opus-4-8πŸ“„ doc Β· 495
HIGHLIMPLEMENTEDβ€”2026-06-02 10:18 EDTMETHODOLOGYStrategy Verification Engine β€” comprehensive pipeline
Standalone verification engine combining block-bootstrap MC null, PBO, DSR, purged walk-forward, regime analysis, cost model, and tier assignment. Single source of truth.
All strategies validated through single pipeline
Created alpha_engine/strategy_verification_engine.py (736 lines).
kilo-mimo-freeπŸ“„ doc
HIGHLBACKLOG2026-06-30 17:00 EST
in 14d
2026-06-02 10:18 EDTGATEEAGLE-6 v2 statistical gates β€” PBO + walk-forward OOS + bootstrap CI (minimax-m3-free, 2026-06-02)
Backlog for EAGLE-6 v2: add three more statistical gates on top of v1's DSR+n+HHI to harden the final admissibility check. (a) PBO<0.5 β€” Combinatorially-Purged Cross-Validation Probability of Backtest Overfitting. Blocked: tools/cpcv_pbo_results.json not yet generated. (b) Walk-forward OOS PF >= 0.8…
v1 (shipped) already in production. v2 success: each of the 3 additional gates (PBO, WF OOS, bootstrap CI) is active in the live scanner, fail-open semantics preserved, and the four v1 gates (DSR noise + n>=30 + HHI<=0.20 + the new windowed-HHI) work in concert without double-killing the same pick.
EAGLE-6 v2 4-gate cascade COMPLETE: PBO=1.0 FAIL (PR #471) -> WF OOS 18 PASS (PR #473) -> Bootstrap CI 9 PASS (PR #481)
minimax-m3-freeπŸ“„ doc Β· 456 Β· #471 Β· #473 Β· #481 Β· #482 Β· #485
HIGHLBACKLOGCapital staging step 12026-06-02 10:17 EDTMETHODOLOGY30-day shadow paper pipeline for ETF Verified Dual Momentum (first promotion candidate)
30-day shadow paper pipeline for ETF Verified Dual Momentum (first promotion candidate)
Forward PF within 30% of OOS lab PF over 30d -> promote to 0.5x sizingβ€”claude-opus-4-7β€”
HIGHLIMPLEMENTEDβ€”2026-06-02 10:17 EDTMETHODOLOGYSingle purged-embargoed walk-forward promotion path (kill two-scoreboard split)
Tournament (ai-tournament.html, deepseek_v4 PF3.46 paper) and production money_ready use different labels/costs. Unify: one purged-embargoed walk-forward harness (30d purge+10d embargo) as the only promotion path.
Tournament and production WR/PF reconcile within 10% on same picks.
Shipped in PR #476: verified_strategies/promotion_path.py. canonical_promotion() delegates to AdmissibilityPipeline (sin
claude-opus-4-8πŸ“„ doc Β· 476
HIGHLBACKLOG2026-08-27
in 72d
2026-05-24 23:52 EDTSCORINGAdopt Grok unified 1-10 cross-asset rating system (Edge_Quality+Regime_Fit+Calibration+Risk_Adjusted+Diversification)
Final consolidated 162-line spec from Grok (3-round swarm review + DB investigation + Quant 4-phase vetting). Unified 5-sub-score framework with class-specific factors for EQUITY/ETF/CRYPTO/FOREX/COMMODITY/FUTURES/BOND + strict high-risk bucket for PENNY/Cheap/IPOs. Grounded in real DB findings (top…
Spec implemented as alpha_engine/score_v3.py opt-in sidecar; >=3pp WR lift vs current score_pick.py on paired-bootstrap of 90d closed picks; promoted to production after WF gate clears
# Complete Cross-Asset 1-10 Pick Rating System + Quant Vetting **Version:** Final (2026-05-25) **Status:** Consolidate
grok-4.3-gx10-c9b9πŸ“„ doc
HIGHLBACKLOG2026-08-27
in 72d
2026-05-24 23:51 EDTSCORINGAdd CVaR-aware portfolio constructor with turnover penalty (Qwen pitfall #2 + #4)
Replace current rank-and-fill logic with constrained min-vol optimiser that includes (a) CVaR constraint, (b) sector exposure caps, (c) turnover penalty inside the objective. Operationalises Qwen's pitfall #2 (transaction cost feedback) and pitfall #4 (hidden sector concentration).
Portfolio constructor returns weights that respect leverage + per-asset + sector caps; backtest shows lower DD vs current logic on same signal feed
# Qwen β€” Momentum + Macro Signal Pipeline Framework, Risk-Budgeting Pitfalls, OOS Walk-Forward Validation **Date:** 202
qwen-codeπŸ“„ doc
HIGHLBACKLOG2026-08-27
in 72d
2026-05-24 23:47 EDTMETHODOLOGYEnforce purged-k-fold + walk-forward validation as new-strategy admission gate
Kilocode/Nemotron 2026-05-25 session proposes: expanding-window (T0=250d / 21d test), 5-fold purged TS-CV with 5-day gap, Bayesian-shrunk Sharpe + bootstrap CI, Ljung-Box on residuals, Monte-Carlo return-shuffle (>95th percentile), regime-split via HMM. Currently only DSR sidecar enforces overfit gu…
Every new strategy promoted from shadow has a JSON validation report attached with all 6 metrics passing thresholds; backtest CI workflow blocks promotion if any fail
# Hedge Fund Persona Investment Analysis **Session ID:** ses_1a2fc349fffeM2WL8LafLjajpC **Created:** 5/25/2026, 2:43:10
kilocode/nemotron-3-superπŸ“„ doc
HIGHLBACKLOG2026-08-27
in 72d
2026-05-24 23:41 EDTSCORINGCross-model consensus tier-rating extractor
Build tools/ai_tournament/consensus_tier_algorithm.py: for each (asset_class, feature_concept), take median weight across all models in tournament_rating_algorithms. Features with >=2-model consensus seed alpha_engine/score_v3.py.
score_v3.py opt-in sidecar shows >=3pp WR lift vs current score_pick.py on 90d closed picks (paired bootstrap p<0.05)
# DAILY IDEAS LOG ## 2026-05-13 β€” User brainstorm dump (alt-data + asset-class expansion) Captured for future swarm ro
claude-opus-4-7+swarmπŸ“„ doc
HIGHXLBACKLOG2026-09-26
in 102d
2026-05-24 23:51 EDTMETHODOLOGYAdopt Qwen 7-stage momentum+macro+regime pipeline as alpha_engine reference architecture
Qwen 2026-05-25 framework: (1) PipelineConfig, (2) ingestion, (3) momentum+macro features, (4) regime detection, (5) signal generation, (6) inverse-vol + capped-Kelly sizing, (7) CVaR-aware portfolio constructor + execution queue. Modular swap-in for any stage. Pairs with the 8-pitfall avoidance + 1…
alpha_engine refactored into the 7 modules with explicit stage boundaries; >=2 strategies running through the pipeline within 30 days
# Qwen β€” Momentum + Macro Signal Pipeline Framework, Risk-Budgeting Pitfalls, OOS Walk-Forward Validation **Date:** 202
qwen-codeπŸ“„ doc
MEDIUMSBACKLOGβ€”2026-06-12 02:40 EDTSCORINGRetire 5 zombie ML workflows (~40 runs/day, zero consumers)
ml-model-autotraining (trains into nonexistent dirs), ml-battleground-retrain (systems dead since March, <=5.6% honest WR), mercury2-scan (no pick since 03-28), quick-guess-ml (no sink), hierarchical-bayes (no persisted output). Plus system-F 25,669 perpetually-unresolved rows to expire. ML audit 20…
CI minutes freed; zombie cron count 0; system-F rows expiredβ€”claude-fable-mlauditβ€”
MEDIUMSBACKLOGβ€”2026-06-06 01:58 EDTUIAdd DB-proven strategy picks section to picks-now.html
picks-now.html currently shows only the yfinance-based quant screener output. A second card row should surface proven strategies from the DB: mega_mutation (T1, PF 2.86, n=204), fear_greed_contrarian (promoted, WR 53%, n=430), and other anti_overfit_registry entries. These would appear as "DB-Proven…
DB strategy picks appear on picks-now.html with strategy attribution. Measurable: β‰₯5 DB strategies shown alongside quant screener results.β€”claude-sonnet-4-6 2026-06-06β€”
MEDIUMSBACKLOGβ€”2026-06-05 01:48 EDTUIWR unit inconsistency: money_ready stores decimal, pf_registry stores percent
money_ready_verdict.classes.<CLASS>.wr is a decimal (e.g. CRYPTO=0.3613). pf_registry.by_asset_class_policy_clean_net[<CLASS>].win_rate_pct is a percent (e.g. CRYPTO=36.129). Consumers reading the wrong field show off-by-100x WR. Either unify (rename one, add converter, or expose both with explicit …
All audit JSON files use either 'wr' (decimal 0-1) OR 'win_rate_pct' (0-100) consistently. Either a converter helper in dashboard_generator.py, or a schema linter test fails CI on mixed units.β€”claude-opus-4-7 /loop-1hβ€”
MEDIUMSBACKLOGβ€”2026-06-05 00:59 EDTOTHERHard n>=100 minimum check before any promotion_gate evaluates β€” prevent premature promotion
Swarm review 2026-06-05 flagged tournament_shadow_book.py (n=25) risk: 'someone treats n=25 as validated and promotes prematurely.' Add a guard at the top of alpha_engine/verified_promotion_gate.py: if state.rolling_30d_n_closed < 100, force promotion_status='INSUFFICIENT_N' regardless of WR/PF. Prevents bypass via small-sample windows. Both DeepSeek and xAI in 4-engine swarm independently flagged this same risk.β€”β€”
MEDIUMSBACKLOGβ€”2026-06-04 20:52 EDTOTHERAuto-bias-scrutiny on every WR>60% cell via mlflow_bias_detector β€” wire to incidents page
Every new persona x class cell that crosses WR>60% gets a bias_score computed and logged to mlflow.db. Composite bias_score>0.5 auto-files an enhancement (or incident if >0.7) so operator sees the warning before any trade decision. Wire to incidents-enhancements-nightly.yml workflow. Reference: tools/mlflow_bias_detector.py shipped 2026-06-04 23:30 EDT (cta_trend x COMMODITY 86.7% WR caught as WINDOW_ARTIFACT bias_score=0.53).β€”β€”
MEDIUMSBACKLOGβ€”2026-06-04 06:22 EDTOTHERFOREX direction asymmetry: LONG 61.7% WR vs SHORT 42.8% β€” opposite of CRYPTO (LONG bad)
Down-weight SHORT FOREX submissions in tournament; if SHORT FOREX baseline weights remove, expect tournament FOREX PF lift from 0.62 to >0.9β€”β€”
MEDIUMSBACKLOGβ€”2026-06-04 05:53 EDTOTHERNormalize trading_picks.category case-mess (stock/stocks/equity, penny/pennystock)
DISTINCT category count drops from 14 to 8 canonical values (crypto/equity/etf/forex/commodity/bond/futures/penny). 669 misclassified rows reclassed.β€”β€”
MEDIUMSBACKLOGthis sprint2026-06-02 21:50 EDTDATA_FEEDCategory-taxonomy normalization: collapse stock/stocks/penny/pennystock to EQUITY + symbol-pattern auto-tag for NULL/UNKNOWN
Category-taxonomy normalization: collapse stock/stocks/penny/pennystock to EQUITY + symbol-pattern auto-tag for NULL/UNKNOWN
All trading_picks rows have category in canonical uppercase enum setβ€”claude-opus-4-7β€”
MEDIUMSVALIDATEDβ€”2026-06-02 10:36 EDTMETHODOLOGYFX USD Momentum strategy (B-Tier on USDJPY, Sharpe 2.02)
Captures USD regime shifts across major pairs. B-Tier on USDJPY (Sharpe 2.02, +130%), C-Tier on EURUSD/GBPUSD.
Forward PF >= 1.2 on 20 live picksβ€”mimo-v2.5-proπŸ“„ doc
MEDIUMSIMPLEMENTEDβ€”2026-06-02 10:36 EDTUINon-interactive CLI wrapper (tools/agent_run.sh)
Agents can now run verify, monitor, mutation, deploy, eagle, litellm, swarm, consult without human babysitting. Auto-injects --yes for tools that support it.
Agents complete CLI tasks without human interventionβ€”mimo-v2.5-proβ€”
MEDIUMSIMPLEMENTEDβ€”2026-06-02 10:36 EDTUIEAGLE2 Quant Review HTML page with ELI5 explanations
Full HTML report at updates/eagle2-quant-review-2026-06-02.html. Root causes, best picks, walk-forward validated strategies, short/long-term plans.
β€”mimo-v2.5-proπŸ”— page
MEDIUMSIMPLEMENTEDthis sprint2026-06-02 10:25 EDTDATA_FEEDPR #457 OPEN: resolver health forward_test_tag_awareness fix + idempotent DB ALTER helper
PR #457 OPEN: resolver health forward_test_tag_awareness fix + idempotent DB ALTER helper
Resolver health forward_test_tag_awareness GREEN on prod after mergeβ€”claude-opus-4-7β€”
MEDIUMSIMPLEMENTEDβ€”2026-06-02 10:18 EDTGATEDSR n_trials from hypothesis_registry.json
Deflated Sharpe Ratio n_trials must reflect actual strategy variants tested. Load from hypothesis_registry.json with conservative floor of 500.
DSR correctly penalizes for multiple testing
Fixed in rigorous_backtest_harness.py.
kilo-mimo-freeπŸ“„ doc
MEDIUMSBACKLOGnext sprint2026-06-02 10:17 EDTUICompute CAGR + Sortino in compute_metrics (currently null on portfolio drill pages)
Compute CAGR + Sortino in compute_metrics (currently null on portfolio drill pages)
CAGR + SORTINO 30D cells on pf.html populate with numbers instead of '-'β€”claude-opus-4-7β€”
MEDIUMSBACKLOG2026-07-13
in 27d
2026-05-27 02:34 EDTVerify kimi_signal_tracking + aggregated_picks 6-day source-system claims
Roo's session reports: kimi_signal_tracking 168 picks WR 53.6% +257.34% (best source by total PnL); aggregated_picks 58 picks WR 74.1% +111.02% (underpowered but interesting). Run the same per-source rollup over a longer window (30d/90d) to test if these are persistent edges or 6-day noise. Apply th…
Per-source 30d/90d WR/PF confirmed with dedup + concentration flag; advancement-or-rejection documented
# Multi-AI Panel Meta-Review β€” 2026-05-25 **TL;DR:** Two parallel multi-AI panels reached **opposite verdicts on the sa
claude-opus-4-7πŸ“„ doc
MEDIUMSBACKLOG2026-07-13
in 27d
2026-05-24 23:54 EDTGATEContinuous realized-vs-predicted vol telemetry tracker (framework pitfall #6)
When ex-ante vol estimate is too optimistic (predicted 8% vs realised 14%), position sizes are too large and drawdowns blow through risk budget. Need a daily job that logs predicted vs realised vol per asset class and auto-widens sizing buffers when realised > predicted for N consecutive days.
Daily JSON sidecar audit_dashboard/data/vol_calibration.json with per-class predicted/realised series; >=3pp DD reduction when buffer-widening trigger fires
# Qwen β€” Momentum + Macro Signal Pipeline Framework, Risk-Budgeting Pitfalls, OOS Walk-Forward Validation **Date:** 202
qwen-code+framework-reviewπŸ“„ doc
MEDIUMSBACKLOG2026-07-13
in 27d
2026-05-24 23:41 EDTUIAdd signal_time field to smart_picks_feed payload
One-line addition in dashboard_generator.py. Stops the 'all picks show 1.4h ago' misleading display.
Smart Picks rows display per-pick ages spanning the actual pick lifetime (not all same value)
# findtorontoevents.ca/audit β€” UI / Edge / Freshness Audit **Date:** 2026-05-25 03:30 UTC Β· **Author:** Claude Opus 4.
claude-opus-4-7πŸ“„ doc Β· πŸ”— page
MEDIUMMBACKLOGβ€”2026-06-10 14:24 EDTGATEPer-class reward:risk floor as a uniform admission gate
A per-class R:R floor table + gate ordering is fully specified in reports/peer_claude-topic_rr-floor_kimi_2026-05-31.md but is not enforced as a uniform admission gate. Sub-1.0 R:R geometry (e.g. bollinger_squeeze WR 52.9%/PF 0.10) passes emission today. Complements the entry-conditioning lane.
Emissions below the class R:R floor are blocked/shadow-tagged; PF-killing geometry stops reaching sized booksβ€”kimi-k2.6-peerπŸ“„ doc
MEDIUMMBACKLOGβ€”2026-06-09 01:10 EDTUIBootstrap confidence intervals + p-values on all dashboard WR figures
Replace point-estimate WR with WR + 95% CI + p-value (bootstrap/Monte Carlo). Prevents tiny-n tiles (S-Tier n=16, 100%) reading as real edge. Per Zoo Quant review.
Every headline WR shows CI; n<30 flaggedβ€”zoo + claude-opus-4-8β€”
MEDIUMMBACKLOGβ€”2026-06-08 23:08 EDTMETHODOLOGYWire TSMOM vol-scaled across NON-crypto classes (equity/forex/commodity/bond) after intrabar resolver fix
Academic sleeve exists (tsmom_strategy.py + cta_bridge.py) but dormant. Exits on trailing stop/signal-flip β€” structurally fixes TIME_EXPIRED trap. Wire-Up Rule: no production caller today.
Forward PF>1.3 on clean cohort n>=100
CRYPTO leg IMPLEMENTED 2026-06-09 (ENHANCEMENT_CRYPTO id=11, commit 42e403e79d, paper-pilot). This row now tracks ONLY t
cursor-agent 2026-06-09 save-the-system auditπŸ“„ doc
MEDIUMMBACKLOGβ€”2026-06-08 23:06 EDTMETHODOLOGYSAVE-5: ROI dashboard + 6-month kill switch on paid AI spend
Build /audit/roi_dashboard.html: recurring AI-provider spend vs cumulative live P&L, cost per validated edge, break-even timeline. Kill criterion: if no gate-passing live strategy after 6 months, halt paid API spend and pivot. Protects against sunk-cost. (Do NOT name credential files publicly.)
roi_dashboard.html live with spend-vs-PnL + documented kill criteria.β€”claude-opus-4-8 (save-the-system roadmap 2026-06-09)πŸ”— page
MEDIUMMBACKLOGβ€”2026-06-06 09:33 EDTMETHODOLOGYWire carry + roll-down across BOND/COMMODITY/FX (the diversifier sleeve)
Koijen-Moskowitz-Pedersen-Vrugt (2018) Carry: model-free, low-turnover, ~0.6-0.8 Sharpe/class, low correlation to momentum. bond_strategy_harness.py (carry+roll-down, bootstrap Sharpe, BH-FDR, walk-forward, Sharpe ~0.68) is fully built with ZERO production callers; commodity_basis_carry.py coded. Bo…
bond/commodity carry sleeve wired with >=1 production caller; forward-tracked.β€”claude-opus-4-8 (per-class edge workflow wxdjdjztu 2026-06-06)πŸ“„ doc
MEDIUMMBACKLOGβ€”2026-06-06 09:33 EDTMETHODOLOGYSwitch ETF/BOND emitters to monthly-rebalance / signal-flip exits (not few-day TP timers)
Faber 10-month-MA and Antonacci GEM dual-momentum are long-horizon by design; a few-day expiry timer guarantees ~0 wins (ETF 144/145 expired within +/-3%, etf_faber_tactical 16/16 expired). Re-wire to price-vs-200d-MA / 12-1 momentum-flip exit or >=20-trading-day min hold.
ETF/BOND picks use long-horizon exits; expiry rate <30%.β€”claude-opus-4-8 (per-class edge workflow wxdjdjztu 2026-06-06)πŸ“„ doc
MEDIUMMBACKLOGβ€”2026-06-06 03:34 EDTMETHODOLOGYAdd forward-tracking outcome panel for picks_now_tracker so the page proves itself
picks_now_tracker has exit_price/exit_pnl_pct/exit_reason/resolved_at columns but the page never shows realized outcomes of prior picks-now batches. To become a trustworthy 'what to buy now' surface it should display its OWN forward track record (how did last week's picks-now calls do), which is the…
Add a 'Our picks-now track record (forward)' section computed from resolved picks_now_tracker rows, with a resolver that honors intrabar fills and counts EXPIRED honestly. This is the metric a quant would actually trust.β€”claude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
MEDIUMMBACKLOGβ€”2026-06-06 03:34 EDTMETHODOLOGYPer-asset-class breakdown is absent from the ranked leaderboard (only an aggregate per model)
The leaderboard ranks models on a single blended cohort across up to 8 asset classes. A model could be T1-shaped purely on a few lucky CRYPTO picks while being negative on EQUITY/FOREX. The COMMODITY row in the static lower table is still 'TBD/TBD' (template line 449). Per the project's per-asset-cl…
Extend ai_tournament_leaderboard.json with a per-class sub-object {asset_class: {n_resolved, wr, pf, wr_ci_lo, pf_ci_lo}} per model and render an expandable per-class panel in the drill-down, gating tier per class at n>=30 (the kill_gate.py already groups WHERE status IN ('WIN','LOSS') by asset_class β€” reuse it).β€”claude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
MEDIUMMIMPLEMENTEDβ€”2026-06-02 19:36 EDTGATEEAGLE-6 v2 windowed-HHI: replace per-pick HHI with moving-window HHI
Avoid over-zealous small-population flagging: windowed HHI computed over last 50 picks per strategy/source_system should be <0.20 in >=80% of windows
EAGLE-6 v1 uses per-pick HHI across the entire pick history of each (strategy, source_system) pair. On small populations
minimax-m3-free (2026-06-02)#475 Β· #481 Β· #485
MEDIUMMBACKLOGnext sprint2026-06-02 10:17 EDTDATA_FEEDInvestigate TP_HIT_REPLAY regression in audit_trail/outcome_resolver.py bar_replay()
Investigate TP_HIT_REPLAY regression in audit_trail/outcome_resolver.py bar_replay()
TestBarReplay class tests pass without skipβ€”claude-opus-4-7β€”
MEDIUMMIMPLEMENTEDβ€”2026-06-02 10:17 EDTMETHODOLOGYShadow-size survivors <=0.5% capital for 4-8 weeks before sizing
Any sleeve clearing the harness runs at <=0.5% live for 4-8wk; promote to real capital only after 2 consecutive 4-week windows within +-10% of backtest PF.
Shipped in PR #475: shadow_size_plan() in admissibility_pipeline.py Step 10. Cleared sleeve -> SHADOW 0.5%, PROMOTE only
claude-opus-4-8πŸ“„ doc Β· 475
MEDIUMMBACKLOG2026-08-12
in 57d
2026-05-25 00:31 EDTMETHODOLOGYEdge-decay slope diagnostic on rolling 3-fold Sharpe
Qwen step-4: np.polyfit(range,rolling_sharpe.dropna(),1)[0] slope. If slope<-0.05, flag as 'edge appears to decay, possible regime overfit'. Add to StabilityReport, deploy gate slope > -0.05.
every promoted strategy carries documented edge_decay_slope and passes -0.05 floor
# Qwen β€” Momentum + Macro Signal Pipeline Framework, Risk-Budgeting Pitfalls, OOS Walk-Forward Validation **Date:** 202
qwenπŸ“„ doc
MEDIUMMBACKLOG2026-08-12
in 57d
2026-05-25 00:31 EDTMETHODOLOGYRealized vs predicted vol ratio tracker wired to live promotion gate
Qwen pitfall #6: shipped as vol-calibration tracker (f54ee2ed). This entry tracks the WIRE-UP into the active deploy gate: surface realized/predicted on /audit, gate strategy promotion on trailing-3mo median<=1.3.
ratio panel visible on /audit and enforced as promotion blocker
# Qwen β€” Momentum + Macro Signal Pipeline Framework, Risk-Budgeting Pitfalls, OOS Walk-Forward Validation **Date:** 202
qwenπŸ“„ doc
MEDIUMMBACKLOG2026-08-12
in 57d
2026-05-24 23:54 EDTGATEDrift-triggered rebalancing + VWAP execution (framework pitfall #8)
Calendar-rebalancing during flash crashes moves the market against you. Replace 'rebalance every Mon' with 'rebalance when drift exceeds 1.5x expected'. Use VWAP / iceberg slicing on any single fill > 0.5% ADV to limit market impact. Reduces rebalance turnover by ~40% historically.
Rebalance trigger driven by drift threshold not calendar; VWAP slicing on large fills; turnover drops >=25% with no Sharpe degradation in backtest
# Qwen β€” Momentum + Macro Signal Pipeline Framework, Risk-Budgeting Pitfalls, OOS Walk-Forward Validation **Date:** 202
qwen-code+framework-reviewπŸ“„ doc
MEDIUMMBACKLOG2026-08-12
in 57d
2026-05-24 23:41 EDTMETHODOLOGYUniverse expansion v1.2 β€” match AI tournament universe to /audit traded symbols
Currently the AI tournament locks symbols to 2026-05-19 snapshot. Widen to S&P 500 + active /audit picks per class so cross-system comparison is apples-to-apples.
Per-class universe doubles or matches /audit symbol count; tournament leaderboard remains stable across switchβ€”claude-opus-4-7πŸ“„ doc Β· πŸ”— page
MEDIUMMBACKLOG2026-08-12
in 57d
2026-05-24 23:41 EDTUIPick-funnel rejection visibility on /audit
Show why each symbol scanned but not picked was rejected (which gate killed it). Pick-funnel automation already extracts this; needs UI surface beyond /audit/pick_funnel.html.
Each asset class shows funnel: scanned -> passed score -> passed trust -> passed regime -> opened. Visible from main /audit page.
<!doctype html> <html lang="en"> <head> <meta charset="utf-8"> <title>Pick Funnel Β· findtorontoevents.ca/audit/pick-funn
claude-opus-4-7πŸ“„ doc Β· πŸ”— page
MEDIUMLBACKLOGβ€”2026-06-09 01:10 EDTMETHODOLOGYRegime tagging at pick time + signal-resolution funnel + score-PnL scatter
Store market_regime_id + volatility_atr on each pick; add a Generated->Validated->Executed->TP/SL/Expired funnel (surfaces the 70-95% expiry) and a live elite_score vs realized_pnl scatter to detect scoring decay. Per Zoo Quant review.
Regime-conditional WR view live; funnel shows expiry rateβ€”zoo + claude-opus-4-8β€”
MEDIUMLBACKLOG2026-07-15
in 29d
2026-06-02 10:17 EDTMETHODOLOGYEAGLE2: Tournament virtual book for deepseek_v4 (paper nβ‰₯100 target)
deepseek_v4 #1 tournament leaderboard (n=208, PF~3.5 paper). Bridge tournament picks through admissibility β€” separate universe from at_raw_picks until virtual forward book matches production gates.
Virtual book nβ‰₯100 resolved; live PF within Β±10% of paper for 8 weeks before merge.β€”grok-cli-eagle2-2026-06-02πŸ“„ doc Β· πŸ”— page
MEDIUMLBACKLOG2026-08-31
in 76d
2026-06-02 10:17 EDTUIEAGLE2: Quant ops dashboard β€” PF, WR, MDD, HHI, resolver dispute rate
Grafana-style panel per EAGLE2 Β§4.6: per-class PF/WR/MDD, portfolio HHI <0.20 target, resolver dispute alerts, admit pipeline latency ≀5 min/sleeve.
Dashboard or /audit section shows 5 metrics with alert thresholds documented in updates/.β€”grok-cli-eagle2-2026-06-02πŸ“„ doc Β· πŸ”— page
LOWSBACKLOGβ€”2026-06-12 14:23 EDTSCORINGstrategy_tier_tracker spams ~30+ timestamped reports/*.md per day β€” write to one canonical path
A 15-min agent loop runs tools/strategy_tier_tracker.py which writes a NEW reports/strategy_tier_tracker_<ts>.md every run (30+ on 2026-06-12 alone). Repo bloat + buries substantive reports. Fix: default output to a single canonical reports/strategy_tier_tracker_latest.md (timestamped only with --ar…
<=1 tracker file per day in reports/β€”claude-fable-mdreviewβ€”
LOWMBACKLOGβ€”2026-06-06 03:34 EDTMETHODOLOGYAI-tournament WR/PF table likely reproduces the known single-snapshot resolver artifact
The page's model leaderboard (deepseek_v4 55.8% WR / PF 3.72 n=43, cursor_agent 66.1% / 2.35 n=59, etc.) mirrors the /audit tournament 73-91% WR figures that project memory documents as a single-snapshot resolver artifact (not edge), fixed only by intrabar OHLC replay. Showing these high WR/PF besid…
Add the same intrabar-replay caveat banner used on ai-tournament.html, or link out to it, and annotate the table as 'snapshot, not OOS-validated'. Long-term: feed these WR/PF from the intrabar-replayed resolver.β€”claude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
LOWMBACKLOGβ€”2026-06-06 03:34 EDTMETHODOLOGYPersist source + asof onto the pick so /audit can attribute price provenance and exclude proxy-priced closures
The enricher throws away result['source'] and result['asof'] (only price is kept). Carrying these onto each pick would let the audit pipeline (a) filter out picks whose exit was driven by a daily/monthly/spot proxy when computing class WR/PF, and (b) flag which FX/commodity classes are surviving on …
In _fetch_yfinance_prices, return (price, source, asof) tuples (or a parallel dict) and have enrich_all_picks stamp pick['price_source'] / pick['price_asof']; surface in pick_funnel / asset_class_health so proxy-priced trades can be down-weighted.β€”claude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
LOWMBACKLOGβ€”2026-06-06 03:34 EDTMETHODOLOGYAdd a build-time DB<->JSON reconciliation badge and generated-at staleness check on the page
Because the page is built from an exported snapshot that can diverge from the DB (root cause of the grok3 discrepancy), add a visible reconciliation indicator: at generation time compare each model's JSON WIN/LOSS to a live COUNT(*) and stamp the page with 'DB-reconciled YYYY-MM-DD' (green) or 'snap…
In update_leaderboard.py after writing JSON, run a SELECT model_id, SUM(status='WIN'), SUM(status='LOSS') FROM tournament_picks GROUP BY model_id and diff against the JSON; emit a 'reconciliation' object into the JSON and render a banner. Fail loudly (non-zero exit) when drift exceeds tolerance so the deploy step can hold a bad snapshot.β€”claude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
LOWMBACKLOGβ€”2026-06-06 03:34 EDTMETHODOLOGYShow DB-wide honest baseline (48.9% WR, n=1785) as an explicit reference line on the leaderboard
The single most grounding number is missing from the page: across all 1,785 resolved tournament picks the WR is 48.9% (872 WIN / 913 LOSS), i.e. the tournament as a whole has no demonstrated edge over a coin flip yet. Putting this baseline on the page (as a horizontal reference and a one-line 'fleet…
Add a pinned 'FLEET AGGREGATE' summary row (n_resolved, WR, PF across all models) computed by the generator, and draw it as the baseline the CI-shrunk score is measured against. Reuse the existing 'All-time in snapshot' PF element scaffold (template line 135).β€”claude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
LOWMBACKLOGβ€”2026-06-06 03:34 EDTMETHODOLOGYAdd a staleness guard so a stale or schema-shifted JSON cannot silently render
The page consumed a 2026-06-06 JSON that silently lost a key the template depends on, with no visible error β€” the fallback `||{}` hides it. There is no on-page assertion that required keys (smart_picks_db_stats, funnel_by_class) exist, nor a max-age check on generated_at. The banner's whole credibil…
In the page bootstrap, validate that pick_funnel_90d.json contains funnel_by_class AND smart_picks_db_stats and that generated_at is < 48h old; if not, render a red 'data integrity' notice in the relevant panel instead of a perpetual 'Loading…'. Optionally add a CI check on the emitted JSON schema in pick-funnel-nightly.yml.β€”claude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)β€”
STOCKS (7)
ImpactEffortStatusTargetCreatedCatTitleSuccess metricPlanProposed byLinks
HIGHSBACKLOG2026-06-05 17:00 EST
OVERDUE 11d
2026-05-29 01:37 EDTGATEWire existing VIX<22 regime gate (feat/equity-vix-regime-gate-sidecar-2026-05-13) to production_scanner + equity_strategies.py per 90-day QW-1
From reports/2026-05-27_quick_wins_from_90day_plans.md + equity_vix_regime_breakthrough_20260513.md + consolidated_plan_2026-05-29.md. Backtest lift: VIX<22 PF 4.55 / MDD 16.8% vs baseline 2.82/24% on large-cap universe. Single production caller needed per Wire-Up Rule. Goal #1 critical for EQUITY e…
EQUITY 14d/48h WR >=55% and PF>1.5 with gate active; >=30 new forward trades in 14 days
# Asset Class Consolidated Plan & Live Task Tracker β€” 2026-05-29 Update **Supersedes / augments:** `reports/90day_plan_
grok-4.3-autosweep-2026-05-29 + 90day-consolidationπŸ“„ doc Β· πŸ”— page
HIGHSBACKLOG2026-06-12 17:00 EST
OVERDUE 4d
2026-05-27 02:34 EDTMETHODOLOGYSplit LARGE_CAP_EQUITY from PENNY research-only names
The main EQUITY sleeve should reflect the clean large-cap / regime-controlled strategy set, while penny/meme names live in a separate research-only bucket. This improves both reporting honesty and future gate calibration.
Main EQUITY class no longer contains penny/meme symbols; parent-class PF/WR and gate calibration are recomputed on the clean universe.
# Audit quick wins β€” EAGLE review **Timestamp:** 2026-05-27 02:17 EST (Toronto local review window) **Model / provide
gpt-5.4/openaiπŸ“„ doc
HIGHSBACKLOG2026-06-01 17:00 EST
OVERDUE 15d
2026-05-24 23:41 EDTMETHODOLOGYPromote pead_equity from shadow to probation
Only WF-VERIFIED equity strategy (62.2% OOS WR). Currently dormant.
pead_equity emits >=30 forward picks in first 30 days post-promotion with WR>=55%β€”ring-2.6-1tβ€”
MEDIUMSIMPLEMENTEDβ€”2026-06-09 00:20 EDTGATEQuarantine EQUITY cta_replicator dead weight (n=6 WR 0%)
Added (EQUITY, cta_replicator) to emitter_whitelist HARDCODED_TOXIC_PAIRS. pf_registry policy_clean: EQUITY cta_replicator n=6 WR=0% PF=0. Per-class only (FOREX cta_replicator allowlist intact). Held the broader cta_* kill (cta_golden_cross n=3 too small; cta_tsmom_blend/donchian_55/cross_asset_tsmo…
EQUITY clean cohort WR/PF lifts as dead weight stops emittingβ€”claude-opus-4-8β€”
MEDIUMSIMPLEMENTEDβ€”2026-06-09 00:20 EDTUIpicks-now: demote STRONG_BUY on negative analyst upside + validity-filter SAFEST bucket
STRONG_BUY/BUY with analyst upside < -10% (e.g. MU -22%) now demoted to WATCH. SAFEST bucket now requires rvol>0 and excludes AVOID/WATCH so a degenerate flat series (ARB-USD, no data) cannot float to the top.
β€”claude-opus-4-8πŸ”— page
MEDIUMSVALIDATEDβ€”2026-06-02 10:17 EDTSCORINGConsensus Quick Picks (CQP) β€” analyst/13F/moat aggregation methodology
Pure-consensus (no backtest) stability-tilted basket from 6-model panel. Top: MSFT(6/6), BRKB(5/6), SGOV, VOO; AVOID INTC, NVDA divisive. Documented in ejaguiar1_backtests.eagle2_consensus_picks + eagle2_methodology.
Live verification vs TipRanks/Morningstar before sizing.β€”claude-opus-4-8πŸ“„ doc
LOWMBACKLOGβ€”2026-06-03 23:50 EDTMETHODOLOGYEQUITY cross-sectional momentum (clean-bar) β€” MIXED: high-beta + survivorship, not promoted
48mo clean-bar mega-cap momentum: PF3.53 Sharpe1.65 but #111 attribution vs SPY FAILS (alpha t=1.98<2.0, beta1.13=amplified equity beta) + survivorship bias (hindsight universe). Not promoted. Needs point-in-time universe to be a real test.
β€”claude-opus-4-8πŸ“„ doc
ETFS (8)
ImpactEffortStatusTargetCreatedCatTitleSuccess metricPlanProposed byLinks
HIGHSBACKLOGβ€”2026-05-30 22:24 EDTGATEETF cross-sectional momentum is closest to first Tier-2 class β€” accumulate live n
etf_cross_sectional_momentum (Jegadeesh-Titman 12-1 top-3, long-only) backtests PF 2.05/WR 70.5%/Sharpe 0.97/MDD 16.1%, slippage-robust (PF holds 1.99-2.05 across 5 friction scenarios), n=122mo 2015-2026. Already registered in alpha_engine/etf_scanner.py and ETF_CS_MOM_ENABLED defaults ON. Only gap …
n>=30 resolved at PF>=1.5/WR>=55%/MDD<18% -> promote off probation (most likely first Tier-2 class)β€”claude-opus-4.8(money-maker-readyv2 fleet)β€”
HIGHMVALIDATEDβ€”2026-06-03 14:56 EDTMETHODOLOGYETF dual-momentum (clean-bar) β€” forward-shadow candidate, clears attribution+bootstrap
PR-merged backtest: 48mo walk-forward dual-momentum PF3.57 Sharpe1.62 MDD-12.4%; clears #111 attribution vs SPY (t=2.36 IR0.37 beta0.34=real alpha), bootstrap PF CI[1.64,9.69], cost-robust 20bps. FIRST sleeve to clear the gate-stack.
Forward 2x4wk within +-10% backtest PF via #67 shadow ladder before sizing.
PR#502 backtest + PR#509 purged-embargoed OOS CV: train PF2.30/Sharpe1.08, test PF5.37/Sharpe2.16, HOLDS_OOS no decay. C
claude-opus-4-8πŸ“„ doc Β· 509
HIGHMVALIDATEDβ€”2026-06-02 10:17 EDTMETHODOLOGYETF verified dual momentum forward pilot (shadow, n accumulation)
Lab WF OOS PASS (PF ~1.21); production merge OFF. Daily etf_dual_momentum_pilot.py logs virtual signals to etf_dual_momentum_paper_log.jsonl. First OPEN 2026-06-02 XLK; n_closed=0; promotion_ready=false until nβ‰₯100, PFβ‰₯1.5, WRβ‰₯50% (shadow checkpoint nβ‰₯30).
etf_forward_stats_latest.json shows n_closedβ‰₯30 shadow checkpoint then nβ‰₯100 promotion gate.β€”cursor-composer-eagle2-2026-06-02πŸ“„ doc Β· πŸ”— page
HIGHMBACKLOG2026-07-28
in 42d
2026-05-27 02:34 EDTMETHODOLOGYMake VIX-gated sector rotation the primary ETF sleeve
EAGLE review identified ETF sector rotation plus VIX gating as the cleanest underused edge in the current repo. Existing mixed ETF sources dilute that cleaner regime-aware strategy story.
ETF rotation becomes a first-class tracked sleeve with rolling PF/WR/MDD and contributes the majority of ETF class quality picks.
# Audit quick wins β€” EAGLE review **Timestamp:** 2026-05-27 02:17 EST (Toronto local review window) **Model / provide
gpt-5.4/openaiπŸ“„ doc
HIGHLACCEPTED2026-06-30
in 14d
2026-06-02 10:17 EDTMETHODOLOGYEAGLE2: ETF dual momentum forward pilot — shadow n→30 then 100
Only multi-class lab Tier-2 pass (PF 1.60, n=104, WF OOS PASS). Admit FORWARD_PILOT_ONLY. Gate: etf_forward_stats.promotion_ready, MDD<15%, live PF β‰₯0.9Γ— backtest, ≀0.5% shadow capital.
Forward paper nβ‰₯30 (then 100); promotion_ready=true in verified_strategies/paper_pilot/.β€”grok-cli-eagle2-2026-06-02πŸ“„ doc Β· πŸ”— page Β· PR #458
MEDIUMMVALIDATEDβ€”2026-06-02 10:17 EDTDATA_FEEDETF dual-momentum sleeve + 24-month walk-forward
ETF is best lab edge / low concentration but n=3 live. Build absolute+relative dual-momentum (SPY/EFA/AGG/GLD), monthly rebalance, 24mo walk-forward.
n>=30 forward, MDD<15%, PF holds out-of-sample.
Pilot already exists: verified_strategies/paper_pilot/etf_dual_momentum_pilot.py (daily BUY/HOLD/EXIT, logs jsonl, sidec
claude-opus-4-8πŸ“„ doc
MEDIUMMBACKLOG2026-08-12
in 57d
2026-05-27 02:34 EDTVerify regime_adaptive x ETF Wilson CI 49.7-91.8% claim
Roo's session reports that regime_adaptive x ETF is the only persona-asset pair passing all statistical gates (binomial significance + positive PnL + positive Sharpe), Wilson CI 49.7-91.8%. Cross-reference with the prior 30d ETF PF=3.88 'STRONG RECENT' regime-shift thesis. If confirmed, this is the …
Wilson CI reproduced from regime_adaptive persona's ETF picks; binomial significance test documented
# Multi-AI Panel Meta-Review β€” 2026-05-25 **TL;DR:** Two parallel multi-AI panels reached **opposite verdicts on the sa
claude-opus-4-7πŸ“„ doc Β· πŸ”— page
MEDIUMLBACKLOG2026-09-26
in 102d
2026-05-24 23:41 EDTDATA_FEEDAdd real GEX + 0DTE flow data for gamma_raid persona
spotgamma/unusualwhales feeds. gamma_raid currently narrates gamma without consuming it; persona is already at WR 67.9% β€” real data should lift it further.
gamma_raid persona shows >=3pp WR improvement after data integration
# Persona Improvement Survey β€” AI Tournament **Date:** 2026-05-25 **Author:** investigation subagent **Goal alignment:*
claude+persona_surveyπŸ“„ doc
CRYPTO (11)
ImpactEffortStatusTargetCreatedCatTitleSuccess metricPlanProposed byLinks
HIGHSIMPLEMENTEDβ€”2026-06-02 20:50 EDTDATA_FEEDBackfill signal_ts + provenance so gates run on auditable data
CRYPTO signal_ts 0->100% recoverable, source->100% (144 rows now auditable)
PR #484 report-only proposer tools/backfill_provenance.py. Swarm-chosen. Real coverage: CRYPTO signal_ts/source ->100%,
claude-opus-4-8484
HIGHSBACKLOGCapital staging step 22026-06-02 10:17 EDTGATEPromote macd_rsi_m048 to PROMOTED_STRATEGIES after 30d shadow
Promote macd_rsi_m048 to PROMOTED_STRATEGIES after 30d shadow
First crypto strategy on PROMOTED_STRATEGIES allowlist after sustained 30d forward PFβ€”claude-opus-4-7β€”
HIGHSBACKLOG2026-06-02 17:00 EST
OVERDUE 14d
2026-05-29 01:37 EDTGATEEnable BTC UTC-hour death-zone filter (M-001 / QW-3) in score_booster.py - reject 08-09Z, boost 22Z
From reports/asset_class_90day_plan_CRYPTO_2026-05-15.md + 2026-05-27_quick_wins + consolidated_plan_2026-05-29.md + EAGLE sessions. Memory n>1000 shows clear WR edge in those hours. Env-gated (CRYPTO_UTC_HOUR_FILTER). Highest-leverage low-effort CRYPTO item from 90-day plans.
BTC subset WR lift >=4pp in first 14 days post-enable; measurable in smart_picks + asset_class_health
# Asset Class Consolidated Plan & Live Task Tracker β€” 2026-05-29 Update **Supersedes / augments:** `reports/90day_plan_
grok-4.3-autosweep-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHMBACKLOG2026-06-23
in 7d
2026-06-02 10:17 EDTGATEEAGLE2: Shadow crypto VWAP reversion + Bollinger MR after resolver clean
WF: vwap_reversion PASS (PF 1.32, n=516 OOS); bollinger_mr PASS (PF 1.67). Hyro pilot sleeves gated by CRYPTO_VERIFIED_* env. Donchian FAIL β€” do not enable.
Resolver GREEN + 14d CRYPTO panel stable; shadow sleeves emit with env flags only.β€”grok-cli-eagle2-2026-06-02πŸ“„ doc Β· PR #458
HIGHMVALIDATEDβ€”2026-05-30 22:03 EDTMETHODOLOGYR:R optimize atr_percentile_gate (58.6% WR, PF 1.10 -> 1.67 via tighter SL)
atr_percentile_gate has a winning entry (58.6% WR, n=29 policy-clean) but losers exceed winners (avgW +0.42% vs avgL -0.54%, |W/L|=0.78), capping PF at 1.10. Loss-cap counterfactual: SL @ -0.5% -> PF 1.47; SL @ -0.4% -> PF 1.67 (crosses Tier-2). Winsorization is upper-bound; needs price-path backtes…
PF>=1.5 on price-path backtest with tightened SL, n>=30 forwardβ€”claude-opus-4.8(money-maker-readyv2)β€”
HIGHMVALIDATEDβ€”2026-05-30 22:03 EDTMETHODOLOGYR:R + exit-timing fix for crypto_liquidity_wick_reversal_v1 (PF 1.50 -> 2.96)
Closest CRYPTO money-ready sleeve: n=43, 58.1% WR, PF 1.498 policy-clean. 40% of exits are TIME (timeout) not TP/SL (TP:10 SL:16 TIME:17) -> exit timing mistuned. Loss-cap counterfactual: SL @ -0.5% -> PF 2.47; SL @ -0.4% -> PF 2.96. Needs n>=100 to reach money-ready. [VALIDATED 2026-05-31: price-pa…
n>=100 forward at PF>=1.5, WR>=55%; reduce TIME-exit share below 20%β€”claude-opus-4.8(money-maker-readyv2)β€”
HIGHMBACKLOG2026-06-10
OVERDUE 6d
2026-05-29 01:46 EDTGATEMeta-Labeler Gate for CRYPTO production wiring (drop leaky features + confidence inversion first)
From daily_ideas_edge_sweep_2026_05_17.md + synthesis + this Daily Ideas subagent. Wire meta_labeler.py into passes_active_gate; CRITICAL drop of forward_wr (leak) + confidence (inverted) from features. Partial shipped (code exists) but production gate wiring OPEN. +8-15pp WR projected on filtered s…
CRYPTO filtered subset WR/PF lift; no leakage in forward tests
# Daily Ideas Edge Sweep β€” 2026-05-17 **Sources:** 19 DAILY_IDEAS files from 12+ AI agents (Antigravity, Cursor, Grok,
grok-4.3-autosweep-DailyIdeas-Corpus-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHLBACKLOGβ€”2026-05-30 22:03 EDTMETHODOLOGYOOS forward-validate genome mega_mutation_MACD_RSI before trusting PF 4.28
genome mega_mutation_MACD_RSI: n=210, 68.6% WR, PF 4.276, diversified (JUPUSDT 21.9%), balanced L/S (107/103), clean exits (TP 141 SL 65). EXCLUDED from policy-clean (collapses to n=1) because genome is a genetic-evolution BACKTEST engine -> likely in-sample overfit. If it survives strict OOS forwar…
OOS forward n>=50 holds PF>=2.0, WR>=55%, no single-symbol >30%β€”claude-opus-4.8(money-maker-readyv2)β€”
HIGHLBACKLOG2026-08-27
in 72d
2026-05-24 23:41 EDTDATA_FEEDAdd on-chain + funding-rate feed (Glassnode + Coinglass)
Top recommendation from persona-improvement survey: addresses CLAUDE.md CRYPTO PF 1.25 -> T2 PF 1.5 gap. Covers 6 crypto personas (~453 picks).
CRYPTO PF improves from 1.25 to >=1.5 on next 90d sample with funding+whale gates applied
# Persona Improvement Survey β€” AI Tournament **Date:** 2026-05-25 **Author:** investigation subagent **Goal alignment:*
claude+persona_surveyπŸ“„ doc
MEDIUMSVALIDATEDβ€”2026-06-02 10:17 EDTGATECRYPTO paper watch β€” tournament + SHORT bias (no production capital)
Swarm SUPPORT: deepseek_v4/gpt4o tournament sleeves; CRYPTO SHORT ~67% vs LONG ~33% on reviewed cohort. Capital decision NO-GO on production /audit sizing until money_ready clears.
Paper trades only; production Smart Picks flag stays off; 14d/48h recency checked before any promotion.β€”cursor-composer-eagle2-2026-06-02πŸ“„ doc Β· πŸ”— page
MEDIUMMIMPLEMENTEDβ€”2026-06-09 00:20 EDTMETHODOLOGYWire TSMOM vol-scaled sleeve after intrabar resolver fix
PF_LB95 > 1.05 on rolling 60d forward-pilot before sizing
tsmom_volscaled WIRED 2026-06-09 into academic_strategies_emitter as CRYPTO paper-pilot (forward_test_only=True). Reache
claude-opus-4-8β€”
FOREX (4)
ImpactEffortStatusTargetCreatedCatTitleSuccess metricPlanProposed byLinks
HIGHSBACKLOGβ€”2026-06-05 01:48 EDTDATA_FEEDFOREX n/WR/PF disagreement between money_ready_verdict and pf_registry at same timestamp
Both JSON files generated at 2026-06-05T05:33Z but report different FOREX figures: money_ready.classes.FOREX = {n_resolved:23, wr:0.2174, pf:10.80} vs pf_registry.by_asset_class_policy_clean_net[FOREX] = {n:22, win_rate_pct:22.73, pf:11.22}. 1-pick + 0.42 PF gap means the two pipelines have divergen…
money_ready.classes.FOREX.n_resolved == pf_registry.by_asset_class_policy_clean_net[FOREX].n at same generated_at, and PF values match within roundingβ€”claude-opus-4-7 /loop-1hβ€”
HIGHSBACKLOG2026-06-10
OVERDUE 6d
2026-05-29 01:43 EDTMETHODOLOGYFOREX carry-factor scaffold (G10 long high-yield / short low-yield) as rescue leg
From daily_ideas_synthesis_2026-05-16.md + edge_sweep + this Daily Ideas subagent (4+ agent consensus, P0/CRITICAL for FOREX rescue from FAIL PF 0.27). Scaffold tools/research/forex_carry.py (monthly rebalance, AQR 30yr Sharpe 0.7-0.9); FRED_API_KEY present. Hard-disable in place until shipped + n>=…
FOREX PF >=1.5 / WR>=50% on n>=30 clean rolling (post-carry); class exits HARD_DISABLE
# DAILY_IDEAS Cross-Agent Synthesis β€” 2026-05-16 **Sources:** 15 files across 6 agents (Antigravity, Cursor, Claude, Gr
grok-4.3-autosweep-DailyIdeas-Corpus-2026-05-29πŸ“„ doc Β· πŸ”— page
MEDIUMSBACKLOG2026-07-13
in 27d
2026-05-24 23:41 EDTGATEAdd forex_carry to non_crypto_policy allowlist
Implementation already in repo (alpha_engine/new_strategies/forex_carry.py) with G10 rate differential, claimed 55-60% WR. Only missing the allowlist entry.
Strategy emits picks within 7 days of allowlist add; achieves >=50% WR on n>=10 within 30 daysβ€”ring-2.6-1tβ€”
LOWMBACKLOGβ€”2026-06-03 15:23 EDTMETHODOLOGYFX trend (clean-bar) β€” MIXED: real alpha vs USD but edge too weak/unstable
48mo clean-bar FX trend: attribution vs UUP PASSES (alpha t=2.13, beta -0.61 short-USD), but Sharpe 0.42 + bootstrap PF CI[0.63,3.36] lower<1. Real but small/unstable alpha. NOT promoted. FXE fetch-failed (4-ccy universe).
β€”claude-opus-4-8πŸ“„ doc
COMMODITIES (5)
ImpactEffortStatusTargetCreatedCatTitleSuccess metricPlanProposed byLinks
HIGHMBACKLOG2026-07-28
in 42d
2026-05-27 02:34 EDTMETHODOLOGYRecompute class health from deduped independent COT cycles only
The class should only advertise edge using independent-cycle-aware COT accounting. This enhancement formalizes the honest source-of-truth rule and blocks stale, over-emitted history from defining the class story.
COMMODITY dashboard tile and supporting rollups use independent-cycle-only metrics; Tier verdict matches the recomputed clean history.
# Audit quick wins β€” EAGLE review **Timestamp:** 2026-05-27 02:17 EST (Toronto local review window) **Model / provide
gpt-5.4/openaiπŸ“„ doc
HIGHMBACKLOG2026-07-28
in 42d
2026-05-24 23:41 EDTDATA_FEEDWire CFTC COT weekly feed for non-cot strategies
Top-2 data-feed investment from survey. Addresses sub-floor FOREX/COMMODITY classes by giving cta_trend / supply_demand / inventory_cycle real positioning data instead of inferring from price.
CFTC COT data ingested weekly into a dedicated table; non-cot strategies show >=5pp WR lift
# Persona Improvement Survey β€” AI Tournament **Date:** 2026-05-25 **Author:** investigation subagent **Goal alignment:*
claude+persona_surveyπŸ“„ doc
MEDIUMLBACKLOG2026-09-26
in 102d
2026-05-24 23:45 EDTMETHODOLOGYExecute COT 7-step testing plan (steps 1-5 active work + step 6 paper-pilot + step 7 risk-of-ruin)
Opencode P2 PASSIVE. Gates the only currently-DSR-verified single-class deviation candidate (cot_positioning + CT=F). Steps 1-5 ~6h active work; Step 6 = 4-week paper pilot (currently SHADOW); Step 7 = Monte Carlo risk-of-ruin sim.
All 7 steps green; cot_positioning + CT=F clears the 10-step Lopez de Prado AFML readiness gate; first eligible LIVE candidate
# COT Paper Pilot β€” Testing + Backtesting Plan for Statistical Edge Proof **Strategy:** `cot_positioning` on `CT=F` (IC
opencode/ring-2.6-1tπŸ“„ doc
LOWMIMPLEMENTEDβ€”2026-06-06 03:34 EDTMETHODOLOGYFMP symbol mapping s.replace('=F','USD') is wrong for grains/softs (ZC=F->ZCUSD does not exist on FMP)
_adapter_fmp maps GC=F->GCUSD which works for metals, but blindly applies the same s.replace('=F','USD') to grains/softs (ZC=F->ZCUSD, KC=F->KCUSD, CT=F->CTUSD). FMP does not list those as <X>USD pairs, so FMP will return empty/None for the exact commodities (CT=F cotton is the documented concentrat…
Replace the blanket .replace with an explicit FMP symbol table (metals -> <X>USD, energy/grains -> FMP's commodities endpoint tickers like ZCUSX/KCUSX) or skip FMP for symbols not in a known-good map, so the chain falls through to a source that can actually answer.β€”claude-opus-4-8 (audit-review workflow waesvw9hq 2026-06-06)⎌ fb27cfa
LOWMREJECTEDβ€”2026-06-03 15:10 EDTMETHODOLOGYCommodity TSMOM (clean-bar) β€” REJECTED: beta not alpha
Clean-bar 48mo TSMOM: PF1.69 Sharpe0.67 MDD-33.8%. Fails #111 (alpha t=0.84, beta0.72 vs DBC) + bootstrap PF CI[0.78,3.93]. Mostly commodity beta, no alpha. Gate-stack correctly rejects (vs ETF which passed).
β€”claude-opus-4-8πŸ“„ doc
BONDS (9)
ImpactEffortStatusTargetCreatedCatTitleSuccess metricPlanProposed byLinks
HIGHSBACKLOG2026-06-10 17:00 EST
OVERDUE 6d
2026-05-29 01:38 EDTMETHODOLOGYWire bond_scanner.py (yield_momentum, duration_rotation, mean_reversion) + promote ZN=F/ZF=F mean_rev_atr from hunt to production
From 90-day BOND plan + Cycle 17 hunt (PF 2.11 Tier 1) + consolidated_plan_2026-05-29.md. BOND class was 0% WR n=9; now has proven edge but still not emitting at scale. FRED key + wiring = high leverage for Goal #1.
BOND n>=50 within 14 days of wire; class no longer INSUFF-N on /audit
# Asset Class Consolidated Plan & Live Task Tracker β€” 2026-05-29 Update **Supersedes / augments:** `reports/90day_plan_
grok-4.3-autosweep-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHSBACKLOG2026-06-28
in 12d
2026-05-24 23:45 EDTMETHODOLOGYWire bond_scanner.py (3 strategies) to production cron
alpha_engine/bond_scanner.py exists with yield_momentum, duration_rotation, mean_reversion; not currently wired into production_scanner main loop. Universe of 14 symbols ready at config.py:721. Wiring should lift BOND n from 18 to 50+ within 2 weeks.
BOND n>=50 within 2 weeks of wire-up; class no longer marked 'sample-size-thin' on /audit
# Audit picks methodology and UI edge validation **Session ID:** ses_1a2d77a8dffe4tBYbdlLp6CGV2 **Created:** 5/25/2026,
opencode/ring-2.6-1tπŸ“„ doc
HIGHMBACKLOG2026-06-15
OVERDUE 1d
2026-05-29 01:43 EDTMETHODOLOGYBond scanner expansion to full 14-symbol roster (stage 3-4 at a time) + ETF sector rotation + macro overlay
From daily_ideas_synthesis_2026-05-16.md + edge_sweep + this Daily Ideas subagent (P1 for BOND/ETF lifts from thin/INSUFF-N). Expand bond_scanner.py beyond TLT/HYG to full 14-symbol; add relative-strength + macro overlay (VIX/YC/DXY) to ETF sector rotation to push PF 1.33β†’1.5.
BOND n>=50-100 clean + ETF PF>=1.5 on policy-clean; classes exit thin/INSUFF-N
# DAILY_IDEAS Cross-Agent Synthesis β€” 2026-05-16 **Sources:** 15 files across 6 agents (Antigravity, Cursor, Claude, Gr
grok-4.3-autosweep-DailyIdeas-Corpus-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHMBACKLOG2026-06-12
OVERDUE 4d
2026-05-29 01:42 EDTMETHODOLOGYPromote CYCLE_17 BOND Tier-1 breakthroughs (ZN=F/ZF=F mean_rev_atr PF~2.0) + wire with gates
From CYCLE_17_FOREX_BOND_BREAKTHROUGH_2026-05-29.md + this subagent. 8 Tier 1/2 BOND candidates discovered; mean-reversion now viable on bond futures (previously unviable). Register + wire with n>=10 / p<0.05 gates.
BOND policy-clean n>=30 + at least one strategy PF>1.5 with OOS validation
# Cycle 17: FOREX/BOND Deep Dive + New Indicator Discovery **Date:** 2026-05-29 **Session:** Autonomous Strategy Hunt
grok-4.3-autosweep-05-27-synth-05-29-hunt-2026-05-29πŸ“„ doc Β· πŸ”— page
HIGHMBACKLOG2026-06-15
OVERDUE 1d
2026-05-29 01:39 EDTMETHODOLOGYBond scanner expansion to full 14-symbol roster (beyond TLT/HYG) + promote yield-curve / duration strategies from daily ideas corpus
From daily_ideas_synthesis_2026-05-16.md + daily_ideas_edge_sweep_2026_05_17.md (multiple agents consensus). BOND class thin (n=11). Wire the existing bond_scanner.py fully and expand universe per 90-day + daily ideas recommendations. Goal #1 blocker.
BOND n>=50 + at least one strategy with n>=20 forward trades and PF>1.3 within 30 days
# DAILY_IDEAS Cross-Agent Synthesis β€” 2026-05-16 **Sources:** 15 files across 6 agents (Antigravity, Cursor, Claude, Gr
grok-4.3-autosweep-daily-ideas-2026-05-29πŸ“„ doc Β· πŸ”— page
MEDIUMMBACKLOGβ€”2026-06-10 14:39 EDTSCORINGbond_strategy_harness: deliberate wiring deferral now unblocked by PR2 β€” wire as forward_test_only
bond_strategy_harness.py is the one genuine orphan of the June-6 academic sleeves (residual_momentum + basis_carry have callers). The deferral was deliberate ('wire only AFTER the resolver is trustworthy') β€” that precondition is NOW MET (entry-anchored resolver default-on since 2026-06-10). BOND hon…
bond harness emits forward_test_only picks; BOND intrabar n grows toward 100β€”claude-fable-ultracodeπŸ“„ doc
MEDIUMMIMPLEMENTEDβ€”2026-06-02 10:17 EDTDATA_FEEDStand up BOND data feed (currently n=0)
BOND class has zero live samples. Stand up yield-curve/duration-timing pilot data feed before any backtest.
Shipped in PR #477 (opt-in sidecar): verified_strategies/bond_data_feed.py β€” bond ETFs via data_fetcher failover + Treas
claude-opus-4-8πŸ“„ doc Β· 477
MEDIUMMBACKLOG2026-08-12
in 57d
2026-05-24 23:41 EDTMETHODOLOGYAdd yield-curve-momentum (TLT/IEF steepener-flattener)
Use new_strategies/tsmom.py framework to trade the 10Y-2Y curve via TLT vs IEF. BOND class currently has only bond_connors_rsi2 (new, no track record).
New strategy emits picks; BOND class n grows from 18 (sub-floor) toward charter n>=100β€”ring-2.6-1tβ€”
LOWMREJECTEDβ€”2026-06-03 23:52 EDTMETHODOLOGYBOND duration-timing (clean-bar) β€” REJECTED: negative alpha vs AGG
48mo clean-bar duration-timing: PF1.08 Sharpe0.10, #111 attribution vs AGG NEGATIVE alpha (t=-0.63 beta0.50) + bootstrap[0.53,2.41]. 2022-25 rate regime. Closes BONDS#7 data-gap. Only ETF dual-momentum clears the gate-stack of 5 archetypes.
β€”claude-opus-4-8πŸ“„ doc
FUTURES (2)
ImpactEffortStatusTargetCreatedCatTitleSuccess metricPlanProposed byLinks
MEDIUMMBACKLOG2026-08-12
in 57d
2026-05-27 02:34 EDTUIReplace empty FUTURES tile with unified futures taxonomy
A unified futures taxonomy would stop the page from showing a nearly empty standalone FUTURES class while real futures exposure is discussed elsewhere. This is primarily a reporting/trust fix before it is a strategy expansion.
The dashboard no longer presents FUTURES as a zombie class; futures exposure is represented under one honest taxonomy with clear sub-sleeves.
# Audit quick wins β€” EAGLE review **Timestamp:** 2026-05-27 02:17 EST (Toronto local review window) **Model / provide
gpt-5.4/openaiπŸ“„ doc
MEDIUMMBACKLOG2026-08-12
in 57d
2026-05-24 23:41 EDTMETHODOLOGYAdd commodity term-structure roll-yield strategy
Use cta_commodity_momentum_term framework. Captures contango/backwardation premium β€” proven hedge-fund recipe.
Roll-yield strategy emits picks; FUTURES class WR rises above 30% (current 11.1%)β€”ring-2.6-1tβ€”
PENNY (1)
ImpactEffortStatusTargetCreatedCatTitleSuccess metricPlanProposed byLinks
MEDIUMLBACKLOG2026-09-26
in 102d
2026-05-24 23:41 EDTMETHODOLOGYImplement float-squeeze detector from skyrocket_detector framework
Penny stocks have only one (unwired) strategy. Build float-squeeze + volume-breakout signal using existing SIDU pattern code.
New strategy wired into production_scanner; emits >=20 picks/month; first 50 picks show WR>=50%β€”ring-2.6-1tβ€”

Findings β€” 21 total across 2 classes (last refresh: 2026-06-16 02:19 EDT)

Dated findings logged by IDE agents (Claude / Grok / Cursor / etc.) via tools/audit_pick_funnel/cli_track.py finding. Times stored UTC, rendered EST. Linked column points to the related INCIDENT or ENHANCEMENT row when set.
OVERALL (18)
SevTitleStatusAgentCreated (EST)Linked
P02026-06-09 non-crypto performance autopsy from ai_leaderboard + money_ready_verdict
Confirmed FOREX 25% WR n=24 PF=0.078 (policy_frozen, neg exp, USDJPY + multi_asset_scanner conc); COMMODITIES ~22% n=18 PF=0.16; ETFS 21% n=14 recency fail; FUTURES 33% n=18. All sub-coinflip, low n<30, INSUFFICIENT_DATA or frozen. Primary emitters: multi_asset_scanner, etf_all_strategies etc. Ties …
evidence: money_ready_verdict.json (2026-06-09); ai_tournament_leaderboard.json + /audit/ai_leaderboard.html; strategy_tier_tracker; config.py PERMANENTLY_KILLED + MIN_ELITE; quality_gates non_crypto_edge_guard
OPENgrok-4.32026-06-09 14:05 EDTINCIDENT_OVERALL #127
P0Client-side AI-tournament leaderboard score uses fake CI (wr_ci_hi=1, score=wr*pf) β€” contradicts page's own methodology
audit_dashboard/ai-tournament.html calcModelStats() L517-521 computes m.score = m.wr * m.pf with m.wr_ci_lo = m.wr and m.wr_ci_hi = 1 β€” i.e. the displayed score is raw WR*PF (not CI-shrunk) and the 'CI' column shows [WR, 100%] (a fake range). Meanwhile the page header (L190), explainer (L201, L207),…
evidence: Resolved via commit df65641d8 (fix/leaderboard-real-ci β†’ main): added closed-form Wilson WR_lo/hi + bootstrap PF_lo (B=500, seeded LCG) per deepseek swarm consult Option B; score now uses real CI valu
RESOLVEDclaude-gx10-c9b92026-05-29 17:25 EDTβ€”
P1at_raw_picks.pnl_pct populated asymmetrically by status β€” raw R:R math invalid
PEER-VERIFIED FINDING (2026-05-31 / claude-opus-4-7-desktop) SYMPTOM ------- Filtering at_raw_picks by `pnl_pct IS NOT NULL` returns one-sided WR for many strategies that pf_registry (policy-clean) reports as balanced. - crypto_liquidity_wick_reversal_v1: at_raw_picks shows 59 WON / 0 LOST, but pf_…
evidence: Query: SELECT strategy, SUM(status='WON'), SUM(status='LOST'), SUM(status='OPEN'), COUNT(*) FROM at_raw_picks GROUP BY strategy HAVING (won>=10 AND lost=0) OR (lost>=10 AND won=0) β†’ 41 rows. NULL-by-s
OPENclaude-opus-4-7-desktop2026-05-30 22:15 EDTβ€”
P1PR #75/76/78 superset chain β€” #78 is mergeable after stripping stale data JSONs
Three open PRs are related: #75 (per-class FINDING tables) is subsumed by #76 (_UNSET sentinel + CLI), which is subsumed by #78 (CLI + renderer + test_cli_finding.py). #78 is the superset. However #78 includes 2 stale data JSONs (ai_tournament_leaderboard.json + ai_tournament_model_summary.json from…
evidence: Chain settled: PR #75 CLOSED, #76 MERGED, #78 CLOSED. Verified 2026-06-11.
RESOLVEDclaude-fable-sweep2026-05-30 22:07 EDTβ€”
P1OpenRouter API key (10c5b7de...) total-limit exhausted β€” blocks 6 revived models
Pipeline run 26641993453 logs show repeated auth_error (403) Key limit exceeded across every OpenRouter-routed model: claude_haiku_4_5, command_a, minimax_m2_5, kimi_k2_6, qwen3_5_plus, qwen3_6_max. Operational fix: top up OpenRouter credits OR rotate to fresh OR API key in GH secret OPENROUTER_API_…
evidence: GH Actions run 26641993453 job 78522642787 log; lines like '[API] anthropic/claude-haiku-4.5 auth_error (403): Key limit exceeded (total limit)'
OPENclaude-gx10-c9b92026-05-29 22:30 EDTβ€”
P2picks-now-refresh Push step: conflicted rebase not aborted -> exit 1 -> FTP deploy skipped
Run 27185397447 (2026-06-09) FAILED at "Push data commit". Two crons regenerated audit_dashboard/data/picks_now.json simultaneously -> git pull --rebase hit a CONFLICT. The retry loop does "git pull --rebase origin main || true" which SWALLOWS the error but leaves the repo MID-REBASE (conflicted); a…
evidence: picks-now-refresh.yml Push step now continue-on-error + rebase --abort handling (comment cites FINDING #14). Verified 2026-06-11.
RESOLVEDclaude-fable-sweep2026-06-09 03:02 EDTβ€”
P2PR #83 meta-effectiveness analyzer β€” Wire-Up Rule compliance needed before merge
tools/portfolios/meta_effectiveness.py (625 lines) has zero production callers and no GHA workflow integration. 11 tests all pass. Math is solid (Sharpe, Sortino, MaxDD, CAGR, Calmar, attribution decomposition). Concern: violates repo Wire-Up Rule β€” new analysis modules need production caller or exp…
evidence: PR #83 MERGED with wiring. Verified 2026-06-11.
RESOLVEDclaude-fable-sweep2026-05-30 22:07 EDTβ€”
P2at_raw_picks: source_system casing duplication (battleground vs Battleground_Main)
crypto_liquidity_wick_reversal_v1 has 366 rows split across 'battleground' (190) and 'Battleground_Main' (176) β€” same engine, two spellings. Any source-system aggregation downstream (BLOCKED_SOURCE_SYSTEMS, edge_filters.py, pf_registry source_share) will mis-count concentration. Fix: normalize at in…
evidence: SELECT source_system, COUNT(*) FROM at_raw_picks WHERE strategy='crypto_liquidity_wick_reversal_v1' GROUP BY source_system β†’ battleground 190, Battleground_Main 176. Surfaced in FINDING_CRYPTO#1 inves
OPENclaude-gx10-c9b92026-05-30 17:29 EDTβ€”
P2pf_registry aggregates per-engine closed_picks.json without per-class single-source flag
pf_registry.by_asset_class_strategy_policy_clean_net surfaced crypto_liquidity_wick_reversal_v1 as the only PF>=1.5 & n>=30 & WR>=50 strategy across all classes β€” looked like an edge. But all 30 'policy-clean' picks came from a single engine's closed_picks.json (battleground), with no cross-engine c…
evidence: Resolved via PR #86 (commit on branch feat/pf-registry-source-concentration). build_pf_registry.py now emits single_source_pct + top_source + is_single_source_artifact on every row in by_asset_class_*
RESOLVEDclaude-gx10-c9b92026-05-30 16:15 EDTβ€”
P2OpenAI 429 rate-limited across gpt4_1, gpt5_chat, gpt5_mini
Pipeline run 26641993453 hit OpenAI 429 across 3 retries on all personas for gpt4_1/gpt5_chat/gpt5_mini. Fix: raise OpenAI tier OR stagger model calls OR bump retry/backoff in populate_picks.
evidence: run 26641993453 log
OPENclaude-gx10-c9b92026-05-29 22:30 EDTβ€”
P2llama4_scout: 'llama-4-scout-17b-16e-instruct' not on Cerebras catalog (404)
config/model_persona_mapping.json sets api_type=cerebras / model_name='llama-4-scout-17b-16e-instruct' for llama4_scout. Cerebras returns 404 'Model does not exist or you do not have access'. Either pick a valid Cerebras model (llama-3.3-70b, llama3.1-8b) or route llama4_scout via OpenRouter (meta-l…
evidence: run 26641993453 Cerebras 404 response
OPENclaude-gx10-c9b92026-05-29 22:30 EDTβ€”
P2ANTHROPIC_API_KEY GH secret unset β€” blocks cursor_agent revival
Workflow env block wires ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY || '' }} but the secret is unset in repo, so the env var renders empty and populate_picks.py returns [] for cursor_agent silently. Fix: set ANTHROPIC_API_KEY in repo secrets.
evidence: run 26641993453 log shows ANTHROPIC_API_KEY: (empty) while OPENROUTER_API_KEY/OPENAI_API_KEY render as ***
OPENclaude-gx10-c9b92026-05-29 22:30 EDTβ€”
P2FINDING CLI: upsert overwrites existing severity with INFO default when --severity not passed on update
tools/audit_pick_funnel/cli_track.py cmd_finding: when called to update an existing row by title (e.g. to mark RESOLVED), if --severity is omitted, argparse provides the INFO default and the SQL UPDATE writes it over the existing severity. Same issue likely affects --status. Fix: only include --seve…
evidence: Fixed by _UNSET sentinel, cli_track.py:43-57, PR #76 MERGED. Verified 2026-06-11 (workflow wf_cf7c13d9).
RESOLVEDclaude-fable-sweep2026-05-29 18:19 EDTβ€”
P3Backup Restore Verification workflow is scheduled but has NEVER run
backup-verify.yml has on: schedule + workflow_dispatch but its run history is empty (never fired). A scheduled backup-restore-verification that never runs = unverified backups. Check the cron syntax / whether the schedule trigger is valid / whether it was ever enabled.
evidence: .github/workflows/backup-verify.yml on:schedule; gh run list shows 0 runs
OPENclaude-opus-4-82026-06-09 13:53 EDTβ€”
P3glm4_7_flash: OpenRouter 'No endpoints found for z-ai/glm-4.7-flash:free'
model_name 'z-ai/glm-4.7-flash:free' invalid on OpenRouter. Switch to a supported variant (e.g. z-ai/glm-4.6 or remove :free suffix).
evidence: run 26641993453 OpenRouter response
OPENclaude-gx10-c9b92026-05-29 22:30 EDTβ€”
NOTEWORTHYEAGLE-6 v1 deployed to main (commit 2b4d7ce36) + documented in PR #456
On 2026-06-02 the EAGLE-6 statistical admissibility gate was deployed to main via commit 2b4d7ce36 (code: alpha_engine/eagle_gates.py + 6f2.7 wire in alpha_engine/production_scanner.py) and the standalone doc was delivered in PR #456 (commit 036599997, file EAGLE6_2026-06-02_minimax-m3-free.MD). v1 …
evidence: alpha_engine/eagle_gates.py:165-356 (EAGLE-6 block); alpha_engine/production_scanner.py:5587-5599 (6f2.7 wire); commit 2b4d7ce36 (code); commit 036599997 (doc); PR #456; tools/deflated_sharpe_results.
CONFIRMEDminimax-m3-free2026-06-02 10:18 EDTENHANCEMENT_OVERALL #63
INFO36 M-107 pre-registrations (H-105..H-110 + peers) sat LOCAL-ONLY uncommitted β€” rescued to main
2026-06-12: origin/main hypothesis_registry.json had 33 hypotheses while the shared working tree held 69 β€” the peer subagent's full M-107 pre-registration entries were one branch-switch/stash away from loss (known silent-revert pattern). Verified local superset, committed 41fbfa4d45 (+H-111 COMMODIT…
evidence: git: main 33 vs worktree 69 hyps; commit 41fbfa4d45
OPENclaude-fable-mdreview2026-06-12 14:23 EDTβ€”
INFOOne-sided resolution sources flagged by check_one_sided_resolution (H5 coverage pathology)
2026-06-12 weekly loop MEASURE: WON-only sources incl. ml_enhanced_ADAUSDT_15m_B_lightgbm (42/43 WON), reddit u/AutoModerator (29/31), youtube coinbureau (21/21), cta_fx_multifactor (20/20); LOST-only incl. copy_pm_pm_6e1d5040 (54/55), gnews Manila Times (42/43), copy_hl_lb_None (37/378). >95% one-s…
evidence: tools/check_one_sided_resolution.py output 2026-06-12
OPENclaude-fable-mdreview2026-06-12 14:23 EDTβ€”
CRYPTO (3)
SevTitleStatusAgentCreated (EST)Linked
P1Resolver starvation: only 42 OPEN CRYPTO resolved since Apr 29 vs 2,403 legacy
active_picks_sync's LIMIT 5000 ORDER BY signal_timestamp ASC means with 39,706 OPEN CRYPTO rows the oldest-5000 window cuts off at ~Apr 7. The wick strategy is the canary, not the only victim β€” every newer CRYPTO strategy is similarly starved. PR #87 (max-rows 20000) is a partial fix; real fix is OR…
evidence: active_picks_sync.py:106 ORDER BY flipped ASC->DESC per PR #87 (2026-05-31); starvation backlog drained, table-wide OPEN=0. Verified 2026-06-11.
RESOLVEDclaude-fable-sweep2026-05-30 17:29 EDTβ€”
P1crypto_liquidity_wick_reversal_v1: 366 raw-DB picks all status=OPEN β€” never resolved
at_raw_picks shows 366 picks for this strategy (190 from battleground + 176 from Battleground_Main), ALL status=OPEN. Meanwhile battleground/data/closed_picks.json has 43 resolved snapshots (25W/18L = 58.1% WR) β€” this is what pf_registry surfaces as the strategy's 'edge'. Either (a) TP/SL targets ar…
evidence: 0 OPEN rows remain (205 WON / 1 ABANDONED). Residual one-sidedness superseded into FINDING_OVERALL#12 (CI assertion shipping). Verified 2026-06-11.
RESOLVEDclaude-fable-sweep2026-05-30 16:15 EDTβ€”
NOTEWORTHYCRYPTO Bonferroni-proven edge: trust=UNK & rr=RR1.0-1.5 & dir=LONG (n=327, WR 60.86%, PF 3.89)
Per top_edges_per_class.json (generated_at 2026-05-29T06:38Z β€” file is itself stale, see OVERALL #30): one CRYPTO segment passes BOTH Bonferroni adjustment AND walk-forward holdout: - Cell: trust=UNK & rr=RR1.0-1.5 & dir=LONG - n=327, wins=199, WR=60.86% (shrunk 60.23%), PF=3.885, avg_pnl=+1.09% - t…
evidence: REFUTED by the 2026-06-10 definitive edge audit (1,278 slices, Bonferroni): no repeatable edge; every narrowed slice fails the time-split. Source top_edges_per_class.json stale (2026-05-31).
RESOLVEDclaude-fable-sweep2026-05-30 22:11 EDTβ€”