The Black Box

The Four-Link Chain

Each upstream case documents one link. The Black Box documents the chain.

The Upstream Cases

UC-085: The Alien AutopsyRoot Node · FETCH 1,260
Link 1: We don’t understand LLMs

UC-082: The Guardrail GapDiagnostic · FETCH 1,260
Link 2: Agents destroy production

UC-083: The Toxic FlowPrognostic · FETCH 1,260
Link 3: Attackers exploit the gap

UC-084: The 250 Billion LinesDiagnostic · FETCH 1,260
Link 4: Markets reprice violently

Link 1 → Link 2 → Link 3 → Link 4

Link 1: The Interpretability Deficit (UC-085). Large language models are studied like alien organisms, not engineered systems. The best microscope — Anthropic’s sparse autoencoder with 16 million latents — degrades model performance to approximately 10% of original capability. Core concepts lack rigorous definitions. The field is split on feasibility. The models self-repair when components are ablated. Chain-of-thought explanations are unfaithful. We cannot predict, at a mechanistic level, what these systems will do next. This is the root condition.[1]

Link 2: The Guardrail Gap (UC-082). Because we cannot understand the systems, we cannot build guardrails that match their velocity. AI coding agents generate code at 10× speed into delivery infrastructure designed for human velocity. Amazon’s AI tools caused 6.3 million lost orders in a single incident. An AI agent ran terraform destroy on production, wiping 1.9 million rows. Engineers stop reviewing code because the volume overwhelms them. The guardrail gap is the operational consequence of the interpretability deficit: you cannot catch what you cannot understand.[2]

Link 3: The Toxic Flow (UC-083). The same interpretability deficit that prevents guardrails also prevents security. Prompt injection succeeds at 85% because models architecturally cannot distinguish instructions from data — and nobody understands why at a mechanistic level. Twenty-four CVEs across all major AI IDEs. Twenty-two repeatable attack patterns across twelve tools. Clinejection proved the full chain: one AI tool compromised and used to silently install a second autonomous AI agent on 4,000 machines. Meta acknowledges prompt injection is a “fundamental, unsolved weakness.” The toxic flow is the security consequence of the interpretability deficit: you cannot defend what you cannot explain.[3]

Link 4: The Market Repricing (UC-084). When the implications of the black box become visible, markets reprice violently. IBM lost $31 billion in market capitalisation when the market realised that AI could dissolve a competitive moat built on complexity nobody understood. That was one company. AI-related stocks now constitute one-third of the S&P 500. AI spending drove more than 90% of US GDP growth in the first half of 2025. If the market reprices AI risk broadly — triggered by a major safety failure, a regulatory action, or a security incident at scale — the financial consequences cascade into the real economy. Deloitte warns that a drop in AI spending alone could push the US into recession.[4][5]

The chain is not hypothetical. Each link has been demonstrated independently. The question The Black Box asks is whether the chain fires end-to-end: an interpretability deficit enables a security exploit (Link 1 → Link 3) that triggers a production cascade at a major enterprise (Link 3 → Link 2) that causes market repricing of AI risk broadly (Link 2 → Link 4) — at the scale where AI is one-third of equity market capitalisation. That is the system-level event.

The Scale Problem

The individual upstream cases document failures at company scale: Amazon lost 6.3 million orders. IBM lost $31 billion. 4,000 machines were compromised. These are significant but contained. The Black Box becomes a system-level risk because of the concentration of economic activity in AI.

AI-related stocks — the so-called Magnificent 7 — constitute approximately one-third of the S&P 500 by market capitalisation. By Bank of England estimates, this concentration exceeds any prior sector concentration in the index’s history. AI investment accounted for more than 90% of US GDP growth in the first half of 2025, according to Harvard economist Jason Furman. JPMorgan Wealth Management says the biggest risk to their outlook is “not having exposure to this transformational technology.” BlackRock says AI will “keep trumping tariffs and traditional macro drivers.”[5][6]

This concentration means that the failure modes documented in the upstream cases are no longer contained within the technology sector. A major AI safety incident — a supply chain attack via prompt injection that compromises critical infrastructure, a production failure at the scale of Amazon’s March 2026 incidents but affecting financial systems, or a regulatory action that constrains AI deployment — would cascade through the equity market into consumer wealth, business investment, and the real economy. The Federal Reserve’s 2026 stress test models equity prices falling 54% in the severely adverse scenario. One-third of that decline would be AI-related stocks.[7]

The EU AI Act becomes fully applicable in August 2026. It requires transparency and explainability for high-risk AI systems. UC-085 documented that the science cannot currently deliver what the regulation requires. If the EU enforces strictly, major AI providers face compliance costs, deployment restrictions, or market withdrawal from the EU — each of which the market would reprice. If the EU retreats, the regulatory vacuum identified in UC-083 persists and the security attack surface continues to widen. Either outcome feeds the chain.[1]

WATCH Triggers

AI_SAFETY_FAILURE

A deployed model exhibits a dangerous capability undetected by pre-deployment testing AND causes measurable harm (financial ≥$1B, physical, or security) before mitigation. The incident must be traceable to the interpretability deficit — specifically, the failure was not predictable from the model’s known behaviour.

Severity: Critical · Linked to: UC-085, UC-082 · Status: INACTIVE

AI_SUPPLY_CHAIN_MASS

A prompt injection via AI coding agent compromises a software package with ≥1 million weekly downloads on npm, PyPI, or crates.io, persisting for ≥24 hours before detection. Escalated from UC-083. Clinejection affected 4,000 in 8 hours; this trigger measures orders-of-magnitude escalation.

Severity: Critical · Linked to: UC-083 · Status: INACTIVE

EU_COMPLIANCE_CRISIS

August 2026 EU AI Act deadline arrives with no major AI provider (OpenAI, Anthropic, Google, Meta) able to deliver required transparency for high-risk systems, triggering enforcement action, formal waiver requests, or market withdrawal announcements.

Severity: High · Linked to: UC-085 · Status: INACTIVE · Deadline: August 2026

AI_MARKET_REPRICING

Magnificent 7 market capitalisation declines ≥25% sustained for ≥60 days, driven by AI-specific concerns (safety incident, regulatory action, demand disappointment, or interpretability-related disclosure) rather than general market conditions.

Severity: High · Linked to: UC-084, UC-112 (AI_BUBBLE_CORRECTION) · Status: INACTIVE

INTERPRETABILITY_BREAKTHROUGH

A research team demonstrates mechanistic interpretability at scale — explaining model behaviour across a full domain with <20% performance degradation and automated analysis. Would fundamentally narrow the black box and break the chain at Link 1.

Severity: High (positive) · Linked to: UC-085 · Status: INACTIVE · Anthropic target: 2027

OPEN

Window Health: 100% · All 5 triggers inactive. Individual links firing at company scale (Amazon, IBM, Clinejection) but no end-to-end chain event. EU deadline approaching (August 2026). Anthropic interpretability target 2027. AI market concentration at historic levels. The measurement: not individual incidents but the chain — does a Link 1 failure enable a Link 3 exploit that produces a Link 2 cascade that triggers Link 4 repricing at market scale? Review: March 23, 2028.

The System-Level Analysis

The Chain Is Proven at Company Scale

Every link has fired independently. UC-085: interpretability at 10% resolution. UC-082: Amazon 6.3M orders lost, terraform destroy on production. UC-083: Clinejection 4,000 machines, 85% attack success. UC-084: IBM −$31B. These are not projections. They are documented events with hard numbers. The prognostic question is not whether the individual links work. It is whether they connect at a scale that matters to the broader economy — and the concentration of economic activity in AI (1/3 of S&P 500, 90%+ of GDP growth) means the scale threshold has been crossed.

The EU AI Act as Catalyst or Release Valve

August 2026 is the most important near-term date for this prognostic. If the EU enforces transparency requirements strictly, major AI providers face an impossible choice: deliver explainability the science cannot produce, accept deployment restrictions that the market will reprice, or withdraw from the EU and accept revenue loss. Any of these outcomes feeds Link 4 (market repricing). If the EU retreats or grants waivers, the regulatory vacuum persists, the security attack surface widens (Link 3), and the interpretability gap deepens (Link 1). The EU AI Act is either a catalyst that forces the chain to fire or a release valve that delays it while the underlying pressure builds.

Connection to UC-112 (The Convergence)

UC-112’s AI_BUBBLE_CORRECTION trigger measures Magnificent 7 decline ≥40% sustained ≥90 days. UC-113’s AI_MARKET_REPRICING trigger is calibrated lower (≥25%, ≥60 days) because it measures AI-specific causation rather than general market decline. If UC-113’s trigger fires, it becomes a contributing factor to UC-112’s system-level convergence. The Black Box is the mechanism by which AI risk enters The Convergence — the specific chain through which a technology failure becomes a financial event becomes a macro event.

The Models Grow Faster Than the Microscopes

Anthropic’s sparse autoencoder with 16 million latents achieves approximately 10% resolution on GPT-4. Each subsequent model generation is larger, more complex, and harder to study. The interpretability tools are improving — but the models are scaling faster. The gap between what we can explain and what we’ve deployed is not stable. It is widening. This is the structural condition that makes the chain increasingly likely to fire over time rather than less likely. The black box is getting blacker.

6/6

Dimensions Hit

10×–15×

Multiplier (Extreme)

1,260

FETCH Score

OriginD5 Quality/Opacity (75)·D4 Regulatory (70)

L1D3 Revenue/Market (68)·D1 Customer (65)·D6 Operational (62)

L2D2 Workforce (58)

CAL SourceCascade Analysis Language — AI safety system-level prognostic

-- The Black Box: AI Safety System-Level Prognostic
-- Capstone for UC-082, UC-083, UC-084, UC-085

FORAGE ai_safety_system_level
WHERE interpretability_resolution < 0.15
  AND prompt_injection_success_rate > 0.80
  AND production_destruction_events > 8
  AND market_repricing_documented = true
  AND ai_sp500_concentration > 0.30
  AND ai_gdp_contribution > 0.50
  AND eu_transparency_deadline_months < 6
  AND architectural_fix_exists = false
ACROSS D5, D4, D3, D1, D6, D2
DEPTH 4
SURFACE black_box

WATCH ai_safety_failure WHEN undetected_dangerous_capability AND harm_ge_1B = true
WATCH ai_supply_chain_mass WHEN compromised_package_1M_downloads AND persists_24h = true
WATCH eu_compliance_crisis WHEN august_2026_unmet AND enforcement_or_withdrawal = true
WATCH ai_market_repricing WHEN mag7_decline_ge_25pct AND sustained_60d AND ai_specific = true
WATCH interpretability_breakthrough WHEN scale_explanation_lt_20pct_degradation = true

DRIFT black_box
METHODOLOGY 75  -- sparse autoencoders, circuit tracing, CoT monitoring, SBOM, CVE tracking, market stress tests all exist and are improving
PERFORMANCE 25  -- 10% resolution, 85% attack success, no architectural fix for prompt injection, 73% no golden paths, EU deadline unmet, models growing faster than microscopes

FETCH black_box
THRESHOLD 1000
ON EXECUTE CHIRP prognostic "Four-link chain: interpretability deficit (10% resolution) enables security exploit (85% success) triggers production cascade (6.3M orders lost) causes market repricing ($31B documented). Each link proven at company scale. AI = 1/3 of S&P 500. AI spending = 90%+ GDP growth. Deloitte: AI spending drop could cause recession. EU AI Act August 2026 deadline: science can't deliver what regulation requires. The chain is proven. The scale threshold is crossed. The question is end-to-end execution. 5 WATCH triggers. 24 months."

SURFACE analysis AS json
SURFACE review ON "2028-03-23"

SENSED5+D4 dual origin — Quality/Opacity: LLMs fundamentally opaque. Best SAE at ~10% resolution on GPT-4. Core concepts undefined. Field split on feasibility. Hydra effect defeats ablation. CoT unfaithful. Anthropic targets 2027. Models growing faster than microscopes. Regulatory: EU AI Act August 2026 transparency mandate undeliverable. FARM Act, FTC right-to-repair (UC-111) show pattern of regulation outpacing industry ability to comply. No NIST/CISA formal classification of prompt injection as supply chain risk. SEC Chair called systemic concerns “overblown.” Upstream cases: UC-085 (Alien Autopsy, 1,232), UC-082 (Guardrail Gap, 2,603), UC-083 (Toxic Flow, 1,312), UC-084 (250 Billion Lines, 2,508). Aggregate upstream FETCH: 7,655. Market concentration: AI stocks ~1/3 S&P 500. AI spending >90% GDP growth H1 2025. Deloitte: AI spending drop could push economy into recession. Fed stress test: equities −54%. JPM: 35% recession probability.

ANALYZED3 Revenue/Market: IBM −$31B from single repricing. Amazon 6.3M orders lost. AI stocks 1/3 of S&P 500 creates systemic exposure. If market broadly reprices AI safety risk, the financial cascade enters the real economy through the wealth effect and business investment channel. D1 Customer: 90% of developers using AI tools they cannot inspect. Downstream package consumers of compromised AI-generated code have no visibility. 22,000 users reported Amazon outage. Retail investors in AI-concentrated index funds bear the repricing risk. D6 Operational: 73% of enterprises have no golden paths. Agents have production access without least-privilege. 10+ destruction events in 16 months. The operational manifestation of opacity is unpredictable failure. D2 Workforce: Few interpretability practitioners globally. Millions of deployers vs dozens of understanders. Amazon laying off 16,000 while spending $200B on AI — fewer humans to catch AI mistakes.

MEASUREDRIFT = 50 (Methodology 75 − Performance 25). The methodology is stronger than in any individual upstream case because this prognostic benefits from the combined defensive toolkit: sparse autoencoders and circuit tracing for interpretability (improving), CVE tracking and SBOM for supply chain security (established), stress testing and circuit breakers for financial markets (robust), and regulatory frameworks (EU AI Act) providing governance pressure. The performance at 25 reflects the reality that none of these tools are adequate for the problem they face: interpretability at 10% resolution, 85% attack success, no architectural fix for prompt injection, 73% without golden paths, EU deadline scientifically undeliverable, and models growing faster than every defence built to understand them. The DRIFT of 50 uses the default because the gap is proportional. The composition is informative: the methodology is genuinely improving (Anthropic’s 2024–2025 progress is real) but the deployment is outpacing the improvement.

DECIDEFETCH = 1,260 → EXECUTE (threshold: 1,000). Chirp: 66.33. DRIFT: 50. Confidence: 0.38. 3D Lens 8.7/10 (Sound 8, Space 9, Time 9). The confidence at 0.38 is appropriate for a system-level prognostic that requires the four-link chain to fire end-to-end. Each individual link has high confidence (0.85–0.88 in the upstream cases). The chain confidence is lower because it requires sequential activation: an interpretability failure must enable a specific security exploit, which must cause a specific production cascade, which must be visible enough to trigger market repricing, at a scale large enough to affect the broader economy. Each conditional reduces the probability. But the structural conditions that make the chain possible are strengthening, not weakening. Calibrated against: UC-106 (1,386, 0.42), UC-110 (1,317, 0.40), UC-112 (1,196, 0.35).

ACTPrognostic — 5 WATCH triggers, review March 23, 2028. UC-113 is the AI safety cluster capstone and the mechanism by which AI risk enters UC-112 (The Convergence). The AI_BUBBLE_CORRECTION trigger in UC-112 measures the financial outcome. UC-113 documents the causal chain that produces it: opacity → vulnerability → exploitation → cascade → repricing. If UC-113’s AI_MARKET_REPRICING trigger fires, it becomes an input to UC-112’s system-level convergence assessment. The positive trigger (INTERPRETABILITY_BREAKTHROUGH) is the only exit: if mechanistic interpretability achieves scale with <20% degradation and automated analysis, it breaks the chain at Link 1 and narrows the black box. Anthropic targets 2027. The race between understanding and deployment is the race this prognostic tracks.

Runtime: @stratiqx/cal-runtime · Spec: cal.cormorantforaging.dev · DOI: 10.5281/zenodo.18905193

Sources

[1]

StratIQX Case Library — UC-085 (The Alien Autopsy): Best interpretability tools ~10% resolution. Core concepts undefined. Field split. Anthropic 2027 target. EU AI Act August 2026. Root node for UC-082/083/084. FETCH 1,260
uc-085.stratiqx.com
March 19, 2026

[2]

StratIQX Case Library — UC-082 (The Guardrail Gap): Amazon 6.3M lost orders. 10+ destruction events across 6 tools. 90% devs using AI, 73% no golden paths. Code velocity 10×, safety 2019. FETCH 1,260
uc-082.stratiqx.com
March 19, 2026

[3]

StratIQX Case Library — UC-083 (The Toxic Flow): 85% attack success rate. 24+ CVEs across AI IDEs. 22 patterns, 12 tools. Clinejection 4,000 machines. Meta: prompt injection “fundamental unsolved weakness.” FETCH 1,260
uc-083.stratiqx.com
March 19, 2026

[4]

StratIQX Case Library — UC-084 (The 250 Billion Lines): IBM −$31B market repricing. AI dissolving moats built on complexity. Market violently reprices when black box implications become visible. FETCH 1,260
uc-084.stratiqx.com
March 19, 2026

[5]

Deloitte Insights, “Global Economic Outlook 2026” — US GDP 1.9%. AI spending >90% of GDP growth. Drop in AI spending could push economy into recession. Risks tilted downside
deloitte.com
January 16, 2026

[6]

Newsweek, “Countries Most at Risk of Recession in 2026” — AI stocks 1/3 of S&P 500 (Bank of England). AI 90%+ of GDP growth (Harvard/Furman). Heavy leverage in AI and crypto
newsweek.com
December 30, 2025

[7]

Federal Reserve Board, “2026 Stress Test Scenarios” — Severely adverse: unemployment 10%, equities −54%, CRE −39%, VIX 72. 32 banks tested
federalreserve.gov
February 4, 2026

[8]

Bloomberg / Fidelity, “Stock Market Predictions 2026” — Fidelity: “disconnect between positive short-term environment and broader structural instability.” JPMorgan: biggest risk is “not having exposure” to AI
bloomberg.com
January 1, 2026

[9]

StratIQX Case Library — UC-112 (The Convergence): System-level macro prognostic. AI_BUBBLE_CORRECTION trigger (Mag 7 ≥40%, 90 days). UC-113 feeds this trigger through the four-link chain. FETCH 1,260
uc-112.stratiqx.com
March 23, 2026