Evaluation mode · Experimental

M3-AC — All-cards mode

Model receives every card in a document concatenated. Tests retrieval-without-oracle.

In one paragraph

All 35 NOAA cards (or all 186 V27 cards) concatenated and served to the model in one prompt. Tests whether models can locate and answer from a document-scale card bundle without pre-selection. NOAA bundle fits in frontier-tier context; V27/V35 bundles overflow most contexts.

How the inputs are generated

Generation · 01

Generator script

evaluation_runs/harness/core.py:load_all_cards

Input sources

• All cards in a document (active variant)

AI use

No — pure deterministic transformation

OCR / re-OCR

Inherits from the upstream pipeline variant

Approximate processing time

Negligible bundling time; model inference: ~10-30 seconds per cell on bundles that fit context.

Resource intensity

Medium — model inference or moderate I/O

Determinism

Deterministic (same input → same output, byte-identical)

Introduced

Cycle 4, 2026-05-21.

Related variants

Cross-reference · 06

← Back to all variants