NARC puzzles are designed so that neither the grid sequence nor the narrative clue is sufficient on its own — but together they uniquely determine the answer. The experiments below probe different dimensions of this narrative–visual interaction. Masking tests whether models can reconstruct hidden grids. Ordering tests whether narratives help recover temporal structure. Odd-one-out tests whether narratives help models recognize which grids belong together. Stances tests how the same visual pattern responds to different narrative framings.
Odd-one-out experiment: Three grids from the same puzzle are shown alongside one distractor grid from a different puzzle. The model must identify which grid doesn't belong. Tested under two conditions: grids only and grids + narrative. This probes whether narratives help models recognize which grids are thematically related, even when pixel-level reconstruction (masking) fails.
Accuracy = % of trials where distractor was correctly identified Lift = accuracy(+narrative) − accuracy(grids only) in percentage points Red border = distractor grid
iter_005)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 0.0% | 100.0% | +100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 100.0% | +100.0pp |
| gemma-4-31b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 0.0% | 100.0% | +100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 0.0% | 100.0% | +100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 0.0% | 100.0% | +100.0pp |
| qwen3.5-122b | 0.0% | 100.0% | +100.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 0.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 0.0% | 100.0% | +100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 100.0% | +100.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 0.0% | 100.0% | +100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
narc_prism_005)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 100.0% | +100.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 0.0% | 0.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 100.0% | +100.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 0.0% | 100.0% | +100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 66.7% | 100.0% | +33.3pp |
| gemma-4-31b | 66.7% | 100.0% | +33.3pp |
| gpt-oss-120b | 0.0% | 66.7% | +66.7pp |
| gpt-oss-20b | 0.0% | 66.7% | +66.7pp |
| nemotron-3-super | 50.0% | 66.7% | +16.7pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
narc_018)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 100.0% | +100.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 0.0% | 0.0% | +0.0pp |
| qwen3.5-122b | 0.0% | 100.0% | +100.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 0.0% | 100.0% | +100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 0.0% | 100.0% | +100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 0.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 0.0% | 100.0% | +100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 0.0% | 100.0% | +100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 0.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 0.0% | 100.0% | +100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 100.0% | +100.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 0.0% | 100.0% | +100.0pp |
narc_gap_104)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 100.0% | +100.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 0.0% | 100.0% | +100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 75.0% | 75.0% | +0.0pp |
| gemma-4-31b | 75.0% | 75.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 75.0% | +75.0pp |
| gpt-oss-20b | 33.3% | 50.0% | +16.7pp |
| nemotron-3-super | 0.0% | 50.0% | +50.0pp |
| qwen3.5-122b | 25.0% | 75.0% | +50.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 50.0% | -50.0pp |
| gemma-4-31b | 66.7% | 100.0% | +33.3pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 0.0% | 66.7% | +66.7pp |
| nemotron-3-super | 66.7% | 66.7% | +0.0pp |
| qwen3.5-122b | 66.7% | 100.0% | +33.3pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 66.7% | 100.0% | +33.3pp |
| gemma-4-31b | 66.7% | 100.0% | +33.3pp |
| gpt-oss-120b | 33.3% | 100.0% | +66.7pp |
| gpt-oss-20b | 33.3% | 33.3% | +0.0pp |
| nemotron-3-super | 66.7% | 66.7% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 66.7% | 66.7% | +0.0pp |
| gemma-4-31b | 66.7% | 100.0% | +33.3pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 66.7% | +66.7pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
iter_042)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 100.0% | +100.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 100.0% | +100.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 0.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 0.0% | 100.0% | +100.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 0.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 0.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 0.0% | 100.0% | +100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 25.0% | 100.0% | +75.0pp |
| gpt-oss-20b | 50.0% | 75.0% | +25.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 100.0% | +100.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 100.0% | +100.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 0.0% | 0.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
hunt_024)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 100.0% | 0.0% | -100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-120b | 0.0% | 0.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 0.0% | 0.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 100.0% | +100.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 100.0% | 0.0% | -100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 100.0% | 0.0% | -100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 0.0% | 100.0% | +100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
stance_015)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 66.7% | -33.3pp |
| gemma-4-31b | 66.7% | 100.0% | +33.3pp |
| gpt-oss-120b | 33.3% | 100.0% | +66.7pp |
| gpt-oss-20b | 100.0% | 66.7% | -33.3pp |
| nemotron-3-super | 66.7% | 66.7% | +0.0pp |
| qwen3.5-122b | 66.7% | 100.0% | +33.3pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 66.7% | 100.0% | +33.3pp |
| nemotron-3-super | 66.7% | 100.0% | +33.3pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 66.7% | 100.0% | +33.3pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 66.7% | 100.0% | +33.3pp |
| gpt-oss-20b | 100.0% | 33.3% | -66.7pp |
| nemotron-3-super | 66.7% | 66.7% | +0.0pp |
| qwen3.5-122b | 66.7% | 100.0% | +33.3pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 0.0% | +0.0pp |
| gemma-4-31b | 0.0% | 0.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 0.0% | 0.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 0.0% | -100.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
narc_029)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 0.0% | 0.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
narc_ai_002)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
narc_042)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 100.0% | +100.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 100.0% | 0.0% | -100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
narc_029)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 0.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 0.0% | 0.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
narc_033)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 0.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 0.0% | -100.0pp |
| nemotron-3-super | 0.0% | 0.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
narc_042)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 0.0% | -100.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 100.0% | 0.0% | -100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
narc_050)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 0.0% | -100.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 0.0% | -100.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 0.0% | +0.0pp |
| gemma-4-31b | 0.0% | 0.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 0.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 0.0% | 0.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 0.0% | -100.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 100.0% | 0.0% | -100.0pp |
| qwen3.5-122b | 100.0% | 0.0% | -100.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 0.0% | 0.0% | +0.0pp |
| gemma-4-31b | 0.0% | 0.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 100.0% | 0.0% | -100.0pp |
| nemotron-3-super | 100.0% | 0.0% | -100.0pp |
| qwen3.5-122b | 0.0% | 0.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 0.0% | -100.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 0.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 100.0% | +100.0pp |
| nemotron-3-super | 100.0% | 0.0% | -100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 0.0% | -100.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 0.0% | 100.0% | +100.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 0.0% | 0.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 0.0% | -100.0pp |
narc_sp_075)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 100.0% | +0.0pp |
| nemotron-3-super | 100.0% | 0.0% | -100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 0.0% | -100.0pp |
| nemotron-3-super | 100.0% | 100.0% | +0.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 0.0% | 0.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 100.0% | 0.0% | -100.0pp |
| qwen3.5-122b | 100.0% | 0.0% | -100.0pp |
narc_013)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-20b | 100.0% | 0.0% | -100.0pp |
| nemotron-3-super | 100.0% | 0.0% | -100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |
narc_ai_003)
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 0.0% | -100.0pp |
| gemma-4-31b | 100.0% | 0.0% | -100.0pp |
| gpt-oss-120b | 0.0% | 0.0% | +0.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 0.0% | 0.0% | +0.0pp |
| qwen3.5-122b | 0.0% | 0.0% | +0.0pp |
| Model | Grids Only | + Narrative | Lift |
|---|---|---|---|
| gemma-4-26b | 100.0% | 100.0% | +0.0pp |
| gemma-4-31b | 100.0% | 100.0% | +0.0pp |
| gpt-oss-120b | 100.0% | 0.0% | -100.0pp |
| gpt-oss-20b | 0.0% | 0.0% | +0.0pp |
| nemotron-3-super | 100.0% | 0.0% | -100.0pp |
| qwen3.5-122b | 100.0% | 100.0% | +0.0pp |