Benchmarks

Useful local AI starts with honest numbers.

These charts are the benchmark source for this page: Artificial Analysis screenshots dated May 30, 2026. No outside benchmark data is mixed in.

Top score shown

0.0

The leading AI system in the index screenshot.

Visible range

0.0–0.0

The spread across the AI systems shown in the index chart.

Output text volume

0M–0M

Tokens generated during the evaluation. It shows effort, not a score.

the spread, drawn to scaleArtificial Analysis Intelligence Index

8.5 smallest shown

≈15 small local modelsthe class that runs on a box

45.8 best cloud system

01020304050

the gap, drawn to scale: 8.5 to 45.8 on the same index

Local models live toward the left of this line, the largest cloud systems at the right. That distance is the whole honest pitch: useful, private, and not pretending to be the top bar.

Higher is better

The index and evaluation charts rank how well different AI systems perform on the test mix.

Answer volume is context

The answer-volume chart is not an intelligence score. It shows the amount of generated text behind the index.

Local has limits

Smaller local AI can be useful and private while still sitting below the largest cloud systems on hard tests.

Overall comparison

Artificial Analysis Intelligence Index

A top-line score across the Artificial Analysis evaluation mix. Higher is better, and the spread shows why Offline Base stays honest about what smaller local AI can and cannot replace.

Answer volume

Answer text produced during the index

This chart shows how much answer text was produced during the evaluation. It is useful context when comparing practical effort across larger and smaller systems.

Output text volume used to run Artificial Analysis Intelligence Index bar chart dated May 30 2026.

Per-test detail

Intelligence evaluations

The detailed panels break the index into individual tests, including planning, coding, long documents, knowledge, scientific reasoning, instruction following, and image reasoning.

The product takeaway

Offline Base is not pretending a desk box is the largest cloud system.

The benchmark story is straightforward: bigger systems win the hardest tests, but local AI is still useful for private drafts, summaries, search, and focused work where ownership matters.

That is why the product promise is local by default, clear about outside help, and honest about the limits before you buy.

See Base Router How consent works