Level
[No level filter available for this benchmark]
Type

Cost

10+ $/1m tokens

The Visual Reasoning Benchmark

We've taken our work on visual maths further, now using questions from Zambia and India to test AI models on non-verbal reasoning tasks - key for foundational numeracy which follows a “Concrete, Pictorial, Abstract” methodology. We made The Visual Reasoning Benchmark to test if AI models can answer genuine visual questions faced by end-of-primary students in LMICs. Find out more here.

Loading...
Loading Results....