Artificial Intelligence Chart

The way we measure progress in AI is terrible

Every time a new AI model is released, it’s typically touted as acing its performance against a series of benchmarks. OpenAI’s GPT-4o, for example, was launched in May with a ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Feedback

Trending now