On benchmarks, Opus 4.8 is a step up rather than a leap. It scores 88.6% on SWE-bench Verified (vs. 87.6% for Opus 4.7), 69.2% on the harder SWE-bench Pro (vs. 64.3%), and 74.6% on Terminal-Bench 2.1 ...
On Thursday, Anthropic released Opus 4.8, the newest version of its most advanced publicly available model. The model is available everywhere, with standard pricing at the same level as the previous ...
For the fastest way to join Tom's Guide Club enter your email below. We'll send you a confirmation and sign you up to our newsletter to keep you updated on all the latest news.
7-0 wipeout: I put ChatGPT-5.5 vs Claude 4.7 through 7 impossible tests — and the results shocked me
For the fastest way to join Tom's Guide Club enter your email below. We'll send you a confirmation and sign you up to our newsletter to keep you updated on all the latest news.
CNET editor Gael Fashingbauer Cooper, a journalist and pop-culture junkie, is co-author of "Whatever Happened to Pudding Pops? The Lost Toys, Tastes and Trends of the '70s and '80s," as well as "The ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results