Claude’s new model is more ‘honest’ when it messes up

The Claude logo with a overlay of an smart phone on an orange background.

Anthropic is releasing Claude Opus 4.8 on Thursday, and the company is touting the model’s “honesty.”

According to Anthropic, it trains “all [its] models to be honest – for instance, to avoid making claims that they can’t support.” But it notes that “a general problem with AI models is that they sometimes jump to conclusions, confidently presenting their work as making progress despite thin evidence.”

The AI lab claims that early testers have found that Opus 4.8 “is more likely to flag uncertainties about its work and less likely to make unsupported claims.” In the company’s evaluations, Opus 4.8 is “around 4x less likely than its predeces …

Read the full story at The Verge.

Read more @ TheVerge

Latest posts

Amazon security research reportedly led to the White House’s Anthropic Fable ban

According to the Wall Street Journal, the export control directive that led to Anthropic cutting off access to Fable 5 and Mythos 5 was...

Microsoft hasn’t ruled out spinning off Xbox

Asha Sharma. | Image: The Verge, Microsoft Microsoft is preparing to lay off a significant chunk of its Xbox division and is reevaluating the plans...

Sealed Super Mario Bros. sells for a record $3 million

You know this came for free bundle with the console for $150, right? | Image: Heritage Auctions A copy of Super Mario Bros., still in...

X-Men ’97 has what Master of the Universe is missing

In 2026, Marvel and Mattel are both releasing projects designed to capitalize on people's love for iconic animated heroes from their childhoods. Masters of...

Anthropic cuts off Fable 5 and Mythos 5 access following government order

On Friday evening, the government ordered Anthropic to block access to Fable 5 and Mythos 5 for all foreign nations, both inside and outside...

My yard is dying, so I made an app for that

When I returned to my computer five minutes after giving Gemini a lengthy prompt, I had two things: a functional app in a preview...

Never Post’s Mike Rugnetta on the creative process and the value of reliable power

Mike Rugnetta is a writer, podcast host, producer, audio engineer, educator, musician, sound designer, and father. In short, the man wears a lot of...

My first 24 hours with Siri AI on the Mac

I turned off Siri on the Mac years ago and never looked back. Similarly, I found Apple Intelligence so fruitless I never engage with...

Bose’s latest QuietComfort Ultra are $70 off, marking a new low price 

Bose’s latest QuietComfort Ultra headphones fold down for easy storage. | Image: Bose If you’re planning on traveling anytime soon, Bose’s second-generation QuietComfort Ultra headphones...

The Fitbit Air made me ditch my Pixel Watch, and I couldn’t be happier

I told myself the Fitbit Air would be a nice addition to my EDC. A simple, complementary tracker that I don’t need to worry...