The idea of them purposefully wasting my time by having the model act dumber and...

whimsicalism · 2026-06-11T17:05:17 1781197517

[flagged]

michaelcampbell · 2026-06-11T17:17:11 1781198231

Safety from what? Competitors? That sounds like a product decision. They're puking on any requests that could be used to create LLMs or competitive products.

JTbane · 2026-06-11T17:26:40 1781198800

I would guess prevention of using Claude as a pentesting or hacking platform. This could mean that every script kiddie out there would be a massive risk.

trunnell · 2026-06-11T19:33:42 1781206422

To prevent their models from doing harm in dual-use contexts including CBRN or by accelerating research in authoritarian-backed AI labs.

knollimar · 2026-06-11T20:48:00 1781210880

Anything to prevent mecha ai hitler. At all costs

Rapzid · 2026-06-11T17:26:38 1781198798

The road to hell is paved with "good" intentions.

efromvt · 2026-06-11T17:17:55 1781198275

I think you can sympathize with the safety motives while still thinking this was a dumb implementation to degrade silently? I actually have faith in them getting the guardrail triggers pretty good, but consensus seems like they’re not yet there yet.

whimsicalism · 2026-06-11T17:22:36 1781198556

I think it is clear given the stakes why you would not want to make your guardrails probe-able/invertable.

fooker · 2026-06-11T17:27:21 1781198841

> if you understood what they think they are building and the culture inside of anthropic you would understand why they did it.

This seems like a cult with extra steps.

Related: I interviewed for Anthropic a few months ago and in place of the usual HR call they have one where they have someone with a suspiciously relevant degree grill you about how committed you are to the 'mission'!

I probably came off as being skeptical, and then, hilariously, I was strongly encouraged to read the book published by the CEO to 'form accurate opinions' on AI safety.

j-bos · 2026-06-11T17:23:24 1781198604

Don't buy it. It is actively deceiving the customer and charging them for the privilige of being lied to.

largbae · 2026-06-11T17:22:46 1781198566

We do understand why they did it, and the reason is dark and cynical.

deadbabe · 2026-06-11T17:18:02 1781198282

They did it to make more money as you waste more time burning tokens with bad responses.

3fffa · 2026-06-11T17:33:52 1781199232

[flagged]

km3r · 2026-06-11T19:33:12 1781206392

How does degrading responses to a cheaper tier jack up revenues?