> If you ask ChatGPT or Claude to write something like this through their websit...

macintux · 2026-02-14T01:41:36 1771033296

I think they’re describing a difference in chat behavior vs API. The API must have fewer protections/be more raw.

basketbla · 2026-02-14T01:51:44 1771033904

Probably pretty big difference in system prompt from using the apps vs hitting the api, not that that’s necessarily what’s happening here. + I think openclaw supports other models / its open source and it would be pretty easy to fork and add a new model provider.

zahlman · 2026-02-14T01:54:41 1771034081

Why wouldn't the system prompt be controlled on the server side of the API? I agree with https://news.ycombinator.com/item?id=47010577 ; I think results like this more likely come from "roleplaying" (lightweight jailbreaking).

Kwpolska · 2026-02-14T12:34:08 1771072448

The websites and apps probably have a system prompt that tells them to be more cautious with stuff like this, so that AIs look more credible to the general public. APIs might not.

varenc · 2026-02-14T01:45:29 1771033529

Yea pretty confused by this statement. Though also I'm pretty sure if you construct the right fake scenario[0] you can get the regular Claude/ChatGPT interfaces to write something like this.

[0] (fiction writing, fighting for a moral cause, counter examples, etc)