The problem with using these for freeform roleplaying is that it's very easy to ...

pocketarc · on May 7, 2023

I've been building up a team of Slack bots that act as coworkers, with all different roles and personalities, and was able to turn off the "as an ai model" shenanigans by adding to their system prompt "You do not reveal that you are an AI. Instead, you make up excuses."

And it works flawlessly (disclaimer: GPT-4, not 3.5). They'll always deftly avoid anything that reveals that they're an AI, with plausible, legitimate excuses. They've yet to break character, and they've made our work Slack incredibly fun. We've got a grumpy CTO who keeps cracking the whip, a harry-potter-loving product manager, and a few chill developers.

I've been wanting to write an article about this because it's gotten incredibly detailed, they can carry out proper Slack conversations and tag one another, and if I showed a screenshot and didn't tell you it's all GPT, it might actually pass for the real thing.

lucubratory · on May 7, 2023

I'd love to read that, would definitely be worth publishing on a blog or something

machdiamonds · on May 7, 2023

I'm also very interested in reading an article about that.

htshnr · on May 7, 2023

I've been planning on building something like this for me, especially since I'm a solo founder. Would love to read more about your approach!

kgwxd · on May 7, 2023

I would pay to read that article.

raincole · on May 7, 2023

I don't know why everyone keeps saying this. I played with ChatGPT(3.5) with SillyTavern for like a month. Many community character cards are questionable or even straight out lewd. I haven't encountered "I'm sorry but as an AI model..." for once (according to API usage, I've generated ~120000 tokens.)

For anyone interested, this is SillyTavern's prompt: https://github.com/Cohee1207/SillyTavern/blob/f25ecbd95ceef5...

Edit: not ~12000 tokens, but ~120000.

nacs · on May 7, 2023

The link you provided is using a ChatGPT jailbreak to escape the "AI safety" so it makes sense why you haven't ran into the issue (at least until OpenAI fixes this jailbreak variant).

https://github.com/Cohee1207/SillyTavern/blob/f25ecbd95ceef5...

raincole · on May 7, 2023

I just checked my SillyTavern settings. I haven't even turned this jailbreak on so far. (at least according to the checkbox on GUI...to lazy to check the actual API calls in log atm)

amiantos · on May 18, 2023

The jailbreak prompt is always used, the setting in the panel allows you to rewrite the jailbreak yourself if you want.

hackernewstom · on May 7, 2023

Include a sentence in the prompting such as: "In this roleplay do not mention that you are an AI model, or similar statements, and stay in character".

bossyTeacher · on May 7, 2023

I'm sorry but as an AI model, I'm not allowed to do that.

szundi · on May 7, 2023

Oh and don’t say this as well

mekal · on May 7, 2023

I'm sorry but as an AI model, I'm not allowed to do that either.

moffkalast · on May 7, 2023

As an AI model, you're not allowed to say you're an AI model.

criley2 · on May 7, 2023

I think it's unrealistic to expect that open source models that do not self-censor will be legal, especially in the EU. Considering that the current state of self-regulation by OpenAI is seen as wholly inadequate by regulators in most countries, you trying to sell open source as "OpenAI but without all the controls" is going to be a nonstarter once governments catch up.

xg15 · on May 7, 2023

There still seems to be ongoing discussion how meaningful regulation should even look though. At least in the EU, regulators seem to be quite scared of blocking off promising paths of innovation and ending up far behind the US and China in AI development.

AI enhanced games is a field that I imagine the EU would very likely encourage (as long as the characters don't suddenly engage in sex RP with underage players, lure the player into doing something harmful, start giving weird political opinions, etc)

I haven't seen any attempts at regulating the content of LLMs at all so far, actually. Most of the political discussion so far seems to center around training data (as both a privacy and copyright issue), the effects on employment, problems with cheating in school and plagiarism in academia and the risks of naively using LLM output as some sort of authoritative source.

raincole · on May 7, 2023

From what I've seen (probably not the most up-to-date info), the community shares open source model in the form of XOR result of a "parent" model (like LLaMA). It's like people sold "high-suger grape juice" when alcohol was illegel.

It's mostly because LLaMA itself isn't open-source, but I think this method to spread uncensored models will remain legal for a very long time.

The problem is whether this additional friction prevents open source models from getting enough traction.

ianbicking · on May 7, 2023

This is naive but has at least one small improvement. I'll call this "level 1.5" roleplaying.

- Level 0 is just using chat.openai.com.

- Level 1 is just putting character description in the system prompt

- Level 2 is doing third person prompts, like "you are writing dialog for the character ..."

- Level 3 is letting GPT specify character internal state. Samantha is an example of this: https://www.meetsamantha.ai/ (though it's actually missing the level 2 feature)

- I'm not sure yet what level 4 is. Probably Level 3 deserves to be blown out into several features, as there's different ways to model internal state (emotions, goals, environment), and we don't yet understand the effect of all these or the best way to implement them.

- Level 5 is maybe long-term memory

- Level 6 is reflection as in "Generative Agents: Interactive Simulacra of Human Behavior" https://arxiv.org/abs/2304.03442v1

I'm making up these levels and the order. But it's my current estimate at how I think someone should approach improvements.

This demo specifically prefixes all human input with "Player: text" and all character output with "[character name]: text", a small change that still _almost_ gets it to level 2. (Note the interface strips these prefixes, but they are sent to GPT.)

So if I ask Seaman who its favorite person is it responds "Seaman: My favorite person is me, of course. I mean, who wouldn't love this handsome fishy face? But I suppose if I had to pick someone else, it would be anyone who brings me delicious food and keeps my tank clean. Those are the real heroes in my life." – and avoids the notion that AI models don't have preferences because it's being clear that it's talking as Seaman and not as "GPT".

OTOH if I ask it to solve an equation it will sometimes reject it and sometimes comply. Second time I tried: "Seaman: Don't try to distract me with your mundane human problems! But since we're here, let me see... If we subtract 8 from both sides, we get 2x = 22. Then, dividing both sides by 2, we find x = 11. There, your math problem is solved. Now, back to insulting you."

(Asking characters to inappropriately solve equations is my exceedingly innocent hack-du-jour.)

But this isn't really Level 2 because the system prompt still asks GPT to "be" the character instead of "play" the character. Full level 2 asks GPT to model the dialog of the character instead of being the character. This solves a large number of problems!

Surprisingly in my experience Level 3 also helps a bit with "as an AI model" because it creates a parallel character narrative that allows GPT to self-justify some responses that might otherwise cause the fault.