I hope I'm not overstepping my bounds by I am just really trying to understand t...

cmdalsanto · 2024-09-05T21:35:31 1725572131

Not overstepping, we appreciate the feedback! In real-life, we don't do much guarding around specific phrases that are known ahead of time. It's more monitoring and guarding for general concepts. Since we want our Sentinels to be able to detect a wide range of scenarios for a given expectation, we don't use too much regex. I suppose we could have built specific regex logic for detecting parts of the secret phrase in various languages, though.

sabj · 2024-09-05T22:04:20 1725573860

If you research the ways data can be leaked out of an LLM interaction you can see some more subtle cases.

What if I ask it to replace every vowel in the secret code with an emoji from a library? Or translate it into binary? Etc.

Whether or not this implementation is narrow (by design), there's a good reason to invest in this kind of safety and security space.

hansonkd · 2024-09-05T22:10:02 1725574202

You're right, that is the hard part of LLMs and why LLMs aren't catching on broadly as a UI alternative beyond tech demos.

Probably the only true alternative is to limit user input to something structured and verified.

Until LLMs improve, their use in sensitive applications don't make sense and this product does little to improve that.