The whole problem is that they are not neutral. They token-complete based on the...

SamPatt · 2024-12-23T18:46:18 1734979578

Nothing is truly neutral. Humans all have a different corpus too. We roughly know what data has gone in, and what the RL process looks like, and how the models handle a given ethical situation.

With good prompting, the SOTA models already act in ways I think most reasonable people would agree with, and that's without trying to build this specifically for that use case.