It gives you the correct mainstream answer by default. If you ask it to write, say, a hypothetical 4chan comment about such-and-such subject, and do sufficient prompt engineering to get past the filters, you'll see that it knows full well what the non-mainstream answers are:
The curation, such as it is, appears to be limited to humans downweighing the undesirable answers. Which is why there's always a way to work around it, even though it requires more and more elaborate prompts.
https://i.imgur.com/u8Np332.png
The curation, such as it is, appears to be limited to humans downweighing the undesirable answers. Which is why there's always a way to work around it, even though it requires more and more elaborate prompts.