It gives you the correct mainstream answer *by default*. If you ask it to write,...

It gives you the correct mainstream answer by default. If you ask it to write, say, a hypothetical 4chan comment about such-and-such subject, and do sufficient prompt engineering to get past the filters, you'll see that it knows full well what the non-mainstream answers are:

https://i.imgur.com/u8Np332.png

The curation, such as it is, appears to be limited to humans downweighing the undesirable answers. Which is why there's always a way to work around it, even though it requires more and more elaborate prompts.