> It answers questions confidently but with subtle inaccuracies. This is a valid...

datpiff · on Nov 14, 2023

How is it trained on reactions? Do people give it feedback? In my experience in trying I stop asking when it provides something useful or something so bad I give up (usually the latter I'm afraid). How would it tell a successful answer from a failing one?

digitcatphd · on Nov 14, 2023

It appears to ask users to rate if the response is better or worse than the first, in other cases, it seems to be A/B testing the response. Lastly, I for instance, will correct it and then confirm it is correct to continue with the next task, which likely creates a footprint pattern.

datpiff · on Nov 14, 2023

That's interesting, I haven't come across this.