It's apparently really hard to objectively measure/report the "truthiness" of LLM results
Allowing an LLM to "improvise" and be a bit fast-and-lose is unfortunately a necessary ingredient in how they currently work.
It's apparently really hard to objectively measure/report the "truthiness" of LLM results
Allowing an LLM to "improvise" and be a bit fast-and-lose is unfortunately a necessary ingredient in how they currently work.