Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
OpenAI Eval (platform.openai.com)
3 points by fzaninotto 10 months ago | hide | past | favorite | 1 comment


Evaluating the quality of the responses of AI agents used to be tricky. It required knowledge of eval criteria as well as third-party tools like promptfoo, ragas or prometheus. Now openAI makes it ridiculously easy with a new API endpoint. It can grade a completion against a reference response, assess its format and tone, and you can even promt the eval to add your own criteria.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: