Gpt-3.5-turbo-instruct had something like 5(or less) illegal moves in 8205 https... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

og_kalu 38 days ago | parent | context | favorite | on: Something weird is happening with LLMs and Chess

Gpt-3.5-turbo-instruct had something like 5(or less) illegal moves in 8205

https://github.com/adamkarvonen/chess_gpt_eval

I expect the rest to be much worse if 4's performance is any indication

gs17 37 days ago [–]

And the most notable part of that:

> Most of gpt-4's losses were due to illegal moves

3.5-turbo-instruct definitely has some better chess skills.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact