my point is that LLMs are already potentially seeing solution on github, so you can't use that benchmark as metric unless there is some explanation.
reply
my point is that LLMs are already potentially seeing solution on github, so you can't use that benchmark as metric unless there is some explanation.