On HumanEval, I see 90.2 for GPT-4o and 89.0 for DeepSeek v2.5.
- https://blog.getbind.co/2024/09/19/deepseek-2-5-how-does-it-...
- https://paperswithcode.com/sota/code-generation-on-humaneval
On HumanEval, I see 90.2 for GPT-4o and 89.0 for DeepSeek v2.5.
- https://blog.getbind.co/2024/09/19/deepseek-2-5-how-does-it-...
- https://paperswithcode.com/sota/code-generation-on-humaneval