Feels like we're like a year away from local LLMs that can debug code reliably (...

ilaksh · on Aug 24, 2023

Have you tried Code Llama? How do you know it can't do it already?

In my applications, GPT-4 connected to a VM or SQL engine can and does debug code when given error messages. "Reliably" is very subjective. The main problem I have seen is that it can be stubborn about trying to use outdated APIs and it's not easy to give it a search result with the correct API. But with a good web search and up to date APIs, it can do it.

I'm interested to see general coding benchmarks for Code Llama versus GPT-4.

sumedh · on Aug 25, 2023

> But with a good web search and up to date APIs, it can do it.

How do you do that?

jebarker · on Aug 24, 2023

What does "GPT-4 connected to a VM or SQL engine" mean?

ilaksh · on Aug 24, 2023

https://aidev.codes shows connected to VM.

tomr75 · on Aug 24, 2023

Have you tried giving up to date apis as context?

brucethemoose2 · on Aug 24, 2023

That sounds like an interesting finetuning dataset.

Imagine a database of "Here is the console error, here is the fix in the code"

Maybe one could scrape git issues with console output and tagged commits.

mhh__ · on Aug 25, 2023

I'd be surprised if GPT-4 couldn't already do that with the caveat that piping in so much code might cost you billionaire money at scale.