This is a learning tool. If you want a local model you are almost certainly better using something trained on far more compute. (Deepseek, Qwen, etc)
The param count is small enough that even cheap (<$500) GPUs would work too.