i feel like there's a general misunderstanding about open source models.
most people use a frontier model, switch the api request to open source model, see poor performance, and then churn off.
this will never work. you have to spend the time to handhold these models in the tasks you're trying to accomplish. basically every coding agent that you use is tuned to output prompts in the format that these frontier models except, and perform best on.
if you invest the time to add custom prompting for these OSS models, you'll see the improvement performance, but it'll never work out of the box.