This model, released last night, is quite something:
It’s a qwen-14 distilled by deepseek r1, and then fine-tuned to code by Together AI.
Together AI claims they match o3-mini-2025-01-031 (Low) and o1-2024-12-17.
The good thing is that they opened the source AND data so people can see what they did there.
And obviously, this is now the default code model on Blablador!
It’s a reasoning model, so you will hear echoes of the system prompt (birthday party) in there.
It also have a 64k context on our hardware, which is nice!
Let’s keep barking!
Alex