Comment by strangescript

Comment by strangescript 6 months ago

curious why you went with Phi as the default models, that seems a bit unusual compared to current trends

codingmoh 6 months ago

I went with Phi as the default model because, after some testing, I was honestly surprised by how high the quality was relative to its size and speed. The responses felt better in some reasoning tasks-but were running on way less hardware.

What really convinced me, though, was the focus on the kinds of tasks I actually care about: multi-step reasoning, math, structured data extraction, and code understanding.There’s a great Microsoft paper on this: "Textbooks Are All You Need" and solid follow-ups with Phi‑2 and Phi‑3.

Reply View 0 replies

jasonjmcghee 6 months ago

agreed - thought the qwen2.5-coder was kind of standard non-reasoning small line of coding models right now

Reply View 1 reply

codingmoh 6 months ago

I saw pretty good reasoning quality with phi-4-mini. But alright - I’ll still run some tests with qwen2.5-coder and plan to add support for it next. Would be great to compare them side by side in practical shell tasks. Thanks so much for the pointer!

Reply View | 0 replies