Comment by syntaxing
You can try the latest GLM 4.6 https://z.ai/ . Their coding plan is $6 a month and performs on par to Sonnet 4 for my personal task. Sonnet 4.5 still has an edge though. All of ZLM’s models are also open sourced so you can run it locally if you want
I am mostly retired but I am thinking of restarting a solo products mini-company next year. I have been looking at much less expensive options like Alibaba Cloud, GLM, Kimi K2, etc. There is a recent Stanford study showing most US startups are using less expensive Chinese models, but I think usually hosted in the US.
For now I am happy enough with Gemini and GPT-5 because my usage is so lite that anything is cheap. For many engineering use cases, Gemini-2.5-flash-lite works well enough.
How do you use GLM? With codex —oss? Or, just ‘raw’ with no agent-wrapping coding environment?