Comment by ozim
Calculation is $0.05/request is valid only as far as AI companies continue to burn money as they are in grab the market phase.
Once the dust settles prices will go up. Even if running models will be cheaper they will need to earn back all the burned cash.
I’d much rather vibe code app get the code to run on some server.
not necessarily, hardware and software gains will make tokens cheaper, so we'll see where we are once the vc money runs out (or the entire US economy, there's a chance AI will pop the tech bubble of the last 20 years: I think tech company evaluation are insanely inflated compared to the value they provide)
I can get gpt 3 level of quality with qwen 8B, even qwen 4B in some cases