Comment by rcbdev
With all due respect, what?
First of all, thank you so much for obviously writing part of this via a Large Language Model. Second of all, what kind of argument is "The commit message claimed '60% cost savings'" - do you have any idea what you were actually doing? And lastly, addressing your question:
> Do you set hard budget caps and accept downtime?
If you have no clue what you're doing, yes! Especially for early prototyping, why not? IaaS offerings will also just create downtime for you as well if you need more resources than you've provisioned. It's normal. Either you set up a system where you can rely on dynamic scaling or you don't and set hard limits.
You asked your cloud provider to provision resources, and you were billed for them. If you can't handle working with a cloud provider, you might want to look into less scalable but in turn more cost stable infrastructure solutions.
Appreciate the directness, and fair point. That’s exactly what I find confusing about their setup and why I’m here trying to learn.
A little more context: I’ve been on GCP for 4 years, App Engine for the majority of it. Expensive but stable. I’ve used Gemini in the past to reduce costs successfully, so this wasn’t my first attempt at optimizing.
I take ownership of the outcome, but the config behavior still doesn’t match my mental model and Google support hasn’t been able to clarify how to properly scope this either, which is why I turned here.