Comment by Atreiden

Comment by Atreiden 12 hours ago

4 replies

Homelab =/= Production systems

The gulf between these two insofar as what approach, technologies, and due-diligences are necessary is vast.

kelnos 12 hours ago

I think we've gone a little nuts defining "production system" these days. I've worked for companies with zero-downtime deployments and quite a lot of redundancy for high availability, and for some applications it's definitely worthwhile.

But I think for many (most?) businesses, one nine is just fine. That's perfectly doable by one person, even if you want, say, >=96% uptime, which allows for 350 hours of downtime per year. Even two nines allows for ~88 hours of downtime per year, and one person could manage that without much trouble.

Most businesses aren't global. Downtime outside regular business hours for your timezone (and perhaps one or two zones to the west and east of you) is usually not much of a problem, especially if you're running a small B2B service.

For a small business that runs on 1-3 servers (probably very common!), keeping a hot spare for each server (or perhaps a single server that runs all services in a lower-supported-traffic mode) can be a simple way to keep your uptime high without having to spend too much time or money. And people don't have to completely opt out of the cloud; there are affordable options for e.g. managed RDBMS hosting that can make maintenance and incident response significantly easier and might be a good choice, depending on your needs.

(Source: I'm building a small one-person business that is going to work this way, and I've been doing my research and gaming it out.)

  • PaulKeeble 11 hours ago

    One thing that AWS, Google and Azure do that your own systems don't is release their updates whenever it suits them, often taking down your business down in the middle of the day with their own problems. You can't fix it, you can't rollback what you just did and get back up and running you just have to sit and wait.

    That is quite different to a business that turns off its boxes for an hour at 0100 Sunday morning to do updates and release new software. The downtime isn't equivalent because it really matters when it is and if that hurts your use case or not. Your own system might be down for more hours a year than AWS, but its not down Monday to Friday on an evening when you do most your sales because you refuse to touch anything during that period and do all the work outside that and schedule your updates.

    • immibis 11 hours ago

      It also feels like AWS (or Azure) isn't really that much more reliable than your own thing. But half the internet is down at the same time so you don't get blamed as much.

      • PaulKeeble 10 hours ago

        Its the "No one gets blamed for going IBM" thing in the modern era. They are making it someone elses fault and absolves the blame. The problem is if your competitor is still up you could be loosing customers on average mid day outage, even if they are down for 3x as long its not when it matters.