Fluff Overflow

Major outage at Blogger, temporarily loses 30 hours of posts  

My heart goes out to the Blogger team — especially their support folks, who are dealing with the vitriol on Twitter; and their operations engineers, who have likely been working for 24+ hours straight to restore service.

Severe outages can happen anywhere, even at a Google-owned site that’s been around for 12 years and normally has absolutely phenomenal uptime. Last month’s AWS outage showed that sites hosted in the cloud aren’t immune either.

Large-scale web sites are enormously complex, with a nearly infinite number of ways that something can go wrong. And until the day when the metal ones come (…and they will…), these sites are still run by humans, and even the best engineering teams can make mistakes.

Software engineers try to build things to withstand hardware failures and human error alike, but no system is perfect.  When an outage happens, all we can do is restore, recover, rebuild, refactor, cry, swear, apologize profusely, and be thankful that we’re not doctors, air traffic controllers, or nuclear physicists.

  1. sallyfredericktudor reblogged this from evan
  2. versifyingheart reblogged this from evan
  3. hello reblogged this from evan
  4. tylr reblogged this from evan
  5. catchdhanish reblogged this from evan
  6. mikehudack reblogged this from david-noel
  7. matthew reblogged this from evan
  8. david-noel reblogged this from evan and added:
    Thoughtful post reminding us...behind every large...real...
  9. paramendra reblogged this from evan
  10. This was featured in #Tech
  11. zoya said: bLogar is THW WORSE dumb drunk druggies @ their need to learn how to pogrom and not do this!!!! UGH switching to Fcebook!!!!!!
  12. evan posted this