Hey everyone, so as I’m sure everyone is aware Lemmy.World has been experiencing several outages throughout the last few days.
We have been investigating the root cause of these outages but believe that they are related to our current hosting provider (Hetzner) blocking access from ClouldFlare as (we think) they believe that our CDN is a DDoS’er, and is causing these disconnects to our backend server, problematic for sure.
We’ve opened support tickets with our current provider and are awaiting a response. We have no issue with being as transparent as possible with downtime. Anyone that is curious, can feel free to check out https://status.lemmy.world and https://dash.lemmy.world for up to the minute outage information. We are also looking into other fediverse friendly methods of posting status and outage updates
In the meantime, we are evaluating alternative hosting options and solutions to provide a high level of reliability to you, our users. Really, we want to say thanks to everyone for soldiering through all our technical growing pains.
Cheers
- LW Infra Team
As always, the transparency is appreciated. Some growing pains are certainly to be expected
Whenever I get frustrated by the outages I remind myself: still better than reddit.
Thanks to lemmy I don’t doom scroll anymore because either there is an outage or I read all the new content I’m subscribed to in 10-20 minutes
That’s mostly how I redditted for years. It was mostly for those moments in between things, on the toilet, laying in bed at night. Not something I did for long periods of time.
Look at this guy with his reasonable social media use. YOU THINK YOU’RE BETTER THAN ME??
It’s your fault for not pitching in and making lemmy more stable. There, happy, you’ve been blamed :P
What is the burden if I wanted to host a node and limit access just to myself? Is it just a portal into the rest of the fediverse or is there a large maintenance burden or storage requirement?
I don’t blame you for this, but the uptime records are incomplete at best. I’ve experienced the site being down (and confirmed with Down for Everyone or Just Me), yet status.lemmy.world showed all systems operational. As I’m writing this, status.lemmy.world is missing most data up to yesterday and dash.lemmy.world shows 16 days uptime.
I have lots of respect to you for even having these. I also remember status.lemmy.world work mostly fine some time ago. But as of right now, both uptime monitors fail to serve their purpose.
You need to hover over the status bar to see if there is any down time for that day. We can enable it to log incidents every time there is a burp, but we are still tuning alerts as we only have it create a incident when we ACK it in PagerDuty. You can always check the dashboard for up to the minute stats, as well as https://lemmy-status.org/endpoints/_lemmy-world We’ll add this info to make things clearer <3
EDIT: Added more info to our status page, thanks for the feedback Machefi!
EDIT2: Also the missing data is due to us removing and adding more specific monitors for the different infra services.
Maybe its just the times I’m accessing but its seem better this week to me compared to the last few ones.
Good luck, no pressure.