Post-Mortem: The massive lemmy.world -> lemmy.dbzer0.com federation delays.(dbzer0.com)

posted 7 months ago

db0@lemmy.dbzer0.comM

div0@lemmy.dbzer0.com

28 commentshide report

Sort:

Hot Top Controversial New Old

[ - ]

sunaurus@lemm.eeAD

32 points

7 months ago

Nice post, I enjoyed the storytelling. Glad it’s all sorted now 😁

Btw, regarding this point:

All in all, this has been a fairly frustrating experience and I can’t imagine anyone who’s not doing IT Infrastructure as their day job being able to solve this. As helpful as the other lemmy admins were, they were relying a lot on me knowing my shit around Linux, networking, docker and postgresql at the same time. I had to do extended DB analysis, fork repositories, compile docker containers from scratch and deploy them ad-hoc etc. Someone who just wants to host a lemmy server would give up way earlier than this.

I think you’re totally right, but at the same time, I think the collaborative troubleshooting that happened on Matrix (and has happened many times in the past for other issues) is pretty healthy, and not something that is always possible for other open source software.

permalink

report

[ - ]

Vilian@lemmy.ca

8 points

7 months ago

people interested in hosting their own instance is probably already interested in linux, or already using it, i don’t think it’s that bad

permalink

report

parent

[ - ]

Dessalines@lemmy.mlD

24 points

7 months ago

Glad you were able to figure this one out, I never know whether to be mad at myself or proud of my persistence when I spend like a day trying to fix something that turned out to be really simple and almost always unrelated to what I thought the problem was 😂

Edit: also if you found any performance-related config improvements, either to the postgres.conf, nginx.conf, or lemmy.hjson, please contribute them to lemmy-ansible so that all instances can benefit from what you’ve learned.

permalink

report

[ - ]

db0@lemmy.dbzer0.comOPM

8 points

7 months ago

Already sent a big pr for lemmy-doc 😊

permalink

report

parent

[ - ]

henfredemars@infosec.pub

15 points

7 months ago

This reinforced in my mind that as much as I like the idea of lemmy (or any of the other threadiverse SW), this is only something experts should try hosting. Sadly, this will lead to more centralization of the lemmy community to few big servers instead of many small ones, but given the nature of problems one can encounter and the lack of support to fix them if they’re not experts, I don’t see an option.

This also gave me an insight about how the federation of lemmy will eventually break when a single server (say, lemmy.world) grows big enough to start overwhelming even servers who are not badly setup like mine was.

Lemmy has many scalability problems to solve, and not all of these problems are slow database queries. I believe your experience is going to become increasingly common as the community grows because that increased centralization will compound the scalability problems and continue to drive up the technical know-how required to host a successful instance. The software eventually needs to do more to detect and present operational problems to administrators in a friendly way. I2P is an example of a distributed network that’s quite good at reporting issues with the node.

With that said, not everything is doom and gloom. The community has proven itself highly resilient and smart people like yourself are finding solutions. It’s going to be tough road ahead.

permalink

report