cross-posted from: https://lemmy.dbzer0.com/post/95652

Hey everyone, you may have noticed that some of us have been raising alarms about the amount of spam accounts being created on insufficiently protected instances.

As I wanted to get ahead of this before we’re shoulders deep in spam, I developed a small service which can be used to parse the Lemmy Fediverse Observer and retrieve instances which are suspicious enough to block.

The Overseer provides fully documented REST API which you can use to retrieve the instances in 3 different formats. One with all the info, one with just the names, and one as a csv you can copy-paste into your defederation setting. You can even adjust the level of suspicion you want to have.

Not only that, I also developed a python script which you can edit and run and it will automatically update your defederation list. You can set that baby to run on a daily schedule and it will take care that any new suspicious instances are also caught and any servers that cleared up their spam accounts will be recovered.

I plan to improve this service further. Feel free to send me ideas and PRs.

19 points

A funny result is the accumulation around server centers, here Hetzner.

permalink
report
reply

That’s a bit too ordered lol

permalink
report
parent
reply
17 points

Thank you for trying to get ahead of the spam influx. Trying to beat the spambots is a thankless task, but thank you anyhouw.

permalink
report
reply
14 points

What about people like me who host an instance just for themselves, but don’t have any communities in their own instance? What’s the risk for me for getting blacklisted? I didn’t find an clear answer by looking at the source code.

permalink
report
reply
9 points

I assume the sus score would remain low, it seems to be looking for high number of accounts with extremely low posts and no/open registration

permalink
report
parent
reply
7 points

You have one user and has many comments and post as you make yourself. This means that your suspicion score is going to be extremely low (under .5 if you post 2 times on any server).

permalink
report
parent
reply
2 points

Feels like an instance per bot would be pretty wasteful, so single user instances shouldn’t be considered suspicious. But maybe it’s more scalable than I’d think.

permalink
report
parent
reply
14 points

“Lemmy Overseer “ is a creepy, ominous name. It sounds like the job title of some super strict, rules-obsessed, joyless office drone who works for the feds or a large corporation.

permalink
report
reply
25 points

This kind of talk really displeases the Overseer. You better watch yourself.

permalink
report
parent
reply
9 points

yeah it’s great isn’t it‽

permalink
report
parent
reply
14 points

I feel like the python script is maybe a bit too extreme ? 20 times more users than posts might happen on smaller instances people use mostly to browse the big ones, I feel. I ran it with a suspicion ratio of 100 and it didn’t seem to block any “legit looking communities”. But then again, it is very hard to tell.

Thanks a lot for your work !

permalink
report
reply
14 points

Yes, that’s why I leave the exact number up to each instance admin

permalink
report
parent
reply
6 points

Indeed, thanks a million for giving the community tools at this critical juncture. Very much appreciated!

permalink
report
parent
reply

Technology

!technology@beehaw.org

Create post

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community’s icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

Community stats

  • 2.6K

    Monthly active users

  • 3.4K

    Posts

  • 82K

    Comments