This is my first try at creating a map of lemmy. I based it on the overlap of commentors that visited certain communities.

I only used communities that were on the top 35 active instances for the past month and limited the comments to go back to a maximum of August 1 2024 (sometimes shorter if I got an invalid response.)

I scaled it so it was based on percentage of comments made by a commentor in that community.

Here is the code for the crawler and data that was used to make the map:

https://codeberg.org/danterious/Lemmy_map

    • Danterious@lemmy.dbzer0.comOP
      link
      fedilink
      English
      arrow-up
      27
      ·
      edit-2
      7 days ago

      Either the people in !steamdeck@lemmy.world are pretty horny or its an artifact of the dimensionality reduction and means nothing.

      Edit: Actually it could also be that it just didn’t collect enough data on that community and the most recent person was also active in nsfw communities. I was only able to get back 14ish days in the data for lemmy.world. They produce way to many comments and I got kicked out early.

      Anti Commercial-AI license (CC BY-NC-SA 4.0)

    • cron@feddit.org
      link
      fedilink
      English
      arrow-up
      13
      ·
      6 days ago

      This community has only two posts and a few comments. The algorithm has very few information on such tiny communities.

      It would probably be useful to only include communities with a minimum amount of interaction to avoid such outliers.