Just started self hosting this instance. Nothing on the docs mentioned anything about storage considerations.

@GustavoM@lemmy.world
link
fedilink
English
2
edit-2
1Y

removed by mod

476M ./postgres 1.1G ./pictrs

After 3 weeks

baduhai
link
fedilink
English
31Y

How many users?

Netto Hikari
link
fedilink
English
631Y

This is my small instance with way fewer users than lemmy.world.

11G	pictrs
5.2G	postgres

Out of curiosity, how long has your instance been up? Just want to get a sense of how fast storage is increasing for you.

Netto Hikari
link
fedilink
English
41Y

23 days.

Kushan
link
fedilink
English
71Y

How has your Lemmy experience been on a self hosted instance? I’m currently using lemmy.world and it’s very error prone, would self hosting reduce those errors at the expense of anything? Does federation take long or do you find you’re getting federated content quickly enough?

@bdonvr@thelemmy.club
link
fedilink
English
71Y

You won’t get any old content, so that’s a downside. You’ll only get content after you start federating. Unless someone votes or comments on old content.

Other than that the only downside is spending time maintaining and updating it.

Netto Hikari
link
fedilink
English
131Y

The experience has been pretty good, to be honest. No instability, easy updates, etc. I find federated content quite quickly, because I use this script to populate the “All” feed.

Kushan
link
fedilink
English
51Y

Thanks for the script!

Netto Hikari
link
fedilink
English
51Y

I didn’t make it! :) I think, @fmstrat@nowsci.com made it.

Red Wizard 🪄
link
fedilink
English
41Y

Oh this is very cool

Ruud
mod
link
fedilink
English
2631Y

This is lemmy.world after 4 weeks:

58G	pictrs
34G	postgres
lightrush
link
fedilink
English
5
edit-2
1Y

Feels like this will benefit from some sort of fuzzy deduplication in the pictrs storage. I bet there are a lot of similar pics in there. E.g. if one pic or a gif is very similar to another, say just different quality or size, or compression, it should keep only one copy. It might already do this for the same files uploaded by different people as those can be compared trivially via hashing, but I doubt it does similarity based deduplication.

@mruczek@lemm.ee
link
fedilink
English
131Y

Interesting, I thought it would be waaayyy more

At the end of the day the vast majority of what needs to be saved is text. If media content is embedded, the the server just has to save the path to the file not the file itself.

Wow, that is surprisingly not bad given the size of the instance!

@HappyHam@lemm.ee
link
fedilink
English
181Y

Now I wonder how viable it would be to support video hosting. The answer is almost certainly “God no!”

GatoB
link
fedilink
English
11Y

It is viable through other hostings

Considering this is going to be around a 5 user instance at most I think I’ll be good for awhile. Thanks!

Dran
link
fedilink
English
151Y

Question if you know: does a lemmy instance have to be publically accessable to work? Like, if I make an instance on my homelab can the instance “fetch” content and serve it faster locally? Could I reply to a post and have others see it? Etc

wondering this also! wouldnt it require a domain for your account though?

manitcor
link
fedilink
English
57
edit-2
1Y

im running 50 users right now, subbed to A LOT of communities, seeing db growth of about 100mb per day.

That seems high when you extrapolate that to 10000 users, like a larger instance might have.

terribleplan
link
fedilink
English
67
edit-2
1Y

It’s all about how many communities your user(s) subscribe to since your instance basically acts as a mirror for those.

My instance has been running for 23 days, and I am pretty much the only active local user:

7.3G    pictrs
5.3G    postgres

edit: I may have a slight Reddit Lemmy problem

Alexander Kehr
link
fedilink
English
31Y

Do you also post stuff? I mean my instance is only about an hour old, but I’ve subscribed to some communities, yet I don’t see the picture service consuming the S3 storage I’ve configured

@bdonvr@thelemmy.club
link
fedilink
English
51Y

Your instance only caches thumbnails, so it won’t take much space. The full images are served from the remote instance. So you basically only store whatever your users upload.

terribleplan
link
fedilink
English
131Y

Lemmy caches every thumbnail of every post for like a month or something using Pictrs, so that storage will eventually hit a sort of equilibrium and start growing much more slowly (only reflecting post/thumbnail volume during the cache time).

Between profile images, community banners/icons, post images etc. there are probably a few dozen images that will be sticking around for the long haul at the moment.

So if you’re the only user (let’s assume for ease) then, that represents all the updates (posts, comments, votes) from each community that you are subscribed to?

terribleplan
link
fedilink
English
161Y

Yeah, and I purposely subscribe to (or sometimes have a dedicated “federation helper bot” account I run subscribe to) most of the most popular communities on the most popular instances so I can get a decent sampling of what’s going on in the fediverse on the “All” feed. So I assume my storage usage is maybe a bit higher than what an “average” single-user instance may be…

@bdonvr@thelemmy.club
link
fedilink
English
41Y

lmao same here. I have a spare account that I use to sub to everything worth subbing to. I haven’t automated it yet though.

@ipkpjersi@lemmy.one
link
fedilink
English
31Y

Ooh, that’s a really good idea, I need a federation helper bot/account when I start self-hosting a Lemmy instance!

ChickenBoo
link
fedilink
English
221Y

It won’t scale linearly. A lot of those users will be subscribed to subs the instance is already replicating. It would only be new subs that would add to the growth.

@bdonvr@thelemmy.club
link
fedilink
English
41Y

And only active subs. And even then, it’s just text and tiny thumbnails.

@bdonvr@thelemmy.club
link
fedilink
English
11
edit-2
1Y

Yeah lemmy seems to use just about nothing for data storage.

Grimr0c
link
fedilink
English
161Y

Honestly, Less than I thought!

@holycrap@lemmy.world
link
fedilink
English
111Y

Unless they changed all of the comment and post ids to bigints that’ll probably bring the site down before it runs out of storage. In defense of the lemmy developers they have been receptive to feedback, so I don’t think it’ll take long for that to be fixed if it hasn’t already.

A10@kerala.party
link
fedilink
English
261Y

Is there any way to purge old data?

@holycrap@lemmy.world
link
fedilink
English
351Y

I really hope it doesn’t get purged if lemmy is to be a Reddit replacement. A lot of the value Reddit had was obscure knowledge and making google searches actually usable.

I think as long as the original community the post is in doesn’t purge the data, it’s fine for other instances to purge if necessary.

@punio7@lemmy.world
link
fedilink
English
31Y

Exactly, when dealing with big data, you need a strategy to archive old data. You can’t just store everything in one DB. Smaller instances may not feel like keeping all the date from all the time. Even big instances should have a mechanism to move old data do different databases.

Kayn
link
fedilink
English
41Y

Are you planning on donating to instances that don’t purge old data?

lol lemmy died almost immediately after i posted this time to figure out what the hell caused that

it was because i set a damn server icon

b3nsn0w
link
fedilink
English
41Y

lmao just how powerful is your server icon?

b3nsn0w
link
fedilink
English
31Y

lol, yeah, that would crash any instance

(jokes aside, you’ll probably need to keep it somewhat low-res, and i’d also recommend cropping it to square. my instance uses a 128x128 icon)

Philip
link
fedilink
English
71Y

After hosting my own instance with just me for ca. 2 weeks:

1.99Gi pictrs

5.21Gi postgres

@JoShmoe@lemmy.zip
link
fedilink
English
-171Y

Holding onto all that data is pointless if you’re not selling it to someone.

AFK BRB Chocolate
link
fedilink
English
81Y

I disagree. One big hunk of value of a place like this is being able to look back at old threads. How many times did people say they always put “Reddit” in front of their Google searches to get the information they were looking for? This could be the same.

@JoShmoe@lemmy.zip
link
fedilink
English
11Y

That’s unsustainable. Why do you think the mainstream platforms are selling out?

AFK BRB Chocolate
link
fedilink
English
11Y

It’s really not, at least for the text part. Text posts and comments take almost nothing and storage continues to get cheaper.

Mainstream platforms are selling out because they’ve always had others and shareholders who ultimately want to make money.

NaN
link
fedilink
English
31Y

That’s a good reason for an instance to put “lemmy” in its url too, I imagine. Search engines are already returning Lemmy results for things.

Info is still useful for people doing google searches. It would be nice to be able to find common troubleshooting tips on Lemmy, etc.

@JoShmoe@lemmy.zip
link
fedilink
English
11Y

Not everything posted here holds any value.

Tugg
link
fedilink
English
161Y

Small instance with about 3 users and myself online for about 2 weeks.

pictrs   930M
postgres 1.4G
hitagi
link
fedilink
English
91Y

My instance eats up almost 100MB everyday. It mostly depends on what your users subscribe to. It was barely growing on my first few days until I invited a couple of friends over to try it out.

My instance has 13 users, and has been up for 2 months now:

1.5G    ./pictrs
3.4G    ./postgres
lightrush
link
fedilink
English
41Y

How many cans-of-beans.jpg can you store?

At least 3. Maybe 4.

Kayn
link
fedilink
English
7
edit-2
1Y

My instance dormi.zone has been running for around 3½ weeks now, has a 3-digit amount of users and hosts a community with little more than 1000 subscribers. Here’s how much storage it currently takes up:

  • 6.2 GiB postgres
  • 4.9 GiB pictrs

In the default Ansible configuration, storage will mostly be accumulated by log files that are automatically generated by Docker and deleted whenever you restart the Docker containers.

key
link
fedilink
English
15
edit-2
1Y

Depends. If you have a lot of users posting a lot of pictures and you use pictrs out of the box config, then a lot. If you are just running a few users with finite communities being synced then a lot less. The number is going to vary a lot as lemmy grows and gets older so hard to document realistic expectations. But docker images are probably going to take up more disk space than actual contents unless you get quite big. I just threw my PG volume into a tgz to move servers and it’s less than a gig.

@bdonvr@thelemmy.club
link
fedilink
English
6
edit-2
1Y

The lemmy.world admin said above that their instance currently takes up less than 100GB

Create a post

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.

Rules:

  1. Be civil: we’re here to support and learn from one another. Insults won’t be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it’s not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don’t duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

  • 1 user online
  • 124 users / day
  • 419 users / week
  • 1.16K users / month
  • 3.85K users / 6 months
  • 1 subscriber
  • 3.68K Posts
  • 74.2K Comments
  • Modlog