Google Search will no longer make site backups while crawling the web.
Quokka
link
fedilink
English
37M

Is there any sort of way to self host a limited version of this? L

I’d love to be able to have my own Searx also cache everything I visit as I go to it, it’d at least let me refind information I’ve previously found.

You want to self host… the Internet?

Quokka
link
fedilink
English
6
edit-2
7M

No, I want to automatically cache pages I’ve searched for and visited and have them show up on my searx.

We’re talking like maybe 10 pages a week if that.

I know there’s ArchiveBox, but I’m after something less manual and more integrated.

I know what you meant. I was just messing with ya

Red
link
fedilink
English
17M

You could use something like archivebox as that saves the whole page, or you could use Waybackmachine and force it to save the page via an add-on.

You could also setup your own yacy index and everytime you find an interesting site you could add it to yacy.
But this is kind of not what you are asking for. Archivebox is probably the closest, or using squidcache and literally caching every url you go to. 😅

Create a post

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community’s icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

  • 1 user online
  • 144 users / day
  • 275 users / week
  • 709 users / month
  • 2.87K users / 6 months
  • 1 subscriber
  • 3.09K Posts
  • 64.9K Comments
  • Modlog