Heyas, wondering if there’s an open sourced piece of software or the like, that could scrape media platforms for a specific topic. Platforms like YT, X, Lemmy, News Media, etc., perhaps using RSS? But, a program I can host on my server, that only I have access too, via webpage, CLI, whatever…

Thanks for any info…

Lemmy Tagginator
bot account
link
fedilink
29M

deleted by creator

november
link
fedilink
English
25
edit-2
9M

FreshRSS has been working great for me! It even has the ability for web scraping if you need it.

@Meuzzin@lemmy.world
creator
link
fedilink
English
39M

Right when I saw you reply, I saw a post about it. Digging in to it now. Thanks!

@charles@lemmy.ca
link
fedilink
English
1
edit-2
9M

Seconding the recommendation for FreshRSS, it’s the one I ended up hosting when I looked into this a while back and it’s been really great. Takes a minute to get everything setup, especially if you want to have different settings for different types of feeds, but once it’s all set it’s perfect (for my needs at least).

I’ve also got it setup with my domain so I can access the feed from anywhere and that’s been one of my favourite features.

Freshrss is really great!

@Urist@lemmy.ml
link
fedilink
English
29M

I would recommend miniflux a “minimalist and opinionated feed reader”. It is great on mobile and desktop and dead simple to set up and use.

YouTube has RSS feeds you can access without scraping, but it’s per channel so if you follow a lot of channels you’ll be following a lot of RSS feeds.

Lemmy also has RSS feeds for each community.

Are you looking for a reader instead? A reader aggregates the feeds and displays them. Usually it keeps track of which items you’ve already read.

adr1an
link
fedilink
English
16
edit-2
9M

Everyone is suggesting readers. I think you are looking for something like https://docs.rsshub.app it’s capable of generating RSS feeds from pretty much everything.

@Meuzzin@lemmy.world
creator
link
fedilink
English
79M

This is it. Exactly what I was looking for. Thanks much!

adr1an
link
fedilink
English
1
edit-2
9M

Jumping in to ask if there’d be a good reason to use a stand alone feed reader instead of using the nextcloud “news” app?

Stefano Prenna
link
fedilink
English
19M

I use rss-bridge for scraping sites that don’t offer rss feeds: https://rss-bridge.github.io/rss-bridge/index.html

seconded. the built-in custom css selection is excellent. ive been strongly considering self-hosting an RSS bridge, but i think my server has too much unpredictable downtime for it

I use Miniflux, it’s a lightweight RSS reader

And they just added Omnivore integration, which I’m so excited for.

@namelivia@lemmy.world
link
fedilink
English
19M

Oh, this looks nice! I need to try this!

@Meuzzin@lemmy.world
creator
link
fedilink
English
29M

That looks great as well. I like that I can integrate with my own domain.

Create a post

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.

Rules:

  1. Be civil: we’re here to support and learn from one another. Insults won’t be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it’s not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don’t duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

  • 1 user online
  • 279 users / day
  • 589 users / week
  • 1.34K users / month
  • 4.55K users / 6 months
  • 1 subscriber
  • 3.5K Posts
  • 70K Comments
  • Modlog