I’ve got a lot of services, some in docker, some in LXC or a VM in proxmox. Currently I’ve got no monitoring service. Recently a service went down and I didn’t notice for quite a while so now I’ve got a bunch of missing data. What monitoring tools do you all use? Looking for something that works with docker and plain Linux CTs/VMs and can notify me if a website is down, docker container crashed, VM is offline, etc.

and as a bonus feature something that I can run on two machines so if an entire machine dies, the other will notice and I’ll still receive a notification.

notification can be anything, email, sms, push, etc.

Matt The Horwood
link
fedilink
English
11Y

a nagios user here, no pretty charts. Just is it down

nitrolife
link
fedilink
English
81Y

I use:

  1. Monitoring server - prometheus
  2. Alert manager for prometheus - alertmanager. You can write any triggers here.
  3. Web UI for prometheus - Grafana
  4. Exporters for prometheus - node-exporter, blackbox-exporter, mysql-exporter, psql-exporter etc. You can find exporter for everything you need.
  5. Some services native support pormetheus. Docker for example: https://docs.docker.com/config/daemon/prometheus/

If you whant cluster you can install thanos on prometheus.

@peregus@lemmy.world
link
fedilink
English
21Y

I’d like to explore Prometheus (I’ve never used it). Right now I use InfluxDB to store some data (ping times, temperature, servers load, etc.) can Prometheus read those values and react if something is off or should I store everything twice?

nitrolife
link
fedilink
English
31Y

prometheus use own time series database. you can connect influxdb to grafana and send alarms from grafana, but alertmanager better i think. node-explorer can collect all this data (sensors, VM/PC load etc.)

@peregus@lemmy.world
link
fedilink
English
2
edit-2
1Y

I’ve never used alerting in Grafana, how do they work? Is it possible to get alert if a ping is higher than xx for a period of time? What are alertmanager and node-explorer? Plugins or standalone tools? Sorry for all the questions! 😁 And thanks for the info!

nitrolife
link
fedilink
English
11Y

Grafana sends an email screenshot of the graph when an event is triggered on the graph. You can see alerts part on any graph for understand.

@peregus@lemmy.world
link
fedilink
English
11Y

You know that I’ve never knew about that? I’ve just set it up! Thanks!!!

ᓰᕵᕵᓍ
link
fedilink
English
5
edit-2
1Y

If you strip down monitoring, all you need is a notification if something goes down

I use monocker it monitors status changes on containers and sends a notification when one happens

Thats all

@exi@feddit.de
link
fedilink
English
31Y

For a handful of servers, try zabbix. Every distribution has a packaged zabbix agent. It has everything: web ui, a way to Auto discover things with a bit of setup, nice graphs, alerting, LDAP User Management if you need it, a way to define per person/group alerting/notification schedules. And the community is big enough that many common services (fail2ban/postfix/MySQL/etc.) have premade custom monitoring scripts. Adding your own metrics is also very easy.

@AES@lemmy.ronsmans.eu
link
fedilink
English
21Y

After years of Nagios use now on Zabbix for 2 years. It’s really really great and my favorite monitoring system once you get the hang of it.

But overkill for just some home monitoring imo. I would recommend uptimekuma.

@tychosmoose@lemm.ee
link
fedilink
English
11Y

Monit works for me. Good basic monitoring solution that can also restart a service/interface.

I also use LibreNMS to do alerting for a variety of conditions (syslog events, sensor conditions, outages and services via nagios). But this is more work to get set up.

lnxtx
link
fedilink
English
21Y

Checkmk maybe?

CheckMK is too complicated for my monkey brain. After a few days of going through docs, I can’t even get a log file monitoring going.

If all you want is uptime monitoring, Uptime Kuma.

SirMaple_
link
fedilink
English
21Y

I use LibreNMS and Healthchecks.io. I also use Grafana to display all the important data in a dashboard on a portrait mounted monitor on my desk.

Leraje
link
fedilink
English
21Y

I have Conky on my desktop and do a curl to a known page on my server to monitor if a web service is up every 60 seconds. If it’s down, I swap to a blinking animated gif as an icon and play an alert sound.

@vegetaaaaaaa@lemmy.world
link
fedilink
English
1
edit-2
1Y

I use netdata (agent only, not the cloud/SaaS stuff) for metrics/charts/health/HTTP checks/alerting, and rsyslog+graylog (or just lnav on small setups) for log analysis. Plus a bunch of other scanners (debsecan, lynis, debsums…)

UptimeKuma is what I use; it’ll watch tcp connections, docker containers, websites… whatever. And the notifications are pretty comprehensive and probably cover anything in 2023 would want to be using.

@peregus@lemmy.world
link
fedilink
English
2
edit-2
1Y

I use Uptime Kuma too in my VPS and I monitor it with Node Red at home (that is monitored by Uptime Kuma! 😁) So if anything goes down (monitoring tool too) I receive alerts. Both of them send me alerts with NTFY.

Velocidapter
link
fedilink
English
11Y

+1 for Uptime Kuma. Dead simple to set up and configure, and it has alert support for dozens of services.

I administer a large Zabbix environment in my day job, and while it’s not complicated to get set up, it’s overkill for simple up/down service monitoring.

Create a post

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.

Rules:

  1. Be civil: we’re here to support and learn from one another. Insults won’t be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it’s not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don’t duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

  • 1 user online
  • 269 users / day
  • 660 users / week
  • 1.42K users / month
  • 3.94K users / 6 months
  • 1 subscriber
  • 3.79K Posts
  • 76.7K Comments
  • Modlog