As in, when I watched YouTube tutorials, I often see YouTubers have a small widget on their desktop giving them an overview of their ram usage, security level, etc. What apps do you all use to track this?

1 point

Grafana. Have alerts set up and get data with node exporter and cadvisor with some other containers giving some metrics.

I have alerts setup and they just ping me on a discord server I setup. High cpu and temps low disk space memory things like that. Mostly get high CPU or temp alerts and that’s usually when plex does its automated things at 4am.

permalink
report
reply
1 point

I use Home Assistant already. They have a plugin for glances. I guess all I’m interested in is cpu temp and load. Any changes =somethings up

permalink
report
reply
1 point

Alerts are much more important than fancy dashboards. You won’t be staring at your dashboard 24/7 and you probably won’t be staring at it when bad things happen.

Creating your alert set not easy. Ideally, every problem you encounter should be preceded by corresponding alert, and no alert should be false positive (require no action). So if you either have a problem without being alerted from your monitoring, or get an alert which requires no action - you should sit down and think carefully what should be changed in your alerts.

As for tools - I recommend Prometheus+Grafana. No need for separate AletrManager, as many guides recommend, recent versions of Grafana have excellent built-in alerting. Don’t use those ready-to-use dashboards, start from scratch, you need to understand PromQL to set everything up efficiently. Start with a simple dashboard (and alerts!) just for generic server health (node exporter), then add exporters for your specific services, network devices (snmp), remote hosts (blackbox), SSL certs etc. etc. Then write your own exporters for what you haven’t found :)

permalink
report
reply
1 point

Alerts are much more important than fancy dashboards.

It depends, If you have to install lot of stuff or manage a lot of thing it’s a good idea to have one but if you mainly do maintenance and you want to have something reliable yes you should have an alerts, for exemple I don’t have a lot of thing install and doesn’t rly care about reliability so I do everything in terminal, I use arch btw

permalink
report
parent
reply
1 point

One thing about using Prometheus alerting is that it’s one less link in the chain that can break, and you can also keep your alerting configs in source control. So it’s a little less “click-ops,” but easier to reproduce if you need to rebuild it at a later date.

permalink
report
parent
reply
1 point

When you have several Prometheus instances (HA or in different datacenters), setting up separate AlertManagers for each of them is a good idea. But as OP is only beginning his journey to monitoring, I guess he will be setting up a single server with both Prometheus and Grafana on it. In this scenario a separate AlertManager doesn’t add reliability, but adds complexity.

As for source control, you can write a simple script using Grafana API to export alert rules (and dashboards as well) and push them to git. Not ideal, sure, but it will work.

Anyway, it’s never too late to go further and add AlertManager, Loki, Mimir and whatever else. But to flatten the learning curve I’d recommend starting with Grafana alerts that are much more user-friendly.

permalink
report
parent
reply
1 point

Thank you for this. I think I need a deeper understanding of Prometheus. I’ll look into it. You are awesome

permalink
report
parent
reply
1 point

Good luck, if you get into it, you’ll be unable to stop. Perfecting your monitoring system is a kind of mania :)

One more advice for another kind of monitoring. When you are installing / configuring something on your server - it’s handy if you can monitor it’s resource usage in real time. And that’s why I use MobaXterm as my terminal program. It has many drawbacks, and competitors such as XShell, RoyalTS or Tabby look better in many ways… but it has one killer feature. It shows a status bar with current server load (CPU, RAM, disk usage, traffic) right below your SSH session, so that you don’t have to switch to another window to see the effect of your actions. Saved me a lot of potential headache.

permalink
report
parent
reply
1 point

I was looking at loki+grafana. is prometheus a replacement for loki in this setup and is it preferred?

permalink
report
parent
reply
1 point

No, they serve different purposes. Loki is for logs, Prometheus is for metrics. Grafana helps to visualize data from both.

permalink
report
parent
reply
1 point

What about InfluxDB? I hear that mentioned around Grafana a lot.

permalink
report
parent
reply
1 point

When you’ve got a lot of variables, especially when dealing with a distributed system, that importance leans the other way. Visualization and analytics are practically required to debug and tune large systems

permalink
report
parent
reply
1 point

Uptime Kuma and Grafana. Uptime Kuna to monitor if a service is up and running and Grafana to monitor the host like CPU, RAM, SSD usage etc.

permalink
report
reply
1 point

Same here, also have some autoscaling mechanisms set up in docker swarm to scale certain services in case the load is high

permalink
report
parent
reply
1 point

Thank you for this. I appreciate the support.

permalink
report
parent
reply
1 point

Netdata, monitoring a few thousand servers (virtual) that way.

permalink
report
reply

Self-Hosted Main

!main@selfhosted.forum

Create post

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.

For Example

  • Service: Dropbox - Alternative: Nextcloud
  • Service: Google Reader - Alternative: Tiny Tiny RSS
  • Service: Blogger - Alternative: WordPress

We welcome posts that include suggestions for good self-hosted alternatives to popular online services, how they are better, or how they give back control of your data. Also include hints and tips for less technical readers.

Useful Lists

Community stats

  • 23

    Monthly active users

  • 1.8K

    Posts

  • 11K

    Comments

Community moderators