| Age | Commit message (Collapse) | Author | |
|---|---|---|---|
| 2 days | Add f3 (192.168.2.133) to Prometheus node_exporter scrape targets | Paul Buetow | |
| f3 was previously excluded from FreeBSD host monitoring. Now that node_exporter is installed and running there, include it in the node-exporter job so CPU temperature and other host metrics are collected alongside f0/f1/f2. Also update the temperature alert comment to reflect that f3 is now covered. | |||
| 2026-04-08 | f3s/prometheus: add Garage admin scrape targets (task f) | Paul Buetow | |
| Add job_name garage for 192.168.2.130-132:3903 with os=freebsd label. Mirror config in additional-scrape-configs-secret for kube apply/ArgoCD. Made-with: Cursor | |||
| 2026-01-15 | Update monitoring and gogios configuration | Paul Buetow | |
| - Add node resources multi-select dashboard for Prometheus - Update gogios cron schedule and add HTML status file output - Update Prometheus scrape configs - Add gogios documentation Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> | |||
| 2026-01-08 | Remove invalid radicale scrape job (no metrics endpoint) | Paul Buetow | |
| 2026-01-08 | Add comprehensive ArgoCD application monitoring and alerts | Paul Buetow | |
| This implements monitoring for ALL services deployed via ArgoCD by leveraging ArgoCD's native Prometheus metrics instead of scraping individual services. Changes: - Created ArgoCD application alerts for health and sync status monitoring - Alert when applications are unhealthy (Degraded, Missing, Unknown, Suspended) - Alert when applications are out of sync for >10 minutes - Alert when sync operations are failing repeatedly - Alert when applications are stuck in Progressing state - Added recording rules for unhealthy/out-of-sync application counts - Added radicale health monitoring via scrape config - Added radicale to additional-scrape-configs for direct health checks - Monitors radicale web interface availability Benefits: - Single monitoring solution for all 21 ArgoCD-managed applications - Automatic monitoring for new applications added to ArgoCD - Early detection of configuration drift and deployment issues - Centralized alerting with actionable remediation steps Monitored applications include: radicale, registry, alloy, grafana, loki, prometheus, tempo, anki-sync-server, audiobookshelf, filebrowser, immich, keybr, kobo-sync-server, miniflux, opodsync, and more. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> | |||
| 2025-12-30 | Add Prometheus Pushgateway and data ingestion tool | Paul Buetow | |
| This commit adds a complete Prometheus data ingestion solution: 1. Pushgateway Helm Chart (f3s/pushgateway/) - Standalone helm chart for Prometheus Pushgateway - Deployed to monitoring namespace - Receives pushed metrics via HTTP POST on port 9091 2. Prometheus Pusher (f3s/prometheus-pusher/) - Standalone Go binary (12MB) for pushing metrics to Pushgateway - Demonstrates all Prometheus metric types: * Counter (app_requests_total) * Gauge (app_active_connections, app_temperature_celsius) * Histogram (app_request_duration_seconds) * Labeled Counter (app_jobs_processed_total) - Pushes metrics every 15 seconds - Includes comprehensive documentation and examples 3. Prometheus Configuration - Updated additional-scrape-configs.yaml to scrape Pushgateway - Uses honor_labels to preserve pushed metric labels Architecture: Go Binary → Pushgateway → Prometheus → Grafana The pusher binary generates realistic example metrics and pushes them to Pushgateway in Prometheus text format. Prometheus then scrapes the Pushgateway and makes the metrics available for querying and alerting. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> | |||
| 2025-12-25 | revert: undo all observability changes from today | Paul Buetow | |
| Reverts hostname relabeling and etcd metrics changes | |||
| 2025-12-25 | use hosts not IPs | Paul Buetow | |
| 2025-12-06 | add openbsd node exporters | Paul Buetow | |
| 2025-12-06 | add more | Paul Buetow | |
