seokjin0414 commented on issue #19:
URL: https://github.com/apache/iggy/issues/19#issuecomment-3980317609

   @hubcio @spetz 
   I'd like to work on this issue. 
   Here's my proposed metrics plan: 
   
   **Phase 1 — System (wire existing sysinfo data to Prometheus)**
   - iggy_process_cpu_usage (Gauge) — process CPU %
   - iggy_memory_usage_bytes (Gauge) — process RSS
   - iggy_messages_size_bytes (Gauge) — total messages on disk
   - iggy_consumer_groups (Gauge) — consumer group count 
   - iggy_uptime_seconds (Gauge) — server uptime
   
   **Phase 2 — Throughput & Consumer Lag**
   - iggy_messages_sent_total, iggy_messages_polled_total (Counter)
   - iggy_bytes_sent_total, iggy_bytes_polled_total (Counter)
   - iggy_consumer_lag (Gauge, labels: consumer/topic/partition) — 
current_offset - stored_offset
   
   **Phase 3 — Connections**
   - iggy_connections_active{transport=tcp|quic|http|ws} (Gauge)
   - iggy_connections_total{transport=tcp|quic|http|ws} (Counter)
   
   **Phase 4 — Auth**
   - iggy_login_attempts_total, iggy_login_failures_total (Counter)
   
   **Phase 5 — Storage & Ops**
   - iggy_segments_rotated_total, iggy_segments_expired_total, 
iggy_messages_expired_total (Counter)
   - iggy_stale_clients_evicted_total (Counter)
   - iggy_messages_deduplicated_total (Counter)
   
   Start with Phase 1-2 in the first PR then follow up with the rest
   If there are any additional metrics you'd like to see or specific priorities 
please let me know


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to