Hi All,
I am reaching out to gather some quantitative insights and experiences 
regarding the scalability of single prometheus instance. I understand that 
performance and scalability can vary significantly based on different 
aspects of infrastructure like whether the backend storage is local disk of 
NFS, network bandwidth , number of targets and metrics per targets, scrape 
interval etc.
For the following queries you can assume ideal conditions I.e. Optimal 
hardware , Maximum network bandwidth , Local disk ( say SSD) Or
 if you can share the information on the hardware and the performance 
metrics , that would help .

Here are the questions:
Q1. What is the practical limit on the number of active series which 
prometheus can handle or,
      What is the maximum number of active series to which prometheus can 
scale? 

           https://www.robustperception.io/scaling-and-federating-prometheus
        this article was written in 2015 and mentions that single instance 
can scale at minimum 1M series ( 1000 servers x 1000 metrics)
          
  
https://prometheus.io/docs/introduction/faq/#i-was-told-prometheus-doesnt-scale
          here i assume under ideal conditions, it can scale between 20 to 
90M.

Q2: What are the practical limits for storage and data retention in a 
single instance.

Q3: What is the highest number of targets and total metrics per target that 
can be efficiently scraped by single instance?

Q4: How does query performance ( latency) scale with increasing number of 
metrics and targets?

Any shared experiences , benchmarks or references to relevant documentation 
would help .

Thank you 
Regards,


-- 
You received this message because you are subscribed to the Google Groups 
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/prometheus-users/800c9642-1d65-4806-b634-412429766831n%40googlegroups.com.

Reply via email to