Hello

Could someone help me resolve an issue in my configuration?

I have three Prometheus jobs that use the *proxy_url* parameter, pointing 
to a PushProx deployment located within the same AWS EKS cluster as 
Prometheus.

PushProx is also externally accessible via an NGINX ingress controller to 
collect metrics from hosts that are not directly reachable by Prometheus


I am currently facing an issue where Prometheus cannot scrape metrics from 
certain hosts, even though their metrics endpoints are accessible. The 
affected hosts appear to be random.

I have already tried increasing the *scrape_interval* and *scrape_timeout* 
settings in Prometheus, as well as the *scrape.default-timeout* parameter 
for PushProx. Unfortunately, these changes have not resolved the issue.

Also I checked pod resources like CPU and memory, they also not at the max 
level

Below are logs from the PushProx proxy pod


 ts=2024-11-15T16:04:04.346Z caller=main.go:179 level=error msg="Error 
scraping:" err="Timeout reached for \"http://10.44.44.161:9000/metrics\": 
context canceled" url=http://10.44.44.161:9000/metrics

ts=2024-11-15T16:04:04.363Z caller=main.go:179 level=error msg="Error 
scraping:" err="Timeout reached for \"http://10.44.57.96:9000/metrics\": 
context canceled" url=http://10.44.57.96:9000/metrics

ts=2024-11-15T16:04:04.463Z caller=main.go:179 level=error msg="Error 
scraping:" err="Timeout reached for \"http://10.44.57.100:9000/metrics\": 
context canceled" url=http://10.44.57.100:9000/metrics

ts=2024-11-15T16:04:04.539Z caller=main.go:179 level=error msg="Error 
scraping:" err="Timeout reached for \"http://10.44.66.66:9000/metrics\": 
context canceled" url=http://10.44.66.66:9000/metrics

ts=2024-11-15T16:04:04.634Z caller=main.go:179 level=error msg="Error 
scraping:" err="Timeout reached for \"http://10.10.8.61:9000/metrics\": 
context canceled" url=http://10.10.8.61:9000/metrics

ts=2024-11-15T16:04:04.664Z caller=main.go:179 level=error msg="Error 
scraping:" err="Timeout reached for \"http://10.44.66.47:9000/metrics\": 
context canceled" url=http://10.44.66.47:9000/metrics

  

-- 
You received this message because you are subscribed to the Google Groups 
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion visit 
https://groups.google.com/d/msgid/prometheus-users/b06ba254-cfb0-4d72-b0f1-b9f82787d075n%40googlegroups.com.

Reply via email to