High load makes API unresponsive #527

Open
opened 2023-03-13 11:28:38 +00:00 by jpds · 1 comment
Contributor

I've noticed that on high traffic volume to storage nodes or a new node being added, the admin API (with of course, the metrics endpoint and presumely health) becomes unresponsive, Prometheus cannot reach it which then triggers alerts.

I've noticed that on high traffic volume to storage nodes or a new node being added, the admin API (with of course, the metrics endpoint and presumely health) becomes unresponsive, Prometheus cannot reach it which then triggers alerts.
quentin added the
kind
performance
label 2023-03-13 14:20:19 +00:00
Owner

This likely depends on the timeout that you allow for scraping, and the level of activity on the metadata... we could maybe improve things here but I would not say it's unexpected

This likely depends on the timeout that you allow for scraping, and the level of activity on the metadata... we could maybe improve things here but I would not say it's unexpected
Sign in to join this conversation.
No milestone
No project
No assignees
2 participants
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference: Deuxfleurs/garage#527
No description provided.