High load makes API unresponsive #527

New issue

Open

opened 2023-03-13 11:28:38 +00:00 by jpds · 1 comment

jpds commented

2023-03-13 11:28:38 +00:00

Contributor

I've noticed that on high traffic volume to storage nodes or a new node being added, the admin API (with of course, the metrics endpoint and presumely health) becomes unresponsive, Prometheus cannot reach it which then triggers alerts.

garage-prometheus-unreach.png

65 KiB

quentin added the

kind

performance

label 2023-03-13 14:20:19 +00:00

maximilien commented

2024-07-25 20:51:28 +00:00

Owner

This likely depends on the timeout that you allow for scraping, and the level of activity on the metadata... we could maybe improve things here but I would not say it's unexpected

No milestone

No project

No assignees

2 participants

Notifications

Due date

The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference: Deuxfleurs/garage#527

No description provided.

Rows
Columns