Garage RPC hangs after a certain amount of time #99
Labels
No labels
AdminAPI
Bug
Check AWS
CI
Correctness
Critical
Documentation
Ideas
Improvement
Low priority
Newcomer
Performance
S3 Compatibility
Testing
Usability
No milestone
No project
No assignees
2 participants
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: Deuxfleurs/garage#99
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Months ago, we had a problem where garage instances crashed during night.
We backtracked the problem and it appeared it occured during backups that were putting an important load on the cluster. We think it is due to some ressource exhaustion linked with Hyper.rs leading to HTTP timeouts, including on our health check that was triggering a reboot. We put a workaround in Nomad, asking it to indefinetely reboot the service when it crashes but as far as I know, the root problem is not yet solved.
Might be related: hyper #2419 - Http2: Hyper client gets stuck if too many requests are spawned #2419 .
Does this still happen with Netapp?
(we will know once Deuxfleurs is migrated to 0.4)
Closing this for now. We will reopen if issues arise again with Netapp.