docs/troubleshooting/timeouts.rst
Issue: cluster is experiencing timeouts due to server overload
Possible Root Cause: aggressive retry policy
When requests fail, it is natural for users to want to retry the requests. However, if the request fails due to overload conditions, retrying the request can make the situation worse. Since the server is still processing the old request, when the new retry is attempted, this makes the server process both requests. The server which was already overloaded is now even more overloaded.
Here are a few situations that can lead to this scenario:
If your cluster is having timeouts during overload, check first if you are not making the overload situation worse through retries, and pay attention to the following:
driver documentation <https://docs.scylladb.com/stable/drivers/index.html>_ about parameters and defaults. For the server-side timeout, the /etc/scylla/scylla.yaml has request-specific timeout settings like read_request_timeout_in_ms and write_request_timeout_in_msdriver documentation <https://docs.scylladb.com/stable/drivers/index.html>_ about parameters and defaults.documentation <speculative-retry-options> for details.