-
Notifications
You must be signed in to change notification settings - Fork 96
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add multi-dc testcase for 90% storage utilization #9157
Comments
Last test run: https://jenkins.scylladb.com/view/staging/job/scylla-staging/job/cezar/job/byo-longevity-test/69/consoleFull
Did not redistribuite data to new dc. Trying fix 1b0e85b |
https://argus.scylladb.com/tests/scylla-cluster-tests/e70ab70a-063f-463e-a289-39a90805e597
|
https://github.com/cezarmoise/scylla-cluster-tests/tree/new-dc Trying to alter all keyspaces before adding the dc so they have per dc replication, and changing it after is only one change. https://argus.scylladb.com/tests/scylla-cluster-tests/4cb74447-6750-4bba-83ef-4ccad8cf6a89 |
failed due to timeout on a large keyspace, updating to only add replicate small keyspaces
|
Still timeout issues, https://argus.scylladb.com/tests/scylla-cluster-tests/5117a642-3a7a-4c9a-ba43-d1898756f556 Set timeout on queries to 5min an try again |
https://argus.scylladb.com/tests/scylla-cluster-tests/6b7ea346-0ca8-42c8-a955-7f7f4f3d1922 Only added the small keyspaces to the new dc, as I got timeouts when trying to alter the large ones. Will update with a new run. |
https://argus.scylladb.com/tests/scylla-cluster-tests/5ab30315-6993-4b6e-8cca-b0b430076eca Initial Cluster: 4 x i4i.large
|
Do we know how long did it take to provision the new DC?
Why different replication? Does the space occupied in the DC2 corresponds to 70% storage utilization or should it be higher/lower?
Did we verify the new DC works as expected? i.e. with reads or writes? |
10 minutes to add the nodes
The RF for the new DC needs to be increased by 1 at a time. So it would take 3x time.
Currently I get stress errors
I think the stress command needs to be updated because of the new dc. But in
without any mention of replication, and I don't know exactly what the difference is. |
@Lakshmipathi any idea whats wrong? |
@pehala I'm not quite sure why stopped working with new-dc. Searching existing issues, came across this one scylladb/cassandra-stress#16 |
@cezarmoise , can you share the jenkins link for this error: I got similar with my simple scaleout run (https://jenkins.scylladb.com/job/scylla-staging/job/LakshmipathiGanapathi/job/byo-longevity-test/268/console)
|
https://jenkins.scylladb.com/job/scylla-staging/job/cezar/job/byo-longevity-test/97/ The Jenkins link probably won't be around for very long, I run a lot of builds. |
Initial Cluster: 4 x i4i.large
At this point I get out of space error
This happened after
When inserting data in the original DC, after keyspace_large3, it was only at 60% capacity. After that There are a lot of erros like this
|
Is not it expected to get out of space? If we have 4 nodes x 0.9 utilization and want to make the same number of replicas using just 3 nodes? |
My mistake. It should say 4 new nodes. In the graph you can see there are 4 new lines |
I will run this again, but with |
Between which operations? I am not sure if this is a right approach - the customer is not expected to wait for balancing to finish before scaling DCs. @bhalevy could you take a look while we can get out of space error here? Or recommend who we should ask? |
Managed to reproduce the failures. This time, after altering each keyspace I waited for tablets balance. https://argus.scylladb.com/tests/scylla-cluster-tests/a318f810-3ae5-4912-9605-21434e3be97f https://argus.scylladb.com/tests/scylla-cluster-tests/6c7cff0a-5fab-482c-8fd7-21499fe35a0e |
@cezarmoise could you create a separate issue describing the bug? |
Updated the description & name to match with changes to the test plan |
Opened scylladb/scylladb#21848 for the out of space issue |
The text was updated successfully, but these errors were encountered: