New feature "Enqueue-Slowdown" (or -Throttling) #69

oli-h · 2020-03-25T14:14:19Z

Background: Redis-Memory is limited - therefore queue sizes are limited (queue length multiplied with average queue entry size).
So at some point, EN-queuing suddenly fails (presumed that DE-queuing is not possible for a while)

For stability reasons we now can slowly increase a backpressure towards EN-queuing process (http- or EventBus-Clients) by delaying a "success enqueue"-reply more and more. Presuming that the originating EN-queuing process also works sequentially (i.e. it only requests to enqueue a next message when the previous message is successfully enqueued) this simple mechano helps to secure 'our' Redis memory and give us more time (minutes or even hours) to react.

There are now two additonal config options 'per QueueName-pattern':

enqueueDelayMillisPerSize
enqueueMaxDelayMillis

Closes #70

Addtionally:

simplified Config-Object (i.e.: remove the 'Builder'-pattern. Seems to be useless but enforces duplicate code)
now using a pre-compiled RegEx-Pattern instead of using method String#matches (which requires to compile the RegEx over and over again)
reduced string-concat operation within RedisQues when building Redis-Key-Prefixes over and over again
harmonized naming: variable-name "queueName" is now used throughout RedisQues-class

Background: Redis-Memory is limited - therefore queue sizes are limited (queue length multiplied with average queue entry size). So at some point, EN-queuing suddenly fails (presumed that DE-queuing is not possible for a while) For stability reasons we now can slowly increase a backpressure towards EN-queuing process (http- or EventBus-Clients) by delaying a "success enqueue"-reply more and more. Presuming that the originating EN-queuing process also works sequentially (i.e. it only requests to enqueue a next message when the previous message is successfully enqueued) this simple mechano helps to secure 'our' Redis memory and give us more time (minutes or even hours) to react. There are now two additonal config options 'per QueueName-pattern': - enqueueDelayMillisPerSize - enqueueMaxDelayMillis Addtionally: - simplified Config-Object (i.e.: remove the 'Builder'-pattern. Seems to be useless but enforces duplicate code) - now using a pre-compiled RegEx-Pattern instead of using method String#matches (which requires to compile the RegEx over and over again) - reduced string-concat operation within RedisQues when building Redis-Key-Prefixes over and over again - harmonized naming: variable-name "queueName" is now used throughout RedisQues-class

src/test/java/org/swisspush/redisques/slowdown/EnqueueThrottleTest.java

src/main/java/org/swisspush/redisques/RedisQues.java

src/main/java/org/swisspush/redisques/util/QueueConfiguration.java

mcweba · 2020-03-25T16:11:38Z

Could you also put the first comment describing the feature as issue? It's easier to link to in the release notes.

src/main/java/org/swisspush/redisques/RedisQues.java

lbovet · 2020-03-26T10:12:27Z

PR ok for me and agree with other comments.
I find a bit ironic that we have to implement such a thing because our system runs too fast :)

mcweba · 2020-03-26T10:15:23Z

PR ok for me and agree with other comments.
I find a bit ironic that we have to implement such a thing because our system runs too fast :)

It's just ISA. NEMO likes it fast :-)

src/test/java/org/swisspush/redisques/slowdown/EnqueueThrottleTest.java

oli-h · 2020-03-27T11:12:12Z

Yes - that's exactly the problem: Our system runs too fast.
Out clients (i.e. entry-side of the overloaded queues) are also RedisQues-Queues.
And when they were offline for a longer time (e.g. network problem in our data center) then those Client-Queues get filled.
As soon as we get online again, all thousands and thousands of Client-Queues start to deliver to our "data center queues". We can handle 3'000 to 4'000 enqueings per second.
But as Bandwith/Throughput to/from Redis-Server is also limited we encountered that the 'DE-queuing' rate suffers heavily under the high EN-queuing rate.
This is also related to #51: We need much more roundtrips with Redis for one DEqueuing than we need for one ENqueuing.

An alternative would be to split RedisQues into two Verticles: one which only does ENqueuing and the other with does only DEqueuing. We then could deploy a differnt number of verticles (i.e. only one ENqueue-verticle but 8 DEqueue-verticles).

Still: a smooth increasing backpressure towards ENqueing clients seems to be a valid option

ljucam

fine for me

mcweba · 2020-03-27T12:03:27Z

Could you also put the first comment describing the feature as issue? It's easier to link to in the release notes.

PR is fine for me now. Could you please add the description as issue.

oli-h requested review from lbovet, mcweba, ljucam and dominik-cnx March 25, 2020 14:14