Integrate CleverMicro and CleverSwarm in shared environment and test the Backpresure scale UP/DOWN #57
Labels
No labels
Blocked
Bounty
$100
Bounty
$1000
Bounty
$10000
Bounty
$20
Bounty
$2000
Bounty
$250
Bounty
$50
Bounty
$500
Bounty
$5000
Bounty
$750
MoSCoW
Could have
MoSCoW
Must have
MoSCoW
Should have
Needs feedback
Points
1
Points
13
Points
2
Points
21
Points
3
Points
34
Points
5
Points
55
Points
8
Points
88
Priority
Backlog
Priority
Critical
Priority
High
Priority
Low
Priority
Medium
Signed-off: Owner
Signed-off: Scrum Master
Signed-off: Tech Lead
Spike
State
Completed
State
Duplicate
State
In Progress
State
In Review
State
Paused
State
Unverified
State
Verified
State
Wont Do
Type
Bug
Type
Discussion
Type
Documentation
Type
Epic
Type
Feature
Type
Legendary
Type
Support
Type
Task
Type
Testing
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Blocks
You do not have permission to read 1 dependency
Reference: clevermicro/amq-adapter-python#57
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Now when there's a shared environment (that receives CleverMicro updates from the git merge requests), we are going to test the Scalability / Backpressure management
However we need an opinion on the actual implementation provided by Clever Micro:
Current situation.
CleverMicro defines an OVERLOAD condition, that would automatically trigger SCALE-UP event as:
The IDLE condition is consequently when average CPU usage is lower than IDLE_THRESHOLD for continuous IDLE_WINDOW seconds (initially set 10% and 30 sec), or for last continuous IDLE_WINDOW the number of parallel reqeuests being served is 0.
The Challenge:
SUGGESTION:
I believe we may need to add a monitor of the inbound queue length and trigger an OVERLOAD condition when the length of the inbound queue exceeds the predefined threshold.
We have discussed this feature with @freemo before, and we rejected this monitoring option, as the Backpressure feature is meant to be pre-emptive, scaling up the instance before it runs out of the resources.
However in the context of CleverSwarm, I believe that the inbound queue is the only measure that we can get to assess how fast is the service consuming incoming requests.
We understand that this is Reactive measure, that it diverges from the original intent, but we believe that for single threaded consumer service, which is I/O heavy, this is only measure that will be able to assess the current back-pressure