[FG:InPlacePodVerticalScaling] A proposal for CPU allocated strategy when Pod scale up and down #8

Chunxia202410 · 2024-11-07T12:28:49Z

What type of PR is this?

/kind cleanup

This PR is based on the kubernetes#123319.

What this PR does / why we need it:

When Guaranteed QoS Class Pod scale up and down with Static CPU management policy alongside InPlacePodVerticalScaling, the allocated CPU should meet the kubernetes CPUs allocation rules.

For scaling up, keep the original CPUs, and only allocate the additional number of CPUs. when allocate additional CPUs, it is necessary to ensure that the combination of the original CPUs and the additional CPUs complies with the Kubernetes CPU allocation rules.
For takeByTopologyNUMAPacked：
For example:
Assuming the CPU topology is as follows, and kubernetes CPUs allocation rules is Packed(takeByTopologyNUMAPacked):

Allocated CPUs of Pod0 is {2,3}, Allocated CPUs of Pod1 is {13}. Other CPUs are free.
-When scale up of Pod 1 from 1 CPU to 2 CPUs. ==> CPU {12} should to be allocated to Pod1, so CPUset of Pod1 is {12,13}
-When scale up of Pod 1 from 1 CPU to 4 CPUs. ==> CPU {12,14,15} should to be allocated to Pod1, so CPUset of Pod1 is {12,13,14,15}
-When scale up of Pod 1 from 1 CPU to 8 CPUs. ==> CPU {8-12,14,15} should to be allocated to Pod1, so CPUset of Pod1 is {8-15}

For scaling down, if there are mustKeepCPUsForResize, when allocate remained CPUs, it is necessary to ensure that the combination of the mustKeepCPUsForResize CPUs and the allocated remained CPUs complies with the Kubernetes CPU allocation rules.

Which issue(s) this PR fixes:

The step as follows for allocate additional CPUs when scale up with CPU allocated packet(takeByTopologyNUMAPacked) rule, and assume that the NUMA nodes are higher than Sockets in the memory hierarchy.

Keep the original CPUs.
Take remain CPUs in the numa which only allocated CPUs to this Pod. (newly add step)
Take full numa nodes.
Take remain CPUs in the socket which only allocated CPUs to this Pod, than take full sockets in numa nodes which allocated CPUs to this Pod. (newly add step)
Take full sockets.
Take remain CPUs in the physical cores which only allocated CPUs to this Pod, than take full cores in sockets which allocated CPUs to this Pod, than take remain full cores in numa which allocated CPUs to this Pod. (newly add step)
Take full physical cores.
Take remain CPUs in the physical cores which only allocated CPUs to this Pod, than take remain CPUs in sockets which allocated CPUs to this Pod, than take remain CPUs in numa which allocated CPUs to this Pod. (newly add step)
Take remain CPUs

For takeByTopologyNUMADistributed：
The allocated should be a subset of the combo. and the number of allocated CPU is not more than distribute number.

Special notes for your reviewer:
UncoreCaches need to be considerd

Does this PR introduce a user-facing change?

NONE

Merge latest code

esotsal · 2025-01-28T14:59:26Z

Thanks @Chunxia202410 for the proposal, can you please update PR using kubernetes#129719 as base ? Thanks

Chunxia202410 · 2025-02-05T02:43:51Z

Yes~

Chunxia202410 · 2025-02-05T08:30:11Z

This PR is replaced by esotsal#3

esotsal · 2025-02-05T09:06:49Z

Thanks !

Sotiris Salloumis and others added 2 commits November 5, 2024 19:16

Static CPU management policy with InPlacePodVerticalScaling

27f692d

CPU allocated strategy when Pod scale up and down

d1820ab

Chunxia202410 mentioned this pull request Nov 8, 2024

[WIP][FG:InPlacePodVerticalScaling] Fix Static CPU management policy alongside InPlacePodVerticalScaling kubernetes/kubernetes#123319

Closed

modify numCPUs to acc.numCPUsNeeded in takeByTopologyNUMADistributed

5a6e324

esotsal force-pushed the esotsal/policy_static branch 6 times, most recently from cb0c2f9 to c3cdfa2 Compare November 13, 2024 16:30

Static CPU management policy with InPlacePodVerticalScaling

dd29062

esotsal force-pushed the esotsal/policy_static branch from c3cdfa2 to dd29062 Compare November 14, 2024 09:22

Chunxia202410 and others added 3 commits November 19, 2024 13:52

Merge branch 'esotsal/policy_static' into CPUs_Update_Strategy

e19f59e

Update versioned_kube_features.go

3aea68f

Static CPU management policy with InPlacePodVerticalScaling

407dde3

esotsal force-pushed the esotsal/policy_static branch from dd29062 to 407dde3 Compare November 19, 2024 16:38

Merge branch 'esotsal/policy_static' into CPUs_Update_Strategy

ff28646

esotsal force-pushed the esotsal/policy_static branch 2 times, most recently from b9e6355 to 8c3741d Compare November 25, 2024 02:31

Chunxia202410 and others added 4 commits December 5, 2024 10:49

Merge pull request #6 from Nordix/esotsal/policy_static

5e96f7b

Merge latest code

Update cpu_assignment.go

9bb3d9e

Update cpu_assignment.go

c27c5cf

modify takeByTopologyNUMADistributed and add UT case

7f9655d

Chunxia202410 changed the title ~~[WIP] [FG:InPlacePodVerticalScaling] A proposal for CPU allocated strategy when Pod scale up and down~~ [FG:InPlacePodVerticalScaling] A proposal for CPU allocated strategy when Pod scale up and down Dec 9, 2024

Chunxia202410 closed this by deleting the head repository Feb 5, 2025

Chunxia202410 mentioned this pull request Feb 5, 2025

[FG:InPlacePodVerticalScaling] A proposal for CPU allocated strategy when Pod scale up and down esotsal/kubernetes#3

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FG:InPlacePodVerticalScaling] A proposal for CPU allocated strategy when Pod scale up and down #8

[FG:InPlacePodVerticalScaling] A proposal for CPU allocated strategy when Pod scale up and down #8

Chunxia202410 commented Nov 7, 2024 •

edited

Loading

esotsal commented Jan 28, 2025 •

edited

Loading

Chunxia202410 commented Feb 5, 2025

Chunxia202410 commented Feb 5, 2025

esotsal commented Feb 5, 2025

[FG:InPlacePodVerticalScaling] A proposal for CPU allocated strategy when Pod scale up and down #8

[FG:InPlacePodVerticalScaling] A proposal for CPU allocated strategy when Pod scale up and down #8

Conversation

Chunxia202410 commented Nov 7, 2024 • edited Loading

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Does this PR introduce a user-facing change?

esotsal commented Jan 28, 2025 • edited Loading

Chunxia202410 commented Feb 5, 2025

Chunxia202410 commented Feb 5, 2025

esotsal commented Feb 5, 2025

Chunxia202410 commented Nov 7, 2024 •

edited

Loading

esotsal commented Jan 28, 2025 •

edited

Loading