Feature #6805: cpu-affinity: enhance CPU affinity logic with per-interface NUMA preferences - Suricata - Open Information Security Foundation

Actions

Copy link

Feature #6805

closed

cpu-affinity: enhance CPU affinity logic with per-interface NUMA preferences

Added by Lukas Sismis over 1 year ago. Updated 2 months ago.

Status:

Closed

Priority:

Normal

Assignee:

Lukas Sismis

Target version:

8.0.0-rc1

Effort:

Difficulty:

Label:

Description

This could help with deployments where CPU cores of 1 NUMA node are interleaved with CPU cores of the other NUMA (nodes) and there are NICs on every NUMA node.
In this scenario, the user might want to use CPU cores from both CPU NUMA nodes but control the NUMA assignment per-interface basis.

Supposed we have 2 NUMA nodes and 2 interfaces, we cannot assign CPU cores to the interfaces to be NUMA-friendly:
e.g.:
NUMA CPU1: 0,2,4,6,8
NUMA CPU2: 1,3,5,7,9

iface1 on NUMA1,
iface2 on NUMA2.

The desired assignment - cores 0,2,4,6,8 are assigned to iface1 and cores 1,3,5,7,9 are assigned to iface2.
Currently, the cores are merged together and are getting picked up in order by individual NICs, so iface1 gets cores 0,1,2,3,4 and iface2 gets cores 5,6,7,8,9.

This could be solved by more granular CPU assignment - e.g. CPU mask per interface or the CPU assigning logic could prefer CPU cores from NUMA nodes of the currently configured NIC.

Subtasks 1 (0 open — 1 closed)

Related issues 4 (3 open — 1 closed)

Actions

Copy link

Updated by Lukas Sismis about 1 year ago

Subtask #7036 added

Actions

Copy link

Updated by Lukas Sismis about 1 year ago · Edited

This feature will likely need to be capture mode-specific, so e.g. DPDK and af_packet.
Or it might be generic if there is a way to obtain NUMA-id of the NIC via generic calls.

For AF_PACKET you need to support autofp mode, in that case you need to consider receive-cpu-set,
otherwise you consider worker-cpu-set. This doesn't make much sense either because then receive-cpu-set would transfer data to worker-cpu-set

Can management threads be pinned to a specific NUMA Node and only work with that, primarily?
- Not at the current moment.
Is memory allocated on both NUMA nodes for Suricata structures?
- Packetpool should be allocated by the worker, Flow memory by the main thread.
The true goal would be being NUMA-local with all memory allocations.

In DPDK it can be relatively easy to pin workers to a specific NUMA node,
- you take the first CPU core from the NUMA node from where the NIC is.
- memory for the packet mempool is allocated on the same NUMA node, because it is allocated in the initialization phase of the workers.
- Suricata packetpool should be allocated by worker and by the kernel call, hopefully on the same NUMA as where CPU is.
- it is unfortunate with the flow table and other structures etc., it will likely be allocated only on one NUMA node,

Flow table could be allocated for each interface to be NUMA-local,
- requires likely a lot of changes
- might not be so useful in the end in the larger deployments
- it is not possible to allocate memory for the flow table on a specific NUMA node.

Implementing it in a generic way - could use -
RunModeSetLiveCaptureWorkersForDevice - TmThreadCreatePacketHandler - TmThreadCreate( - TmThreadSetSlots - TmThreadsSlotVar - TmThreadSetupOptions - AffinityGetNextCPU

Design idea:
From RunModeSetLiveCaptureWorkersForDevice propagate device name to AffinityGetNextCPU or assign it to thread vars structure so it can be later queried in the AffinityGetNextCPU for the NUMA ID. In this function, individual CPUs should also be queried for the NUMA locality.

Actions

Copy link

Updated by Victor Julien about 1 year ago

Related to Task #3318: Research: NUMA awareness added

Actions

Copy link

Updated by Victor Julien about 1 year ago

I think note 2 is mostly off topic here. It should probably be added to #3318 or a related ticket. Lets focus this ticket on how to express the NIC/NUMA/cores in our yaml.

Actions

Copy link

Updated by Lukas Sismis about 1 year ago

Status changed from New to In Progress

Actions

Copy link

Updated by Victor Julien about 1 year ago

Related to Bug #7137: "invalid cpu range" when trying to use CPU affinity added

Actions

Copy link

Updated by Lukas Sismis 11 months ago

Related to Task #3695: research: libhwloc for better autoconfiguration added

Actions

Copy link

Updated by Victor Julien 9 months ago

Related to Task #7336: Suricon 2024 brainstorm added

Actions

Copy link

Updated by Lukas Sismis 6 months ago

Status changed from In Progress to In Review

Actions

Copy link

#10

Updated by Victor Julien 4 months ago

Target version changed from 8.0.0-beta1 to 8.0.0-rc1

Actions

Copy link

#11

Updated by Lukas Sismis 3 months ago

To mention other suggestions on how to define CPU affinity - e.g. in interface nodes - it would contain both flow: + management-cpu-set and receive/worker-cpu-set settings. This would only make sense if flow tables are allocated per interface.

That way, you can define managers and recycler and all threads per interface, and then you can define whole affinity in interface nodes

Actions

Copy link

#12

Updated by Lukas Sismis 2 months ago

Status changed from In Review to Closed

Addressed in https://github.com/OISF/suricata/pull/13387

Actions

Copy link

Also available in: Atom PDF

Project

General

Profile

Suricata

Custom queries

Feature #6805

cpu-affinity: enhance CPU affinity logic with per-interface NUMA preferences

Updated by Lukas Sismis about 1 year ago

Updated by Lukas Sismis about 1 year ago · Edited

Updated by Victor Julien about 1 year ago

Updated by Victor Julien about 1 year ago

Updated by Lukas Sismis about 1 year ago

Updated by Victor Julien about 1 year ago

Updated by Lukas Sismis 11 months ago

Updated by Victor Julien 9 months ago

Updated by Lukas Sismis 6 months ago

Updated by Victor Julien 4 months ago

Updated by Lukas Sismis 3 months ago

Updated by Lukas Sismis 2 months ago

Related to Suricata - Task #3318: Research: NUMA awareness	New	OISF Dev	Actions
Related to Suricata - Bug #7137: "invalid cpu range" when trying to use CPU affinity	Feedback	OISF Dev	Actions
Related to Suricata - Task #3695: research: libhwloc for better autoconfiguration	Closed	Lukas Sismis	Actions
Related to Suricata - Task #7336: Suricon 2024 brainstorm	New	Victor Julien	Actions