Helm Chart Values#
Reference for all configurable values in the AIM Engine Helm chart.
Controller Manager#
Controller manager configuration
Parameter |
Description |
Default |
|---|---|---|
|
Number of operator replicas |
|
|
Operator container image repository |
|
|
Operator container image tag |
|
|
Image pull policy |
|
|
Secrets for pulling the operator image from private registries |
|
|
Controller command-line arguments |
|
|
Additional environment variables for the controller |
|
|
Require non-root user |
|
|
Seccomp profile type |
|
|
Prevent privilege escalation |
|
|
Dropped Linux capabilities |
|
|
Read-only root filesystem |
|
|
Memory limit |
|
|
CPU request |
|
|
Memory request |
|
RBAC Helpers#
Create admin/editor/viewer ClusterRoles for each CRD
Parameter |
Description |
Default |
|---|---|---|
|
Enable RBAC helper roles |
|
CRDs#
Custom Resource Definitions
Parameter |
Description |
Default |
|---|---|---|
|
Install CRDs with the chart |
|
|
Keep CRDs when uninstalling (prevents data loss) |
|
Metrics#
Controller metrics endpoint
Parameter |
Description |
Default |
|---|---|---|
|
Enable metrics endpoint |
|
|
Metrics endpoint port |
|
Cert-Manager#
Cert-manager integration for TLS certificates
Parameter |
Description |
Default |
|---|---|---|
|
Enable cert-manager integration |
|
Prometheus#
Prometheus ServiceMonitor for metrics scraping
Parameter |
Description |
Default |
|---|---|---|
|
Create a Prometheus ServiceMonitor resource |
|
Cluster Runtime Configuration#
Cluster-wide runtime configuration for AIM resources. Creates an AIMClusterRuntimeConfig CR when enabled.
Parameter |
Description |
Default |
|---|---|---|
|
Enable creation of the AIMClusterRuntimeConfig resource |
|
|
Name of the AIMClusterRuntimeConfig resource |
|
acceleratorDetector#
AcceleratorDetector DaemonSets for hardware detection via NFD. Detects GPU and CPU accelerators on cluster nodes and writes NFD feature files so that AIM profiles can target specific hardware. Requires NFD (Node Feature Discovery) to be installed on the cluster.
Parameter |
Description |
Default |
|---|---|---|
|
Enable the AcceleratorDetector DaemonSets |
|
|
Seconds between re-detection cycles |
|
|
GPU node detection (uses aim-base image with ROCm/amdsmi). Detects AMD Instinct GPUs and writes NFD labels like feature.node.kubernetes.io/aim-accelerator.MI300X=8 Only scheduled on nodes with feature.node.kubernetes.io/amd-gpu=true (set by the AMD GPU Operator NFD rule). |
|
|
Enable GPU accelerator detection DaemonSet |
|
|
GPU detector image repository (aim-base) |
|
|
GPU detector image tag |
|
|
Image pull policy |
|
|
Secrets for pulling the GPU detector image from private registries |
|
|
Node selector to target GPU nodes (requires AMD GPU Operator NFD rule) |
|
|
Tolerations for GPU nodes (defaults to tolerate all taints) |
|
|
Resource limits and requests for GPU detector pods |
|
|
CPU node detection (uses aim-epyc-base image, lighter, no ROCm). Detects AMD EPYC CPUs and writes NFD labels like feature.node.kubernetes.io/aim-accelerator.EPYC_9965=128 Only scheduled on nodes WITHOUT feature.node.kubernetes.io/amd-gpu label (i.e. CPU-only nodes). |
|
|
Enable CPU accelerator detection DaemonSet |
|
|
CPU detector image repository (aim-epyc-base) |
|
|
CPU detector image tag |
|
|
Image pull policy |
|
|
Secrets for pulling the CPU detector image from private registries |
|
|
Node selector for CPU-only nodes (no additional selector needed; the DaemonSet uses nodeAffinity DoesNotExist on the amd-gpu label) |
|
|
Tolerations for CPU detector pods (defaults to tolerate all taints) |
|
|
Resource limits and requests for CPU detector pods |