Reference Stacks

Reference Stacks#

AMD Enterprise AI Reference Stack#

The AMD enterprise AI reference stack is an opinionated software stack for developing, deploying, and running AI workloads on a Kubernetes platform. The platform can be utilized by system administrators, resource managers, AI researchers, and AI solution developers.

Reference Stack

The AMD enterprise AI reference stack brings together:

AMD Inference Microservices (AIMs): Portable inference microservices for serving models on AMD GPUs. AIMs provide a mechanism for automatic selection of optimized runtime parameters based on user input, hardware, and model specifications.
AIM Engine: A Kubernetes operator for deploying and managing inference workloads.
AMD AI Workbench: An easy-to-use graphical interface for running and managing AI workloads, including features such as one-click deployment of AIMs from the AIM Catalog and fine-tuning capabilities.
AMD Resource Manager: Provides administrators with tools to manage clusters, monitor resources, and maintain team access to computational resources.

Together, these components enable capabilities such as:

High GPU efficiency: Intelligent workload placement and dynamic resource sharing
Unified AI infrastructure: Standardized governance, tools, and processes enabling seamless collaboration across teams
Accelerated time to value: Built-in inference and streamlined development workflows
AI-native orchestration: Intelligent job prioritization, dynamic allocation, and consistent performance
Information security: Built-in RBAC with the possibility to federate and integrate with existing SSO and IAM solutions.

The reference stack supports bare-metal, on-premises, and public cloud deployments of AI workloads. It has been validated on:

AMD Instinct™: MI300X, MI325X, MI350X and MI355X GPUs
AMD Radeon™ Pro GPUs: W7900 and R9700
AMD EPYC™ CPUs: EPYC 9965; Zen4/Zen5

To get started, view the resources below.

Resource	Link
GitHub	amd-enterprise-ai/amd-eai-apps
Getting started (on-premises)	On-premises Installation
Getting started (DigitalOcean)	DigitalOcean Cloud Installation
Getting started (Workbench)	Quick start guide
Getting started (Resource Manager)	Quick start guide

Reference Stacks

Contents

Reference Stacks#

AMD Enterprise AI Reference Stack#