We are excited to announce a new open-source release of our GIM SR-IOV Driver.
Along with regular bug fixes and improvements, this release expands KVM SR-IOV virtualization support for MI325X on RHEL 10 and enables ROCm profiler tool support within KVM SR-IOV virtualized (Guest) GPU environments..
Hardware / Features Supported for this Release
| Hardware | Host OS | Guest OS | ROCm version | Number of VFs per GPU | Infinity Fabric (XGMI) Configuration |
|---|---|---|---|---|---|
| AMD Instinct MI355X | Ubuntu 24.04 Kernel 6.14 | RHEL 9.6, RHEL 10.0, Ubuntu 24.04 | ROCm 7.2.1 | 1 | between 8 GPUs |
| AMD Instinct MI350X | Ubuntu 24.04 Kernel 6.14 | RHEL 9.6, RHEL 10.0, Ubuntu 24.04 | ROCm 7.2.1 | 1 | between 8 GPUs |
| AMD Instinct MI325X | Ubuntu 22.04 Kernel 6.8 | Ubuntu 22.04 | ROCm 7.2.1 | 1 | between 8 GPUs |
| AMD Instinct MI325X | RHEL 10.1 Kernel 6.12.0-124 | RHEL 10.1 | ROCm 7.2.1 | 1 | between 8 GPUs |
| AMD Instinct MI300X | Ubuntu 22.04 Kernel 6.8 | Ubuntu 22.04 | ROCm 7.2.1 | 1 or 8* | between 8 GPUs |
| AMD Instinct MI300X | RHEL 9.4 Kernel 5.14.0-427 | RHEL 9.4, Ubuntu 24.04 | ROCm 7.2.1 | 1 or 8* | between 8 GPUs |
| AMD Instinct MI210X | RHEL 9.4 Kernel 5.14.0-427 | RHEL 9.4, Ubuntu 22.04 | ROCm 7.2.1 | 1 | between 4/8 GPUs |
| AMD Radeon PRO V710 | Ubuntu 24.04 Kernel 6.14 | Ubuntu 24.04 | ROCm 7.2.1 | 1 or 4 or 12 | N/A |
Partition Mode Support
| Number of VFs per GPU | Dynamic Compute Partitioning | NPS1 | NPS2 | NPS4 |
|---|---|---|---|---|
| 1 | SPX (Default) | MI300X, MI325X, MI35XX | ||
| 1 | DPX | MI35XX | ||
| 1 | CPX | MI35XX (Preview) | ||
| 8* | CPX | MI300X (Preview) |
For the full list of supported partition modes and the relationship between partition mode and number of VFs per GPU, see the GPU Partitioning.
* Multi-VF (8 VF) Support requires a compatible firmware BKC bundle and ROCm which will be released in coming months.
Please refer to the ROCm Firmware Compatibility Table for the list of required firmware versions and supported configurations.
Host AMD SMI Versions
- Library: 51.0.0
- Tool: 34.1.1
Please checkout out SMI doc for more details.
Getting Started
Please check out our User Guide for instructions on how to set up GIM and example configurations to run SR-IOV enabled VMs.
Known Issues
- MI300: MultiVF configuration for MI300X requires a compatible firmware BKC bundle and ROCm which will be released in coming months. Please refer to the ROCm compatibility matrix for supported version
- MI210: Unloading the Guest driver on MI210 may produce kernel call trace. Note that this is just a warning and not a kernel panic. Guest VM reboot is required to restore the operation
- Navi32: AMDSMI CLI --gpureset command on Nv32 doesn't reset all the GPUs like the output is saying, but just GPU 0.