Skip to content

better AMD support #1381

@titansonico10

Description

@titansonico10

Summary

The current requirements-rocm.txt uses torch==2.9.1+rocm6.3 which only has
wheels for manylinux_2_28_x86_64 (Linux). On Windows with an RDNA4 GPU (gfx1200,
e.g. RX 9070 XT), installation fails completely because no win_amd64 wheel exists
for that ROCm version.

Environment

  • GPU: AMD RDNA4 (gfx1200 architecture, e.g. RX 9070 / 9070 XT)
  • OS: Windows 11
  • ROCm build detected by ComfyUI: rocm 7.13.0a20260318 (TheRock nightly)
  • Install method: Stability Matrix (uses uv instead of pip / install.bat)

Error

× No solution found when resolving dependencies:
╰─▶ Because torch==2.9.1+rocm6.3 has no wheels with a matching platform
    tag (e.g., `win_amd64`) and you require torch==2.9.1+rocm6.3, we can
    conclude that your requirements are unsatisfiable.

hint: Wheels are available for `torch` (v2.9.1+rocm6.3) on the following
platform: `manylinux_2_28_x86_64`

Root Cause

AMD does not publish ROCm 6.x wheels for Windows. Windows ROCm support for
RDNA4 (gfx1200) is only available through TheRock nightly builds, hosted at:

https://therock-nightly-python.s3.us-east-2.amazonaws.com/gfx120X-all/index.html

Additionally, onnxruntime with ROCm EP has been removed since ORT 1.23
ROCm 7.0 was the last supported version. On ROCm 7.x only the CPU runtime is
available via standard pip.

Suggested fix for requirements-rocm.txt

# AMD ROCm - TheRock Nightly (gfx1200 / RDNA4 - Windows)
# https://github.com/ROCm/TheRock

--find-links https://therock-nightly-python.s3.us-east-2.amazonaws.com/gfx120X-all/index.html
--pre
torch
torchvision
torchaudio

# ROCm EP removed in ORT 1.23+, last supported on ROCm 7.0
onnxruntime

# optimizers
# TODO

Notes

  • For other RDNA generations, the bucket URL changes:
    • RDNA2 (gfx103X): …/gfx103X-all/index.html
    • RDNA3 (gfx110X): …/gfx110X-all/index.html
    • RDNA4 (gfx120X): …/gfx120X-all/index.html
  • When installing via Stability Matrix, the .bat files are bypassed entirely.
    SM uses uv and reads requirements-rocm.txt directly, so the fix must be
    applied to that file before reinstalling.
  • It would be ideal if OneTrainer could auto-detect the gfx architecture and
    select the correct TheRock index URL automatically.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions