AMD Publishes Open-Source Driver for GPU Virtualization, Radeon "In the Roadmap"

latchkey 5 hours ago

Hi, this is actually pretty important for my business. I've been waiting for the driver to be released for a year now. We got the binary a few months ago under NDA, but open sourcing it, is next level for us.

What I wrote about this on twitter: https://x.com/HotAisle/status/1914549886185611627

Reply View 0 replies

throwaway48476 12 hours ago

If AMD does deliver on client dGPU virtualization it would be amazing.

Reply View 2 replies

transpute 11 hours ago

Some old AMD workstation GPUs supported SR-IOV, that repo was just archived.
https://open-iov.org/index.php/GPU_Support#AMD
https://github.com/GPUOpen-LibrariesAndSDKs/MxGPU-Virtualiza...
As "AI" use cases mature, NPU/AIE-ML virtualization will also be needed.

Reply View | 1 reply
- mtillman 8 hours ago
  
  The pro vii (a wonderfully spec’d card for my purposes) had a bunch of awesome features-even remote access-but then they pulled the marketing pages from their website and stopped shipping the link bridge (non mpx) to water down the value of the card. I have no idea how AMD works.
  
  Reply View | 0 replies

AbuAssar 12 hours ago

related: (AMD 2.0 – New Sense of Urgency)

https://news.ycombinator.com/item?id=43780972

Reply View 2 replies

bryanlarsen 7 hours ago

Wow, that's a very substantive article.

Reply View | 0 replies
okucu 6 hours ago

>Dylan Patel
Yeah, not gonna read that.

Reply View | 0 replies

seanhunter 13 hours ago

It blows my mind how reliably AMD shoots itself in the foot. What we want isn’t that hard:

1) Support your graphics cards on linux using kernel drivers that you upstream. All of them. Not just a handful - all the ones you sell from say 18 months ago till today.

2) Make GPU acceleration actually work out of the box for pytorch and tensorflow. Not some special fork, patched version that you “maintain” on your website, the tip of the main branch for both of those libraries should just compile out of the box and give people gpu-accelerated ML.

This is table stakes but it blows my mind that they keep making press releases and promises like this that things are on the roadmap without doing thing one and unfucking the basic dev experience so people can actually use their GPUs for real work.

How it actually is: 1) Some cards work with rocm, some cards work with one of the other variations of BS libraries they have come up with over the years. Some cards work with amdgpu but many only work with proprietary kernel drivers which means if you don’t use precisely one of the distributions and kernel versions that they maintain you are sool.

2) Nothing whatsoever builds out of the box and when you get it to build almost nothing runs gpu accelerated. For me, pytorch requires a special downgrade, a python downgrade and a switch to a fork that AMD supposedly maintain although it doesn’t compile for me and when I managed to beat it into a shape where it compiled it wouldn’t run GPU accelerated even though games use the GPU just fine. I have a GPU that is supposedly current, so they are actively selling it, but can I use it? Can I bollocks. Ollama won’t talk to my GPU even though it supposedly works with ROCm. It only works with ROCm with some graphics cards. Tensorflow similar story when I last tried it although admittedly I didn’t try as hard as pytorch.

Just make your shit work so that people can use it. It really shouldn’t be that hard. The dev experience with NVidia is a million times better.

Reply View 39 replies

creata 6 hours ago

It doesn't diminish most of your points, but getting PyTorch to work on Arch Linux is as easy as installing the `python-pytorch-opt-rocm` package. Similar with Ollama: `ollama-rocm`. So if you just want to use PyTorch, and don't need the very latest version, I wouldn't say the dev experience with Nvidia is much better.

Reply View | 1 reply
- mindcrime 5 hours ago
  
  > Similar with Ollama: `ollama-rocm`.
  Same experience here. Installing ROCm and Ollama on my box were both dead simple and everything worked right out of the box. Using an RX 7900 XTX card, FWIW.
  
  Reply View | 0 replies
bavell 11 hours ago

Sucks that you've had so much trouble... My experience with my cheap 6750XT is that it just works OOTB on Arch with rocm, llama.cpp, ollama, whisper, etc by setting an envvar.

Reply View | 3 replies
- A4ET8a8uTh0_v2 9 hours ago
  
  Did you do any writeup/post on your experience with setting it up? I think it would have some audience ( apart from me that is ).
  
  Reply View | 1 reply
  
  dharmab 42 minutes ago
  
  It never occurred to me because it was such an easy process kn Arch.
  https://wiki.archlinux.org/title/Ollama 3 commands to install and run an LLM.
  
  Reply View | 0 replies
- rbjorklin 6 hours ago
  
  This has been my experience with a 6950XT as well using Fedora.
  
  Reply View | 0 replies
nimish 8 hours ago

AMD has never understood that software is important. It's not culturally baked into them.

Reply View | 1 reply
- latchkey 5 hours ago
  
  You aren't wrong, but the important question is: "If they decided to change, could they actually do it?"
  
  Reply View | 0 replies
pjmlp 7 hours ago

Meanwhile NVidia has embraced Python as first class programming language on CUDA, with the new cuTile format, as companion to PTX.
And given the tooling they are adding full steam ahead for Python, I even wonder if Mojo will manage to get enough mindshare, let alone what AMD and Intel are not doing.

Reply View | 0 replies
bayindirh 13 hours ago

AMD has two driver teams at this point. One of Linux/Open Source, one for Catalyst/Closed source, and they are not allowed to interact.
Because, there are tons of IP and trade secrets involved in driver development and optimization. Sometimes game related, sometimes for patching a rogue application which developers can't or don't fix, etc. etc.
GPU drivers are ought to be easy, but in reality, they are not. The open source drivers are "vanilla" drivers without all these case-dependent patching and optimization. Actually, they really work well out of the box for normal desktop applications. I don't think there are any cards which do (or will) not work with the open source kernel drivers as long as you use a sufficiently recent version.
...and you mention ROCm.
I'm not sure how ROCm's intellectual underpinnings are but, claiming lack of effort is a bit unfair to AMD. Yes, software was never their strong suit, but they're way better when compared to 20 years earlier. They have a proper open source driver which works, and a whole fleet of open source ROCm packages, which is very rigorously CI/CD tested by their maintainers now.
Do not forget that some of the world's most powerful supercomputers run on Instinct cards, and AMD is getting tons of experience from these big players. If you think the underpinnings of GPGPU libraries are easy, I can only say that the reality is very different. The simple things people do with PyTorch and other very high level libraries pull enormous tricks under the hood, and you're really pushing the boundaries of the hardware performance and capability-wise.
NVIDIA is not selling a tray full of switches and GPUs and require OEMs to integrate it as-is for no reason. On the other hand, the same NVIDIA acts very slowly to enable an open source ecosystem.
So, yes, AMD is not in an ideal position right now, but calling them incompetent doesn't help either.
P.S.: The company which fought for a completely open source HDMI 2.1 capable display driver is AMD, not NVIDIA.

Reply View | 21 replies
- spockz 12 hours ago
  
  I accept that there are two teams for reasons that include IP. However, Nvidia must have the same problem and they appear not to be hamstrung by it. So what is the difference?
  
  Reply View | 3 replies
  
  bayindirh 12 hours ago
  
  NVIDIA and AMD, from my experience, have completely different cultures.
  ATI started as a much more closed company. Then, they pivoted and started to open their parts. They were hamstrung at the HDCP at one point, and they decided to decouple HDCP block from video accelerators at silicon level to allow open source drivers to access video hardware without leaking/disabling HDCP support. So, they devoted to open what they have, but when you have tons of legacy IP, things doesn't go from 0 to 100 in a day. I want to remind that "game dependent driver optimization" started pre 2000s. This is how rooted these codebases are.
  NVIDIA took a different approach, which was being indifferent on the surface, but their hardware became a bit more hostile at every turn towards nouveau. Then they released some specs to allow "hardware enablement" by nouveau, so closed source drivers can be installed, and the card didn't blank-screened at boot.
  Then, as they fought with kernel developers, with some hard prodding by Kernel guys and some coercing by RedHat, NVIDIA accepted to completely remove "shim" shenanigans, and moved closed bits of the kernel module to card firmware by revising card architecture. It's important to keep in mind that NVIDIA's open drivers means "an open, bare bones kernel module, a full fledged and signed proprietary firmware which can be used by closed source drivers only and a closed source GLX stack", where in AMD this means "An open source kernel module, standard MESA libraries, and a closed source firmware available to all drivers".
  It was in talks with nouveau guys to allow them to use the full powered firmware with clock and power management support, but I don't know where it went.
  The CUDA environment is also the same. Yes it works very well, and it's a vast garden, but it's walled and protected by electrified fence and turrets. You're all in, or all out.
  
  Reply View | 1 reply
  
  markus_zhang 11 hours ago
  
  I'm wondering how much effort went into RE Nvidia cards and drivers. Graphics card drivers are completely a mythical beast to me, and I guess it's one of the most complicated drivers in the hardware world.
  
  Reply View | 0 replies
  
  mariusor 10 hours ago
  
  > Nvidia must have the same problem and they appear not to be hamstrung by it
  Probably because they don't have an open-source driver for linux and they can focus on the proprietary one.
  
  Reply View | 0 replies
- onli 11 hours ago
  
  Fact of the matter is that I have a Radeon RX 6600, which I can't use with ollama. First, there is no ROCm at all in my distros repository - it doesn't compile reliably and needs too many ressources. Then, when compiling it manually, it turns out that ROCm doesn't even support the card in the first place.
  I'm aware that 8GB Vram are not enough for most such workloads. But no support at all? That's ridiculous. Let me use the card and fall back to system memory for all I care.
  Nvidia, as much as I hate their usually awfully insufficient linux support, has no such restrictions for any of their modern cards, as far as I'm aware.
  
  Reply View | 7 replies
  
  JonChesterfield 2 hours ago
  
  I know it's in Debian (and thus Ubuntu), Arch, Gentoo. Pretty sure RedHat, Suse, Nix have it. What distro are you using?
  ROCm is a train wreck to compile from source but can be done with sufficient bloodymindedness.
  The RX6600 is a gfx1032. I used a gfx1010 for ages with this stuff. Seems likely it'll run for you if you ignore the "supported cards" list, which really should be renamed to something that antagonises people less.
  
  Reply View | 1 reply
  
  onli an hour ago
  
  I'm using void. https://github.com/void-linux/void-packages/issues/26415 gives an insight, though it doesn't explain the whole problem, if I remember correctly what maintainers wrote elsewhere.
  > ROCm is a train wreck to compile from source but can be done with sufficient bloodymindedness.
  Yeah, I did that myself. Not impossible, just a bit annoying and time consuming. The issue I ran into then was exactly picking the gpu model (incredible that this is even necessary) and not having the gfx1032 available, see https://github.com/void-linux/void-packages/issues/26415#iss... for what I was following back then. I tried to edit the configuration for the gfx1032 anyway, but it did not succeed.
  Side note: Already having to know which card corresponds to which code is annoying, and completely unnecessary. They could also just map the consumer facing name. But that would be too easy I assume.
  
  Reply View | 0 replies
  
  pja 6 hours ago
  
  My recent experience has been that the Vulkan support in llama.cpp is pretty good. It may lag behind Cuda / Metal for the bleeding edge models if they need a new operator.
  Try it out! Benchmarks here: https://github.com/ggml-org/llama.cpp/discussions/10879
  (ollama doesn’t support vulkan for some weird reason. I guess they never pulled the code from llama.cpp)
  
  Reply View | 1 reply
  
  onli 6 hours ago
  
  Thanks, I might indeed give this a test!
  
  Reply View | 0 replies
  
  smerrill 10 hours ago
  
  You should be able to use ollama’s Vulkan backend and in my experience the speed will be the same. (I just spent a bunch of time putting Linux on my 2025 ASUS ROG Flow Z13 to use ROCm, only to see the exact same performance as Vulkan.)
  
  Reply View | 1 reply
  
  onli 7 hours ago
  
  That would mean switching to https://github.com/whyvl/ollama-vulkan? I see no backend selection in ollama, nor anything in the faq.
  
  Reply View | 0 replies
  
  yjftsjthsd-h 7 hours ago
  
  > I'm aware that 8GB Vram are not enough for most such workloads. But no support at all? That's ridiculous. Let me use the card and fall back to system memory for all I care.
  > Nvidia, as much as I hate their usually awfully insufficient linux support, has no such restrictions for any of their modern cards, as far as I'm aware.
  In fact, I regularly run llamafile (and sometimes ollama) on an nvidia dGPU in a laptop, with 4GB of VRAM, and it works fine (ish... I mostly do the thing where some layers are on the GPU and some are CPU; it's still faster than pure CPU so whatever).
  
  Reply View | 0 replies
- rwmj 12 hours ago
  
  A laundry list of excuses ... or a list of things to work on. ("Why the hell do we have two driver teams?" - would be my #1 thing to fix if I was at AMD.)
  
  Reply View | 8 replies
  
  jeroenhd 9 hours ago
  
  The "fix" would be to make games perform like shit on Windows and disable HDR and other proprietary features, or to abolish the open Linux drivers. You can't have both, unless you do what Nvidia does and move all of the proprietary stuff to the GPU firmware and write a minimal driver to control that massive firmware blob. Which, obviously, would require reengineering the GPU hardware, which is expensive and of questionable value.
  They can't open source their proprietary drivers even if they wanted to because they don't own all of the IP and their code is full of NDA'd trade secrets. AMD isn't paying two different teams to do the same work because they like wasting money.
  
  Reply View | 3 replies
  
  bayindirh 12 hours ago
  
  I guess that you don't understand that how silicon and 3rd party IP works. It took Intel a completely new GPU from scratch to be able to open drivers. AMD did at least one revision to their silicon to enable that kind of openness.
  Yet, HDMI forum said that they can't implement an HDMI2.1 capable driver in the open, with some nasty legal letters.
  I have a couple of friends who wrote 3D engines from scratch and debugged graphics drivers for their engines for a living. It's a completely different jungle filled with completely different beasts.
  I think being able to call glxinfo on an AMD card running with completely open drivers and being able to see extensions from NVIDIA, AMD, SGI, IBM and others is a big win already.
  
  Reply View | 3 replies
pshirshov 10 hours ago

> Support your graphics cards on linux using kernel drivers that you upstream. All of them. Not just a handful - all the ones you sell from say 18 months ago till today.
All the stuff works even if it's not officially supported. It's not that hard to set a single environment variable (HSA_OVERRIDE_GFX_VERSION).
Like literally, everything works, from Vega 56/64 to ryzen 99xx iGPUs.
Also, try nixos. Everything literally works with a single config entry after recent merge of rocm 6.3. I successfully run a zoo of various radeons of different generations.

Reply View | 4 replies
- seanhunter 9 hours ago
  
  I’m using Nix and setting HSA_OVERRIDE_GFX_VERSION. It’s not working on my GPU (Radeon RX7600)
  
  Reply View | 1 reply
  
  JonChesterfield 2 hours ago
  
  This https://www.techpowerup.com/gpu-specs/radeon-rx-7600.c4153 says that's a gfx1102. I'd expect that to work out of the box. Linux kernel version vaguely near one of the nominally supported ones?
  
  Reply View | 0 replies
- iforgotpassword 7 hours ago
  
  But why do you even have to do this fucking bullshit that you randomly stumble upon, while googling error message after error message, ending up in random github repos and issues?
  And no, just because the three random cards you have work doesn't mean "everything works". Just tried an MI300A a few months ago... I just wanted to test ollama as this is one of the hottest applications for GPU acceleration now, it will surely be well supported right? First, the gfx version listed for it in the ollama docs is wrong - but OK, figured it out. Then Tried some random models with it, the only output it ever generates is GGGGGGGGGGGGG. Apparently only fp16 models work, nothing more quantized. So I pick one explicitly. Then it's slower than running on the cpu in the same system.
  Thanks but no thanks; this cost me two days when Nvidia just works first try.
  
  Reply View | 1 reply
  
  pshirshov an hour ago
  
  > But why do you even have to do this fucking bullshit
  Because it's like 2-4 times cheaper than to go nvidia?..
  > the three random cards you have
  It's more than 3 random cards. I run 6900 XT, 7900 XTX, W7900 Pro, VII, VII Pro, Vega 56, Vega 64, 6800 XT, 5700 XT plus I've experimented with a 9950 iGPU, a 5xxx series iGPU and the only thing which didn't work was 3400g iGPU.
  > Apparently only fp16 models work
  fp8 works for me
  
  Reply View | 0 replies
faust201 13 hours ago

IIRC there was only one AMD employee that was working to integrate linux based things. Often, the response was - things are stuck in Intellectual property, or project managers etc. So even specs were not available.

Reply View | 0 replies
jmward01 5 hours ago

Adding to the list above. Please, PLEASE, give me a single place I can look to see what pytorch will actually support. I would probably buy something if I could get a straight answer on what will/won't actually work.

Reply View | 0 replies
logicchains 13 hours ago

SemiAnalysis had a good article on this recently, basically the reason AMD still sucks on the ML software side is that their compensation for devs is significantly worse than competitors like NVidia, Google and OpenAI, so most of the most competent devs go elsewhere.

Reply View | 0 replies

janpmz 14 hours ago

This article is almost unreadable for me. The ads change in size and make the text jump. I'm adding it to NotebookLM now.

Reply View 2 replies

DistractionRect 6 hours ago

Seems like using a sledge hammer to pound a nail. Why not just use an ad blocker like ublock origin?

Reply View | 0 replies
Mountain_Skies 13 hours ago

The article is extremely light on details anyway. The most important thing in it is the link to the repo at https://github.com/amd/MxGPU-Virtualization

Reply View | 0 replies

404human 7 hours ago

Anyone else find it funny that AMD releases new GPU features while people still can't get basic ML stuff working? It's like building a fancy garage before fixing the broken car.

Reply View 3 replies

creata 6 hours ago

Maybe ROCm is a bit of a mess, but AMD makes amazing graphics cards.
And they open sourced their GPU driver, which is a massive plus in my book.
If Radeon got virtualization support, it would make it the perfect GPU for running video games (or other GPU applications) in virtual machines - you wouldn't need to mess with PCIe passthrough anymore.

Reply View | 0 replies
Nullabillity 7 hours ago

Graphics work just fine. Y'know, the G in the name.

Reply View | 1 reply
- Redoubts 6 hours ago
  
  Shame that's not where the money is
  
  Reply View | 0 replies