I feel like people have been misunderstanding the new AMD Ryzen AI Max chips (and relatedly, the Framework Desktop). It’s not a “laptop CPU with a GPU tacked on”, it’s a laptop GPU with a CPU tacked on. By all accounts, it’s a GPU-first system - silicon area, memory bandwidth, etc. When comparing it with other AMD laptop chips, it becomes obvious that this is a GPU first, and a CPU second.
Which is why I dislike the criticism of Framework Desktop’s soldered memory. Nobody bats an eye if GPU’s memory is soldered (when was the last time that it wasn’t?), but because people see the chip as a CPU, and thus see the soldered down ram as a regression.
You should in all regards look at the chip as a GPU first, if you’re considering to buy something with it. Sure, it has CPU chops, but you can get those way cheaper, and without the limitations that being attached to a GPU brings - if you’re not buying it for the GPU, then get something else. Comparisons with other laptop CPU-centric chips are IMO misguided, the comparison should be done with other GPU’s first, and CPU’s attached to them second.
@mikoto Yeah, much more into that direction.
It looks cool, but I’m absolutely not in the market for one even if I have the budget, because I do jack shit with GPU’s, and the CPU side is way more important to me. I don’t game, I don’t do LLMs, I don’t do any kind of HPC that could go on a GPU, so I have zero reason to buy a system that’s GPU first, and a CPU second.
@mikoto you don’t really need a huge GPU for things other than LLMs tbh, and I think most NPU’s on SoCs are better and more efficiently used for that other stuff like image recognition (at least on inference side, for training you still want a GPU). Such GPUs are good for LLMs because of decently high compute power, combined with high memory capacity and bandwidth - 128GB of memory in direct access to the GPU allows to use larger models more efficiently, as the weights don’t have to be streamed into from “main memory” through a PCIe connection, which is a bottleneck for traditional GPUs, and it’s exactly why a lot of the LLM crowd is excited for this - it’s exactly what they need for relatively cheap (in comparison to dedicated GPUs with a ton of memory)
@ignaloidas @mikoto
Yeah, it's "GPU first" but who needs a GPU with 32 GB of RAM? Mostly people who do ML.
@wolf480pl @mikoto I mean, consider that the ram is shared - IIRC you talked about 16GB being bare minimum on the CPU side, and for GPU’s, on the higher end at least 10GB for that level of GPU makes sense.
And that the same chip is intended for “mobile workstations”, where you do CAD and stuff, where all that RAM is very valuable. It makes sense beyond ML.
@ignaloidas @mikoto and I think Framework marketing it as a small gaming PC you can take to LAN parties is disingenous. If that's your usecase, get a gaming laptop, which comes with a screen.
@stefan @wolf480pl @mikoto AMD doesn’t directly sell that chip - it’s a laptop chip after all. Framework’s desktop mobo is the first one that’s publicly available in a standard format AFAICT
On the memory, I think it’s purely because of the attached CPU - while there’s no real problems with using GDDR with them, it’s quite different target to optimize, and the performance of many regular programs would suffer. You can allow that if you’re building a console, but for a more “general purpose” chip, it’s not great optics to loose a bunch of performance on CPU-focused benchmarks, even if the product is primarily GPU focused.
I’d say that it is as much GPU focused as possible for what it is - I mean, just look at the silicon area and IO space for memory that’s been used up for GPU side of things. Sure, consoles can be even more GPU focused, but those are not expected to run general purpose tasks with good performance, so latency hit on memory accesses can be eaten up.
And yeah, the value for it as a desktop really depends if you can utilize what’s unique about or not. I don’t see the value for majority of people.
@wolf480pl @mikoto 🤷 It is a fairly good small gaming PC that’s easy to take wherever. Whether it makes sense for the price is another question.
@ignaloidas @mikoto you can use a Ford Transit to drive to work everyday, but you don't see it advertised for that usecase
@wolf480pl @mikoto I don’t think there’s anything inherently disadvantageous with putting that chip on a gaming desktop. Whereas there would be disadvantages to driving a van purely for commute. I’d see it more as driving something like a Porche 911 for commute - it’s perfectly fine for that, but it doesn’t necessarily make sense financially.
@stefan @wolf480pl @mikoto yeah, the pricing is the most questionable part. I guess it miiight make sense if you’re just buying the mobo, as it’s kinda the same as just getting a high-end CPU/GPU combo, but you should really think if you’re going to utilize the unique advantages or not, because the performance doesn’t fully match the price, even if for some workloads it is very good.