I hope they keep making GPUs. They were making really good progress and had pretty good performance per dollar lately.
If they wanna abandon discrete GPUs… OK.
But they need graphics. They should make M Pro/Max-ish integrated GPUs like AMD is already planning on doing, with wide busses, instead of topping out at bottom-end configs.
They could turn around and sell them as GPU-accelerated servers too, like the market is begging for right now.
Intel sees the AI market as the way forward. NVIDIA’s AI business eclipses its graphics business by an order of magnitude now, and Intel wants in. They know that they rule the integrated graphics market, and can leverage that position to drive growth with things like edge processing for CoPilot.
The localllama crowd is supremely unimpressed with Intel, not just because of software issues but because they just don’t have beefy enough designs, like Apple does, and AMD will soon enough. Even the latest chips are simply not fast enough for a “smart” model, and the A770 doesn’t have enough VRAM to be worth the trouble.
They made some good contributions to runtimes, but seeing how they fired a bunch of engineers, I’m not sure that will continue.
People running LLMs aren’t the target. People who use things like ChatGPT and CoPilot on low power PCs who may benefit from edge inference acceleration are. Every major LLM dreams of offloading compute on the end users. It saves them tons of money.
One can’t offload “usable” LLMs without tons of memory bandwidth and plenty of RAM. It’s just not physically possible.
You can run small models like Phi pretty quick, but I don’t think people will be satisfied with that for copilot, even as basic autocomplete.
About 2x faster than Intel’s current IGPs is the threshold where the offloading can happen, IMO. And that’s exactly what AMD/Apple are producing.
Honestly, graphics have been good enough for a long time now.
I’m currently re-playing Skyrim on an ultra-light convertible laptop attached to an external monitor.
And it looks beautiful.
After several days, I noticed it was still on power-saving + silent cooling mode.Right, now we just need 64GB of VRAM to run our own LLM.
No, we don’t. Billions of people manage to live without LLMs.
I think you missed the point of where the discrete graphics card market is going