AI Supercomputer on my desk! 𝗡𝗩𝗜𝗗𝗜𝗔 𝗗𝗚𝗫 𝗦𝗽𝗮𝗿𝗸 has arrived and I finally got my hands on one of these beasts a few days ago. A quick reminder why it is a big deal: 🚀 NVIDIA GB10 Grace Blackwell Superchip. 💾 128 GB of Unified System memory - you can run models with up to 200 billion parameters on GPU locally! ☁️ Supports the same software stack that runs on datacenter GPUs - build on NVIDIA DGX Spark, deploy to cloud seamlessly. 𝗙𝗶𝗿𝘀𝘁 𝗶𝗺𝗽𝗿𝗲𝘀𝘀𝗶𝗼𝗻𝘀. 𝘏𝘦𝘳𝘦 𝘪𝘴 𝘸𝘩𝘢𝘵 𝘴𝘵𝘰𝘰𝘥 𝘰𝘶𝘵 𝘢𝘧𝘵𝘦𝘳 𝘴𝘰𝘮𝘦 𝘵𝘦𝘴𝘵𝘪𝘯𝘨. After some early testing, what really stood out was how smooth the 𝗗𝗲𝘃𝗘𝘅 is: ✅ Everything just works out of the box. ✅ NVIDIA Sync enables remote work on the machine as if I was running everything on my laptop. ✅ Tons of advanced examples that can be easily adjusted for your own workloads. 𝘐𝘯 𝘭𝘦𝘴𝘴 𝘵𝘩𝘢𝘯 𝘵𝘸𝘰 𝘥𝘢𝘺𝘴 𝘰𝘧 𝘵𝘦𝘴𝘵𝘪𝘯𝘨 𝘐 𝘸𝘢𝘴 𝘢𝘣𝘭𝘦 𝘵𝘰: ⚡ Run gpt-oss-120b on the device with as much as ~30 tokens/s throughput. Moved one of my projects to run locally on the device using the model endpoint. 🤖 Migrate two of my multi-agent projects to run on device using LLMs deployed on the machine. Currently fine-tuning performance with new models in the background 𝘞𝘩𝘢𝘵’𝘴 𝘯𝘦𝘹𝘵: 🕐 I have few projects lined up that require fine tuning of reasonably large models, can’t wait to try the machine for this use case. Want to get your own? 👉 https://nvda.ws/47nDdqG Huge shout-out to NVIDIA AI for the collaboration and support! #LLM #AI #DGXSpark
Incredible performance, and migrating full multi-agent projects to a local device in two days is seriously impressive mate! For that multi-agent setup, did you consolidate the full orchestration and tool execution stack onto the DGX Spark, or is it primarily serving the model endpoints?
I want one :)
Thank you for the collaboration, and support for launch. ✨
This really showcases the power of DGX Spark for both experimentation and production workloads. Aurimas Griciūnas
Useful first impressions for anyone considering high performance AI infrastructure. Aurimas Griciūnas
Aurimas Griciūnas conceptually, if everyone can run compute on their desktop, how does this impact data center demand?
Aurimas, Local AI beast unleashed!
Love this Aurimas Griciūnas, being able to migrate multi agent projects to a local supercomputer opens huge possibilities.
Technical Program & AI Delivery Leader | Hands-on Developer | Cloud & Data Strategist | Fluent German & Mandarin | MBA
9hNice, mine just arrived in an oversized box. I've been contemplating whether I want to return the thing, but the more I think about it, I can probably get enough value out of it within 1-2 years. Compared to the Strix Halo things, this is a dedicated all-in-one setup. Unlike my desktop PC it would be a dedicated 'work' machine for sure.