youtube.nixfred.com nixfred.com
Creator

Alex Ziskind

1video

← All videos

20:11
Alex Ziskind

I Plugged a DGX Spark and Mac Together... and Didn’t Expect This

Two machines on a desk, each brilliant at exactly half of what running a large language model needs, and each terrible at the other half. The NVIDIA DGX Spark (here a GB10 in MSI's Edge Expert clothing) chews through a long prompt at hundreds of tokens per second, then crawls when it actually has to write the answer. The Mac mini does the reverse: slow to read the prompt, fast to stream the reply. Alex Ziskind spends this video trying to bolt the good half of each onto the other, a trick the industry calls disaggregated prefill and decode, then measuring whether the Frankenstein is actually worth building.

AIHardwareMay 1, 2026