BREAKING
Gemma 4 runs 16-way on one DGX Spark
0x
parallel runs
0tok/s
per instance
0tok/s
aggregate
0B
total params
0B
active params
0K
context
Throughput scales with parallel runs
1 session68
4 parallel246
16 demo300
8 parallel403
The DGX Spark setup at a glance
DGX Sparkhardware
128GB unified memory
273GB/s bandwidth
About $3,999 to $4,699
Caveatslimits
FP4 kernel driver issues
Efficiency drops on huge prompts
Bandwidth questioned
Open weights, ready to run
AI NEWS BLITZ
Google just showed its open Gemma 4 model running sixteen instances at once on a single desktop machine.