Comment by jychang

Comment by jychang 4 days ago

0 replies

No.

128GB vram gets you enough space for 256B sized models. But 400B is too big for the DGX Spark, unless you connect 2 of them together and use tensor parallel.