Comment by jacobgorm Comment by jacobgorm a day ago 1 reply Copy Link View on Hacker News What kind of hardware setup would be needed to replicate the paper’s results?
Copy Link deepsquirrelnet 21 hours ago Collapse Comment - I am training phi-4 (14B) using a single A6000. There’s some tricks you have to use to keep VRAM consumption down - mainly LoRA and quantization.There’s a package called “unsloth” that integrates with huggingface’s TRL library that can help. Reply View | 0 replies
I am training phi-4 (14B) using a single A6000. There’s some tricks you have to use to keep VRAM consumption down - mainly LoRA and quantization.
There’s a package called “unsloth” that integrates with huggingface’s TRL library that can help.