Comment by Tiberium Comment by Tiberium 21 hours ago 0 replies Copy Link View on Hacker News A bit interesting that they used Deepseek 3's architecture for their Large model :)