HN Top New Show Ask Jobs

settings

Theme

Hand Mode

Feed

Comment by bigeagle

Comment by bigeagle 3 days ago

0 replies

View on Hacker News

I believe so.

Grok-1 is 341B, DeepSeek-v3 is 671B, and recent new open weights models are around 70B~300B.