daveguy a day ago

That's the size of the largest, most capable, open source models. Specifically Llama 3.1 has 405B parameters. Deepseek's largest model is 671B parameters.

  • mhitza a day ago

    Small corrections. Llama 3.1 is not an Open Source model, but a Llama 3.1 Licensed model. Neither is DeepSeek apparently https://huggingface.co/deepseek-ai/DeepSeek-V3/blob/main/LIC... which I was of the false opinion that it is. Though I never considered using it, so haven't checked the license before.

    • gunalx 20 hours ago

      Both deepseek R1 and V3-0324 is mit licensed.

    • Der_Einzige 17 hours ago

      You can just ignore the license since the existence of these models is based on piracy at a scale never before seen. Aaron Swartz couldn’t have even imagined violating copyright that hard.

      If you live in a glass house, you won’t throw stones. No one in the LLM space wants to be litigious

      It’s an open secret that DeepSeek used a ton of OpenAI continuations both in pre training and in the distillation. That totally violates openAI TOS. No one cares.

      • LoganDark 16 hours ago

        > No one in the LLM space wants to be litigious

        Except for OpenAI.