Comment by omneity
Mistral Large 3 is reportedly using Deepseek V3.2 architecture with larger experts and fewer of them, and a 2B params vision module.
Mistral Large 3 is reportedly using Deepseek V3.2 architecture with larger experts and fewer of them, and a 2B params vision module.
According to whom?
I haven't seen any claims of that being the case (other than you), just that there are similar decisions made by both of them.
https://mistral.ai/news/mistral-3