Comment by omneity

Comment by omneity 2 days ago

1 reply

Mistral Large 3 is reportedly using Deepseek V3.2 architecture with larger experts and fewer of them, and a 2B params vision module.