yorwba 2 days ago

DeepSeekMath-V2 is also an LLM doing math and not a specific formal math system. What interpretation of "general purpose" were you using where one of them is "general purpose" and the other isn't?

  • simianwords 2 days ago

    This model can’t be used for say questions on biology or history.

    • yorwba 2 days ago

      How do you know how well OpenAI's unreleased experimental model does on biology or history questions?

      • simianwords 2 days ago

        Sam specifically says it is general purpose and also this

        > Typically for these AI results, like in Go/Dota/Poker/Diplomacy, researchers spend years making an AI that masters one narrow domain and does little else. But this isn’t an IMO-specific model. It’s a reasoning LLM that incorporates new experimental general-purpose techniques.

        https://x.com/polynoamial/status/1946478250974200272