Comment by yorwba

Comment by yorwba 2 days ago

6 replies

DeepSeekMath-V2 is also an LLM doing math and not a specific formal math system. What interpretation of "general purpose" were you using where one of them is "general purpose" and the other isn't?

simianwords 2 days ago

This model can’t be used for say questions on biology or history.

  • yorwba 2 days ago

    How do you know how well OpenAI's unreleased experimental model does on biology or history questions?

    • simianwords 2 days ago

      Sam specifically says it is general purpose and also this

      > Typically for these AI results, like in Go/Dota/Poker/Diplomacy, researchers spend years making an AI that masters one narrow domain and does little else. But this isn’t an IMO-specific model. It’s a reasoning LLM that incorporates new experimental general-purpose techniques.

      https://x.com/polynoamial/status/1946478250974200272

      • lossolo 2 days ago

        You are overinterpreting what they said again. "Go/Dota/Poker/Diplomacy" do not use LLMs, which means they are not considered "general purpose" by them. And to prove it to you, look at the OpenAI IMO solutions on GitHub, which clearly show that it's not a general purpose trained LLM because of how the words and sentences are generated there. These are models specifically fine tuned for math.