Comment by simianwords

Comment by simianwords 2 days ago

View on Hacker News

This model can’t be used for say questions on biology or history.

yorwba 2 days ago

How do you know how well OpenAI's unreleased experimental model does on biology or history questions?

Reply View 4 replies

simianwords 2 days ago

Sam specifically says it is general purpose and also this
> Typically for these AI results, like in Go/Dota/Poker/Diplomacy, researchers spend years making an AI that masters one narrow domain and does little else. But this isn’t an IMO-specific model. It’s a reasoning LLM that incorporates new experimental general-purpose techniques.
https://x.com/polynoamial/status/1946478250974200272

Reply View | 3 replies
- lossolo 2 days ago
  
  You are overinterpreting what they said again. "Go/Dota/Poker/Diplomacy" do not use LLMs, which means they are not considered "general purpose" by them. And to prove it to you, look at the OpenAI IMO solutions on GitHub, which clearly show that it's not a general purpose trained LLM because of how the words and sentences are generated there. These are models specifically fine tuned for math.
  
  Reply View | 2 replies
  
  simianwords 2 days ago
  
  they could not have been more clear - sorry but are you even reading?
  
  Reply View | 1 reply
  
  lossolo 2 days ago
  
  Clear about what? Do you know the difference between an LLM based on transformer attention and a monte carlo tree search system like the one used in Go? You do not understand what they are saying. It was a fine tuned model, just as DeepSeekMath is a fine tuned LLM for math, which means it was a special purpose model. Read the OpenAI GitHub IMO submissions to see the proof.
  
  Reply View | 0 replies