Comment by bckr Comment by bckr 14 hours ago 0 replies Copy Link View on Hacker News That’s where they take their big pile of data and train the model to do next-token-prediction.