Comment by imjonse

Comment by imjonse a day ago

I suppose the vast majority of training data used for cutting edge models was created after 1900.

Ofc they are because their primary goal is to be useful and to be useful they need to always be relevant.

But considering that Special Relativity was published in 1905 which means all its building blocks were already floating in the ether by 1900 it would be a very interesting experiment to train something on Claude/Gemini scale and then say give in the field equations and ask it to build a theory around them.

Reply View 2 replies

famouswaffles a day ago

His point is that we can't train a Gemini 3/Claude 4.5 etc model because we don't have the data to match the training scale of those models. There aren't trillions of tokens of digitized pre-1900s text.

Reply View | 0 replies
p1esk a day ago

How can you train a Claude/Gemini scale model if you’re limited to <10% of the training data?

Reply View | 0 replies

kopollo a day ago

I don't know if this is related to the topic, but GPT5 can convert an 1880 Ottoman archival photograph to English, and without any loss of quality.

Reply View 1 reply

ddxv 14 hours ago

My friend works in that period of Ottoman archives. Do you have a source or something I can share?

Reply View | 0 replies