HN Top New Show Ask Jobs

settings

Theme

Hand Mode

Feed

Comment by andy12_

Comment by andy12_ 2 days ago

0 replies

View on Hacker News

Do note that that is a different model. The one we are talking about here, DeepSeekMath-V2, is indeed overcooked with math RL. It's so eager to solve math problems, that it even comes up with random ones if you prompt it with "Hello".

https://x.com/AlpinDale/status/1994324943559852326?s=20