Comment by mangolie Comment by mangolie 2 days ago 3 replies Copy Link View on Hacker News https://x.com/deepseek_ai/status/1995452646459858977Boom
Copy Link andy12_ 2 days ago Next Collapse Comment - Do note that that is a different model. The one we are talking about here, DeepSeekMath-V2, is indeed overcooked with math RL. It's so eager to solve math problems, that it even comes up with random ones if you prompt it with "Hello".https://x.com/AlpinDale/status/1994324943559852326?s=20 Reply View | 0 replies
Copy Link yorwba 2 days ago Prev Next Collapse Comment - That's a different model: https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Speciale Reply View | 0 replies
Copy Link simianwords 2 days ago Prev Collapse Comment - Oh you may be correct. Are these models general purpose or fine tuned for mathematics? Reply View | 0 replies
Do note that that is a different model. The one we are talking about here, DeepSeekMath-V2, is indeed overcooked with math RL. It's so eager to solve math problems, that it even comes up with random ones if you prompt it with "Hello".
https://x.com/AlpinDale/status/1994324943559852326?s=20