Comment by pseudotensor
Comment by pseudotensor 3 days ago
This is related: https://www.reddit.com/r/LocalLLaMA/comments/1fiw84a/open_st...
The idea is not silly in my view, I did something similar here: https://github.com/pseudotensor/open-strawberry
The idea is that data generation is required first, to make the reasoning traces. ToT etc. are not required.