Comment by cubefox

Comment by cubefox 2 days ago

2 replies

It should work like normal instruction tuning, except the SFT examples contain additional instructions in <|quote|> tokens which are ignored in the sample response. So more complex than ordinary SFT but not that much more.