Comment by echelon
Perhaps "alignment" is stored in the loosest of weights connections and these are catastrophically forgotten during fine tuning.
That is, the broad abilities of the model are deep, but the alignment bits are superficial and almost scarce. They get blown away with any additional fine tuning.
That would make sense to me.
[flagged]