HN Top New Show Ask Jobs

settings

Theme

Hand Mode

Feed

Comment by imtringued

Comment by imtringued 6 months ago

0 replies

View on Hacker News

I'm not an expert, but MoE models perform better at continuous learning, because they are less prone to catastrophic forgetting.