HN Top New Show Ask Jobs

settings

Theme

Hand Mode

Feed

Comment by svachalek

Comment by svachalek 3 months ago

0 replies

View on Hacker News

This is an interesting paper, it postulates that the ability of an LLM to perform tasks correlates mostly to the number of layers it has, and that reasoning creates virtual layers in the context space. https://arxiv.org/abs/2412.02975