HN Top New Show Ask Jobs

settings

Theme

Hand Mode

Feed

Comment by cubefox

Comment by cubefox 2 days ago

0 replies

View on Hacker News

DeepSeek-v3.2 should be be better for long context because it is using (near linear) sparse attention.