Comment by cubefox Comment by cubefox 2 days ago 0 replies Copy Link View on Hacker News DeepSeek-v3.2 should be be better for long context because it is using (near linear) sparse attention.