HN Top New Show Ask Jobs

settings

Theme

Hand Mode

Feed

Comment by nirw4nna

Comment by nirw4nna 8 hours ago

0 replies

View on Hacker News

I'm currently chipping away at DSC, a tensor library I wrote from scratch to play with large language models. Last week I re-wrote flash attention from scratch in CUDA and was able to get good perf.

[1]: https://github.com/nirw4nna/dsc

[2]: https://x.com/nirw4nna/status/1968812772944126329