Comment by Snuggly73

Comment by Snuggly73 18 hours ago

Congrats - there is a very small problem with the LLM - its reusing transformer blocks and you want to use different instances of them.

Its a very cool excercise, I did the same with Zig and MLX a while back, so I can get a nice foundation, but since then as I got hooked and kept adding stuff to it, switched to Pytorch/Transformers.

icemanx 18 hours ago

correction: It's a cool exercise if you write it yourself and not use GPT

Reply View 1 reply

Snuggly73 18 hours ago

well, hopefully the author did learn something or at least enjoyed the process :)
(the code looks like a very junior or a non-dev wrote it tbh).

Reply View | 0 replies