NanoChat – The best ChatGPT that $100 can buy

mhitza 17 minutes ago

Should be "that you can train for $100"

Curios to try it someday on a set of specialized documents. Though as I understand the cost of running this is whatever GPU you can rent with 80GB of VRAM. Which kind of leaves hobbyists and students out. Unless some cloud is donating gpu compute capacity.

Reply View 2 replies

Onavo 4 minutes ago

A GPU with 80GB VRAM costs around $1-3 USD an hour on commodity clouds (i.e. the non-Big 3 bare metal providers e.g. https://getdeploying.com/reference/cloud-gpu/nvidia-h100). I think it's accessible to most middle class users in first world countries.

Reply View | 0 replies
portaouflop 12 minutes ago

If I have let’s say 40gb RAM does it not work at all or just take twice as long to train?

Reply View | 0 replies

karimf 2 hours ago

I've always thought about the best way to contribute to humanity: number of people you help x how much you help them. I think what Karpathy is doing is one of the highest leverage ways to achieve that.

Our current world is build on top of open source projects. This is possible because there are a lot of free resources to learn to code so anyone from anywhere in the world can learn and make a great piece of software.

I just hope the same will happen with the AI/LLM wave.

Reply View 7 replies

jackphilson an hour ago

Yes agree. Other high leverage ways are to control culture. Andrew Tate comes to mind.

Reply View | 6 replies
- nsriv 43 minutes ago
  
  Controlling culture, yes but wild pivot to mention that criminal alongside Karpathy.
  
  Reply View | 5 replies
  
  cultofmetatron 18 minutes ago
  
  not a particularly ethical guy and I wouldn't hold him up as a example of morality but the guy hasn't actually been found guilty YET. Multiple courts have tried. You'd think that for a guy under as much scrutiny as him that they would have SOMETHING to pin him on by now.
  Innocent until PROVEN guilty is a foundational legal precedent for a reason.
  
  Reply View | 1 reply
  
  portaouflop 8 minutes ago
  
  He is definitely guilty of being a waste of human life, a massive asshole and a general detriment to society worldwide. Don’t need a court to prove that.
  There are 6 criminal cases against him in several countries, let’s see how they pan out - but regardless he is not an innocent person.
  
  Reply View | 0 replies
  
  jackphilson 42 minutes ago
  
  I mean just an example. He obviously wasn't the most ethical person. Depends how you do it
  
  Reply View | 2 replies

TheAceOfHearts 29 minutes ago

Here's the announcement post [0] from Karpathy, which provides a bit of additional context.

[0] https://x.com/karpathy/status/1977755427569111362

Reply View 0 replies

flakiness an hour ago

Eureka Labs: https://github.com/EurekaLabsAI

What a prolific person Andrej is. It's been more than amazing to follow along!

Reply View 0 replies

[removed] an hour ago

[deleted]

Reply View 0 replies

swyx 27 minutes ago

> Thank you to chief LLM whisperer Alec Radford for advice/guidance.

oh man an Alec x Andrej podcast would BREAK THE INTERNET... just saying... going from glory days of GPT1 to now building GPT3? in 4 hours

Reply View 1 reply

codybontecou 20 minutes ago

Please oh please. This would be perfect.

Reply View | 0 replies

daft_pink 2 hours ago

Wow, how do we sign up for the Eurekalabs course and how much does it cost?

Reply View 3 replies

karpathy 21 minutes ago

Still under development, remaining work includes tuning nanochat (current state being solid v0.1) and finalizing the in-between projects so that students can "unlock" all complexity that hides underneath: `torch.Tensor`, `torch.dist`, `.backward()`, '.compile()`, etc. And then the more ops heavy aspects.

Reply View | 0 replies
huseyinkeles 2 hours ago

Karpathy says nanochat will become the capstone project of the course LLM101n being developed by Eureka Labs.
I guess it’s still a work in progress? Couldn’t find any other information elsewhere.

Reply View | 1 reply
- Schiphol an hour ago
  
  A bit more info [here](https://github.com/karpathy/LLM101n)
  
  Reply View | 0 replies

[removed] 35 minutes ago

[deleted]

Reply View 0 replies

andrewmcwatters 34 minutes ago

[flagged]

Reply View 0 replies