Comment by dspillett
Is there a public dump of the data anywhere that this is based upon, or have they scraped it themselves?
Such as DB might be entertaining to play with, and the threadedness of comments would be useful for beginners to practise efficient recursive queries (more so than the StackExchange dumps, for instance).
While not a dump per se, there is an API where you can get HN data programmatically, no scraping needed.
https://github.com/HackerNews/API