Comment by khafra
Recommending Gwern on any technical topic is practically cheating; he always has in-depth, impeccably referenced overviews, complete with experiments he has done.
For deep learning in particular, I will add Neel Nanda's interpretability work: https://www.neelnanda.io/mechanistic-interpretability