Comment by fiso64
I don't get his "modern" proof. Specifically the step where he says "it's easy to see geometrically that these matrices differ by a rotation" seems to be doing a lot of heavy lifting. The first matrix transforms e1 to (a,-b), the second scales e1 to (c,0). If you can see that you obtain one of these vectors by rotating the other, then you've shown that their lengths are equal (i.e. a²+b²=c²), which is what we want to show in the first place.
Draw it on graph paper:
Set B as the origin, and let BC (the 'a' side), be on the the positive side of the x-axis. Let AC be on the positive side of the y-axis.
The left matrix is a clockwise rotation and scaling. This is clearly seen if you draw the transformation applied to the two axis basis vectors. (The scaling factor isn't obvious yet.)
Then the left matrix varies (1,0) to the side AB, which has magnitude c. Z and carries (0,1) to an perpendicular line of the (importantly) same magnitude,
So it's a rotation and a scaling by c.
The right matrix obviously is a scaling by c.
> If you can see that you obtain one of these vectors by rotating the other, then you've shown that their lengths are equal (i.e. a²+b²=c²), which is what we want to show in the first place.
Yes, that's the point!