Comment by tossandthrow

Comment by tossandthrow 3 days ago

0 replies

The paper says transformers perform better than RNNs, which is not surprising.

However, they are both, theoretically, Turing complete computers. So they are equally expressive.