Comment by tossandthrow
Comment by tossandthrow 3 days ago
> We have mathematically proven that transformers can solve any problem, provided they are allowed to generate as many intermediate reasoning tokens as needed.
This is also the case with plain and regular RNNs
Now just need an autoregressive transformer <==> RNN isomorphism paper and we're golden