Comment by procaryote

Comment by procaryote 8 hours ago

0 replies

also, the redundancy means that you get a pretty good heuristic for "is this utf-8". Random data or other encodings are pretty unlikely to also be valid utf-8, at least for non-tiny strings