Comment by ofou Comment by ofou 4 hours ago 0 replies Copy Link View on Hacker News UTF-8 should be a universal tokenizer