Comment by Ekaros
Should have just gone with 32 bit characters and no combinations. Utter simplicity.
Should have just gone with 32 bit characters and no combinations. Utter simplicity.
Maybe we should have just replaced ascii, horrible encoding were entire 25% of it is wasted. And maybe we could have gotten a bit more efficiency by saying instead of having both lower and uppercase letters just have one and then have a modifier before it. Saving lot of space as most text could just be lowercase.
That quickly explodes if you need more than one diacritic per letter (e.g. Vietnamese often has two, and then there's https://en.wikipedia.org/wiki/International_Phonetic_Alphabe...).
That would be extremely wasteful, every single text file would be 4x larger and I'm sure eventually it would not be enough anyway.