ks2048 8 months ago

There is no single unicode character representing "Ch".

Here's a list of Unicode digraphs: DZ, Dz, dz, DŽ, Dž, dž, IJ, ij, LJ, Lj, lj, NJ, Nj, nj, ᵺ

https://en.wikipedia.org/wiki/Digraph_(orthography)#In_Unico...

  • notpushkin 8 months ago

    Yeah, but why does Unicode have those and not ch?

    • ks2048 8 months ago

      According to [1], these particular ones exist because of legacy encodings of Serbo-Croatian,

          Digraphs ⟨dž⟩, ⟨lj⟩ and ⟨nj⟩ in their upper case, title case and lower case forms have dedicated Unicode code points as shown in the table below, However, these are included chiefly for backwards compatibility with legacy encodings which kept a one-to-one correspondence with Cyrillic; modern texts use a sequence of characters. 
      
      [1] https://en.wikipedia.org/wiki/Gaj%27s_Latin_alphabet#Computi...
TRiG_Ireland 8 months ago

Ch may be a digraph in many languages, but is it implemented in Unicode as a single character?