Comment by CorrectHorseBat

Comment by CorrectHorseBat a day ago

11 replies

In German you have the same, only within one language. ß can be written as ss if it isn't available in a font, and only in 2017 they added a capital version. So depending the font and the unicode version the number of letters can differ.

kbelder a day ago

"Traditionally, ⟨ß⟩ did not have a capital form, and was capitalized as ⟨SS⟩. Some type designers introduced capitalized variants. In 2017, the Council for German Orthography officially adopted a capital form ⟨ẞ⟩ as an acceptable variant, ending a long debate."

Thanks, that is interesting!

guappa a day ago

should "ß" == "ss" evaluate as true?

  • birn559 a day ago

    I don't see why it should. I also believe parent is wrong as there are unambiguous rules about when to use ß or ss.

    Never thought of it but maybe there are rules that allow to visually present the code point for ß as ss? At least (from experience as a user) there seem to be a singular "ss" codepoint.

    • CorrectHorseBat a day ago

      >also believe parent is wrong as there are unambiguous rules about when to use ß or ss.

      I never said it was ambiguous, I said it depends on the unicode version and the font you are using. How is that wrong? (Seems like the capital of ß is still SS in the latest unicode but since ẞ is the preferred capital version now this should change in the future)

      • birn559 a day ago

        > How is that wrong? Not sure where, how or if it's defined as part of Unicode, but so far I assumed that for a Unicode grapheme there exists a notion of what the visual representation should look like. If Unicode still defines capital of ß as SS that's an error in Unicode due to slow adaption of the changes in the German language.

      • weinzierl 20 hours ago

        ẞ is not the preferred capital version, it is an acceptable variant (according to the Council for German Orthography).

    • guappa a day ago

      well I don't speak german, I was asking

      • birn559 a day ago

        I see, wasn't clear to me on what level you were asking. The letter ß has never been generally equivalent to ss in the German language.

        From a user experience perspective though it might be beneficial to pretend that "ß" == "ss" holds when parsing user input.