Comment by bstsb

Comment by bstsb a day ago

18 replies

ironic that unicode is stripped out the post's title here, making it very much wrong ;)

for context, the actual post features an emoji with multiple unicode codepoints in between the quotes

dang a day ago

Ok, we've put Man Facepalming with Light Skin Tone back up there. I failed to find a way to avoid it.

Is there a way to represent this string with escaped codepoints? It would be both amusing and in HN's plaintext spirit to do it that way in the title above, but my Unicode is weak.

  • NobodyNada a day ago

    That would be "\U0001F926\U0001F3FC\u200D\u2642\uFE0F" in Python's syntax, or "\u{1F926}\u{1F3FC}\u{200D}\u{2642}\u{FE0F}" in Rust or JavaScript.

    Might be a little long for a title :)

    • dang a day ago

      Thanks! Your second option is almost identical to Mlller's (https://news.ycombinator.com/item?id=44988801) but the extra curly braces make it not fit. Seems like they're droppable for characters below U+FFFF, so I've squeezed it in above.

      • NobodyNada 19 hours ago

        That works! (The braces are droppable for 16-bit codepoints in JS, but required in Rust.)

  • Mlller a day ago

    That would be …

      "\u{1F926}\u{1F3FC}\u200D\u2642\uFE0F".length == 7
    
    … for Javascript.
    • dang a day ago

      I can actually fit that within HN's 80 char limit without having to drop the "(2019)" bit at the end, so let's give it a try and see what happens... thanks!

cmeacham98 a day ago

Funny enough I clicked on the post wondering how it could possibly be that a single space was length 7.

  • ale42 a day ago

    Maybe it isn't a space, but a list of invisible Unicode chars...

    • yread a day ago

      It could also be a byte length of a 3 byte UTF-8 BOM and then some stupid space character like f09d85b3

  • c12 a day ago

    I did exactly the same, thinking that maybe it was invisible unicode characters or something I didn't know about.

  • eastbound a day ago

    It can be many Zero-Width Space, or a few Hair-Width Space.

    You never know, when you don’t know CSS and try to align your pixels with spaces. Some programers should start a trend where 1 tab = 3 hairline-width spaces (smaller than 1 char width).

    Next up: The <half-br/> tag.

    • Moru a day ago

      You laugh but my typewriter could do half-br 40 years ago. Was used for typing super/subscript.

Phelinofist 19 hours ago

Before it wasn't, about 1h ago it was showing me a proper emoji