Comment by visarga

Comment by visarga 2 months ago

6 replies

> Seems like a non trivial problem to solve.

Took me 5 minutes to land this GPT prompt.

https://chatgpt.com/share/66e84c0c-a92c-800a-b452-255d6fe942...

Results:

- Chinese (Simplified) 四四 (sì sì) – sounds like "four-four", which can be associated with bad luck due to the number four in Chinese culture

- Arabic "Sisi" is a common nickname, also associated with Egypt's President Abdel Fattah el-Sisi

- Russian Сиси (sisi) – slang for breasts

- Bulgarian Сиси (sisi) – slang for breasts

- Serbian Сиси (sisi) – slang for breasts

- Croatian Sisi – slang for breasts

You should probably complement with a web search and a wiktionary search because they have all languages on a single page.

pdimitar 2 months ago

Does ChatGPT get anything right, ever?

In Bulgarian the slang is Цици (tsi tsi). I imagine it's near-identical for many other Slavic languages.

  • visarga 2 months ago

    Yeah I noticed it was pretty shaky, change the prompt a bit and the result changes a lot. Not very reliable after all by itself, but used in conjunction with other methods.

sureIy 2 months ago

It’s not that straightforward due to spelling. Does that catch køk? Tihts? P. Nus? For a non English swear word, I had to ask 3 times and about a specific language to finally make that connection.

Lockal 2 months ago

Tried with "hui" - for ChatGPT this word "has no specific meaning and is safe for use in any language".

fedeb95 2 months ago

that's a nice start, maybe does 99% of the job, but to be 100% sure, you still need additional (manual?) checks.