Comment by visarga

Comment by visarga 10 months ago

6 replies

> Seems like a non trivial problem to solve.

Took me 5 minutes to land this GPT prompt.

https://chatgpt.com/share/66e84c0c-a92c-800a-b452-255d6fe942...

Results:

- Chinese (Simplified) 四四 (sì sì) – sounds like "four-four", which can be associated with bad luck due to the number four in Chinese culture

- Arabic "Sisi" is a common nickname, also associated with Egypt's President Abdel Fattah el-Sisi

- Russian Сиси (sisi) – slang for breasts

- Bulgarian Сиси (sisi) – slang for breasts

- Serbian Сиси (sisi) – slang for breasts

- Croatian Sisi – slang for breasts

You should probably complement with a web search and a wiktionary search because they have all languages on a single page.

pdimitar 10 months ago

Does ChatGPT get anything right, ever?

In Bulgarian the slang is Цици (tsi tsi). I imagine it's near-identical for many other Slavic languages.

  • visarga 10 months ago

    Yeah I noticed it was pretty shaky, change the prompt a bit and the result changes a lot. Not very reliable after all by itself, but used in conjunction with other methods.

sureIy 10 months ago

It’s not that straightforward due to spelling. Does that catch køk? Tihts? P. Nus? For a non English swear word, I had to ask 3 times and about a specific language to finally make that connection.

Lockal 10 months ago

Tried with "hui" - for ChatGPT this word "has no specific meaning and is safe for use in any language".

fedeb95 10 months ago

that's a nice start, maybe does 99% of the job, but to be 100% sure, you still need additional (manual?) checks.