somat 7 hours ago

I know Japanese does not have a th sound, and I don't think chinese or most other asian languages have it, but am less sure about that. Unfortunately I lack the data needed to substantiate my claim.

    with
    lang_sounds as (
    select
        lang,
        unnest(string_to_array(ipa, null) ) as sound
    from world_dictionary
    ),
    totals as (
    select
        lang,
        count(sound) as sound_count
    from lang_sounds
    group by lang
    )

    select
        lang,
        totals.sound,
        count(sound) / totals.sound_count
    from
        lang_sounds join
        totals on
        lang_sounds.lang = totals.lang
    where sound = 'θ' or sound = 'ð' or sound = 'θ̠' or sound = 'z'
    group by lang, sound
    order by count(sound) / totals.sound_count
  • yorwba 4 hours ago

    https://phoible.org/parameters has the data you seek: 5% of languages in the database have eth (ð) and 4% have theta (θ). Z is not a 'th' sound and fairly common at 30% of languages, though.

  • eru 5 hours ago

    > I know Japanese does not have a th sound, and I don't think chinese or most other asian languages have it, [...]

    There's no single th sound in English. There's a few different sounds you get from that letter combination in different words (and in different dialects).

  • inkyoto 2 hours ago

    Out of all Asian languages (East and South East) I can think of, only Burmese has ð and θ.

jenzig 6 hours ago

This is just a feature of Castilian Spanish: https://en.wikipedia.org/wiki/Phonological_history_of_Spanis... \th\ only occurs naturally in like 5% of the thousands of human languages that have ever existed. Just because those languages are some of the most widely-spoken ones worldwide does not make the sound a commonly-occurring one in a meaningful phonological sense.

jkaplowitz 7 hours ago

It’s true. English and the main Spain version of Spanish are two of the few languages in the world which have the sound. Even most Latin American versions of Spanish (maybe all?) do not have it.

  • jacquesm 7 hours ago

    Can you give an example of a common Spanish word that has it?

    • jaggederest 6 hours ago

      In "distinción" spanish, the classic pair is the word for house and for hunt - "casa" and "caza" respectively. If you pronounce them the same (with an S sound), you're a Seseo speaker like (most) latin america. If you pronounce them with different sounds, one an S sound, the other a TH sound, you're a "distinción" speaker, and if you pronounce them both with a TH sound, it's the more uncommon ceceo accent, usually largely Andalusian.

    • mejutoco 5 hours ago

      Any c+e/i (cena, cine) or z+a/o/u (zarza, zorro, zurrar) is a good heuristic.

      c+a/o/u sounds like k (casa, cosa, cuchara) and z+e/i does not exist.

      • pezezin 3 hours ago

        Z+e/i does exist, but it is not very common. A few examples:

        - Words that are only written with Z: zepelín, zigurat, zigzag.

        - Words that can be written with either Z or C: zénit, zinc, zirconio, azimut.

    • pezezin 6 hours ago

      My favourite word to troll people who are learning the language is "cerrojo" /θe'roxo/, meaning "latch" or "lock", as it contains the three most difficult consonants in the language in sequence xD

      • jacquesm 5 hours ago

        In Polish there is 'szczoteczka', which took me just about forever to learn how to pronounce. I just could not hear what I was doing wrong.

    • [removed] 7 hours ago
      [deleted]
    • [removed] 6 hours ago
      [deleted]