Comment by hammadmlk

Comment by hammadmlk 6 days ago

2 replies

I think the Voice Models market will be like eCommerce. There will be no global winner instead a few regional winners -- each being really big.

We plan to be one of those winners.

chirau 6 days ago

What does it take to build such a model? As in, the key steps. And how expensive does it get? I might be interested in being a regional player and winner as well, lol. In my own corner of the world in Africa.

  • hammadmlk 6 days ago

    Not much... Just the willingness to work hard on this problem instead of others problems where large revenue is perhaps quicker :)

    Ingredients: Decent audio scraping skills, hiring great voice actors for each language, algos to gather text/audio with diverse phonetics, decent ML skills (enough to merge the best features of a few different papers). Lots and lots of data labels (and your own tools to get the data labeled efficiently) And finally GPUs!!!!

    None of this is technically hard... the hardest thing is working with Voice Actors (oh man!!!)