Comment by ks2048
Nice work.
Have you looked at the MMS models from Meta and how do they compare?
By publicly release, does that mean offering an API or have you considered huggingface model release? I understand why that might not be best for your business model - but what would be your goal from a business perspective?
Yes we read the paper when it came out and reviewed the audios. We didnt find it good enough for adoption. We didnt compare results with MMS in a systematic way coz it seems irrelevant.