Comment by zaidqureshi
Comment by zaidqureshi 6 days ago
The model itself can work well for new languages, its just the process of data gathering and maintaining high quality of data is what we have to figure out as we scale across languages.
Currently the model is only given data for these languages so it doesn't know anything else.
> just the process of data gathering and maintaining high quality of data is what we have to figure out as we scale across languages.
À crawler and data ingestion pipeline will not help with that?