Comment by bbminner

Comment by bbminner 4 days ago

0 replies

I suppose it means per speaker. And it is based on a simplified style tts 2 which from my small dive into the subject seems one of the smaller models achieving great quality.