iagooar 4 days ago

I wonder if AI could create a "commentary" script that instructs the TTS how to read certain words or chapters. The commentary would be like an additional meta-track to help the TTS make the best reading.

That should actually be possible to do already with existing tech. I haven't seen if you can instruct Kokoro to read in a certain way, does anyone know if this is possible?

lyu07282 4 days ago

Like with almost everything, its an active area of research:

https://emosphere-tts.github.io/

We are getting there

croes 4 days ago

Emotion is the acting part of voice acting. Hard to copy with AI