Generate audiobooks from E-books with Kokoro-82M

417 points by csantini 4 days ago

On the one hand, this is very convenient. Probably cool for some non-fiction.

On the other, some of my favorite audio books all stood out because the narrator was interpreting the text really well, for example by changing the pacing during chaotic moments. Or those audiobooks with multiple narrators and different voices for each character. Not to mention that sometimes the only cue you get for who's speaking during dialogue is how the voice actor changes their tone. I have mixed feelings about using this and losing some of that quality.

I would totally use this over amateur ebooks or public domain audiobooks like the ones on project guttenberg. As cool as it is/was for someone to contribute to free books... as a listener it was always jarring to switch to a new chapter and hear a completely different voice and microphone quality for no reason.

Reply View 93 replies

stavros 4 days ago

> On the other, some of my favorite audio books all stood out because the narrator was interpreting the text really well
This (and everything else with AI) isn't saying "you don't need good actors any more". It's saying "if you don't have an audiobook, you can make a mediocre one automatically".
AI (text, images, videos, whatever) doesn't replace the top end, it replaces the entire bottom-to-middle end.

Reply View | 56 replies
- j4coh 4 days ago
  
  RIP to future top-enders that would normally have started out on the bottom to middle end.
  
  Reply View | 52 replies
  
  aredox 4 days ago
  
  Bingo. AI is going to destroy any pathway for training and accruing experience.
  An embalming tech for our dying civilization.
  
  Reply View | 37 replies
  
  sam_lowry_ 3 days ago
  
  > RIP to future top-enders that would normally have started out on the bottom to middle end.
  This stance always reminds me of the Profession, a 1957 novella by Isaac Asimov that depicts pretty much the future where there are only top performers and the ignorant crowd.
  
  Reply View | 1 reply
  
  xyproto 3 days ago
  
  He was a clear thinker.
  
  Reply View | 0 replies
  
  anothermathbozo 3 days ago
  
  Virtually every book I want this for has been around for 70+ years and still no high or low quality audiobook has been produced. How long do I have to wait for those aspiring top-enders before an audiobook can be made available?
  
  Reply View | 2 replies
  
  gosub100 3 days ago
  
  I'm super opposed to AI, but I see this as a rare positive. As someone already said, the win here is to have a audiobook where one doesn't yet exist. hell, maybe the tables will turn and the scrubs will do the hard work of discovering which titles are popular with an audience, then the ebook industry can capitalize on AI by hiring voice actors to produce proper titles?
  
  Reply View | 1 reply
  
  DidYaWipe 3 days ago
  
  Not gonna happen. Once the AI shit is out there, people will have consumed it by the time a real actor can create (and edit) the audiobook.
  
  Reply View | 0 replies
  
  CuriouslyC 3 days ago
  
  It's common for shows to use big name actors as voices because they draw an audience, nothing will change. Just means a smaller pool of voice actors and they'll mostly be good looking.
  
  Reply View | 0 replies
  
  [removed] 4 days ago
  
  [deleted]
  
  Reply View | 0 replies
  
  cmdtab 3 days ago
  
  The value of distribution is increasing while the value of content and product is decreasing for all but the top end.
  
  Reply View | 0 replies
  
  Der_Einzige 3 days ago
  
  Not RIP at all. "Meritocracy" was coined in a book literally warning us about how terrible such a society would be: https://en.wikipedia.org/wiki/The_Rise_of_the_Meritocracy
  The "top-enders" are the privileged who need to have some of their gains for their intelligence redistributed to others. The alternative is "survival of the smartest", which is de-facto what we have today and what Young was trying to warn us about.
  
  Reply View | 0 replies
  
  credit_guy 4 days ago
  
  By that time, AI will beat the toppest of the top enders. Remember the time Deep Blue barely beat Kasparov? Now no human, or group of humans can beat a chess engine, even one that runs on an iPhone.
  
  Reply View | 2 replies
- numpad0 4 days ago
  
  AI TTS has been available for quite some time. Tacotron V1 is about 8 years old. I don't think we saw much bottom end replacement.
  IMGO(gut opinion), generative AI is a consumption aid, like a strong antacid. It lets us be done with $content quicker, for content = {book, art, noisy_email, coding_task}. There's obvious preconceptions forming among us all from "generative" nomenclature, but lots of surviving usages are rather reductive in relevant useful manners.
  
  Reply View | 1 reply
  
  sam_lowry_ 3 days ago
  
  Yeah, let us not blame AI. Audible damaged the quality of audiobooks than AI.
  
  Reply View | 0 replies
- no_wizard 3 days ago
  
  Bottom end really, Middle end is still superior to this AI drivel.
  
  Reply View | 0 replies
felixhummel 4 days ago

I wholeheartedly agree. https://en.m.wikipedia.org/wiki/Stephen_Briggs got me hooked on Terry Pratchett's Discworld series. I loved "Going Postal".

Reply View | 1 reply
- IndrekR 4 days ago
  
  I know someone who listened Terry Pratchett's "Wachen! Wachen!" audiobook on Spotify while living in Germany for few years. It was so well narrated that he also acquired some peculiarities of local dialects used by specific characters in the book. Locals in Bavaria were quite surprised of a foreigner speaking such language.
  
  Reply View | 0 replies
dmazin 4 days ago

Absolutely.
Even on the non-fiction side, the narration for Gleick's The Information adds something.
While I want this tool for all the stuff with no narration, NYT/New Yorker/etc replacing human narrators with AI ones has been so shitty. The human narrators sound good, not just average. They add something. The AI narrators are simply bad.

Reply View | 0 replies
ldoughty 3 days ago

I agree with you, but also want to point out:
New authors, self-publishers, can't afford tens of thousands of dollars to get an audiobook recorded professionally... This can limit their distribution.
Authors might even choose not to make such version (or lack confidence to record themselves), so AI capable of making a decently passable version would be nice -- something more than reading text blandly. AI in theory could attempt to track the scene and adjust.

Reply View | 2 replies
- plorg 3 days ago
  
  By observation the current approach is for authors to narrate the book themselves of they think their readers will want it and if they feel reasonably confident in their own narration.
  
  Reply View | 0 replies
- DidYaWipe 3 days ago
  
  You can get narrators to work on a royalty basis.
  
  Reply View | 0 replies
WillAdams 3 days ago

Yes, but if the alternative is not having a book, or having to listen to one poorly read (I love Librivox, but there are some books which I just haven't been able to finish because of readers, and many more which were nixed for family vacation travel listening on that account), this may be workable.

Reply View | 0 replies
micw 4 days ago

With this technology, one could produce high quality audio books without having access to high quality narrators by annotating the books with the voice, speed and such things.
I wonder if a standardized markup exists to do so.

Reply View | 7 replies
- albert_e 4 days ago
  
  There is SSML for speech markup to indicate various characters of speech like whispers, pronunciation, pace, emphasis, etc.
  With LLMs proving to be very good at generating code, it may be reasonable to assume they can get good at generating SSML as well.
  Not sure if there is a more direct way to channel the interpretation of the tone/context/emotion etc from prose into generated voice qualities.
  If we train some models on ebooks along with their professionally produced human-narrated audiobooks, with enough variety and volume of training data, the models might capture the essence of that human-interpretation of written text? Just maybe?
  Amazon with its huge collection of Audible + Kindle library -- if it can do this without violating any rights -- has a huge corpus for this. They already have "whispersync" which is a feature that syncs text in a kindle ebook with words in corresponding audible audiobook.
  
  Reply View | 1 reply
  
  micw 4 days ago
  
  Good points, thank you! I just tested it. While ChatGPT was very good in adding generic (textual) annotations, the result for generating SSML where very poor (lack of voice names, lack of distinction between narrator and character etc).
  Probably the results with a model trained for this plus human audit could lead to very good results.
  
  Reply View | 0 replies
- pegasus 4 days ago
  
  They still wouldn't be high quality. It's just not possible to capture the precise tone of voice in an annotation, and that precision I believe really makes a difference. My experience is that the deeper the narrator understands the text and conveys that understanding, the easier it becomes for me to absorb that information.
  
  Reply View | 1 reply
  
  vasco 4 days ago
  
  Have you tried those "podcast from a paper" models? They do some of the things you are saying they don't, although it's not 100% it's also miles ahead of for example human Polish TV lectors, or other monotone style narrations.
  
  Reply View | 0 replies
- KeplerBoy 4 days ago
  
  Don't end to end trained models already do this to some extent? Like raising the pitch towards a question mark, like a human would.
  TortoiseTTS has a few examples under prompt engineering on their demo site: https://nonint.com/static/tortoise_v2_examples.html
  
  Reply View | 2 replies
  
  micw 4 days ago
  
  That's a bit of basic and random. Some models have the features you describe. From the better models you get a slightly different voice for text in quotes.
  But the difference to good audio books is that you have * different voices for the narrator and each character * different emotions and/or speed in certain situations.
  I guess you could use a LLM to "understand" and annotate an existing book if there's a markup and then use TTS to create an audio book from it and so automate most of the the process.
  
  Reply View | 1 reply
  
  micw 4 days ago
  
  Edit: I actually tried this. I prompted in ChatGPT:
  "Annotate the following text with speakers and emotions so that it can be turned into an audiobook via TTS", followed by a short text from "The Hobbit" (The "Good morning scene"). The result is very good.
  
  Reply View | 0 replies
ahoka 4 days ago

I guess this is still very useful if you are blind.

Reply View | 12 replies
- loktarogar 4 days ago
  
  Yeah, for accessibility purposes on things that aren't already narrated, this is kind of thing is huge.
  
  Reply View | 11 replies
  
  em-bee 4 days ago
  
  that's the thing. it's not just for accessibility. anything not already narrated is a fair target for TTS. i don't have time to sit down and read books. all reading is done on the go, while getting around or doing daily routines at home. i have a small book that i am reading now, which should take a few hours to finish, but in the time i manage to get done reading it i will probably have listened to two or three audio books.
  oh, and it's also a boon for those who can't afford to buy audiobooks.
  
  Reply View | 7 replies
  
  flir 3 days ago
  
  I was just thinking about automatically slapping an mp3 on every blog post, just an accessibility nicety.
  Can someone with low vision tell me if this would be useful to them? It may be that specialist tools already do this better.
  
  Reply View | 2 replies
taude 3 days ago

Agree with you on this.
My example, I was never a Wheel of Time fan, but the new audio editions done by Rosamund Pike are quite the performance, and make me like the story. She brings all the characters to life in a way thats different than just reading. It's a true performance.

Reply View | 0 replies
Oneunscripted 3 days ago

I guess using different narrators is essential for both fiction and non-fiction books if you want the full experience. Personally, I love it when audiobooks have narrators who stick to the characters’ personalities—it just feels right. Some of the audiobooks I’ve listened to have narrators who switch up their voices for each character, and others even use a different narrator for every character, which gets really good. Narration Box has been doing a really great job with this lately

Reply View | 0 replies
stevenwoo 3 days ago

A couple of my favorite audiobooks are Stranger in a Strange Land and Flowers for Algernon where the performer changes the intonation and enunciation of main character with the character’s journey and it was a revelation and made me appreciate the stories in a way I did not get reading the printed books the first time. Just the consistency of the performance is sometimes difficult to do in my imagination perhaps.

Reply View | 0 replies
whazor 3 days ago

A GenAI model that read audiobooks with such dramatisation is really my dream. There are so many books that I would want to listen to, but still lack such an adaptation. Also it takes months after the book release before the audiobook gets released.
Just imagine what this would do for writers. They can get instant feedback and adjust their book for the audiobook.

Reply View | 0 replies
rd11235 3 days ago

I agree but the opposite can be true too. Sometimes the narrator seems to target some general audience that doesn’t fit me at all, in a way that makes me cringe when I listen, until I stop listening altogether. In these cases I’d rather listen to a relatively flat narration from a tool like this.

Reply View | 0 replies
gmuslera 3 days ago

Would a "better" AI would do a "better" narration with a better understanding of the text? Of course that it would imply a different (and far bigger?) model.
Anyway, even if in theory it might, in practice things may end even worse than doing it with a monotone voice.

Reply View | 0 replies
lern_too_spel a day ago

On the other hand, there are a lot of narrators who are just bad, and the publisher is not going to pay for an alternate narration. These tools are a good way to re-narrate Wil Wheaton narrated books with correct pronunciation and inflection, for example.
Computer chess took a long time to get better than the best players in the world, but it was better than most chess players for many years before that. We're seeing that a lot with these generative models.

Reply View | 0 replies
Melomomololo 4 days ago

I like one speaker in one particular book.
He also narrates another scifi book series and honestly I dislike this a lot.
He became the voice of one particular character for me.
I would love variety

Reply View | 0 replies

delegate 3 days ago

The quality is great (amazing even), but I can't listen to AI generated voices for more than 1 minute. I don't know why, I just don't like it. I immediately skip the video on youtube if the voice is AI generated.

Might be because our brains try to 'feel' the speaker, the emotion, the pauses, the invisible smile, etc.

No doubt models will improve and will be harder to identify as AI generated, but for now, as with diffusion images, I still notice it and react by just moving on..

Reply View 7 replies

rockemsockem 3 days ago

That kinda means the quality isn't great or amazing. Good TTS should be nearly or indistinguishable from a human speaker and should include emoting, natural pauses, etc

Reply View | 0 replies
CMay 3 days ago

Haven't really been following the latest in TTS ML, but I expected this to be better or at least as good-bad as the stuff you hear on YouTube. Somehow it sounds worse. It really is jarring to listen to any of these ML voices and can't really stand it. Nope out of every video that uses them and can't tell if YouTube never recommends them to me for that reason, or just because the recommendations around what I watch are just so rarely going to be from some low reputation channel.
Take a moment here for a second though and think about it. Even if these voices got to be really good, indistinguishable almost... would I want to listen to it even then? If it was an NPC's generated voice and generated dialogue in a game to help enrich the world building, maybe in that context. On YouTube or with newscasters? Probably not. Audio books? Think I would still rather have it be a real person, because it's like they're reading a story to me and it feels better if it's coming from someone. There's also the unknown factor, where if it's ML generated it's so sterile that the unknowns are kind of gone.
Think about it like this, in the movie industry we had practical effects that were charming in a way. You could think about the physical things that had to occur to make that happen. Movie magic. Now, everything is so CG it's like the magic is gone. Even though you know people put serious hard work into it, there's a kind of inauthenticity and just lack of relevance to the real world that takes something away from it.
It's like a real magician has interesting tricks, while an artificial magician is most likely just a liar.
Still, I grant that it makes some cool things possible and there is potential if things are done right. Some positive mixture of real humans and machine generated stuff so it isn't devoid of anything connected to real life effort.

Reply View | 0 replies
_DeadFred_ 3 days ago

For new generations/those coming up now this will be the norm and not generate the negative reaction is does for us, it will just be part of how the world is and has always been, and eventually we will be the minority.
Future generations will never know a world where you don't watch a 2 hour AI generated orientation video about the wonders of working for Generic Corp when you start a new job.

Reply View | 0 replies
yjftsjthsd-h 3 days ago

> I immediately skip the video on youtube if the voice is AI generated.
I mean, I do that because it's correlated with the content being garbage. If I'm intentionally using it on content I want to consume I expect it to be different, though I haven't gotten around to trying it properly yet so I guess we'll see. (OTOH I already listen to ebooks via pre-AI TTS, so I'm optimistic)

Reply View | 0 replies
xdennis 3 days ago

Among other things, what I don't like is the hallucinated stress. Take the classic example of:
> I never said she stole my money
It can have 7 different meanings based on which word you stress out.
The new AI voices sound very natural at a shallow level, but overall pronounce things in odd ways. Not quite wrong, but subtly unnatural which introduces some cognitive load.
Old TTS systems with their monotonic voices are less confusing, but sound very robotic.

Reply View | 1 reply
- DidYaWipe 3 days ago
  
  erroneous or inappropriate ≠ hallucinated
  
  Reply View | 0 replies
karmasimida 3 days ago

Yeah same.
Doesn't mean the quality is bad. In fact I think Kokoro's quality is amazing.
But it is not the right tool for narration, the kind of training data they use make the sound too flat, if that makes sense.

Reply View | 0 replies

swores 4 days ago

Can anyone recommend an open source option that would allow training on a custom voice (my own, so I'd be able to record as many snippets as it needed to train on) to allow me to use it for TTS generation without sharing it off my machine?

Edit: I'll wait to see if any recommendations get made here, if not I might give this one a go: https://github.com/coqui-ai/TTS

Reply View 7 replies

hm64 3 days ago

Coqui is great, but in practice, I found Piper easier to set up, train, and deploy as an ONNX file. Big thanks to the Sherpa development team for their helpful resources: https://k2-fsa.github.io/sherpa/onnx/tts/piper.html and to the Rhasspy team for their training guide: https://github.com/rhasspy/piper/blob/master/TRAINING.md.
I also found DEMUCS + Whisper + pydub to be a super helpful combo for creating quality datasets.

Reply View | 0 replies
phrotoma 4 days ago

https://github.com/DrewThomasson/ebook2audiobook

Reply View | 0 replies
drewbitt 3 days ago

There is a fork here https://github.com/idiap/coqui-ai-TTS 'coqui-tts'
Though according to the TTS leaderboard, Fish Speech https://github.com/fishaudio/fish-speech and Kokoro are higher.
https://huggingface.co/hexgrad/Kokoro-82M
https://huggingface.co/fishaudio/fish-speech-1.5

Reply View | 1 reply
- xnx 3 days ago
  
  AFAIK Kokoro can't be fine tuned
  
  Reply View | 0 replies
numpad0 4 days ago

I think you can probably generate TTS audio by classical means, and voice2voice that audio through RVC or Beatrice V2. Haven't looked into it in a while but Beatrice is apparently super fast and CPU only.

Reply View | 0 replies
jsemrau 3 days ago

I wrote this a while ago about xTTSv2 mixed with Nvidia's Nemo. Maybe it kicks off your journey.
https://jdsemrau.substack.com/p/teaching-your-agent-to-speak...

Reply View | 0 replies
esskay 3 days ago

If I recall Coqui is very much a dead project, just one to be aware of.

Reply View | 0 replies

pprotas 4 days ago

I would love to have an e-reader that allows me to switch between text and audio at the press of a button. Imagine reading your book on the couch and then switching into audio mode while doing the dishes seamlessly, by connecting bluetooth headphones.

Reply View 15 replies

InsideOutSanta 4 days ago

Kindles used to provide this feature, but publishers and/or the Authors Guild stopped it, because audio rights and text rights are handled differently. In other words, when Amazon sells you a text book, it does not have the right to then also do TTS on that text and let you listen to it.
There's some contemporary discussion of what happened here: https://tidbits.com/2009/03/02/why-the-kindle-2-should-speak...
I think there is still integration with Audible, though. If you buy a book on the Kindle and on Audible, the position will sync, and you can switch between listening and reading without losing your place in the book.

Reply View | 4 replies
- albert_e 4 days ago
  
  Yes the feature is called WhisperSync -- I used it many years ago and it was pretty good.
  I tried it while on a treadmill so it allowed me to follow the book with more focus without sacrificing much else.
  
  Reply View | 1 reply
  
  thfuran 4 days ago
  
  Isn't whisper sync the current version that relies on owning both the ebook and audiobook?
  
  Reply View | 0 replies
- hamzakc 3 days ago
  
  I am not sure if this still works, but 2-3 years ago I listened to a kindle book that I bought through my Echo show device. It was pretty good. I listened to it while I was cooking. It even allowed you to carry on where you left off. But I did notice that a few pages were skipped as I had read the book before. I have since packed away my echo show so I can't verify if they have removed this feature or not.
  
  Reply View | 0 replies
- Brybry 4 days ago
  
  I used that TTS feature semi-regularly on a Kindle 2.
  It wasn't a good experience but it was nice to be able to keep 'reading' a book while I was exercising.
  It worked for me for over a decade, until I broke the device. I don't know if I never updated the firmware or if the fact I used Calibre to convert books bypassed the feature gate.
  
  Reply View | 0 replies
dsign 4 days ago

It is a supported feature in the epub 3.0 standard. It's possible to distribute an epub with audio, and have the audio sync to the HTML elements that form the ebook's text. And there is an e-reader that actually supports this feature, I can't remember which one now but it should be possible to find it with Google.
It's more of an open problem how to create those epubs. I have some code that can do it using Elevenlabs audio, but I imagine it way harder to have something similar for a human narrator.... who's going to do the sync? Maybe we need a sync AI.

Reply View | 0 replies
freefaler 4 days ago

You can do it easily with non-DRM books (or DRM stripped books):
For Android:
- Moon+ reader pro - some paid high-quality TTS voices (like Acapella)
For iOS:
- Kybook reader and internal iOS voices (no external TTS voices for the walled garden)
This works well enough to listen to a book while you walk and when you get back home read on the WC from the place you stopped.
Additionally if you buy a tablet or an android ebook reader, you install the app there an you can continue on your bigger/better device seamlessly.
Whisper-sync for the masses! Ahoy...

Reply View | 4 replies
- basedrum 4 days ago
  
  But you need an android phone, and can't use a kobo or similar wink reader?
  
  Reply View | 3 replies
  
  freefaler 3 days ago
  
  for ios you use Kybook on your iphone and your ipad. It syncs positions between the devices. When you go for a walk, opens Kybook, start TTS. When back home, open your tablet, you'll see the page TTS has stopped reading to.
  
  Reply View | 2 replies
monkeydust 3 days ago

Literally started doing that this week with Amazon Audible. I gave in an started the three month 99c trial and downloaded the app.
What surprised me a good way was my Kindle app was aware of this and asked if I wanted to download the audible version of the current book I am reading.
Been listening on the way to work and then reading on the way back. Enjoying it so far.

Reply View | 1 reply
- mmahemoff 3 days ago
  
  Some Kindke books also have a checkbox to add the audio (for a fee) when you buy it. Sometimes I’ve seen books discounted to e.g. £0.99, but adding the audio might be £5.99. The upsell seems to be a good hack for adding some revenue when there’s a deep discount being used to drive interest.
  
  Reply View | 0 replies
llamaimperative 3 days ago

Boox Ultra Tab whatever the fuck (their product naming sucks) + Readwise Reader = amazing for this
Not quite seamless but it works. It has a cursor that follows the words as they’re spoken to, which allows you to read and hear (“immersive reading”) which I find to be extremely helpful for maintaining focus.

Reply View | 0 replies
leobg 2 days ago

iOS Voice Dream Reader. First app I install on a new iPhone since 2010 I believe. I will even cut and scan physical books just so I can read them in the app. The story of the guy who made it is also interesting!

Reply View | 0 replies

qurashee 4 days ago

This looks incredible! I’ve had an idea simmering in the back of my mind for a while now: creating an audiobook from an ebook for my commute using the voice of a specific audiobook narrator I really enjoy. The concept struck me after coming across the Infinite Conversation project here on HN. Unfortunately, I just haven’t found the time to bring it to life yet. :(

Reply View 10 replies

leobg 2 days ago

Made this for my kids for Christmas:
- take an ebook in any language - AI translates it to German - AI speaks it using the voice of their fav narrator - a UI showing the text as it is being read
Now they can read Asimov, Kulansky, Bryson, regardless of whether a translation or audio version exists. :)

Reply View | 0 replies
vinni2 4 days ago

What about the copyright issue? You can’t mimic the voice of a narrator without their consent. OpenAI landed in trouble after using Scarlett Johansson’s voice in a demo.
https://www.theverge.com/2024/5/20/24161253/scarlett-johanss...

Reply View | 8 replies
- notachatbot123 4 days ago
  
  No limitations on this kind of thing if you are in private use.
  
  Reply View | 2 replies
  
  vinni2 4 days ago
  
  Forgive me for not knowing it was for personal use.
  
  Reply View | 0 replies
  
  qurashee 4 days ago
  
  Indeed I was thinking about private use only.
  
  Reply View | 0 replies
- benatkin 4 days ago
  
  She only won in that OpenAI decided it wasn’t worth the trouble.
  
  Reply View | 3 replies
  
  K0balt 3 days ago
  
  Yeah, by my ear it was pretty clearly not SJ’s voice-likeness, although there were some superficial similarities.
  But some people could have mistook it due to some regional accent similarities, though it would be akin to interpretation of any light southern drawl with a similar timbre as being SJ.
  
  Reply View | 2 replies
- amrrs 4 days ago
  
  Kokoro really mentions that they used only permissive licensed voice
  
  Reply View | 0 replies

cwmoore 4 days ago

The word “kokoro” means “heart” in Japanese, which I learned making the (heart shaped and paperback) puzzle books at https://www.kakurokokoro.com/

Reply View 2 replies

tkgally 4 days ago

Note that kokoro (心) means “heart” in the sense of “spirit,” “soul,” “mind,” “emotions,” etc. It doesn’t mean “heart” in the sense of “internal organ that pumps blood.” That is shinzō (心臓).
I once heard an American friend with so-so Japanese ability ask a Japanese woman who had recently had a heart operation how her kokoro was doing, and she looked surprised and taken aback.
Side note: After I started reading HN in 2019, I was struck by how many tech products mentioned here have Japanese names. I compiled a list for a few years and eventually posted it:
https://news.ycombinator.com/item?id=31310370

Reply View | 0 replies
terhechte 4 days ago

Its also the name of the AI in Terminator Zero https://villains.fandom.com/wiki/Kokoro
I'm not sure if that is related here.

Reply View | 0 replies

Dowwie 3 days ago

2025 may be the year where we can generate a dramatic audiobook with ambient music, sound effects, and theatrical narration using neural networks. Many of the parts already exist.