Comment by jmrm
Some time ago I was toying around with a library called [MuPDF](https://www.mupdf.com/) for something related, and with that library and a small Python script you can programmatically OCR any book you want.
That library is free for personal or open source projects, but paid for commercial ones