Comment by victorbjorklund
Comment by victorbjorklund 13 hours ago
PDF:s are not great on mobile. And you can’t easily translate them (I often translate restaurant menus when they are on a website with just 2 clicks)
Comment by victorbjorklund 13 hours ago
PDF:s are not great on mobile. And you can’t easily translate them (I often translate restaurant menus when they are on a website with just 2 clicks)
There's been a translate button for years which hooks deep into every nook and cranny of the website's HTML. It works great, it's built in and many restaurants even advertise it for tourists, because it's a zero-effort translation of their existing menu. Plus, it's low-data when you're inside a 1-bar basement restaurant.
Using an LLM to translate the visible part of a PDF on a mobile... seems like the worst possible solution to the problem.
Translating PDFs is more complicated than that because the strcture of a PDF document doesn't lend itself well to this kind of thing.
For example: if there's a dish name with a 2 line description below it and some allergy symbols below that, in HTML you can imagine the document structure that produces that. In PDF terms that might be 4 separate objects and, in particular, the eyes can see the two lines are adjacent so they fit together but the document structure doesn't really represent it taht way, necessarily.
This might also not work with translation because the lines are set for the size of the text they contain. Same for resizing the font.
Put another waay, PDF should be viewed as a typeset and layout format, not a document format.
I think you're misunderstanding what I'm describing. It's getting a screenshot of the visible portion of the rendered document, not the document itself with all the tags and nastiness inside. The same feature works with a photo of handwritten text, where obviously no digital document exists. It's not perfect, but usually adequate for menu translation.
Translating anything that renders on my screen is the same two clicks to open an LLM with the screen contents. I expect that will become an increasingly universal experience as LLM features get shoved into every nook and cranny of tech.