Comment by tough

Comment by tough 4 days ago

8 replies

I had edited prior message but would be great if supported PDF rendering as good as docx! (for research mostly, not a lawyer)

piker 4 days ago

One thing it does is cast the PDF to grayscale to render for speed (lawyers rarely care about the color of a legal doc in PDF), so perhaps I'll make that an adjustable setting which will trade off speed for clarity. Otherwise, it uses PDFium for PDF rendering so it should come out just as well as what you see in a Chromium-based browser.

  • tough 4 days ago

    might be an issue with DPI / Retina screens on macOS?

    it's really more blurry than any pdf on chrome https://imgur.com/a/AElOuaA

    • piker 4 days ago

      Oh, yes, that is also a cheat code to render them quicker -- it downsamples the render to save time. That will improve in future versions as Tritium uses spare cycles to increase the resolution.

      • piker 4 days ago

        ^ yes it does greedily ingest the PDFs to allow for instantaneous search. What it needs (and doesn't yet do) is at least multi-thread that parsing step to get a 2-4x speedup.

      • tough 4 days ago

        I also noticed when i opened a folder with hundreds of big pdf's it choked on it

        seems you're doing work on all files when loading, you should be more like lazy-loading?

yauneyz 4 days ago

If this is Electron, try pdf.js - really good rendering, you can create a text layer (for text selection, etc). Probably the best result per effort you can get