Comment by 6510

Comment by 6510 4 days ago

4 replies

I've often wanted a bulk tool that takes the title or some other easy to find value from a pdf and renames the file to that.

matthewshere 4 days ago

Appreciate you sharing that requirement!

The need for batch processing to pull out targeted data points from PDFs (rather than converting the whole document) is a valuable insight.

While the current tool focuses on full conversion to Markdown, enhancing https://pdftomarkdown.pro/ to handle specific data extraction tasks like yours is definitely something I'll consider carefully for the future roadmap. Thanks for highlighting it!

voidUpdate 4 days ago

Unfortunately, PDFs are right buggers to work with and there often isn't an "easy to find value" for anything

  • matthewshere 4 days ago

    You're absolutely right, PDFs can be incredibly tricky. That lack of a consistent, easily parsable structure for arbitrary data is the core challenge.

  • 6510 4 days ago

    I mean easy in the PDF sense. I have folders full of randomString.pdf and name(15).pdf but those that share a folder all have the same layout.