Every time I try to convert a PDF to epub or something, or OCR one that doesn’t actually have selectable text, it turns out shit. I assume the real reason people would want to get LLMs involved is that there is actually a lot of ambiguity in what a correct conversion would be, and there are a lot of PDFs out there.
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !programmerhumor@lemmy.ml
Post funny things about programming here! (Or just rant about your favourite programming language.)
Rules:
Posts must be relevant to programming, programmers, or computer science.
No NSFW content.
Jokes must be in good taste. No hate speech, bigotry, etc.
Every time I try to convert a PDF to epub or something, or OCR one that doesn’t actually have selectable text, it turns out shit. I assume the real reason people would want to get LLMs involved is that there is actually a lot of ambiguity in what a correct conversion would be, and there are a lot of PDFs out there.
I self host sterling-pdf and I haven’t had an issue with file conversion in… When did I set this thing up?
To be truthful, the machine I had it running on has been sent to the grave (I sold it) so I don’t actually have this service running right now