richard reasoned be aware that you will lose one the most valuable things about a PDF document: the embedded text layer. Oh, no, there’s something more valuable, which you also lose: anything over 72 dpi! I played with a pdf library for python for a bit, as well as raw pdf, without much luck.