PDF text extraction?
-hh
hh at livecode.org
Fri Apr 1 13:08:54 EDT 2016
My first choice for that is 'ghostscript'.
Richard Gaskin wrote
> I may need to extract text from a fair number of PDFs (hundreds). I can
> find all sorts of third-party tools to do that, many of them free and
> easy to use, but I'd prefer to integrate this step into some other
> things I need to do with the files.
>
> The format isn't as simple as Word or docx, though. I'm not even sure
> if we have support in LC for the compression used in the text streams.
> Lots of parts there.
>
> Anyone here have a library or external for extracting text from PDFs?
> Ideally a good solution would be available for Win, Mac, and Linux.
--
View this message in context: http://runtime-revolution.278305.n4.nabble.com/PDF-text-extraction-tp4702906p4702937.html
Sent from the Revolution - User mailing list archive at Nabble.com.
More information about the use-livecode
mailing list