PDF text extraction?

-hh hh at livecode.org
Fri Apr 1 13:08:54 EDT 2016


My first choice for that is 'ghostscript'.

Richard Gaskin wrote
> I may need to extract text from a fair number of PDFs (hundreds).  I can 
> find all sorts of third-party tools to do that, many of them free and 
> easy to use, but I'd prefer to integrate this step into some other 
> things I need to do with the files.
> 
> The format isn't as simple as Word or docx, though.  I'm not even sure 
> if we have support in LC for the compression used in the text streams. 
> Lots of parts there.
> 
> Anyone here have a library or external for extracting text from PDFs? 
> Ideally a good solution would be available for Win, Mac, and Linux.





--
View this message in context: http://runtime-revolution.278305.n4.nabble.com/PDF-text-extraction-tp4702906p4702937.html
Sent from the Revolution - User mailing list archive at Nabble.com.




More information about the Use-livecode mailing list