PDF
Paul Dupuis
paul at researchware.com
Sat May 12 15:36:01 EDT 2018
I hear you. Too bad though because with XPDF, the function to fetch all
the text from a PDF is just:
function pdfText pFile
local tUnicodePageText, tUnicodeDocumentText
create inv group "pdfViewer" in this card
XPDFViewer_Open "pdfViewer",the windowID of this stack
XPDFViewer_Set "pdfViewer","filename",pFile
put XPDFViewer_Get("pdfViewer","pageCount") into pageCount
repeat with pageNumber=1 to pageCount
XPDFViewer_Unicode "pdfViewer",pageNumber,tUnicodeText
put tUnicodeText after tUnicodeDocumentText
end repeat
XPDFViewer_Close "pdfViewer"
delete group "pdfViewer"
return textDecode(tUnicodeDocumentText,"UTF16")
end pdfText
Maybe LiveCode will have a sale some day in the future that will meet
your price point.
For other who may have a Business License or just interested in what you
COULD do with XPDF, I am speaking about it at this Thursdays's LiveCode
Global.
On 5/12/2018 2:30 PM, Mike Bonner via use-livecode wrote:
> Ty Paul. Unfortunately, a business license is way outside my level of
> affordability. Looking into alternative methods to generate the index now,
> though I'd still love to find an lc way to do it.
>
> On Sat, May 12, 2018 at 11:30 AM, Paul Dupuis via use-livecode <
> use-livecode at lists.runrev.com> wrote:
>
>> If you have a Business License, you can use the XPDF external available
>> with those editions for doing that.
>>
>> On 5/12/2018 12:58 PM, Mike Bonner via use-livecode wrote:
>>> I haven't needed to do this before, but is there a (relatively) easy way
>> to
>>> extract the text from a bunch of pdf files? I'm hoping I can build some
>>> indexes for the boatload of files I want to go through. (THough, I
>> guess I
>>> could bipass LC and just grep my heart out)
>>>
>>> Any suggestions?
>>> _______________________________________________
>>> use-livecode mailing list
>>> use-livecode at lists.runrev.com
>>> Please visit this url to subscribe, unsubscribe and manage your
>> subscription preferences:
>>> http://lists.runrev.com/mailman/listinfo/use-livecode
>>>
>>
>> _______________________________________________
>> use-livecode mailing list
>> use-livecode at lists.runrev.com
>> Please visit this url to subscribe, unsubscribe and manage your
>> subscription preferences:
>> http://lists.runrev.com/mailman/listinfo/use-livecode
>>
> _______________________________________________
> use-livecode mailing list
> use-livecode at lists.runrev.com
> Please visit this url to subscribe, unsubscribe and manage your subscription preferences:
> http://lists.runrev.com/mailman/listinfo/use-livecode
>
More information about the use-livecode
mailing list