PDF

Paul Dupuis paul at researchware.com
Sat May 12 15:36:01 EDT 2018


I hear you. Too bad though because with XPDF, the function to fetch all
the text from a PDF is just:

function pdfText pFile
  local tUnicodePageText, tUnicodeDocumentText
  create inv group "pdfViewer" in this card
  XPDFViewer_Open "pdfViewer",the windowID of this stack
  XPDFViewer_Set "pdfViewer","filename",pFile
  put XPDFViewer_Get("pdfViewer","pageCount") into pageCount
  repeat with pageNumber=1 to pageCount
    XPDFViewer_Unicode "pdfViewer",pageNumber,tUnicodeText
    put tUnicodeText after tUnicodeDocumentText
  end repeat
  XPDFViewer_Close "pdfViewer"
  delete group "pdfViewer"
  return textDecode(tUnicodeDocumentText,"UTF16")
end pdfText

Maybe LiveCode will have a sale some day in the future that will meet
your price point.

For other who may have a Business License or just interested in what you
COULD do with XPDF, I am speaking about it at this Thursdays's LiveCode
Global.

On 5/12/2018 2:30 PM, Mike Bonner via use-livecode wrote:
> Ty Paul.  Unfortunately, a business license is way outside my level of
> affordability.  Looking into alternative methods to generate the index now,
> though I'd still love to find an lc way to do it.
>
> On Sat, May 12, 2018 at 11:30 AM, Paul Dupuis via use-livecode <
> use-livecode at lists.runrev.com> wrote:
>
>> If you have a Business License, you can use the XPDF external available
>> with those editions for doing that.
>>
>> On 5/12/2018 12:58 PM, Mike Bonner via use-livecode wrote:
>>> I haven't needed to do this before, but is there a (relatively) easy way
>> to
>>> extract the text from a bunch of pdf files?  I'm hoping I can build some
>>> indexes for the boatload of files I want to go through.  (THough, I
>> guess I
>>> could bipass LC and just grep my heart out)
>>>
>>> Any suggestions?
>>> _______________________________________________
>>> use-livecode mailing list
>>> use-livecode at lists.runrev.com
>>> Please visit this url to subscribe, unsubscribe and manage your
>> subscription preferences:
>>> http://lists.runrev.com/mailman/listinfo/use-livecode
>>>
>>
>> _______________________________________________
>> use-livecode mailing list
>> use-livecode at lists.runrev.com
>> Please visit this url to subscribe, unsubscribe and manage your
>> subscription preferences:
>> http://lists.runrev.com/mailman/listinfo/use-livecode
>>
> _______________________________________________
> use-livecode mailing list
> use-livecode at lists.runrev.com
> Please visit this url to subscribe, unsubscribe and manage your subscription preferences:
> http://lists.runrev.com/mailman/listinfo/use-livecode
>





More information about the use-livecode mailing list