How to structure HTML text (tags and attributes) for processing in LiveCode?

stephen barncard stephenREVOLUTION2 at barncard.com
Sun Jun 12 05:57:41 EDT 2011


Jerry Daniels has an excellent series on screen scraping. Several video
lessons.

http://revmentor.com/business-logic-screen-scraping-1


On 12 June 2011 02:27, Keith Clarke <keith.clarke at clarkeandclarke.co.uk>wrote:

> Hi folks,
> Local rainy Saturday night broadband load prevented me from seeing the
> whole of Colin Holgate's fascinating LiveCode Live presentation on working
> with web page source HTML text - so I can't wait for the recording!
>
> Meanwhile, I'm trying to extract various html tags and specific attributes
> from a page's source code - you know, this and that, where <tag>stuff="this"
> other_stuff="that"</tag>
>
> I'm trying to create the situation where I can iterate through the text
> using something like 'repeat for each tag' and within that loop, 'repeat for
> each attribute' - the question is, how to get the source HTML text
> structured and delimited so that 'HTML tag = line' and 'HTML tag attribute =
> Item'
>
> Given there are no obvious single character itemDelimiters in HTML and the
> inefficiency of building-up an algorithm from scratch with chunk functions,
> are any specialised resources, techniques or tricks available - maybe I
> missed something in the libURL feature-set?
> Best,
> Keith..
> _______________________________________________
> use-livecode mailing list
> use-livecode at lists.runrev.com
> Please visit this url to subscribe, unsubscribe and manage your
> subscription preferences:
> http://lists.runrev.com/mailman/listinfo/use-livecode
>



-- 



Stephen Barncard
San Francisco Ca. USA

more about sqb  <http://www.google.com/profiles/sbarncar>



More information about the use-livecode mailing list