htmlText, xHTML and revXML

David Bovill david at openpartnership.net
Fri Dec 22 19:23:08 EST 2006


I made a brief start at using Revs XML external to parse htmlText thining it
would be easy as htmlText is HTML - but not so. has anyone done this?

It seems that the problem arises with the xHTML entities? At least these are
what chokes the parser.

On an aside note - despite searching for a very long time the archives I
cannot find the previous post on how to parse HTML to extract all image
links or href links... I remember some clever replacing and filtering going
on... but I forget the sequence...

Anyone have some scripts for extracting all anchors (ie "a name="http:....">
) or href/image links from htmltext?



More information about the use-livecode mailing list