remove html tags from text
Brian Yennie
briany at qldlearning.com
Sun Sep 10 00:11:27 EDT 2006
FWIW:
http://www.amk.ca/python/howto/regex/
Check out section 6.3, which presents the same regex that Ken
suggested. Also not the parenthetical statement, which I've come
across in so many words before. Basically, if you want industrial
strength HTML tag processing, you should really consider an actual
parser and not a single expression. For most purposes, RegEx is
probably fine:
"(Note that parsing HTML or XML with regular expressions is painful.
Quick-and-dirty patterns will handle common cases, but HTML and XML
have special cases that will break the obvious regular expression; by
the time you've written a regular expression that handles all of the
possible cases, the patterns will be very complicated. Use an HTML or
XML parser module for such tasks.)"
- Brian
More information about the use-livecode
mailing list