Problem with Latin 1 (I think)
Ken Corey
ken at kencorey.com
Mon Feb 6 07:21:03 EST 2012
I expect you're running up against UTF8 characters in your html (just
like the thread "Japanese characters in HTML result"). What's the
encoding of the page?
Here's what I wrote in that thread. Try it and see if it solves the
problem:
On 04/02/2012 05:14, Bob Sneidar wrote:
> So the trick is to programmatically determine what is double byte and
what is not?
Well, if you take a look at the headers from that page, the
Content-Type: "text/html; charset=UTF-8"
Which is specifies exactly how the data will fall:
http://en.wikipedia.org/wiki/UTF-8
For regular ascii characters, nothing changes. For the higher ranges,
you could have 2 to 4 bytes per character.
http://runrev.com/newsletter/december/issue62/newsletter2.php
I've not tried it, but I think your solution is something like this:
put "https://www.googleapis.com/books/ETC" into tURL
get url tURL
set the unicodeText of field X to uniencode(it,"UTF8")
As I understand it, you've got to get the text from UTF8 into the
internal representation of a string. Then, for the field to understand
it and display it properly, you must put the string into the unicodeText
of it.
Hope that helps.
-Ken
On 06/02/2012 09:43, Geoff Canyon Rev wrote:
> I'm retrieving a url and parsing the HTML. If I view the URL in Safari (all
> of this on a mac) there are places where safari shows "isnt" but livecode
> shows "isnt"
>
> I'm using 5.0.2, and isotomac doesn't seem to fix this. Any suggestions?
More information about the use-livecode
mailing list