formattedText and Unicode

Phil Davis revdev at pdslabs.net
Sun Aug 9 20:32:22 EDT 2009


Hi Richard,

Richard Gaskin wrote:
> I've been avoiding using Unicode because of all the difficulties noted 
> here with Rev, and I just haven't needed it.  But now I have a bunch 
> of files in which I don't know whether they're UTF-8 or UTF-16, and I 
> don't know which language's character set they use.
>
> First, I need to just display these in a field. That would seem simple 
> enough, but I haven't figured out a reliable way to do this without 
> knowing some external information about the files.  Use the wrong 
> character set and they look like junk.

Don't know if this will help, but Klaus posted a response to Ken Ray in 
"Re: Detecting UTF-8 Encoded Files" on 7 Aug. It contains helpful hints 
about detecting what Unicode file format you're dealing with - I don't 
know if the tips work universally, but maybe that's a starting place.

>
> Then I need to get the formattedText of the field so I can measure 
> distances to line breaks.  So far it seems that the formattedText 
> property is limited to ASCII only -- is that correct?
>
> Am I just missing something obvious here?
>
> I'll continue with my experiments, but if you have any pointers they 
> would be much appreciated.
>
> TIA -
>
> -- 
>  Richard Gaskin
>  Fourth World
>  Revolution training and consulting: http://www.fourthworld.com
>  Webzine for Rev developers: http://www.revjournal.com 

-- 
Phil Davis

PDS Labs
Professional Software Development
http://pdslabs.net




More information about the use-livecode mailing list