Unicode Chinese Mac
Dar Scott
dsc at swcp.com
Wed May 11 14:04:49 EDT 2005
On May 11, 2005, at 9:27 AM, Lynn Fredricks wrote:
>> So, for some types of processing, using UTF8 might be better
>> than host UTF16.
>
> My understanding is that UTF16 is preferable to UTF8 if you plan on
> sorting
> your data. Valentina 2 supports UTF16 for this reason.
Good point. I should have qualified that to apply within Transcript
and especially related to chunks.
Even so, if the sort is, say, UCA-DUCET from UTS #10, then conversion
would be a small part of the time. If the sort is a code-point sort,
then UTF16 could be better.
Alas, in Transcript doing a code-point sort (I would guess) would need
something like UTF16BE (with a BOM or some other prefix to prevent
numerical comparisons). (I'm ignoring surrogates.)
Even so, in UTF8, like characters will sort by code point, so
Transcript comparison and sorting can be useful in a rough sort of way.
ASCII code point sorting will be exact. And automatic numeral
comparisons in Transcript will apply.
Both in today's ad hoc solutions and in future Revolution technology,
it should not matter that much the form used as long as we can convert
quickly across interface boundaries.
Dar
--
**********************************************
DSC (Dar Scott Consulting & Dar's Lab)
http://www.swcp.com/dsc/
Programming and software
**********************************************
More information about the use-livecode
mailing list