Unicode sorting

Devin Asay devin_asay at byu.edu
Fri Jun 2 11:56:15 EDT 2006


AAAAARRRRGH! Disregard the previous post. I neglected to change the  
function call from the old sort function to the renamed new sort  
function. No wonder it was working!

I'll fix it and let you know how it REALLY works.

Devin
On Jun 1, 2006, at 5:21 PM, Dar Scott wrote:

> Wow!  Great news for sorting Unicode!
>
> On May 30, 2006, at 5:08 PM, Devin Asay wrote:
>
>> I got your code to work by making some simple changes in the  
>> sortCodeFromRussian function:
>
> Deven, I've been processing some bits of UTF-8, and something  
> dawned on me that is probably known by the Unicode experts.
>
>  **** A lexical byte sort of well-formed UTF-8 will result in a  
> Unicode code point sort!  *****
>
> That avoids the NUL problem in sort.  That means that russianLex()  
> can return the UTF-8 of the string with your character conversions.
>
> I think the replace command will work with UTF-8, so you can even  
> avoid a character loop.  All you need is 34 replaces and then a  
> return.  OK, that might actually be slower than a character loop.
>
> Dar
> Unicode Sophomore
>
>
> _______________________________________________
> use-revolution mailing list
> use-revolution at lists.runrev.com
> Please visit this url to subscribe, unsubscribe and manage your  
> subscription preferences:
> http://lists.runrev.com/mailman/listinfo/use-revolution
>

Devin Asay
Humanities Technology and Research Support Center
Brigham Young University




More information about the use-livecode mailing list