Double-Value Unicode?

Dar Scott dsc at swcp.com
Sun May 18 04:45:19 EDT 2014


There is certainly a hole in the codes left for surrogates that starts at 0xD8000.  And yeah, one doesn’t want to represent a surrogate code using surrogates.  

But, the surrogates are in the range 0xD800-0xDFFF.  The characters from 0xE000-0xFFFF must also be represented as single 16-bit values, that is, without surrogates.  That includes the Private Use Area and the Compatibility Area.  

So, the test should be 0xFFFF.  If the code is above that, then use surrogates, otherwise use itself.  That is, if it can fit in 16 bits, do that.  

Dar Scott


On May 17, 2014, at 1:03 PM, Richmond <richmondmathewson at gmail.com> wrote:

> On 17/05/14 20:54, Scott Rossi wrote:
>> <snip>
>> 
>> 
>> One item that would be helpful to know is at what value does a unicode
>> character start being represented as pairs?  Is 65536 the upper limit for
>> single value characters?
>> 
>> 
> 
> <snip>
> 
> Representing as pairs starts at Hex D800 = Decimal 55296
> 
> Richmond.
> 
> 
> 
> _______________________________________________
> use-livecode mailing list
> use-livecode at lists.runrev.com
> Please visit this url to subscribe, unsubscribe and manage your subscription preferences:
> http://lists.runrev.com/mailman/listinfo/use-livecode





More information about the use-livecode mailing list