Merge and unicode
Paul Dupuis
paul at researchware.com
Mon Sep 9 19:54:41 EDT 2019
Doesn't any Unicode in htmlText of a field need to be in HTML form (i.e.
#<hexnumber>;?
I thought htmlText turns any non ASCII into either hex encoded html or,
where html entity names exists, uses html entity names.
On 9/9/2019 6:35 PM, J. Landman Gay via use-livecode wrote:
> On 9/9/19 2:39 PM, Paul Dupuis via use-livecode wrote:
>> On 9/9/2019 2:13 PM, J. Landman Gay via use-livecode wrote:
>>> On 9/9/19 1:08 PM, J. Landman Gay via use-livecode wrote:
>>>> It seems that the merge command doesn't respect unicode. Does
>>>> anyone have a workaround? The text I'm inserting is already decoded
>>>> to UTF16.
>>>>
>>>
>>> I misspoke, sorry. It's the metadata that doesn't respect unicode.
>>>
>>
>> Can you clarify what you mean when you say the "metadata" doesn't
>> respect Unicode?
>>
>> I'm in the middle of a big Unicode problem and have found and
>> reported a ton of bugs where Unicode is not yet everywhere. I'm
>> keenly interested in any I don't know about.
>
> Actually I just double-checked and both merge and metadata may be wrong.
>
> I get UTF8 text from a server that is then textDecoded to UTF16, and
> an html template that I merge with parts of the UTF16 text. In the
> variable watcher, the merged template looks correct but when a field
> is set to the htmltext the result is wrong, diacriticals and curly
> quotes are question marks. My solution for that was to urlEncode the
> content before merging, and urlDecoding when extracting it for
> display. That works.
>
> In another part of the app I use the same (UTF16) text to set the
> metadata of a line in a field. When the script gets the metadata
> later, diacriticals and curly quotes are strange characters with very
> high UTF numbers.
>
More information about the use-livecode
mailing list