Unicode languages and text selection...
Paul Dupuis
paul at researchware.com
Mon Jul 22 10:02:06 EDT 2019
In moving a commercial application that allows researchers to perform a
lot of textual analysis from LC6.7.11 to LC9.0.x, we have been trying
to make sure we take advantage of "Unicode Everywhere".
In do in so, we test with text in as many different languages
(representative of different glyphs/alphabets and sets of Right-To-Left
(RTL) and LTR languages). We certainly do NOT have staff that know even
a fraction of the languages. We craft test content using Google
Translate. In doing this, we have discovered and reported a few long
standing Unicode related bugs in the current release of LiveCode, some
going back to LC7.Some examples:
https://quality.livecode.com/show_bug.cgi?id=22249
https://quality.livecode.com/show_bug.cgi?id=22213
https://quality.livecode.com/show_bug.cgi?id=22202
I trust that all will eventually get fixed, hopefully sooner rather
than later. We have now seen another oddity that we can not figure out
if it is an intentional behavior of certain languages in Unicode or a bug.
In certain languages in fields, Arabic for example, when we select
portions of the text in a field, certain glyphs and even words seem to
change. Not knowing the language, we can't be sure. Has anyone else seen
this odd behavior in any language (seems to appear more in RTL
languages) and is it a characteristic of Unicode for some languages or
some sort of bug?
More information about the use-livecode
mailing list