LiveCode 7 codepoint question
dsc at swcp.com
Fri Apr 18 18:24:44 EDT 2014
Looking a little closer, I think daduken and handaduken are going to be the only combining marks you will see in Japanese. The half-width versions are not called combining marks in the Unicode database, but they do count as part of the character to the left in LiveCode.
Would loan words include tone marks? Then maybe tone marks might apply. Also, general combining marks such as keycap might apply: ト⃣
Oh, that looks ugly on my computer.
So, look at U+3099, U+309A, U+FF9E, and U+FF9F.
On Apr 18, 2014, at 3:47 PM, Dar Scott <dsc at swcp.com> wrote:
> Here is my experiment to look at characters that are multiple codepoints in Japanese. (This experiment is limited to Katakana.)
> (I don’t know Japanese, so I apologize for anything goofy.)
> This shows the two-codepoint versions of ド as one character.
> On my OS X system, the latter two did not render as one character in the message box, though. I might be doing something wrong. When I pasted the output string into mail, one of those combined but the second didn’t—maybe it is intended for use with half-width Katakana.
> The output is this:
> ト ド ド 1 トﾞ 1
> on mouseUp
> put numToCodepoint( 0x30C8 ) into kto
> put numToCodepoint( 0x30C9 ) into kdo
> put numToCodepoint( 0x3099 ) into kVoiceMark
> put numToCodepoint( 0xFF9E ) into kHalfVoiceMark
> put kto & kVoiceMark into kdoAlt1
> put length(kdoAlt1) into kdoAlt1N
> put kto & kHalfVoiceMark into kdoAlt2
> put length(kdoAlt2) into kdoAlt2N
> put kto & tab & kdo & tab & kdoAlt1 & tab & kdoAlt1N & tab & kdoAlt2 & tab & kdoAlt2N
> end mouseUp
> On Apr 18, 2014, at 2:59 PM, Kenji Kojima <index at kenjikojima.com> wrote:
>> What is the actual single unicode character which is composed of two or more code points?
>> I could not find it in Japanese characters. I could use same “char” and “code point” in Japanese.
>> Are there it in other languages?
>> There is a comment of “codepoint" on the dictionary.
>> "A codepoint is an integer identifier associted witha a Unicode character.
>> A single character is composed of one or more code points.”
>> Kenji Kojima / 小島健治
>> use-livecode mailing list
>> use-livecode at lists.runrev.com
>> Please visit this url to subscribe, unsubscribe and manage your subscription preferences:
> use-livecode mailing list
> use-livecode at lists.runrev.com
> Please visit this url to subscribe, unsubscribe and manage your subscription preferences:
More information about the Use-livecode