New chunks

Richmond richmondmathewson at gmail.com
Wed Mar 12 03:59:21 EDT 2014


I think this might be useful for anyone who wants to detect word breaks 
in Unicode,
although, personally, it makes me feel very queasy indeed:

http://www.unicode.org/Public/6.3.0/ucd/auxiliary/WordBreakTest.html

This comment; "The material here is informative, not normative." doesn't
exactly inspire confidence either. But, rather than some of the RunRev team
taking a year to have a world tour interviewing speakers of the
100 most-spoken languages to find out how they type their stuff, this is all
there is.

"If your browser handles titles (tool tips), then hovering the mouse 
over the row header will show a sample character of that type."

Well, Firefox handles them; and in some ways the tooltips are the most 
useful things on that page.

[ Mind you, if I were the person at RunRev having to deal with implementing
Unicode I would jump at the chance to have a world tour interviewing
people . . . LOL . . . however, I might just "softly and silently vanish 
away"
somewhere near Singapore! ]

This might, as well, be a "right bu**er" when one comes to typing 
languages that go from right to left
[ Arabic, Manda, Hebrew, et al ].

And here is a bit about sentences:

http://www.unicode.org/reports/tr29/#Sentence_Boundaries

Richmond.




More information about the use-livecode mailing list