Line Breaks Dropped on Importing Unicode Text
Joe F.
joef1 at mac.com
Fri Sep 4 15:23:17 EDT 2009
The trick is to use uniencode/unidecode for everything.
Three separate examples:
ask file "Name new file:" with "NewFile.xml"
put "binfile:" & it into theNewFileName
get the unicodetext of cd fld 1
put unidecode(it,"utf8") into url (theNewFileName)
------------------------------------------
set the unicodetext of cd fld 1 to uniencode(tMyUnicode,"utf8")
------------------------------------------
put URL (theFTPRequest) into theDownLoadedText
put uniencode(theDownLoadedText,"utf8") into theDownLoadedText
put unidecode(theDownLoadedText,"ANSI") into cd fld 1 of cd id 4630
On Sep 4, 2009, at 1:39 AM, Sivakatirswami wrote:
> I have some UTF-16 unicode raw text. If I import this into Pages, it
> displays the font correctly and also the line breaks between
> paragraphs correctly
>
> But if I use this function:
>
> on mouseUp
> answer file "Choose a unicode file to read in."
> if it is empty then exit mouseUp
> put "binfile:" & it into urlName
> replace numtochar(13) with numtochar(10) in urlName
> set the unicodeText of fld "display" to url urlName
> end mouseUp
>
> the line breaks are not appearing in the field in revolution.
>
> Also if I try to analyze what chars are there, where the line break
> should be, by selecting across a missing line break and then use
> this test:
>
> on mouseUp
> set the useUnicode to true
> if the selection is empty then
> answer "No Selection" with "ok"
> end if
> put the selection into tUnicode
> put tUnicode
> repeat for each char x in tUnicode
> put chartonum(x) & cr after tOutput end repeat
> put tOutput # returns empty
> end mouseUp
>
> I get nothing in the msg box. If I switch from Tamil Inaimathi (Mac
> unicode font) to Anjal Chittu unicode, The Tamil displays correctly
> and now I can clearly see a two byte block
>
> space+square-box-symbol (somewhat transparent) where the line breaks
> should be. But it still returns empty on an empty to determine what
> the bytes are....
>
> How do we deal with/import correctly, line breaks in unicode text in
> Revolution?
>
> I plan to create an editing environment as a revlet for online
> work... and unicode will be imported and exported freely for later
> use in InDesign. Obviously CRLF (or whatever it is in Unicode) needs
> to be preserved.
>
> Thanks!
> Sivakatirswami
> "completely Unicode newbie"
>
>
>
>
> _______________________________________________
> use-revolution mailing list
> use-revolution at lists.runrev.com
> Please visit this url to subscribe, unsubscribe and manage your
> subscription preferences:
> http://lists.runrev.com/mailman/listinfo/use-revolution
More information about the use-livecode
mailing list