Converting from unicode to ASCII
J. Landman Gay
jacque at hyperactivesw.com
Thu Sep 24 11:57:33 EDT 2020
It's all automated already except for the uploading. The file organization
on AWS is complex and the stacks don't all go in the same place.
--
Jacqueline Landman Gay | jacque at hyperactivesw.com
HyperActive Software | http://www.hyperactivesw.com
On September 23, 2020 4:53:36 PM Richard Gaskin via use-livecode
<use-livecode at lists.runrev.com> wrote:
> For an ongoing need like that on a substantial project, I'd automate it:
>
> She works on her master copy, then presses a button. Done.
>
> The button saves the stack, copies it to the munged name, and uploads it
> for her, even verifying the integrity of the upload afterward (machines
> don't mind the extra work).
>
> --
> Richard Gaskin
> Fourth World Systems
>
>
> J. Landman Gay wrote:
>> On 9/23/20 1:26 PM, Richard Gaskin via use-livecode wrote:
>>> My only suggestion was to change how the existing munger works to satisfy
>>> the two problem areas
>>> identified: that names not be too long, and that any munger not remove so
>>> many characters as to
>>> make the file name non-unique or empty.
>>
>> There's one more consideration though. The file name must be recognizable
>> so that it can be
>> replaced or updated on the server easily by a human. Hashes, UUIDs, html
>> entity numbers, HEX,
>> etc. would all be workable if that weren't the case.
>>
>> But consider the case where my client has made a few edits to the text in a
>> stack and wants to
>> replace the existing one. With descriptive names, the file is easy to find
>> in the AWS bucket.
>> But comparing long sequences of indecipherable text is cumbersome.
>>
>> I'm drifting toward the idea of removing non-ascii characters. That might
>> satisfy all
>> requirements, at least for now. We don't do Sanskrit or Chinese yet. Or
>> alternately I could
>> bite the bullet and convert my build tool to insert metadata into the
>> clickable lists. That
>> isn't terribly difficult, I was just wondering if there was a different way
>> using what we
>> already have.
>>
>> Devin, Paul and Scott suggested variations on the "removal" approach. I
>> haven't tested much,
>> but it looks like converting to UTF8 will quickly remove any non-ascii
>> characters. Duplication
>> of file names is unlikely given the way various product files are
>> separately stored on AWS.
>>
>> But I'm still pondering. When I first asked the question, I wondered if
>> there was a quick way
>> to do what I want, though I didn't expect much. What I got back from this
>> amazing list is a
>> wealth of ideas and a very interesting discussion.
>>
>> --
>> Jacqueline Landman Gay | jacque at hyperactivesw.com
>> HyperActive Software | http://www.hyperactivesw.com
>
>
> _______________________________________________
> use-livecode mailing list
> use-livecode at lists.runrev.com
> Please visit this url to subscribe, unsubscribe and manage your
> subscription preferences:
> http://lists.runrev.com/mailman/listinfo/use-livecode
More information about the use-livecode
mailing list