New chunks
Jim Hurley
jhurley0305 at sbcglobal.net
Thu Mar 13 00:48:13 EDT 2014
Hi Bob,
I have an app that I use for manuscript analysis and have done some of this "prep."
For example to deal with decimal points, since the digit following the decimal point is always a number (0123456789):
repeat with i = 0 to 9
replace "." & i with "\" & i in tText
end repeat
So that 3.14 becomes 3\14 and 2.3456 becomes 2\3456
But that's only the beginning. There are a massive number of abbreviations: Mr. Smith, etc. Jan. , A.D., Dr.
I have a list of about 70.
But then there are names: P. G. Wodehouse and on and on.
I find my script useful for my own needs, but I don't see any commercial application. Just too many unexpected places where a period will show up.
So I really can't see the purpose of RR's "sentence chunk". I wish they would explain.
Jim
>
> Message: 25
> Date: Wed, 12 Mar 2014 23:58:46 +0000
> From: Bob Sneidar <bobsneidar at iotecdigital.com>
> To: How to use LiveCode <use-livecode at lists.runrev.com>
> Subject: Re: New chunks
> Message-ID: <E3AB1833-C8FC-47F0-AF8E-358E6EC802CD at iotecdigital.com>
> Content-Type: text/plain; charset="Windows-1252"
>
> Pretty sure Livecode is going to do a simple delimiter on period. You would have to prep the data first by replacing periods in any word that is a number with a placeholder, processing your sentences, then restoring the placeholders (if you need to).
>
> You could get fancy by setting the lineDelimiter to space, then finding every line that ends in a period and processing everything in-between. It?s doubtful a number would end in a period without it being the end of a sentence.
>
> Bob
>
>
> On Mar 11, 2014, at 15:34 , Jim Hurley <jhurley0305 at sbcglobal.net> wrote:
>
>> Can someone explain how the ?sentence" chunk would work?
>> How are decimal points, and points in an abbreviation distinguished from the ?period? that deliniates the end of a ?sentence??
>> Does it presume that the exitsing text has special embedded ?periods??
>>
>> I?ve written my own, but it is very cumbersome and not flawless. I use it to do manuscript analysis.
>> Like: Find all sentences in which ?time? and ?party? occur anywhere in the same sentence.
>>
>> My ignorance on unicode is profound.
>> Jim
>> For some reason data
> in certain rows didn't 'register' correctly and so WHERE clauses based on
> those rows didn't work. A bug report was issued and the problem solved.
>
> Currently the WHERE clauses in SQLite + LC 6.6.1 (6.6 rc1) seem to be
> working for me, but I haven't really stress tested it.
>
> Try creating a brand new db and see if you can get WHERE clauses to work.
> If so, what about dumping your data, build a new db and see if that works?
>
>
> ------------------------------
>
> Subject: Digest Footer
>
> _______________________________________________
> use-livecode mailing list
> use-livecode at lists.runrev.com
> http://lists.runrev.com/mailman/listinfo/use-livecode
>
> ------------------------------
>
> End of use-livecode Digest, Vol 126, Issue 19
> *********************************************
More information about the use-livecode
mailing list