a regular expression question, or at least a text manipulation question
Jim Ault
JimAultWins at yahoo.com
Wed Aug 27 18:36:05 EDT 2008
Just thinking, and here is a bit more compact code to do the job:
It should run about the same speed.
> repeat for each line LNN in incomingList
> put word 1 of LNN & cr after prefixList
> end repeat
>
> split prefixList using cr and tab
> put the keys of prefixList into prefixList
could be replaced by the three-liner
get incomingList
split it using cr and space
put the keys of it into prefixList
Jim Ault
Las Vegas
On 8/27/08 2:22 PM, "Jim Ault" <JimAultWins at yahoo.com> wrote:
> I may be solving the wrong problem for you but see if this works
>
> The prefix will always be word 1 of each line
> You do not need case-sensitive
> the delimiter is a tab
>
> ------------- start copy
> on test
> put the clipboarddata into incomingList
> filter incomingList without empty
> repeat for each line LNN in incomingList
> put word 1 of LNN & cr after prefixList
> end repeat
>
> split prefixList using cr and tab
> put the keys of prefixList into prefixList
> set the itemdel to tab
>
> repeat for each line LNN in prefixList
> get incomingList
> filter it with LNN & "*"
> replace cr with tab in it
> put it & cr after packedList
> end repeat
>
> put packedList into msg
>
> end test
>
> ------ end copy
>
> Jim Ault
> Las Vegas
>
>
>
> On 8/27/08 1:35 PM, "Peter Alcibiades" <palcibiades-first at yahoo.co.uk>
> wrote:
>
>> How do you do the following?
>>
>> I have a series of lines which go like this
>>
>> | [record separator, new record starts]
>> AAA consectetur adipisicing elit, sed
>> BBB lorem ipsum
>> CCC consectetur adipisicing elit, sed
>> CCC laboris nisi ut aliquip ex ea
>> DDD ut aliquip ex ea commodo
>> | [record separator]
>> AAA adipisicing elit, sed [new record starts]
>>
>> | is the record separator.
>>
>> In the above, its CCC that is repeated, but it could be any prefix. Also CCC
>> is next to its repetition. This will always be the case.
>>
>> I want to go through the file. When I find a single prefix (like AAA) this
>> should be written to the output file. when the next line starts with the
>> same prefix (as in the CCC cases, I want to put both occurences on the same
>> line. So the desired output would be
>>
>> AAA consectetur adipisicing elit, sed
>> BBB lorem ipsum
>> CCC consectetur adipisicing elit, sed CCC laboris nisi ut aliquip ex ea
>> DDD ut aliquip ex ea commodo
>> EOR
>> AAA adipisicing elit, sed
>>
>> How do I detect a repetition of that sort and do this?
>>
>> A similar question, if the line is
>>
>> CCC adipisicing elit, sed TAB CCC adipisicing elit, sed
>>
>> How do you detect the multiple occurence (I can do this with regex) and then
>> write out in place of thie above expression (this I don't see how to do) the
>> following:
>>
>> CCC adipisicing elit, sed CCC adipisicing elit, sed
>>
>> Obviously, the pseudo latin is different in each case, so no way to check
>> using that.
>>
>> Peter
>> _______________________________________________
>> use-revolution mailing list
>> use-revolution at lists.runrev.com
>> Please visit this url to subscribe, unsubscribe and manage your subscription
>> preferences:
>> http://lists.runrev.com/mailman/listinfo/use-revolution
>
>
> _______________________________________________
> use-revolution mailing list
> use-revolution at lists.runrev.com
> Please visit this url to subscribe, unsubscribe and manage your subscription
> preferences:
> http://lists.runrev.com/mailman/listinfo/use-revolution
More information about the use-livecode
mailing list