how to compare 2 very large textfiles

Matthias Rebbe matthias_livecode_150811 at m-r-d.de
Thu Oct 6 15:17:47 EDT 2011


Hi Pete,

thank you very much. It´s so much faster.

It seems, i should look closer to arrays.


Regards,

Matthias
Am 06.10.2011 um 01:13 schrieb Pete:

> I've used an array to do this type of operation in the past.  Haven't tried
> this code but it might work better.
> 
> repeat for each line i in tTextA
> put true into myArray[i]["A"]
> end repeat
> 
> repeat for each line i in tTextB
> put true into myArray[i]["B"]
> end repeat
> 
> repeat for each line k in the keys of myArray
> if myArray[k]["A"] is not true then put k & return after after tMissingInA
> if myArray[k]["B"] is not true then put k & return after after tMissingInB
> end repeat
> 
> Pete
> Molly's Revenge <http://www.mollysrevenge.com>
> 
> 
> 
> 
> On Wed, Oct 5, 2011 at 3:00 PM, Matthias Rebbe <
> matthias_livecode_150811 at m-r-d.de> wrote:
> 
>> Hi,
>> 
>> i need to compare two very large text files with about 5000 - 7000 lines
>> each with a lines size of up to 256 chars.
>> 
>> I need to find out if there are lines missing in either file a or file b.
>> 
>> What is the best way to do this with good speed?
>> 
>> I tried to check each line in file a and if the line is in file b.
>> And after that, i check for each line in file b and try to find out
>> if the line is in file a.
>> 
>> With large files it takes about 10 to 15 minutes to do the complete check.
>> 
>> My script looks like this
>> 
>> repeat for each line i in tTextA
>> if i is not among the lines of tTextB then put i &return after tMissingInB
>> end repeat
>> 
>> repeat for each line i in tTextB
>> if i is not among the lines of tTextA then put i &retrurn after tMissingInA
>> end repeat
>> 
>> Is there a better (faster) way?
>> 
>> Regards,
>> 
>> Matthias
>> 
>> 
>> 
>> 
>> 
>> _______________________________________________
>> use-livecode mailing list
>> use-livecode at lists.runrev.com
>> Please visit this url to subscribe, unsubscribe and manage your
>> subscription preferences:
>> http://lists.runrev.com/mailman/listinfo/use-livecode
>> 
>> 
> _______________________________________________
> use-livecode mailing list
> use-livecode at lists.runrev.com
> Please visit this url to subscribe, unsubscribe and manage your subscription preferences:
> http://lists.runrev.com/mailman/listinfo/use-livecode





More information about the use-livecode mailing list