how to compare 2 very large textfiles

Michael Kann mikekann at yahoo.com
Thu Oct 6 15:15:23 EDT 2011


Matthias,

I noticed a typo in the second loop:

repeat for each line i in tTextB
if i is not among the lines of tTextA then put i &retrurn after tMissingInA TYPO RETRURN
end repeat

This probably isn't the actual script or anyway it wouldn't work even after ten minutes. Perhaps there is a type in the script that you used?

Mike


--- On Wed, 10/5/11, Matthias Rebbe <matthias_livecode_150811 at m-r-d.de> wrote:

From: Matthias Rebbe <matthias_livecode_150811 at m-r-d.de>
Subject: how to compare 2 very large textfiles
To: "How to use LiveCode" <use-livecode at lists.runrev.com>
Date: Wednesday, October 5, 2011, 5:00 PM

Hi,

i need to compare two very large text files with about 5000 - 7000 lines each with a lines size of up to 256 chars.

I need to find out if there are lines missing in either file a or file b.

What is the best way to do this with good speed?

I tried to check each line in file a and if the line is in file b.
And after that, i check for each line in file b and try to find out
if the line is in file a.

With large files it takes about 10 to 15 minutes to do the complete check.

My script looks like this

repeat for each line i in tTextA
if i is not among the lines of tTextB then put i &return after tMissingInB
end repeat

repeat for each line i in tTextB
if i is not among the lines of tTextA then put i &retrurn after tMissingInA
end repeat

Is there a better (faster) way?

Regards,

Matthias





_______________________________________________
use-livecode mailing list
use-livecode at lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode



More information about the use-livecode mailing list