how to compare 2 very large textfiles

Matthias Rebbe matthias_livecode_150811 at m-r-d.de
Wed Oct 5 18:00:04 EDT 2011


Hi,

i need to compare two very large text files with about 5000 - 7000 lines each with a lines size of up to 256 chars.

I need to find out if there are lines missing in either file a or file b.

What is the best way to do this with good speed?

I tried to check each line in file a and if the line is in file b.
And after that, i check for each line in file b and try to find out
if the line is in file a.

With large files it takes about 10 to 15 minutes to do the complete check.

My script looks like this

repeat for each line i in tTextA
if i is not among the lines of tTextB then put i &return after tMissingInB
end repeat

repeat for each line i in tTextB
if i is not among the lines of tTextA then put i &retrurn after tMissingInA
end repeat

Is there a better (faster) way?

Regards,

Matthias








More information about the use-livecode mailing list