how to compare 2 very large textfiles

Pete pete at mollysrevenge.com
Thu Oct 6 15:32:43 EDT 2011


Glad it worked Matthias.  Could you give us an idea of the new timing using
the arrays?
Pete
Molly's Revenge <http://www.mollysrevenge.com>




On Thu, Oct 6, 2011 at 12:17 PM, Matthias Rebbe <
matthias_livecode_150811 at m-r-d.de> wrote:

> Hi Pete,
>
> thank you very much. It´s so much faster.
>
> It seems, i should look closer to arrays.
>
>
> Regards,
>
> Matthias
> Am 06.10.2011 um 01:13 schrieb Pete:
>
> > I've used an array to do this type of operation in the past.  Haven't
> tried
> > this code but it might work better.
> >
> > repeat for each line i in tTextA
> > put true into myArray[i]["A"]
> > end repeat
> >
> > repeat for each line i in tTextB
> > put true into myArray[i]["B"]
> > end repeat
> >
> > repeat for each line k in the keys of myArray
> > if myArray[k]["A"] is not true then put k & return after after
> tMissingInA
> > if myArray[k]["B"] is not true then put k & return after after
> tMissingInB
> > end repeat
> >
> > Pete
> > Molly's Revenge <http://www.mollysrevenge.com>
> >
> >
> >
> >
> > On Wed, Oct 5, 2011 at 3:00 PM, Matthias Rebbe <
> > matthias_livecode_150811 at m-r-d.de> wrote:
> >
> >> Hi,
> >>
> >> i need to compare two very large text files with about 5000 - 7000 lines
> >> each with a lines size of up to 256 chars.
> >>
> >> I need to find out if there are lines missing in either file a or file
> b.
> >>
> >> What is the best way to do this with good speed?
> >>
> >> I tried to check each line in file a and if the line is in file b.
> >> And after that, i check for each line in file b and try to find out
> >> if the line is in file a.
> >>
> >> With large files it takes about 10 to 15 minutes to do the complete
> check.
> >>
> >> My script looks like this
> >>
> >> repeat for each line i in tTextA
> >> if i is not among the lines of tTextB then put i &return after
> tMissingInB
> >> end repeat
> >>
> >> repeat for each line i in tTextB
> >> if i is not among the lines of tTextA then put i &retrurn after
> tMissingInA
> >> end repeat
> >>
> >> Is there a better (faster) way?
> >>
> >> Regards,
> >>
> >> Matthias
> >>
> >>
> >>
> >>
> >>
> >> _______________________________________________
> >> use-livecode mailing list
> >> use-livecode at lists.runrev.com
> >> Please visit this url to subscribe, unsubscribe and manage your
> >> subscription preferences:
> >> http://lists.runrev.com/mailman/listinfo/use-livecode
> >>
> >>
> > _______________________________________________
> > use-livecode mailing list
> > use-livecode at lists.runrev.com
> > Please visit this url to subscribe, unsubscribe and manage your
> subscription preferences:
> > http://lists.runrev.com/mailman/listinfo/use-livecode
>
>
> _______________________________________________
> use-livecode mailing list
> use-livecode at lists.runrev.com
> Please visit this url to subscribe, unsubscribe and manage your
> subscription preferences:
> http://lists.runrev.com/mailman/listinfo/use-livecode
>
>



More information about the use-livecode mailing list