2011-05-07 02:54, Robert Bonomi skrev:
 From [email protected]  Fri May  6 19:27:54 2011
Date: Sat, 07 May 2011 02:09:26 +0200
From: Rolf Nielsen<[email protected]>
To: FreeBSD<[email protected]>
Subject: Comparing two lists

Hello all,

I have two text files, quite extensive ones. They have some lines in
common and some lines are unique to one of the files. The lines that do
exist in both files are not necessarily in the same location. Now I need
to compare the files and output a list of lines that exist in both
files. Is there a simple way to do this? diff? awk? sed? cmp? Or a
combination of two or more of them?


If the files have only 'minor' differences -- i.e. no long runs of lines
that are in only one fie -- *and* the common lines are  in the same order
in each file, you can use diff(1), without any other shennigans.

If the above is -not- true, and If you need _only_ the common lines, AND
order is not important, then sort(1) both files, and use diff(1) on the
two sorted versions.


Beyond that it depends on what you mean by 'extensive' ones.  megabytes?
Gigabytes? or what??




Some 10,000 to 20,000 lines each. I do need only the common lines. Order is not essential, but would make life easier. I've tried a little with uniq, as suggested by Polyptron, but I guess 3am is not quite the right time to do these things. Anyway, thanks.
_______________________________________________
[email protected] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "[email protected]"

Reply via email to