Joining lines with same field

Question

I've two files in this form:

File1: id:0.0260509118455
File2: id:X:Y

I'd like to get a third file having all lines of file1 joined with the lines of second file containing the same id. i.e. :

File3: id:0.0260509118455:X:Y

(file1 has 100 lines, file2 has 666 lines). There are not unpairable lines

Community · Accepted Answer · 2017-04-13 12:36:48Z

2

To join files containing database tables, use the join command after sorting the tables into key order:

sort -b -t : file1 > sorted-file1
sort -b -t : file2 > sorted-file2
join -t : sorted-file1 sorted-file2

admstg · Accepted Answer · 2017-02-27 16:16:30Z

0

You should able to do this with the "paste" command. It reads the columns instead of lines.

awk -F: '{ print $2}' File2 > File4

To remove the id: tag

Then

paste  File1 File4 > File3

Should do the job.

answered Feb 27, 2017 at 16:16

admstg

8315 silver badges15 bronze badges

George Vasiliou · Accepted Answer · 2017-02-28 11:23:41Z

0

You can also do it with awk, checking the id, without the need to sort or to pre-process the files in any way:

awk -F: 'NR==FNR{a[$1]=$0;next}$1 in a {print a[$1],$2,$3}' OFS=: file1 file2 >file3

PS: To gain performance small file (file1 100 lines) is loaded first in memory, and big file is compared against memory.

answered Feb 28, 2017 at 11:23

George Vasiliou

8,1013 gold badges24 silver badges43 bronze badges

3 Answers 3