parse line output into table in bash

Nice. Can you explain the y/;/,/ instead of s/;/,/g?

NeronLeVelu Over a year ago

y mean transform like TR where s is substitute. y works per peer where s take the whole pattern. So s/12/34/ change any 12by 34 where y/12/34/ change any 1 to 3 AND any 2 to 4. y is always for all occurance and a bit faster.

Avinash Raj · Accepted Answer · 2014-08-25 10:59:35Z

1

You could try the below sed command if the contents are in the format you mentioned,

$ sed 's/^[^(]*(\([^)]*\))\s*;\s*\S*\s*=\s*\(\S\+\)\s*;\s*\S*\s*=\s*\(\S\+\)\s*;\s*\S*\s*=\s*\(\S\+\)$/\1,\2,\3,\4/' file
7,59,0.876,0.000433344,0.00003

edited Aug 25, 2014 at 10:59

answered Aug 25, 2014 at 10:21

Avinash Raj

175k32 gold badges247 silver badges289 bronze badges

3 Comments

Note your final version (quite similar to mine, by the way) does not need the /g in sed, because it is executed just once.

Avinash Raj Over a year ago

@fedorqui yep , you're right but your grep command will match incorrect numbers also like 99...9. So i posted this. I'll remove if you insist.

No, no need to remove the full approach... it was just that last part (piped sed to format output) was quite similar. Regarding my solution, yes, it is right that won't match 99..., etc, but should be fine for sample numbers are described by the OP.

perreal · Accepted Answer · 2014-08-25 11:52:42Z

1

Using sed:

sed 's/[^0-9,.][^0-9,.]*/ /g' input

for better formatting:

 sed 's/[^0-9,.][^0-9,.]*/ /g' input | column -to,

Gives:

7,59,0.876,0.000433344,0.00003

edited Aug 25, 2014 at 11:52

answered Aug 25, 2014 at 10:35

perreal

98.7k23 gold badges159 silver badges187 bronze badges

4 Comments

Post the output of your commands. I get 7,59 0 876 0 000433344 0 00003. So all . is gone and only one , left.

perreal Over a year ago

@Jotne, ah missed the dots, fixed now

But it's still not: 7,59,0.876,0.000433344,0.00003. This sed 's/[^0-9,.][^0-9,.]*/ /g;s/ /,/g' will help some, but gives an extra , at the beginning. Sorry to be picky :)

perreal Over a year ago

@Jotne, thanks again, modified the column command to add the commas.

fedorqui · Accepted Answer · 2014-08-25 10:21:03Z

0

You can grep for numbers:

$ grep -o '[0-9.]*' file
7
59
0.876
0.000433344
0.00003

With the -o flag we indicate grep just to print the matched results. This way, you have all your values but not the surrounding text.

If you want it comma-separated, pipe to tr to replace every new line with comma, and finally to sed to replace last comma with a new line:

$ grep -o '[0-9.]*' a | tr -s '\n' ',' | sed 's/,$/\n/'
7,59,0.876,0.000433344,0.00003

answered Aug 25, 2014 at 10:21

fedorqui

293k113 gold badges592 silver badges640 bronze badges

7 Comments

what if there is multiple lines in the file ?

@pomeh it will still work. Test it with dummy data :)

This does not work for me with multiple lines, I got: 7,59,0.876,0.000433344,0.00003,7,59,0.876,0.000433344,0.00003,[...]

@pomeh it is not quite clear what you mean. If you refer to the trailing comma, it is handled by the sed at the end.

Here's what I get: sebsauvage.net/paste/…

|

Kent · Accepted Answer · 2014-08-25 10:29:04Z

0

also gnu awk with FPAT:

awk -v FPAT="[0-9.]+" '{for(i=1;i<=NF;i++)printf "%s%s", $i,(i!=NF?",":"\n")}'

test:

$ echo "new file (7,59) ; lim = 0.876 ; dim = 0.000433344 ; r_d = 0.00003"|awk -v FPAT="[0-9.]+" '{for(i=1;i<=NF;i++)printf "%s%s", $i,(i!=NF?",":"\n")}'      
7,59,0.876,0.000433344,0.00003

The FPAT could be made better.

answered Aug 25, 2014 at 10:29

Kent

197k36 gold badges248 silver badges317 bronze badges

1 Comment

It may be worth to mention that you need gnu awk 4.00 or newer to use FPAT

clt60 · Accepted Answer · 2014-08-25 12:07:19Z

0

Many solutions, only perl misisng ;)

perl -nlE '$,=",";say m/[\d.]+/g'

set the "list separator" to ,
match only numbers (returns a list)
print the list

or (ofc) @neronlevelu's solution

perl -plE 's/[^\d,;.]//g;y/;/,/'

remove anything what isn't an digit,;.
change ; to ',' (the y transliterates all occurrences of the characters found in the search list with the corresponding character in the replacement list ) - aka tr.

edited Aug 25, 2014 at 12:07

answered Aug 25, 2014 at 11:59

clt60

64.3k17 gold badges114 silver badges206 bronze badges

Comments

anubhava · Accepted Answer · 2014-08-25 13:51:44Z

0

Using gnu awk:

cat file

new file (7,59) ; lim = 0.876 ; dim = 0.000433344 ; r_d = 0.00003
new file (7,59) ; lim = 0.876 ; dim = 0.000433344 ; r_d = 0.00003
new file (7,59) ; lim = 0.876 ; dim = 0.000433344 ; r_d = 0.00003
new file (7,59) ; lim = 0.876 ; dim = 0.000433344 ; r_d = 0.00003
new file (7,59) ; lim = 0.876 ; dim = 0.000433344 ; r_d = 0.00003
new file (7,59) ; lim = 0.876 ; dim = 0.000433344 ; r_d = 0.00003
new file (7,59) ; lim = 0.876 ; dim = 0.000433344 ; r_d = 0.00003

awk -F ' *[=()] *' -v RS=' ; |\n' -v OFS= -v ORS= 'NF{print $2, (NR%4==0)? "\n":","}' file
7,59,0.876,0.000433344,0.00003
7,59,0.876,0.000433344,0.00003
7,59,0.876,0.000433344,0.00003
7,59,0.876,0.000433344,0.00003
7,59,0.876,0.000433344,0.00003
7,59,0.876,0.000433344,0.00003
7,59,0.876,0.000433344,0.00003

edited Aug 25, 2014 at 13:51

answered Aug 25, 2014 at 10:24

anubhava

790k67 gold badges603 silver badges671 bronze badges

5 Comments

does not work for me, output has a trailing comma in a new line. Also, this does not work with multiple lines.

anubhava Over a year ago

@pomeh It does work with multiple lines also. Also what is your awk version? Are you using gnu awk?

Here is the output I get: sebsauvage.net/paste/… My awk version ig 3.1.7 running on CentOS 6.5