BASH: Find a string in file after finding a first string

Question

Please bear with me...

I have a large xml file, I need to find a string "JOBNAME=9027" then find the line that comes after it that contains "TASKTYPE" and change that line.

So I have to change the TASKTYPE line that comes after JOBNAME=9027. There are several hundred JOBNAME and TASKTYPE lines, all different amounts of lines away from each other.

I have tried sed, awk and bash to no avail. I am sure there is a way to do it, but it is escaping me.

EXAMPLE:

JOBNAME="MYSAP#SDOR-SG-D-LATECODED-0927"
            JUL="1"
            JUN="1"
            MAR="1"
            MAXDAYS="0"
            MAXRERUN="0"
            MAXRUNS="0"
            MAXWAIT="0"
            MAY="1"
            MULTY_AGENT="N"
            NODEID="sappr2"
            NOV="1"
            OCT="1"
            PARENT_FOLDER="MYSAP#SSDOR-D-SG-LATECODED-0927"
            PRIORITY="10"
            RETRO="0"
            RULE_BASED_CALENDAR_RELATIONSHIP="O"
            RUN_AS="MYSAP"
            SEP="1"
            SHIFT="Ignore Job"
            SHIFTNUM="+00"
            SUB_APPLICATION="MYSAP"
            SYSDB="0"
            TASKTYPE="Job"

Here goes a chunk, remember there will be multiples of similar data. — Vonedaddy
– Vonedaddy, Commented Jun 1, 2016 at 1:16
Possibly related: stackoverflow.com/questions/893585/how-to-parse-xml-in-bash or stackoverflow.com/questions/4680143/… — Eric Renouf
– Eric Renouf, Commented Jun 1, 2016 at 1:17

John1024 · Accepted Answer · 2016-06-01 01:52:28Z

7

Using sed

Try:

sed '/JOBNAME.*0927/,/TASKTYPE/ {s/TASKTYPE.*/TASKTYPE="NewJob"/}' largefile

This produces as output:

JOBNAME="MYSAP#SDOR-SG-D-LATECODED-0927"
            JUL="1"
            JUN="1"
            MAR="1"
            MAXDAYS="0"
            MAXRERUN="0"
            MAXRUNS="0"
            MAXWAIT="0"
            MAY="1"
            MULTY_AGENT="N"
            NODEID="sappr2"
            NOV="1"
            OCT="1"
            PARENT_FOLDER="MYSAP#SSDOR-D-SG-LATECODED-0927"
            PRIORITY="10"
            RETRO="0"
            RULE_BASED_CALENDAR_RELATIONSHIP="O"
            RUN_AS="MYSAP"
            SEP="1"
            SHIFT="Ignore Job"
            SHIFTNUM="+00"
            SUB_APPLICATION="MYSAP"
            SYSDB="0"
            TASKTYPE="NewJob"

How it works:

/JOBNAME.*0927/,/TASKTYPE/ {...} executes the commands in curly braces only for groups of lines that start with a line matching the regex JOBNAME.*0927 and end with the first line after that that matches TASKTYPE.
s/TASKTYPE.*/TASKTYPE="NewJob"/ replaces the TASKTYPE followed by anything with TASKTYPE="NewJob".

Using awk

This awk script uses the same logic:

awk '/JOBNAME.*0927/,/TASKTYPE/ {sub(/TASKTYPE.*/, "TASKTYPE=\"NewJob\"")} 1' largefile

How it works:

/JOBNAME.*0927/,/TASKTYPE/ {...}

This executes the commands in curly braces only for groups of lines that start with a line matching the regex JOBNAME.*0927 and end with the first line after that that matches TASKTYPE.
sub(/TASKTYPE.*/, "TASKTYPE=\"NewJob\"")

This performs the substitution.
1

Unlike sed, awk does not, by default, print anything. This 1 is awk's cryptic shorthand for print-the-whole-line.

In more detail, 1 is a logical condition. It evaluates to "true." We specified no action to go along with that condition. Therefore, awk performs its default action which is print-the-line: print $0.

edited Jun 1, 2016 at 1:52

answered Jun 1, 2016 at 1:25

John1024

115k15 gold badges152 silver badges183 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

Vonedaddy Over a year ago

That seems to work, can you give me a little explanation of the code so I may learn?

John1024 Over a year ago

@Vonedaddy I just added some explanation.

Vonedaddy Over a year ago

One last comment, what is the "1" for at the end of the line?

John1024 Over a year ago

@Vonedaddy I have added an explanation for the lone 1 also.

Collectives™ on Stack Overflow

BASH: Find a string in file after finding a first string

1 Answer 1

Using sed

Using awk

4 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Using sed

Using awk

4 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related