sed: jump to next input file

Question

Is there an option or command similar to grep's -m1 or awk's nextfile for sed, which would allow sed to immediately stop processing the current input file when a match is found (while continuing to process subsequent input files)?

For example:

find . -type f -exec sed -ns '/pattern/{<do stuff>; p; <next>}' {} +

where <next> would be a command to cease reading the current input file. The quit command (q) is not suitable, since it simply causes sed to exit (abandoning subsequent input files), and would therefore find at most one match per batch of input files.

how about using \; instead of + so that sed sees only one file input at a time? — Sundeep
– Sundeep, Commented Jan 21, 2020 at 4:54
@Sundeep: I had thought about that, but I prefer to use + for performance reasons. — user001
– user001, Commented Jan 21, 2020 at 17:35
in that case perhaps use awk or perl as they do have ability to skip rest of the lines... if your input is ASCII, using LC_ALL=C will give you speed boost.. perhaps use xargs to parallelize.. and do at least try \; with q.. it isn't always easy to know speed results without actually performing the tests — Sundeep
– Sundeep, Commented Jan 22, 2020 at 3:33
@Sundeep: Thanks, LC_ALL=C is a good point in terms of performance. — user001
– user001, Commented Jan 22, 2020 at 7:28

Jakub Jindra · Accepted Answer · 2020-01-21 12:01:59Z

1

Example wit search and replace:

GNU sed

stops processing file/input (thanks to sed's -s option and find's +) after first occurrence of pattern.

find . -type f -exec sed -ns '0,/pattern/s/patter/replacement/p' "{}" +

BSD sed

BSD sed seems to lack -s option. So I'm using Sundeep's suggestion.

Quit sed after first occurrence of pattern and find will execute sed with next file.

find . -type f -exec sed -n '0,/pattern/p;s/pattern/replacement/p;q' "{}" \;

edited Jan 21, 2020 at 12:01

answered Jan 21, 2020 at 8:43

Jakub Jindra

1,5371 gold badge14 silver badges26 bronze badges

Thanks. The address range (0,/pattern/) does work, though it requires searching for the pattern twice (in the general case, <do stuff> might not involve a substitution on the pattern).

user001
– user001

2020-01-21 17:59:53 +00:00
Commented Jan 21, 2020 at 17:59
It should work with different things than substitution as well.

Jakub Jindra
– Jakub Jindra

2020-01-21 19:06:10 +00:00
Commented Jan 21, 2020 at 19:06
To see that it does not generalize, compare the output of seq 10 | sed -n '0,/9/ s/9/A/p' and seq 10 | sed -n '0,/9/ ='. Also, the address range only restricts execution of those commands, but processing will continue until EOF, which may be undesirable for large input files.

user001
– user001

2020-01-21 22:51:55 +00:00
Commented Jan 21, 2020 at 22:51
1

There's one more option how to possibly speed it up. You can find . -type f -print 0 | xargs -0 -P XX -L YY sed … this will run max XX parallel seds each with up to YY arguments from find. But it's possible that your <do stuff> isn't a good case for this.

Jakub Jindra
– Jakub Jindra

2020-01-22 13:32:32 +00:00
Commented Jan 22, 2020 at 13:32
Thanks, that's a good point as well. In this case, unmixed output is preferred, so parallel could be used.

user001
– user001

2020-01-22 17:51:34 +00:00
Commented Jan 22, 2020 at 17:51

| Show 2 more comments

Stack Exchange Network

sed: jump to next input file

1 Answer 1

You must log in to answer this question.

Hot Network Questions

sed: jump to next input file

1 Answer 1

You must log in to answer this question.

Related

Hot Network Questions