regex select multilines in powershell

Question

I created a file like this

echo "test 1", Hello, foo, bar, world, "test 2" > test.txt

and the result is this:

test 1
Hello
foo
bar
a better world
test 2

I need to remove all the text starting with the keyword "Hello" and ending with "world", including both keywords.

Something like this

test 1
test 2

I tried

$pattern='(?s)(?<=/Hello/\r?\n).*?(?=world)'
(Get-Content -Path .\test.txt -Raw) -replace $pattern, "" | Set-Content -Path .\test.txt

but nothing happend. What can I try?

it seems like you could do it with -replace '(?s)\s*Hello.*world' — Santiago Squarzon
– Santiago Squarzon, Commented Jan 11, 2023 at 16:36
@Leo Your post say "the text between the keywords", please update your question to reflect what you actually want — Mathias R. Jessen
– Mathias R. Jessen, Commented Jan 11, 2023 at 16:38
Nicely done, @Santiago - I suggest posting that as an answer (the only consideration worth mentioning is whether the .* should be greedy or not). — mklement0
– mklement0, Commented Jan 11, 2023 at 16:40
thanks @mklement0 but im honestly still unclear on what OP wants — Santiago Squarzon
– Santiago Squarzon, Commented Jan 11, 2023 at 16:40
@MathiasR.Jessen, sorry, I got confused: yes, my answer removes the keywords, because I believe that to be the OP's intent ("including both keywords"). — mklement0
– mklement0, Commented Jan 11, 2023 at 16:44

Santiago Squarzon · Accepted Answer · 2023-01-11 16:51:19Z

3

Assuming you want to remove the starting and ending keywords you could use either (?s)\s*Hello.*world or (?s)\s*Hello.*?world depending on if you want .* to be greedy or lazy.

(Get-Content path\to\file.txt -Raw) -replace '(?s)\s*Hello.*world' |
    Set-Content path\to\result.txt

Use -creplace for case sensitive matching of the keywords.

answered Jan 11, 2023 at 16:51

Santiago Squarzon

65.6k5 gold badges26 silver badges60 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Leo Over a year ago

I also like the -NoNewLine option after the Set-Content command, just to avoid the new empty line at the end of the file

mklement0 · Accepted Answer · 2023-01-11 18:44:28Z

3

Leaving aside that there are extraneous / in your regex, reformulate it as follows:^{Tip of the hat to Santiago Squarzon.}

$pattern = '(?sm)^Hello\r?\n.*?world\r?\n'

(Get-Content -Path .\test.txt -Raw) -replace $pattern | 
  Set-Content -Path .\test.txt

This removes the line starting with Hello all the way through the (first) subsequent line that ends in world, including the next newline. This yields the desired output, as shown in your question.

As for what you tried:

Aside from the extraneous / chars., your primary problem is that you are using look-around assertions ((?<=...), (?=...)), which cause what they match not to be captured as part of the overall match, and are therefore not replaced by -replace.

edited Jan 11, 2023 at 18:44

answered Jan 11, 2023 at 16:36

mklement0

452k68 gold badges728 silver badges987 bronze badges

Comments

iRon · Accepted Answer · 2023-01-11 17:31:00Z

0

I think this is a duplicate with How can I deleted lines from a certain position? or any of the included other duplicates:

'test1', 'Hello', 'foo', 'bar', 'world', 'test2' |SelectString -From '(?=Hello)' -To '(?<=world)'

edited Jan 11, 2023 at 17:31

answered Jan 11, 2023 at 17:06

iRon

24.4k10 gold badges60 silver badges107 bronze badges

1 Comment

mklement0 Over a year ago

Note that you're doing line-by-line processing, whereas the OP's attempt uses single-string, multi-line processing. I'm sure there are plenty of posts here that are variations of the same theme, though the specifics often warrant separate answers. Your custom SelectString function is a nice alternative approach, but, given that its name looks like Select-String, I suggest making it clear (here too) that a custom function is being used.

Collectives™ on Stack Overflow

regex select multilines in powershell

3 Answers 3

1 Comment

Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

1 Comment

Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related