Regex to read URL from ASPX File PowerShell

Question

I'm writing a PowerShell Script which extracts URL's from ASPX files and test if their HTTP Statuscode is equal to 200.

I found the following Regex to get the URL:

$regex = "(http[s]?|[s]?ftp[s]?)(:\/\/)([^\s,]+)"
select-string -Path $path -Pattern $regex -AllMatches | % { $_.Matches } | % { $_.Value }

But the return looks like this:

https://code.jquery.com/ui/1.9.0/themes/base/jquery-ui.css"/>
https://code.jquery.com/ui/1.11.4/jquery-ui.min.js"></script>

as you can see, it doesn't really trim the end of the HTML Tags.

How can I edit my regex to get the URL without the HTML Tags in the end?

Replace [^\s,] with [^\s,<>"]

Wiktor Stribiżew
– Wiktor Stribiżew

2017-08-18 07:15:20 +00:00
Commented Aug 18, 2017 at 7:15 — Wiktor Stribiżew
– Wiktor Stribiżew, Commented Aug 18, 2017 at 7:15
@WiktorStribiżew Perfect, thanks!

SimonS
– SimonS

2017-08-18 07:17:19 +00:00
Commented Aug 18, 2017 at 7:17 — SimonS
– SimonS, Commented Aug 18, 2017 at 7:17

Wiktor Stribiżew · Accepted Answer · 2017-08-18 07:21:55Z

2

If you have a look at the [^\s,] negated character class, you will see it matches any char but whitespace and ,. If you look at the input you have, you will notice that " and < and > can all be matched with [^\s,].

A fix for the current situation is to add <>" chars into the negated character class to make the regex engine "stop" when it comes across the >, < and " chars.

Note that since you extract whole matches, you may refactor the pattern a bit and remove unnecessary groupings and turn the first one into a non-capturing group:

$regex = '(?:http|s?ftp)s?://[^\s,<>"]+'

Mind that in .NET patterns, / does not need to be escaped (it is not a special regex metacharacter/operator).

answered Aug 18, 2017 at 7:21

Wiktor Stribiżew

631k41 gold badges502 silver badges632 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Regex to read URL from ASPX File PowerShell

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related