C# Regex Problem

Question

I want to extract all table rows from an HTML page. But using the pattern @"<tr>([\w\W]*)</tr>" is not working. It's giving one result which is first occurence of <tr> to last occurrence of </tr>. But I want every occurrence of <tr>...</tr> value. Can anyone please tell me how I can do this?

carla · Accepted Answer · 2017-11-27 00:04:31Z

5

[\w\W]* matches greedily so it will match from the first <tr> to the last </tr>.

A regex approach won't work well because HTML is not a regular language. If you really wanted to try to use a lazy modifier such as "<tr>(.*?)</tr>" with the RegexOptions.Singleline flag, however this isn't guaranteed to work in all cases.

For parsing HTML you need an HTML parser. Try HTML Agility Pack.

edited Nov 27, 2017 at 0:04

carla

2,1471 gold badge34 silver badges48 bronze badges

answered Feb 4, 2011 at 22:55

Mark Byers

843k202 gold badges1.6k silver badges1.5k bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

Zach Johnson Over a year ago

And we all know what happens when you try to parse html with a regex... stackoverflow.com/questions/1732348/…

Barun Over a year ago

Another question is there anyway so that I can do it using regex ?

Mark Byers Over a year ago

This page shows a quick example of how the HTML Agility Pack library can be used: htmlagilitypack.codeplex.com/wikipage?title=Examples

Rubens Farias · Accepted Answer · 2011-02-04 23:00:10Z

2

I do agree with Mark: you should to use HTML Agility Pack library.

About your regex, you should to go with something like:

@"<tr>([\s\S]*?)</tr>"

That's a non greedy pattern, and you should to get one match for every TR.

answered Feb 4, 2011 at 23:00

Rubens Farias

58k8 gold badges136 silver badges165 bronze badges

1 Comment

Barun Over a year ago

Another question... Can you provide me any link or book name where I can learn this all regex [C#] property properly ?

Collectives™ on Stack Overflow

C# Regex Problem

2 Answers 2

3 Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

3 Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related