javascript regex for xml/html attributes

Question

I cant seem to be able to build a good regex expression (in javascript) that extracts each attribute from an xml node. For example,

<Node attribute="one" attribute2="two" n="nth"></node>

I need an express to give me an array of

['attribute="one"', 'attribute2="two"' ,'n="nth"']

... Any help would be appreciated. Thank you

@jfriend00 - probably because browsers have a built–in XML parser and suitable DOM methods already. — RobG
– RobG, Commented Jul 25, 2011 at 3:13
I'm not sure i want the overhead of an xml parser library, plus i'm rarely ever going to have well formed xml. im actual parsing the diff generated by git. — James
– James, Commented Jul 26, 2011 at 1:30

Community · Accepted Answer · 2020-06-20 09:12:55Z

4

you can't parse XML with a regular expression.

And the link: RegEx match open tags except XHTML self-contained tags

You can get the attributes of a node by iterating over its attributes property:

function getAttributes(el) {
  var r = [];
  var a, atts = el.attributes;

  for (var i=0, iLen=atts.length; i<iLen; i++) {
    a = atts[i];
    r.push(a.name + ': ' + a.value);
  }
  alert(r.join('\n'));
}

Of course you probably want to do somethig other than just put them in an alert.

Here is an article on MDN that includes links to relevant standards:

https://developer.mozilla.org/En/DOM/Node.attributes

edited Jun 20, 2020 at 9:12

CommunityBot

11 silver badge

answered Jul 25, 2011 at 3:01

RobG

148k32 gold badges180 silver badges216 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Pablo Fernandez Over a year ago

I'd definitely use this instead of a regex if possible +1

Monday · Accepted Answer · 2011-07-25 02:59:54Z

3

try this~

  <script type="text/javascript">
    var myregexp = /<node((\s+\w+=\"[^\"]+\")+)><\/node>/im;
    var match = myregexp.exec("<Node attribute=\"one\" attribute2=\"two\" n=\"nth\"></node>");
    if (match != null) {
    result = match[1].trim();
    var arrayAttrs = result.split(/\s+/);
    alert(arrayAttrs);}
  </script>

answered Jul 25, 2011 at 2:59

Monday

1,41312 silver badges10 bronze badges

1 Comment

James Over a year ago

I got about this far as well. unfortunately, a space in the attribute value breaks this. Perhaps I need to first replace spaces in between "" with an underscore, then after i split the array, return back to spaces?

Ryan Gross · Accepted Answer · 2011-07-25 02:50:45Z

0

I think you could get it using the following. You would want the second and third matching group.

<[\w\d\-_]+\s+(([\w\d\-_]+)="(.*?)")*>

answered Jul 25, 2011 at 2:50

Ryan Gross

6,5652 gold badges35 silver badges46 bronze badges

1 Comment

RobG Over a year ago

That won't work in a number of cases, such as if there's a namespace, e.g. <ns1:tagname .... >, or an attribute name contains a colon (:) or a period (.) character (not included in the appropriate part of the regular expression) or the value contains a double quote character.

Pablo Fernandez · Accepted Answer · 2011-07-25 17:23:23Z

0

The regex is /\w+=".+"/g (note the g of global).

You might try it right now on your firebug / chrome console by doing:

var matches = '<Node attribute="one" attribute2="two" n="nth"></node>'.match(/\w+="\w+"/g)

edited Jul 25, 2011 at 17:23

answered Jul 25, 2011 at 2:49

Pablo Fernandez

106k59 gold badges196 silver badges234 bronze badges

6 Comments

RobG Over a year ago

And if an attribute value has a space it fails. See the link in the first comment.

RobG Over a year ago

No, it doesn't. The question was "a good regex expression ... that extracts each attribute from an xml node", not one for the very limited example.

Qtax Over a year ago

@Pablo, maybe you should try it before saying that you fixed anything. ;-)

Pablo Fernandez Over a year ago

@Qtax, oops, left the escape there. Thanks for the correction man :)

Qtax Over a year ago

@Pablo, you should still try it, even on your limited example. ;-)

|

Collectives™ on Stack Overflow

javascript regex for xml/html attributes

4 Answers 4

you can't parse XML with a regular expression.

1 Comment

1 Comment

1 Comment

6 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

you can't parse XML with a regular expression.

1 Comment

1 Comment

1 Comment

6 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related