I have one XML file as "sample.xml" and has 4 records .
<?xml version='1.0' encoding='UTF-8'?>
<hello xmlns:show="http://www.example.com" xmlns:css="http://www.example.com" xml_version="2.0">
<entry id="2008-0001">
<show:id>2008-0001</show:id>
<show:published-datetime>2008-01-15T15:00:00.000-05:00</show:published-datetime>
<show:last-modified-datetime>2012-03-19T00:00:00.000-04:00</show:last-modified-datetime>
<show:css>
<css:metrics>
<css:score>3.6</css:score>
<css:access-vector>LOCAL</css:access-vector>
<css:authentication>NONE</css:authentication>
<css:generated-on-datetime>2008-01-15T15:22:00.000-05:00</css:generated-on-datetime>
</css:metrics>
</show:css>
<show:summary>This is first entry.</show:summary>
</entry>
<entry id="2008-0002">
<show:id>2008-0002</show:id>
<show:published-datetime>2008-02-11T20:00:00.000-05:00</show:published-datetime>
<show:last-modified-datetime>2014-03-15T23:22:37.303-04:00</show:last-modified-datetime>
<show:css>
<css:metrics>
<css:score>5.8</css:score>
<css:access-vector>NETWORK</css:access-vector>
<css:authentication>NONE</css:authentication>
<css:generated-on-datetime>2008-02-12T10:12:00.000-05:00</css:generated-on-datetime>
</css:metrics>
</show:css>
<show:summary>This is second entry.</show:summary>
</entry>
<entry id="2008-0003">
<show:id>2008-0003</show:id>
<show:published-datetime>2009-03-26T06:12:08.780-04:00</show:published-datetime>
<show:last-modified-datetime>2009-03-26T06:12:09.313-04:00</show:last-modified-datetime>
<show:summary>This is 3rd entry with missing "css" tag and their metrics.</show:summary>
</entry>
<entry id="2008-0004">
<show:id>CVE-2008-0004</show:id>
<show:published-datetime>2008-01-11T19:46:00.000-05:00</show:published-datetime>
<show:last-modified-datetime>2011-09-06T22:41:45.753-04:00</show:last-modified-datetime>
<show:css>
<css:metrics>
<css:score>4.3</css:score>
<css:access-vector>NETWORK</css:access-vector>
<css:authentication>NONE</css:authentication>
<css:generated-on-datetime>2008-01-14T09:37:00.000-05:00</css:generated-on-datetime>
</css:metrics>
</show:css>
<show:summary>This is 4th entry.</show:summary>
</entry>
</hello>
and 1 Java file as "Test.java" -
import java.io.File;
import java.util.ArrayList;
import java.util.List;
import javax.xml.parsers.DocumentBuilder;
import javax.xml.parsers.DocumentBuilderFactory;
import javax.xml.xpath.XPath;
import javax.xml.xpath.XPathConstants;
import javax.xml.xpath.XPathExpression;
import javax.xml.xpath.XPathFactory;
import org.w3c.dom.Document;
import org.w3c.dom.Node;
import org.w3c.dom.NodeList;
public class Test {
public static void main(String[] args) {
List<String> list = new ArrayList<String>();
File fXmlFile = new File("/home/ankit/sample.xml");
DocumentBuilderFactory factory = DocumentBuilderFactory.newInstance();
try
{
DocumentBuilder dBuilder = factory.newDocumentBuilder();
Document doc = dBuilder.parse(fXmlFile);
doc.getDocumentElement().normalize();
NodeList nList = doc.getElementsByTagName("entry");
XPathFactory xPathfactory = XPathFactory.newInstance();
XPath xpath = xPathfactory.newXPath();
for (int i = 0; i < nList.getLength(); i++)
{
XPathExpression expr1 = xpath.compile("//hello/entry/css/metrics/score");
NodeList nodeList1 = (NodeList) expr1.evaluate(doc, XPathConstants.NODESET);
if(nodeList1.item(i)!=null)
{
Node currentItem = nodeList1.item(i);
if(!currentItem.getTextContent().isEmpty())
{
list.add(currentItem.getTextContent());
}
}
}
}
catch(Exception e)
{
e.printStackTrace();
}
System.out.println("size----"+list.size());
for(int i=0;i<list.size();i++)
{
System.out.println("list----"+list.get(i));
}
}
}
I need to read the <entry> tag from the XML and for that I am using XPath . In the XML file there are 4 entry tags and inside entry tag there is <show:css> tag, but in 3rd <entry> tag this <show:css> tag is missing and putting those css tag's score values in the list. So when I am running this java code first 2 values got stored in the list and at the 3rd place it stores 4th tag's css's score value.
I want a list as output which will have first, second and forth element as "3.6", “4.8” and “5.3” and 3rd element should be empty string or nill. But I am getting only 3 elements in the list with values of 1,2 and 4.
I need to put empty string “” at 3rd place and original value at 4th. Means If that tag is not present then put blank value in the list.
Current output - [“3.6” , “4.8” , “5.3”]
I expect - [“3.6” , “4.8” , “” , “5.3”]
Could anyone please help me with this that how to do this.