I am using spark 2.2 and java 1.8
sample XML format -
<?xml version="1.0" encoding="UTF-8" ?>
<!-- Generated by Oracle DVM Editor version 1.0 at [9/6/11 5:14 PM].
-->
<dvm name="CIHSubscriptionTypeMapping" xmlns="http://xmlns.oracle.com/dvm">
<description>
</description>
<columns>
<column name="SSPMW"/>
<column name="CIH"/>
</columns>
<rows>
<row>
<cell>ute.recordClass</cell>
<cell>sku_type</cell>
</row>
<row>
<cell>ute.name.en</cell>
<cell>name_en</cell>
</row>
</rows>
</dvm>
Reading XML file in spark with java API -
Dataset<Row> xmlDF =spark.read()
.format("com.databricks.spark.xml")
.options("rowTag","row")
.load("sample.xml");
xmlDF.printSchema()
root
|-- cell: array (nullable = true)
| |-- element: string (containsNull = true)
xmlDF.show(false)
cell
================
[ute.recordClass, sku_type]
[ute.name.en, name_en]
I want convert above "cell" column to Lookup Map(String,String) and later will use it for broadcast.
Example - (ute.sku.price,list_price)......
Can some one help on this?.Thanks.