I have one xml file like this:
<dbReference type="PM" id="17224074"/>
<dbReference type="DOI" id="10.1186/bcr1637"/>
</citation>
<scope>VARIANTS ILE-282 AND ASN-777</scope>
</reference>
<comment type="function">
<text evidence="24">Calcium.</text>
</comment>
<comment type="function">
<text evidence="24">Has a strong inhibitory effect on APP C99 and C83 production.</text>
</comment>
<comment type="subunit">
<text evidence="5 13">Homodimer; disulfide-linked.</text>
</comment>
<comment type="interaction">
<interactant intactId="EBI-727477"/>
<interactant intactId="EBI-7644904">
<id>Q9JIY2</id>
<label>Cbll1</label>
</interactant>
<organismsDiffer>true</organismsDiffer>
<experiments>21</experiments>
</comment>
I want to extract only the information in
<comment type="function">...</comment>
In this example being: 'Calcium.' AND 'as a strong inhibitory effect on APP C99 and C83 production.'
I have this table where I want to save the data:
CREATE TABLE IF NOT EXISTS INFORMATION (id varchar(255) NOT NULL, name varchar(255), entry varchar(255), comment longtext, PRIMARY KEY (id));
where I would save 'Calcium.' AND 'as a strong inhibitory effect on APP C99 and C83 production.' in column named 'comment'. I thought that I could insert this info directly from the xml to the table with LOAD XML but my xml file has too many different fields. How do I do this? Do I have to first extract the data from the xml and then insert into the table?