So I'm trying to parse following data into a CSV. From my reading it sounds like the best way to go about it is using HAP since it has a robust parser.
As of right now, the WPF WebBrowser control content is being accessed by:
dynamic doc = this.wbControl.Document;
Content
<div class="content">
<fieldset>
<ul class="fieldsetr">
<li class="row medium">
<div class="field">
<div class="shell">
<em class="disable">Sender:</em>
</div>
</div>
<div>
<div class="clip">
<em>[email protected]</em>
</div>
</div>
</li>
<li class="row medium alt">
<div class="field">
<div class="shell">
<em class="disable">Recipient:</em>
</div>
</div>
<div>
<div class="clip">
<em>[email protected]</em>
</div>
</div>
</li>
<li class="row medium">
<div class="field">
<div class="shell">
<em class="disable">Message ID:</em>
</div>
</div>
<div>
<div class="clip">
<em>2342342345235</em>
</div>
</div>
</li>
<li class="row medium alt">
<div class="field">
<div class="shell">
<em class="disable">Message size:</em>
</div>
</div>
<div>
<div class="clip">
<em>18.74 KB
</em>
</div>
</div>
</li>
<li class="row medium">
<div class="field">
<div class="shell">
<em class="disable">Date and time received:</em>
</div>
</div>
<div>
<div class="clip">
<em>11/27/2012 6:17:22 AM</em>
</div>
</div>
</li>
<li class="row medium alt">
<div class="field">
<div class="shell">
<em class="disable">Date and time filtered:</em>
</div>
</div>
<div>
<div class="clip">
<em>11/27/2012 6:17:22 AM</em>
</div>
</div>
</li>
<li class="row medium">
<!-- Connector Details -->
</li>
<li class="row medium alt">
<div class="field">
<div class="shell">
<em class="disable">First delivery attempt:</em>
</div>
</div>
<div>
<div class="clip">
<em>11/27/2012 6:17:23 AM</em>
</div>
</div>
</li>
<li class="row medium">
<div class="field">
<div class="shell">
<em class="disable">Final delivery attempt:</em>
</div>
</div>
<div>
<div class="clip">
<em>11/27/2012 6:17:23 AM</em>
</div>
</div>
</li>
<li class="row medium alt">
<div class="field">
<div class="shell">
<em class="disable">From IP address:</em>
</div>
</div>
<div>
<div class="clip">
<em>1.2.3.4 <unknown></em>
</div>
</div>
</li>
<li class="row medium">
<div class="field">
<div class="shell">
<em class="disable">To IP address:</em>
</div>
</div>
<div>
<div class="clip">
<em>4.3.2.1 <mail.example2.com> </em>
</div>
</div>
</li>
<li class="row medium alt">
<div class="field">
<div class="shell">
<em class="disable">Filtering results:</em>
</div>
</div>
<div>
<div class="clip">
<em>Passed Filtering</em>
</div>
</div>
</li>
<li class="row medium">
<div class="field">
<div class="shell">
<em class="disable">Delivery result:</em>
</div>
</div>
<div>
<div class="clip">
<span><em>Delivered: 470 2.4.0 <2342342345235> [InternalId=2321233] Queued mail for delivery</em></span>
</div>
</div>
</li>
</ul>
</fieldset>
</div>
What is the best way for me to convert this data? This is only one record, but more records would be added.
Edit
Ended up using the following code to test it out:
HtmlAgilityPack.HtmlDocument docHAP = new HtmlAgilityPack.HtmlDocument();
docHAP.LoadHtml(doc.Body.InnerHtml.ToString());
foreach(HtmlNode emNode in docHAP.DocumentNode.SelectNodes("//em"))
{
MessageBox.Show(emNode.InnerText.ToString());
}
If anyone has a more efficient solution, please feel free to let me know.