<html>
<body>
<div>
<div>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal"></p>
</div>
</div>
</div>
</body>
</html>
我上面有html代码。没有<html><body>
标签,如何获取此内容并删除<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0cm 0cm 0cm">
?
这是我想要的结果是:
<div>
<div>
<p class="MsoNormal"></p>
</div>
</div>
var doc = new HtmlDocument();
doc.Load("..\\..\\h.html");
HtmlNode body = doc.DocumentNode.SelectNodes("//body")[0];
HtmlNode div1 = body.FirstChild;
while (div1.NodeType != HtmlNodeType.Element)
div1 = div1.NextSibling;
HtmlNode div2 = div1.FirstChild;
while (div2.NodeType != HtmlNodeType.Element)
div2 = div2.NextSibling;
HtmlNode div3 = div2.FirstChild;
while (div3.NodeType != HtmlNodeType.Element)
div3 = div3.NextSibling;
var div3_p = div3.SelectNodes("//p")[0];
div2.ReplaceChild(div3_p, div3);
doc.DocumentNode.InnerHtml = body.InnerHtml;
doc.Save("..\\..\\h2.html");