Retrieving Inner Text of Html Tag C#

John Dougherty picture John Dougherty · Aug 30, 2011 · Viewed 9.4k times · Source

I have a string that contains html. Inside of this string there is an html tag and I want to retrieve the inner text of that. How can I do that in C#?

Here is the html tag whose inner text I want to retrieve:

<td width="100%" class="container">

Answer

Matt Ball picture Matt Ball · Aug 30, 2011

Use the Html Agility Pack.


Edit something like this (not tested)

HtmlDocument doc = new HtmlDocument();
string html = /* whatever */;
doc.LoadHtml(html);
foreach(HtmlNode td in doc.DocumentElement.SelectNodes("//td[@class='container']")
{
    string text = td.InnerText;
    // do whatever with text
}

You can also select the text directly with a different XPath selector.


Related questions: