Parsing Html and extracting articles of interest
More...
|
static HtmlAgilityPack.HtmlNode | ExtractYahooConversationsHtml (string yahooHtml) |
| Extracts the html node containing all the comments More...
|
|
static string | AdjustStrForTex (string s) |
|
static YahooComment | GetYahooComment (HtmlAgilityPack.HtmlNode commentNode, bool parseForTex=false) |
|
static HtmlDocument | MkHtmlDoc (string htmlStr) |
| Make html document from htmlStr
|
|
static HtmlAgilityPack.HtmlNode | GetShowButton (HtmlAgilityPack.HtmlNode yahooCommentNodes) |
|
static List< YahooComment > | ExtractComments (HtmlAgilityPack.HtmlNode yahooCommentNodes) |
|
Parsing Html and extracting articles of interest
◆ ExtractYahooConversationsHtml()
static HtmlAgilityPack.HtmlNode ConScrap.Parse.ExtractYahooConversationsHtml |
( |
string |
yahooHtml | ) |
|
|
inlinestatic |
Extracts the html node containing all the comments
- Parameters
-
yahooHtml | Yahoo html in str format |
summary> Replace tex sensitive characters
- Todo:
- Mark urls in tex using \url{} after removing characters
◆ GetYahooComment()
static YahooComment ConScrap.Parse.GetYahooComment |
( |
HtmlAgilityPack.HtmlNode |
commentNode, |
|
|
bool |
parseForTex = false |
|
) |
| |
|
inlinestatic |
- Todo:
- ignore yahoo comment replies for now
The documentation for this class was generated from the following file: