Re: Parsing HTML pages
- From: "Nick Hounsome" <nh002@xxxxxxxxxxxxxxxxxx>
- Date: Sat, 11 Mar 2006 10:33:47 GMT
"MisterKen" <MisterKen@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in message
news:538B602D-D2A2-417D-B777-278C67C6BCDA@xxxxxxxxxxxxxxxx
If I have the html from webpage loaded into a string. How would I use
regex
to return sections from within that html string?
I want to be able to get the "text" back between two different tags.
Basically I want to scrape some web pages and populate a database.
Does anybody have a snippet of code that could me out get the "text"?
Is it XHTML? If so you can just read it as an XmlDocument.
.
- Follow-Ups:
- Re: Parsing HTML pages
- From: Alvin Bruney - ASP.NET MVP
- Re: Parsing HTML pages
- Prev by Date: Re: ComboBox like Address bar in IE or in Run Dialog box
- Next by Date: Re: Sockets, to be continued...
- Previous by thread: ComboBox like Address bar in IE or in Run Dialog box
- Next by thread: Re: Parsing HTML pages
- Index(es):
Relevant Pages
|