Re: HTML parsing
- From: David Connet <stuff@xxxxxxxxxxxxxxxxxxxxx>
- Date: Tue, 18 Mar 2008 14:24:11 GMT
Carmen Sei <fatwallet951@xxxxxxxxx> wrote in
news:gjvot31p124cmnnr6fae2lb1ujgikhp9og@xxxxxxx:
looks like HTMLTidy is a good solution
http://sourceforge.net/projects/tidy
You could, or you could use a 3rd party library to do that. I
recommend http://www.pcre.org/ , which does Perl Compatible Regular
Expressions, and is used by a lot of well-known open source
projects. If you use the C++ wrappers which are freely downloadable
from that site, doing matching and/or replacing is pretty easy.
Nathan Mates
I use this in my project - it works quite nice.
After using tidy to parse into XML, I then use xerces to parse the xml. (I
have these compiled statically into my executable)
Dave Connet
.
- References:
- HTML parsing
- From: worlman385
- Re: HTML parsing
- From: Nathan Mates
- Re: HTML parsing
- From: Carmen Sei
- HTML parsing
- Prev by Date: Re: IsWindow assertion error communicating between windows
- Next by Date: Re: call base class function or derived class function
- Previous by thread: Re: HTML parsing
- Next by thread: Re: HTML parsing
- Index(es):
Relevant Pages
|