Re: DOM text extraction
- From: "stephane" <stephane.gamard@xxxxxxxxx>
- Date: 8 Mar 2006 00:52:25 -0800
I got to this using createRange and then pulling the metrics out of the
splited strings. While this gives me a good "planar" textual surface, I
am more interested in representing all my text within a tree.
I tried to recurse thru IHTMLElement.children but I face the following
problem: When do I actually ask for text? given the folowing example:
<p>
some text<a href="...">toto</a>
</p>
If I asked for text at each level I would get something like this:
p (some text toto)
|__ a (toto)
I wold thus have redundancy of information.
How can I avoid this???
_Stephane
.
- References:
- DOM text extraction
- From: stephane
- DOM text extraction
- Prev by Date: Re: Capturing a button click event in Browser Helper Object
- Next by Date: Catching any and all script function calls when automating IE
- Previous by thread: DOM text extraction
- Next by thread: Catching any and all script function calls when automating IE
- Index(es):