Re: Html to Text Convertor?



On Feb 18, 6:59 pm, Joseph M. Newcomer <newco...@xxxxxxxxxxxx> wrote:
If you; don't care about rendering, it shouldn't take more than an afternoon to write a
piece of code that removes all tags from an HTML file. The result might be confusing
because inserts, tables, frames, etc. will all be mushed together in some random order,
but it would give you the text.
joe

On 17 Feb 2007 15:24:18 -0800, halmura...@xxxxxxxxx wrote:



On Feb 15, 3:32 pm, David Wilkinson <no-re...@xxxxxxxxxxxx> wrote:
halmura...@xxxxxxxxx wrote:
i mean by text is the text which we can see on web browser, not with
html tags

halmuratov:

I think you mean not an "Html to Text Converter" but rather a means of
displaying HTML. Take a look at the Web Browser Control. It uses the
Internet Explorer engine to display the page.

David Wilkinson

hi,
i know thaat web browser control displays the html file, but i want
only text withour tags? is there any class that extracts texts without
tags ?

thanks

Joseph M. Newcomer [MVP]
email: newco...@xxxxxxxxxxxx
Web:http://www.flounder.com
MVP Tips:http://www.flounder.com/mvp_tips.htm

thanks, i am going to write the code

.



Relevant Pages

  • Re: Html to Text Convertor?
    ... piece of code that removes all tags from an HTML file. ... Take a look at the Web Browser Control. ... MVP Tips:http://www.flounder.com/mvp_tips.htm ...
    (microsoft.public.vc.mfc)
  • Re: Extracting bolds and italics from HTML
    ... I have to make some calculations on the contents of url before ... > I had found a very useful program of Word Count from sun java forum, ... > but its problem is that it also includes the HTML tags in calculation. ... > i) A program which counts words in HTML file but doesnt include HTML ...
    (comp.lang.java.programmer)
  • Re: Problem page IE clear float problem, Opera/FF header problem and N4
    ... > caps (which doesn't work so well with css). ... > be missing quotes or tags but those quotes and tags are already there, ... Yes but you also have to change your CSS file, an id is prefixed with a # ... to id="mainimage" in your HTML file but that they are still in your CSS ...
    (comp.infosystems.www.authoring.stylesheets)
  • Re: how to capture multiple lines?
    ... > that as I parse through an html file and find the first line of the ... If you want to grab ... anything between and (including other tags) you must extend ...
    (comp.lang.perl.misc)
  • Re: Html to Text Convertor?
    ... piece of code that removes all tags from an HTML file. ... Take a look at the Web Browser Control. ...
    (microsoft.public.vc.mfc)