Re: Convert HTML to XML or Paser HTML
From: David Elliott (DavidElliott_at_BellSouth.net.nospam)
Date: 02/11/04
- Next message: Jerry: "Re: Website Login"
- Previous message: Dilip Krishnan: "Re: mspx?"
- Next in thread: Maxim Kazitov: "Re: Convert HTML to XML or Paser HTML"
- Reply: Maxim Kazitov: "Re: Convert HTML to XML or Paser HTML"
- Reply: George Ter-Saakov: "Re: Convert HTML to XML or Paser HTML"
- Messages sorted by: [ date ] [ thread ]
Date: Wed, 11 Feb 2004 11:02:11 -0500
I have tried the SgmlReader but am having difficultly with some sites, such as www.msn.com
If I could find a way to do parsing on HTML using C/C++/C# I would be happy. All I really
need is a way to have an array of <tag> and <data>. Finer grainularity is not necessary. Just
the raw information. I do need the entire page though from opening <html> to the closing </html>.
I would prefer an HTML to XML conversion, but as time is limited, any solution would be
appreciated.
Thanks,
Dave
On Fri, 09 Jan 2004 03:23:29 GMT, v-schang@online.microsoft.com (Steven Cheng[MSFT]) wrote:
>Hi Q.Z,
>
>
>Thank you for using Microsoft Newsgroup Service. Based on your description,
>you are looking for some COM or dotnet components which can convert the
>html document into XML (XHTML) style document. Is my understanding correct?
>
>If so, I think Ken Cox've provided some good sites on this topic, they
>shows two components of COM. You may have a try on them to see whether they
>help.
>
>Steven Cheng
>Microsoft Online Support
>
>Get Secure! www.microsoft.com/security
>(This posting is provided "AS IS", with no warranties, and confers no
>rights.)
- Next message: Jerry: "Re: Website Login"
- Previous message: Dilip Krishnan: "Re: mspx?"
- Next in thread: Maxim Kazitov: "Re: Convert HTML to XML or Paser HTML"
- Reply: Maxim Kazitov: "Re: Convert HTML to XML or Paser HTML"
- Reply: George Ter-Saakov: "Re: Convert HTML to XML or Paser HTML"
- Messages sorted by: [ date ] [ thread ]
Relevant Pages
|