Re: Get Webpage Source Code

Tech-Archive recommends: Repair Windows Errors & Optimize Windows Performance



"Mike" <mbaith@xxxxxxxxx> schrieb:
I am trying to find a way to better analyze the meta tags on my website. I would like to get the source code and then parse the code for the meta tags, links, etc.

Can someone give me an example of how to read the source code of a webpage from VB.net?

<URL:http://dotnet.mvps.org/dotnet/code/net/#InternetLoadFile>

Parsing an HTML file:

MSHTML Reference
<URL:http://msdn.microsoft.com/library/default.asp?url=/workshop/browser/mshtml/reference/reference.asp>

- or -

..NET Html Agility Pack: How to use malformed HTML just like it was
well-formed XML...
<URL:http://blogs.msdn.com/smourier/archive/2003/06/04/8265.aspx>

Download:

<URL:http://www.codefluent.com/smourier/download/htmlagilitypack.zip>

- or -

SgmlReader 1.4
<URL:http://www.gotdotnet.com/Community/UserSamples/Details.aspx?SampleGuid=B90FDDCE-E60D-43F8-A5C4-C3BD760564BC>

If the file read is in XHTML format, you can use the classes contained in
the 'System.Xml' namespace for reading information from the file.

--
M S Herfried K. Wagner
M V P <URL:http://dotnet.mvps.org/>
V B <URL:http://dotnet.mvps.org/dotnet/faqs/>

.



Relevant Pages

  • Re: Writing an XML document via Javascript.
    ...  Saving it as .xml instead of .html might help, ... as HTML by the browser, even though it was not showing it in the ... least post the relevant lines of your source code. ... Sorry if I didn't give any deeper information; ...
    (comp.lang.javascript)
  • Re: problems with creating both HTML and plaintext emails
    ... I also looked around at more source code of my successfully received ... the plain text doc and not have the size, ... $subject="My first HTML E-mail"; ... echo "Message Sent!"; ...
    (comp.lang.php)
  • Re: Using sendkeys in a web form
    ... One of the ways of getting at fields without a name or ID is to navigate ... is MUCH harder than finding it in the raw HTML ... statement that "You will need to look at the source code of the page" is ... Just take the Post URL, add a question mark and then the ...
    (microsoft.public.scripting.vbscript)
  • RE: www.readnotify.com
    ... Also from viewing the source code for the HTML I've found calls to the ... This one was the non-silent tracking method: ... Computer Emergency Response Teams, and Digital Investigations. ...
    (Security-Basics)
  • Re: Saving txt files with IE 6
    ... Do you have wordwrap turned on in your text viewing program (notepad?, ... source code to make it harder for novices to read and to protect their ... it seems to delete all the HTML ... > same appearance in text format in Notepad as it had in HTML format in Word ...
    (microsoft.public.windows.inetexplorer.ie6.browser)