Re: Generated Site Map - not catching internal broken links to PHP

Tech-Archive recommends: Repair Windows Errors & Optimize Windows Performance



On Wed, 28 Mar 2007 14:12:01 -0700, cmcmorrow
<cmcmorrow@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote:



"Paul Herber" wrote:

On Wed, 28 Mar 2007 13:20:01 -0700, cmcmorrow
<cmcmorrow@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote:



"Paul Herber" wrote:

On Wed, 28 Mar 2007 12:16:00 -0700, cmcmorrow
<cmcmorrow@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote:

Hello,

I am using Visio's site mapping tool to generate a site map automatically of
my company's web site. For the most part, it is working extremely well, and
is very useful. However, I'm having one major problem, and that is that the
reports are not picking up some internal links that are broken. The site is
built in PHP, and the broken links are to other php pages within the site. I
do have PHP as one of the extensions to include in my drawing settings.

Visio has no problem catching broken paths to images, pdfs and other
documents on the site, as well as external links that are broken.

At first I thought it was because our .htaccess file on the server had a
redirect set up for 404 errors, to go to the home page. We removed that, and
re-ran the site maps, but Visio is still not catching the broken link. (could
there be a caching issue going on here, that's not letting us see the effects
of the change?)

To illustrate the issue . . . I ran the report for this directory:
http://greencampus.harvard.edu/rep/index.php.

I know that there is a broken link on this page:
http://greencampus.harvard.edu/rep/faq.php
If you do a find on the word "recognition", you'll find it's hyperlinked to
a nonexistent script: http://greencampus.harvard.edu/rep/limelight.php

However, Visio shows is as a normal functioning page on the site map.

Any ideas on what could be going on here?

Your server is returning a text/html document containing the text "404
Page Not Found on http://greencampus.harvard.edu/"; rather than an
error 404 status.



--
Regards, Paul Herber, Sandrila Ltd.
Visio Utilities http://www.visio-utilities.sandrila.co.uk/


Hi, thanks for your response. I wondered if something like this might be the
case, but I don't think so . . . The server is returning a 404 response. I
checked it out by displaying the HTTP request/response info using an
extension on my browser. Also the W3C link checker and other link checkers
I've tried return 404 status on this link.

But not Visio . . . Any thoughts?

Right, I've checked in 4 different browsers, Mozilla, Opera, Netscape
and IE and a proper status 404 is only returned for IE, the others get
a text/html response.
So, I'll take a guess that the server is doing a browser check:

if IE then return 404
else wibble and break other browsers

so, as Visio is not IE then it gets back a text/html file, hence it's
a valid link.


--
Regards, Paul Herber, Sandrila Ltd.
Electronics for Visio http://www.electronics.sandrila.co.uk/


That's odd - I was using the firebug extension in Firefox when I got the
404. So this is what we have in our htaccess file on the server . . .

ErrorDocument 404 "404 Page Not Found on http://greencampus.harvard.edu/

I would think that this is not overriding the 404 error response, but what
do you think?

well, it's now returning "404 Page Not Found" as a text/html file so
that is probably the default for the whole server rather than your
virtual server for your subdomain.
It's still a text response.
It all boils down to what response Visio's site mapping tool gets and
what it does with it.


--
Regards, Paul Herber, Sandrila Ltd.
Electrical for Visio http://www.electrical.sandrila.co.uk/
.



Relevant Pages

  • Re: Generated Site Map - not catching internal broken links to PHP
    ... but Visio is still not catching the broken link. ... Visio shows is as a normal functioning page on the site map. ... I'll take a guess that the server is doing a browser check: ...
    (microsoft.public.visio.general)
  • Re: Generated Site Map - not catching internal broken links to PHP
    ... but Visio is still not catching the broken link. ... Visio shows is as a normal functioning page on the site map. ... I'll take a guess that the server is doing a browser check: ...
    (microsoft.public.visio.general)
  • Re: [u]filename.htm
    ... Or if working locally are you running a Disc Based Web or a Server Based web ... Reopen the site in FP & run a Tools Recalculate Hyperlinks ... To find the best Newsgroup for FrontPage support see: ... |>> Frontpage is telling me I have a broken link on one of my pages referring ...
    (microsoft.public.frontpage.client)
  • Re: Generated Site Map - not catching internal broken links to PHP pag
    ... built in PHP, and the broken links are to other php pages within the site. ... but Visio is still not catching the broken link. ... Visio shows is as a normal functioning page on the site map. ...
    (microsoft.public.visio.general)
  • Re: Is the paradoxcommunity news-server alive these days?
    ... > The requested URL /newsreaders.htm was not found on this server. ... Oops. ... Broken link. ... Larry DiGiovanni ...
    (comp.databases.paradox)