Incremental indexing efficiency



I am indexing my dotText blog content by construct pages that contain
the URLs to all the individual posts and articles on my dotText blog.
I set my source of the index to the directory in my SBS2003 intranet
site, a straight IIS/FrontPage site, and set the index to read all the
linked-to pages in this site. So it picks up all the pages in that
directory. I also have it set to include the linked content but not
the source pages. The Content Source crawl configuration is set to
custom with 1 hop.

When I publish more content, I go to the blog, look in the categories
page of the categories to which I have added material, and copy the
permalinks of the new entries. I then edit this a bit, and add them to
the pages that contain the URLs. I want to minimize load on the blog
server.

The questions are 1) should I put the new URLs on new pages (to leave
all the old items on clean, undirtied pages) and 2) which kind of index
should I start: Incremental update, Incremental (inclusive) update, or
Adaptive update.

As an aside, I have already throttled the spider by forcing it to
request one file at a time.

Hollis D. Paul [MVP - Outlook]
Hollis@xxxxxxxxxxxxxxxxxxxxxx
Mukilteo, WA USA


.


Loading