Incremental indexing efficiency
- From: "Hollis D. Paul" <Hollis@xxxxxxxxxxxxxxxxxxxxxx>
- Date: Sun, 30 Oct 2005 10:07:25 PST
I am indexing my dotText blog content by construct pages that contain
the URLs to all the individual posts and articles on my dotText blog.
I set my source of the index to the directory in my SBS2003 intranet
site, a straight IIS/FrontPage site, and set the index to read all the
linked-to pages in this site. So it picks up all the pages in that
directory. I also have it set to include the linked content but not
the source pages. The Content Source crawl configuration is set to
custom with 1 hop.
When I publish more content, I go to the blog, look in the categories
page of the categories to which I have added material, and copy the
permalinks of the new entries. I then edit this a bit, and add them to
the pages that contain the URLs. I want to minimize load on the blog
server.
The questions are 1) should I put the new URLs on new pages (to leave
all the old items on clean, undirtied pages) and 2) which kind of index
should I start: Incremental update, Incremental (inclusive) update, or
Adaptive update.
As an aside, I have already throttled the spider by forcing it to
request one file at a time.
Hollis D. Paul [MVP - Outlook]
Hollis@xxxxxxxxxxxxxxxxxxxxxx
Mukilteo, WA USA
.
- Prev by Date: IFilter for a dotText blog
- Next by Date: Search Cache
- Previous by thread: IFilter for a dotText blog
- Next by thread: Search Cache
- Index(es):
Loading