Re: unexpected behavior with certain words "i"
- From: "Daniel Crichton" <msnews@xxxxxxxxxxxxxxxx>
- Date: Mon, 9 Jul 2007 13:48:49 +0100
Thorsten wrote on Mon, 9 Jul 2007 02:38:02 -0700:
Hello,
I have started to test fulltext indexing. I have run into a problem that I
don't know how to solve.
I have this data in one fulltext-column "Landsorganisationen i Sverige".
When I perform this search I get 1 hit:
select top 100 * from CA_CATALOG_New_SEARCH where
CONTAINS(FWD_ALL,'"landsorganisationen"') AND
CONTAINS(FWD_ALL,'"sverige"')
But, when I do this search I get 0 hits (should be 1 hit):
select top 100 * from CA_CATALOG_New_SEARCH where
CONTAINS(FWD_ALL,'"landsorganisationen"') AND
CONTAINS(FWD_ALL,'"sverige"') AND CONTAINS(FWD_ALL,'"i"')
There is something with the word "i" that confuses the search engine. I
suspect that it is considered a special word and not indexed. But since I
want a hit, it seems like I need to know all words that won't work and
exclude them from searches. But it sounds like unnessecery work. It must
be some better way to work around this.
And if I need to know which words, were is the list?
Sincerely,
Thorsten
There is a list of words held in the noise.<languagecode> files (eg.
noise.enu for Neutral English) which are not indexed. You can remove all of
these (but leave a single line with a space on it), and then repopulate your
index. Just remember to edit the correct file related to your database
locale.
The other alternative is to parse the noise word list and create a routine
to strip out these ignored words when searching - the benefit of this is
that you keep the keep the index catalog small and strip out common words
from the searches. However, it might be more work than the simple clearing
of the noise file.
Personally, I cleared all my noise word files - the performance hit was
negligible for the tests I performed on my FT searching before and after.
Dan
.
- Prev by Date: Re: Update to SP4 causes starnge Server behavior
- Next by Date: Re: Too many full-text columns or the full-text query is too compl
- Previous by thread: Re: Update to SP4 causes starnge Server behavior
- Next by thread: Re: unexpected behavior with certain words "i"
- Index(es):
Relevant Pages
|