Re: Where can I find a list of word breakers?
- From: "Dave Poole" <dp00le@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx>
- Date: Tue, 3 Jan 2006 15:43:49 -0800
Single character words are treated as noise by default so something like
j@xxxxxxx would first be broken into "j foo com" (since @ and . are both
breaking characters), of these only foo and com would be indexed.
So only foo com was indexed but when you query for j@xxxxxxx you should
still find a match since query and index time noiseword behaviour should be
the same.
Can you reproduce it? If so I can take a look and try to explain it.
--
Dave Poole
SQL Server Fulltext Team
This posting is provided "AS IS" with no warranties, and confers no rights.
Use of included script samples are subject to the terms specified at
http://www.microsoft.com/info/cpyright.htm.
"Jim Sneeringer" <JimSneeringer@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in message
news:6C685B68-03E0-46C6-B244-99533E2F9DC8@xxxxxxxxxxxxxxxx
> After further testing, I think @ is a word breaker. Apparently something
> else
> in my first test caused me to think it wasn't.
>
> Maybe it has to do with noise words. The search that filed was for an
> address with only one letter (the letter "j") in front of the @. When I
> put
> some other e-mail addresses in and searched for them, it found them just
> fine.
>
> Could it be that noise words must be removed from the query, or the result
> is not found. In other words, did SQL possibly ignore teh "j" in the
> database, but not ignore the "j" in the query?
>
> Or maybe there is some other anomoly that caused that e-mail address not
> to
> be found.
.
- Prev by Date: Re: mssearch.exe CPU utilization increased with SP4
- Next by Date: Re: Zero Ranking
- Previous by thread: mssearch.exe CPU utilization increased with SP4
- Next by thread: Re: Where can I find a list of word breakers?
- Index(es):
Relevant Pages
|
Loading