Re: Different wildcard/hyphen behaviour in Windows 2003 Server?
- From: "David Lee" <dlee@xxxxxxxxxxxxxxxxxxxxxx>
- Date: Fri, 6 May 2005 09:32:53 -0700
Yes, the English wordbreaker in Windows 2003 Server is different from the
one in Windows 2000. In most cases, query results are better now, but this
particular case looks somewhat broken. Win2k actually thinks 1-2-3 is a
date, but it works to its advantage!
Here is the output from the two wordbreakers. (the clsid is from
hklm\system\currentcontrolset\control\contentindex\language\english_us\wbreakerclass)
w2k3:
R:\>lrtest /b /c:{80A3E9B0-A246-11D3-BB8C-0090272FA362} "1-2-3"
dll loaded: U:\WINDOWS\system32\LangWrbk.dll
dll version 5.2.3790.0
file create time: 11-03-2003 11:35a
file last write time: 3-25-2003 5:00a
FileVersion: '5.2.3790.0 (srv03_rtm.030324-2048)'
FileDescription: 'English wordbreaker'
CompanyName: 'Microsoft Corporation'
ProductName: 'Microsoft« Windows« Operating System'
Wordbreaker requires license: No
Wordbreaker license: 'Copyright Microsoft Inc.'
Original text: '1-2-3'
IWordSink::PutAltWord: cwcSrcLen 1, cwcSrcPos 0, cwc 1, '1'
IWordSink::PutWord: cwcSrcLen 1, cwcSrcPos 0, cwc 3, 'NN1'
IWordSink::PutAltWord: cwcSrcLen 1, cwcSrcPos 2, cwc 1, '2'
IWordSink::PutWord: cwcSrcLen 1, cwcSrcPos 2, cwc 3, 'NN2'
IWordSink::PutAltWord: cwcSrcLen 1, cwcSrcPos 4, cwc 1, '3'
IWordSink::PutWord: cwcSrcLen 1, cwcSrcPos 4, cwc 3, 'NN3'
w2k:
C:\>lrtest /b /c:{59e09780-8099-101b-8df3-00000b65c3b5} "1-2-3"
dll loaded: C:\WINNT\system32\infosoft.dll
dll version 5.0.2195.6601
file create time: 7-09-2003 1:40p
file last write time: 6-19-2003 12:05p
FileVersion: '5.00.2195.6601'
FileDescription: 'Wordbreaker and stemmer dll'
CompanyName: 'Microsoft Corporation'
ProductName: 'Microsoft(R) Windows (R) 2000 Operating System'
Wordbreaker requires license: Yes
Wordbreaker license: 'Copyright Infosoft International Inc.'
Original text: '1-2-3'
IWordSink::PutAltWord: cwcSrcLen 5, cwcSrcPos 0, cwc 5, '1-2-3'
IWordSink::PutWord: cwcSrcLen 5, cwcSrcPos 0, cwc 10, 'DD19030102'
"Jeremy Scrivener" <jscriv@xxxxxxxxxxxx> wrote in message
news:1115328842.075310.281390@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
> Hi All,
>
> The company I work for has a product based on Indexing Service.
>
> Since several of our clients have migrated to Windows 2003 Server, they
> are experiencing different results with queries based on wildcards and
> hyphens to that experienced on Windows 2000. The query itself resembles
>
> "@propertyname 1-2-3*".
>
> [NB - This is not a dialect issue as far as I can tell as the query is
> an "@propertyname" query]
>
> I have tried removing all the numbers from the noise.enu file,
> rescanned, but this did not help - in fact, this stopped results being
> returned on an exact match query! (& doubled the size of the catalog
> ;-)
>
> Can anyone confirm that behaviour has changed with 2003 Server, and if
> so provide any details? We're considering opening a PSS with MS on
> this, but you would hope that such details would be documented
> somewhere!
>
> Thanks in advance,
>
> Jeremy.
>
.
- Follow-Ups:
- Re: Different wildcard/hyphen behaviour in Windows 2003 Server?
- From: Jeremy Scrivener
- Re: Different wildcard/hyphen behaviour in Windows 2003 Server?
- References:
- Different wildcard/hyphen behaviour in Windows 2003 Server?
- From: Jeremy Scrivener
- Different wildcard/hyphen behaviour in Windows 2003 Server?
- Prev by Date: Re: Register custom IFilter to FilterFilesWithUnknown Extensions?
- Next by Date: Re: Different wildcard/hyphen behaviour in Windows 2003 Server?
- Previous by thread: Different wildcard/hyphen behaviour in Windows 2003 Server?
- Next by thread: Re: Different wildcard/hyphen behaviour in Windows 2003 Server?
- Index(es):
Relevant Pages
|