Re: Portuguese language support for full-text?
From: John Kane (jt-kane_at_comcast.net)
Date: 02/18/04
- Next message: John Kane: "Re: what is this error means"
- Previous message: John Kane: "Re: Imp Technology index engine."
- In reply to: Desconfiado: "Portuguese language support for full-text?"
- Messages sorted by: [ date ] [ thread ]
Date: Wed, 18 Feb 2004 12:15:18 -0800
Desconfiado,
While SQL Server (7.0 and 2000) supports the Portuguese language, Full-Text
Search (FTS) only supports a smaller sub-set of these languages and
unfortunately Portuguese is not one of these languages. See SQL Server 2000
BOL title "Column-Level Linguistic Analysis" for more details. To the best
of my knowledge there is no 3rd party package that would allow Microsoft's
full-text search engine to support Portuguese words and forms, currently
available. However, this may change or may not change in the future version
of SQL Server, codenamed Yukon or may or may not change in future service
packs...
What you can do today is drop and re-create your existing FT Catalog and use
the Neutral "Language for Word Breaker" and then run a Full Population and
re-test your FTS query . Note, with the Neutral word breaker, you lose the
ability to use the INFLECTIONAL predicate for your search terms. This is
because the Neutral word breaker "breaks" the words during the FT
Indexing/Population process using the "white space" between the words.
You might also want to consider using SharePoint Portal Server (SPPS), see
this TechNet article - SharePoint Products and Technologies in Multilingual
Scenarios at:
http://www.microsoft.com/technet/treeview/default.asp?url=/technet/prodtechnol/office/sps2003/maintain/SPMultiL.asp
Note - "Word Breakers - The following table provides information about word
breakers provided for the SharePoint Portal Server Search Service."
Language Word breaker
Japanese Native word breaker
Traditional Chinese Native word breaker
Simplified Chinese Native word breaker
Korean Native word breaker
Pan-Chinese Traditional Chinese word breaker
German Native word breaker
French Native word breaker
Italian Native word breaker
Spanish Native word breaker
Brazilian Portuguese word breaker
It also references a "Custom Dictionary", however, "The custom dictionary
feature is available for certain languages only." and I'm not sure if
Portuguese is one of the languages available.
There was also some discussion on the possibility of incorporating
non-supported languages that already had existing word breaker support in
either the Indexing Service &/or Sharepoint into SQL Server 2000 FTS, and
while this is a non-trivial effort, it might be possible....
Best Regards,
John
"Desconfiado" <anonymous@discussions.microsoft.com> wrote in message
news:08B932A3-122D-42EB-9C53-48645976BFA8@microsoft.com...
> Hi,
>
> Can anyone tell me if there is some package wich allows full-text search
engine to support portuguese words and forms?
> We portuguese have some special characters like 'ç' wich are can be used
in a word but when searching the net we usually don't use the 'ç' but the
'c' so full-text engine doesn't recognize the word.
> If we search with the 'ç' and the word were are looking for is with 'c' it
doesn't show up.
> And of course the form like in english 'do' and 'doing' we also have those
forms.
> Or if anyone knows how to build a dictionary?! don't what they call to the
file wich contains the language support for full-text search engine..
>
> Help apreciated.
- Next message: John Kane: "Re: what is this error means"
- Previous message: John Kane: "Re: Imp Technology index engine."
- In reply to: Desconfiado: "Portuguese language support for full-text?"
- Messages sorted by: [ date ] [ thread ]
Relevant Pages
|