How to add new extensions "correctly"



I have come across a problem. I cannot find the answer here, though
I'm surprised nobody else has asked.

The question is: How should one get SQL FTS to index files' content
which have a suffix/extension not among those supplied by default BUT
perfectly searchable by existing filters? I do *not* mean content
which requires a new IFilter.

I have an IMAGE column with content taken from files, and I have
populated the "extension mapping" column with the original file's
extension. All is working fine. However....

Users could add *any* type of file. Take an example: if they add
"fred.txt" it will be indexed (with text filter), but if the file
happens to be, say, "fred.cs", it will not be indexed.

So far as I can see, via the Citeknet IFilter Explorer, for SQL
2000/2005 purposes (but not other MSSearch apps) essentially only
extensions with subkeys in registry
HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\ContentIndexCommon\Filters
\Extension
(about 30 entries) get indexed. This seems to be a list installed by
Microsoft.

Now, ".cs" files themselves are known as "text/plain", and have the
right "PersistentHandler", but because they do not appear there they
are not indexed by SQL FTS; and so on for many other file types which
are plain text.

That gives me 2 possibilities:

1. Tell my users to (ask their admins to) add extensions to this list
(somehow, e.g. directly in registry).

2. Invent a "pseudo-extension" column in my SQL table, which does not
hold the true file extension but rather the extension to tell SQL FTS
to use for indexing, e.g. ".txt" for a ".cs" file. Then allow a
"mappings" table (or whatever) to be set up, and use that to decide
for each file added into IMAGE column what to put in the "Extension"
column.

Which is the "correct"/"recommended" way to do this? Have others
chosen either path?

.



Relevant Pages

  • Re: Searching for text within files
    ... It seems to ignore files with a .SQL extension when searching, ... In the Advanced options, check 'Search Subfolders' ... contain that string. ...
    (microsoft.public.windowsxp.general)
  • Re: empty values in INSERT INTO statement
    ... > Extension) _ ... > So with complete data the above lines generate the following SQL ... > If the texts fields are empty then the statement would be: ... This email account is my spam trap so I ...
    (microsoft.public.inetserver.asp.db)
  • Re: sliced find in Linq
    ... > Of course, this is the same as the Customers collection, so you ... you want to use the Skip extension ... so I wouldn't assume that Linq to SQL knows about it. ... certainly avoids returning everything to the client. ...
    (microsoft.public.dotnet.languages.csharp)
  • Re: Fulltext and text type fields
    ... file system and the ifilter is in the os and SQL is configured to load it), ... and varbinary(with the document type column containing the extension ... Director of Text Mining and Database Strategy ...
    (microsoft.public.sqlserver.fulltext)
  • RE: SecurityException for registry access while opening SQL Connection
    ... It seems that it is a permission issue of the ASPNET account. ... 1.Add the account as a SQL Server Login. ... Registry Monitor logs all calls the to registry. ...
    (microsoft.public.dotnet.framework.aspnet.security)

Loading