Re: Benchmark for Full Text Search

Tech-Archive recommends: Repair Windows Errors & Optimize Windows Performance



We ended up breaking the database into smaller databases of about 50 million rows. Eventually the databases became monthly (again holding about 50 million rows).

Searches were directed against each database - each db had a single catalog and a single sql fts index.

Our queries did not have a real time component to them so we were more tolerant of the search latencies. The long and short of it is we eventually moved to Lucene:)

Contact me offline if you want more info.

Hilary

"sql411@xxxxxxxxxx" <sql411nospamcom@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in message news:576B160F-BCC3-4431-B414-9BF6800FAEAD@xxxxxxxxxxxxxxxx
Hillary, I am interested in how you optimized the fts for a multi-terabyte
catalog. I am in the same situation, a 7TB table with 350+ million rows.
The FT index for this table is over 1.5 TB. Some searches with very unique
words (hence, small # of rows coming back) finish fast – but very common
words can take as long as 40 minutes before we decide to time things out.
What kind of optimizations have you done outside the norm of creating a
catalog/fulltext index of the table? With billions of rows – your key must
be a bigint – did that affect performance on building/searching? Did you add
a unique value to help further filter the data (equivalent to a batch ID or
something that minimizes hits)? Is the underlying data partitioned with SQL
2005, or old school partitioning with an updateable view?

I am sure SQL 2008 will help quite a bit being able to utilize the regular
table filters to the full-text query, but we still have ~3 months before it
releases.

I’ve considered breaking out the table to multiple ones across multiple
servers, and creating multiple full-text catalogs to go against. Anyone who
has done this before on a large scale? I am considering breaking the large
table up to possibly several thousand tables – say 5,000. If I did this and
created a full-text index on each table – what is the optimal relationship to
catalog/full-text index? For example, how many full-text indexes per
full-text catalog are OK before you start having performance issues (my guess
is it probably has more to do with record size/full-text index size than
actual full-text index counts). Has anyone created several thousand
full-text catalogs or full-text indexes on one server in production and it
went well?

Anyone who has helped SQL FT search scale well on many terabytes – I’d
greatly appreciate your hard earned lessons.
Many thanks,
Robert Towne


"Hilary Cotter" wrote:

There are no published benchmarks as they would vary on your content (text,
image, char), what is being indexed (a lot of the same words will perform
differently than a lot of unique words), your machines and table design.

However I was working on a system where we sql fts multi-terabytes and
performance was subsecond for most queryies (over a billion rows IIRC),
however queries which were complex or for which there were a large number of
rows could be very lengthy.

"Ankur Rawat" <AnkurRawat@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in message
news:33271B68-1CE5-4DBE-8797-427008625EA8@xxxxxxxxxxxxxxxx
> Hi,
> We are currently using a 3rd party Search Engine to search our > application
> content in SQL server.
>
> Could someone please point me to Benchmark data for the FTS service? I > am
> interested in switching to FTS even if the Performance numbers are
> comparable
> or better.
>
> thanks for any help!
>
> regards,
> Ankur.
>




.



Relevant Pages

  • Re: Locks & FT Catalogs
    ... Director of Text Mining and Database Strategy ... Looking for a SQL Server replication book? ... in the 6 hours after that either) leaving that catalog completely unusable ... synchronise their changes from staging to live (or vice versa to roll ...
    (microsoft.public.sqlserver.fulltext)
  • Full-Text options disabled in management console
    ... If I open a SQL 2000 database using the management console the Full-Text ... of SQL 2000 and on server editions installed on servers. ... right click on the database "New Full-Text Catalog is grayed out/disabled. ...
    (microsoft.public.sqlserver.fulltext)
  • Re: Unique key count = 1
    ... Right click on your database, select full-text index and make sure you have ... If it is can you post your gatherer log here. ... Looking for a SQL Server replication book? ... I have created a new fulltext catalog and a done a full population. ...
    (microsoft.public.sqlserver.fulltext)
  • Re: Not the usual Error 7635: The Microsoft Search Service Cannot . (l
    ... Catalog and possibly other system tables are not set correctly. ... it is SQL Server version specific. ... > Search Service cannot be administered under the present user account". ... > as its Default database, and is a member of sysadmin. ...
    (microsoft.public.sqlserver.fulltext)
  • Fixed
    ... I also found this article that gives the proper way to move system dbs in sql 2008: ... The model exists where the master states it exists. ... This is the error log prior to detaching the model database. ...
    (microsoft.public.sqlserver.server)