Re: newbie: Full Text Search Against PDF Blobs



No, but I know a site that use sql fts to index over 2 terrabytes.

--
relevantNoise - dedicated to mining blogs for business intelligence.

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
"Des" <support@xxxxxxxxxxxxxx> wrote in message
news:e0GSPlj3HHA.4712@xxxxxxxxxxxxxxxxxxxxxxx
Thanks Hillary,

That was very useful indeed.
I would much prefer to stay inside the environments I am familiar with, if
possible.

Do you know of a site that uses SQL FTS to Search PDF's Stored as Blobs?
It would be nice to know that any "known" shortcomings, before I attempt
to implement it, otherwise I will be thinking it was something I did.

Thanks,
Des

Hilary Cotter wrote:
With Lucene you really have to roll your own solution, all it is, is a
full-text search engine. You have to write code to query it and to feed
documents to it to index these documents. Lucene is designed for the 5-10
million document range, but can be scaled much higher. It is optimized to
return results in batches to 10, 20, 25 or 100 results. If you return all
results its performance is much worse than SQL FTS.

Lucene allows you to so true property based searches.

SQL FTS is highly scalable but you really have to think about
partitioning after you hit 50 million rows.

You really have to test to see what works best in your environment.



.



Relevant Pages

  • Re: newbie: Full Text Search Against PDF Blobs
    ... full-text search engine. ... Lucene is designed for the 5-10 ... SQL FTS is highly scalable but you really have to think about partitioning ...
    (microsoft.public.sqlserver.fulltext)
  • Re: newbie: Full Text Search Against PDF Blobs
    ... I would much prefer to stay inside the environments I am familiar with, ... Do you know of a site that uses SQL FTS to Search PDF's Stored as Blobs? ... full-text search engine. ... Lucene is designed for the 5-10 million document range, but can be scaled much higher. ...
    (microsoft.public.sqlserver.fulltext)
  • SQL 2005 FT for large-scale web search engine
    ... I have a lot of experience working with SQL FTS, ... and a lot of the information I've seen implies that SQL FTS ... search engine with 1 Billion-plus records? ... boxes running SQL2005 be enough to sustain a major search engine? ...
    (microsoft.public.sqlserver.fulltext)
  • Re: SQL 2005 FT for large-scale web search engine
    ... Microsoft did test SQL FTS to a 2 ... the scalability of SharePoint 2007 and its scalability. ... search engine with 1 Billion-plus records? ... boxes running SQL2005 be enough to sustain a major search engine? ...
    (microsoft.public.sqlserver.fulltext)
  • Re: newbie: Full Text Search Against PDF Blobs
    ... There was a shortcoming that there was no 64 bit iFilter that has now been resolved http://sqlblogcasts.com/blogs/simons/archive/2007/07/18/PDF-64-bit-iFilter-at-last.aspx ... a full-text search engine. ... If you return all results its performance is much worse than SQL FTS. ... Lucene allows you to so true property based searches. ...
    (microsoft.public.sqlserver.fulltext)

Loading