Re: newbie: Full Text Search Against PDF Blobs
- From: "Hilary Cotter" <hilary.cotter@xxxxxxxxx>
- Date: Tue, 14 Aug 2007 06:56:44 -0400
No, but I know a site that use sql fts to index over 2 terrabytes.
--
relevantNoise - dedicated to mining blogs for business intelligence.
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html
Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
"Des" <support@xxxxxxxxxxxxxx> wrote in message
news:e0GSPlj3HHA.4712@xxxxxxxxxxxxxxxxxxxxxxx
Thanks Hillary,
That was very useful indeed.
I would much prefer to stay inside the environments I am familiar with, if
possible.
Do you know of a site that uses SQL FTS to Search PDF's Stored as Blobs?
It would be nice to know that any "known" shortcomings, before I attempt
to implement it, otherwise I will be thinking it was something I did.
Thanks,
Des
Hilary Cotter wrote:
With Lucene you really have to roll your own solution, all it is, is a
full-text search engine. You have to write code to query it and to feed
documents to it to index these documents. Lucene is designed for the 5-10
million document range, but can be scaled much higher. It is optimized to
return results in batches to 10, 20, 25 or 100 results. If you return all
results its performance is much worse than SQL FTS.
Lucene allows you to so true property based searches.
SQL FTS is highly scalable but you really have to think about
partitioning after you hit 50 million rows.
You really have to test to see what works best in your environment.
.
- References:
- newbie: Full Text Search Against PDF Blobs
- From: Des
- Re: newbie: Full Text Search Against PDF Blobs
- From: Hilary Cotter
- Re: newbie: Full Text Search Against PDF Blobs
- From: Des
- newbie: Full Text Search Against PDF Blobs
- Prev by Date: Re: newbie: Full Text Search Against PDF Blobs
- Next by Date: Re: incremental change tracking
- Previous by thread: Re: newbie: Full Text Search Against PDF Blobs
- Next by thread: Re: newbie: Full Text Search Against PDF Blobs
- Index(es):
Relevant Pages
|
Loading