Re: Text searching in word documents

Tech Tip: Click here to run a free scan for Windows Errors and optimize PC performance



Russell, most of these PDF's that I have seen like this can still be
indexed. The real problem with PDF's is sometimes the text is a binary image
which cannot be indexed.

--
http://www.zetainteractive.com - Shift Happens!

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
"Russell Fields" <russellfields@xxxxxxxxxx> wrote in message
news:%23JlNss1MIHA.4712@xxxxxxxxxxxxxxxxxxxxxxx
Raf,

You will need to install a PDF iFilter. Adobe has one for 32-bit PDFs and
it works fine for indexing PDFs. If you ever go to 64-bit, FoxIt has a
64-bit version.

You should also know that some PDFs are 'locked' in some sense so that
they cannot be indexed. You also cannot copy the text out of them for
pasting into a Word document, etc.

RLF

"Raf" <Raf@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in message
news:C608A2AC-B523-4FA0-A2BD-A2EC1E3F709E@xxxxxxxxxxxxxxxx
Thanks Hillary,

it seems to work now.
Another question yet:
Does the full text indexing work for PDF-files?
I'm working at the moment with sql-server 2000.

Regards,

Raf

"Hilary Cotter" wrote:

In Word, go to tools, options, save and ensure that the option fast
saves is
unchecked.

You need to verify this on the Word programs which created your content.

--
http://www.zetainteractive.com - Shift Happens!

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
"Raf" <Raf@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in message
news:F35A1624-1E85-437D-B42F-30F0AD6D676B@xxxxxxxxxxxxxxxx
Hello Hilary,

what exactly do you mean with saving documents in the fast save
format?
Is this something you do in word, or is it the way you save documents
to
the
database. I save the documents as BLOB.
I'll insert some vb-code showing how I save the documents to the
database.
Maybe this helps you to locate the problem....

Regards,

Raf

Dim arr() As Byte
FileOpen(1, txtFileName.Text, OpenMode.Binary, OpenAccess.Read,
OpenShare.Shared)
ReDim arr(FileLen(txtFileName.Text) - 1)
FileGet(1, arr)
FileClose(1)

Row = BlobSet1.Tables("Documents").NewRow
Row.Item("filetype") = ".doc" ' filetype is a varchar 50
Row.Item("content") = arr ' content is a image 16
BlobSet1.Tables("Documents").Rows.Add(Row)
OdbcBlob.Update(BlobSet1)
MessageBox.Show("File added to database")

"Hilary Cotter" wrote:

check to make sure that your file type contains doc or .doc. Also
verify
that the word documents were not saved in the fast save format.

--
http://www.zetainteractive.com - Shift Happens!

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
"Raf" <Raf@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in message
news:57C4705E-8B4B-4889-9F67-0C95B63C39EE@xxxxxxxxxxxxxxxx
Hello,

I'm trying to do a full text search on word documents.
The word documents are stored in an image-field in the
SQL-database.
The table in fact has three fields:
- FileNumber --> unique key
- FileType -> varchar containing the extention of the file (eg
.doc)
- FileContent --> image field containing the contents of the word
document.

I created a Catalogue as it is described in the help files and
started
the
Full Population.
When I do a search with CONTAINS or FREETEXT, I don't get any
results.
Don't know what I'm doing wrong.

Can anyone give me some advise on possible errors that I could have
made??

Regards,

Raf










.



Relevant Pages

  • Re: Text searching in word documents
    ... Some PDFs contain text as text and the ifilter can index ... Looking for a FAQ on Indexing Services/SQL FTS ... I'm trying to do a full text search on word documents. ...
    (microsoft.public.sqlserver.fulltext)
  • Re: Text searching in word documents
    ... it works fine for indexing PDFs. ... Does the full text indexing work for PDF-files? ... database. ... I'm trying to do a full text search on word documents. ...
    (microsoft.public.sqlserver.fulltext)
  • Re: Text searching in word documents
    ... Looking for a FAQ on Indexing Services/SQL FTS ... Adobe has one for 32-bit PDFs ... database. ... I'm trying to do a full text search on word documents. ...
    (microsoft.public.sqlserver.fulltext)
  • Re: Text searching in word documents
    ... Does the full text indexing work for PDF-files? ... "Hilary Cotter" wrote: ... Looking for a FAQ on Indexing Services/SQL FTS ... I'm trying to do a full text search on word documents. ...
    (microsoft.public.sqlserver.fulltext)
  • Re: FTS and PDFs in row
    ... and the reference the file in the file system and extract its ... Looking for a SQL Server replication book? ... Looking for a FAQ on Indexing Services/SQL FTS ... I've read here and there that storing PDFs in row is frowned upon, ...
    (microsoft.public.sqlserver.fulltext)