Re: Scanned Doc to OCR to Indexed SQL DB

Tech-Archive recommends: Fix windows errors by optimizing your registry

From: Mary Chipman (mchip_at_online.microsoft.com)
Date: 07/15/04


Date: Thu, 15 Jul 2004 14:05:39 -0400

I'd suggest looking into SQL Server's full-text search capabilities.
There's quite decent documentation in SQL Books Online.

--Mary

On Wed, 14 Jul 2004 16:56:35 -0700, "Ryan Ternier"
<rternier@icompasstech.com.spamproof> wrote:

>Hello,
>
>We're on the way to start a new web application that deals with users
>scanning in Printed off Documents, and uploading them to the their website.
>When these docs hit the website, they will be ran through an online OCR
>utility, to get the raw text from it.
>>From this point, they will be stripped down and shoved (maybe even stomped)
>into a SQL DB.
>
>I've never attempted this before, and was looking for some advice with the
>DB section.
>Once the document is uploaded and stripped into it's text form, what is the
>best way to index it so users can do searches on it. We could just put the
>whole thing into
>a large text field, but indexing seems faster.
>
>If anyone could shed light, or point me in a direction it'd be appreciated.
>
>Thanks.
>
>Ryan Ternier
>Code Monkey
>



Relevant Pages

  • ANNOUNCE - Muldis::DB v0.0.0 released, in p5+p6
    ... I am pleased to announce the release of Muldis::DB version 0.0.0 for Perl 5 on CPAN. ... If you want to read all the Muldis::DB documentation that exists now, I recommend doing so in this order: ... Muldis::DB implements a D language as defined by Hugh Darwen and Chris Date, and presents a superior interface for working with the relational model of data, contrasted with SQL. ...
    (perl.dbi.users)
  • Re: Generating SQL
    ... > Scanning a bunch of column descripter classes for column ... a data dictionary documented in code - the value object class. ... > language that is uglier and/or less documented than SQL. ... > cannot find the documentation for your custom query API, ...
    (comp.object)
  • RE: Microsoft Jet SQL Reference
    ... The entire Jet 4.0 SQL Reference is on your hard drive in the following ... > I'm quite challenged by the CONSTRAINT clause documentation. ... > CREATE TABLE Orders (OrderId INTEGER PRIMARY KEY, CustId INTEGER, OrderNotes ...
    (microsoft.public.access.queries)
  • Re: An efficient way to read data from excel
    ... Do you have some URLs for SQL documentation specific to this task? ... Administrator Drivers tab on my computer? ...
    (microsoft.public.scripting.wsh)
  • Re: Generating SQL
    ... a data dictionary documented in code - the value object class. ... >> cannot find the documentation for your custom query API, ... >> of SQL books and resources available. ... >> Any business app programmer who does not know SQL should ...
    (comp.object)