Re: Finding duplicates that might differ slightly
- From: "Jeff Boyce" <nonsense@xxxxxxxxxxxx>
- Date: Wed, 4 Jan 2006 11:19:14 -0800
Access is quite literal ... "duplicate" means an exact match, character for
character. Unless you can tell Access exactly how to decide what is/isn't a
duplicate, you may have to rely on USB (using someone's brain).
You and I (?!) can look at a list of CompanyNames and easily spot those that
are (probable) duplicates. For example, would you consider the following to
be duplicates?:
IBM
International Business Machines
IBM, Inc.
International Business Machines Corporation
If you or I weren't quite sure, we'd probably look at one/more of the
additional fields to help confirm/reject them as duplicates. Unless
instructed how, Access can't do that.
If you have 10 million records, you probably won't want to try this. If you
have 10,000 records or less, you might want to use a query to sort the
field(s) in alpha sequence, then eyeball the list.
Or, you could try to explicitly define how to evaluate each possible
"acceptable" difference that nonetheless still results in "duplicates".
By the way, if all four of those business names I listed are the same
business, which one is the "correct" name? Why does that matter? Because
any related tables/records will have to also be converted over to use the
record ID of the correct name!
Regards
Jeff Boyce
<Office/Access MVP>
"KBuser" <sakred@xxxxxxxxxxxxxxxxx> wrote in message
news:1136401296.365244.167230@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
> I'm trying to create a query to find duplicate records, but some
> entries might contain a . or a '. How can I account for this in my
> query?
>
> SELECT BrandName.BrandName, BrandName.Suffix, BrandName.ProductClass,
> BrandName.Company, BrandName.updateDate
> FROM BrandName
> WHERE (((BrandName.BrandName) In (SELECT [BrandName] FROM [BrandName]
> As Tmp GROUP BY [BrandName],[Suffix] HAVING Count(*)>1 And [Suffix] =
> [BrandName].[Suffix])))
> ORDER BY BrandName.BrandName, BrandName.Suffix;
>
>
> Thanks in advance.
>
.
- Follow-Ups:
- Re: Finding duplicates that might differ slightly
- From: KBuser
- Re: Finding duplicates that might differ slightly
- References:
- Finding duplicates that might differ slightly
- From: KBuser
- Finding duplicates that might differ slightly
- Prev by Date: Re: View contents of a folder
- Next by Date: Re: Finding duplicates that might differ slightly
- Previous by thread: Finding duplicates that might differ slightly
- Next by thread: Re: Finding duplicates that might differ slightly
- Index(es):
Relevant Pages
|