Re: Containstable weighting question



Ranking is based on relative rarity of a word. So if a word occurs rarely in
your documents it has higher resolving power and will be ranked higher than
words which occur relatively rarely. Likewise a word which occurs more
densely in a document will be weighted lower than a word which occurs less
often. So its like a bell curve with greatest contribution to overall weight
being around 1000 words IIRC.
--
Hilary Cotter
Director of Text Mining and Database Strategy
RelevantNOISE.Com - Dedicated to mining blogs for business intelligence.

This posting is my own and doesn't necessarily represent RelevantNoise's
positions, strategies or opinions.

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com



"geek-y-guy" <noone@xxxxxxxxxxx> wrote in message
news:OBgBq4C0GHA.1536@xxxxxxxxxxxxxxxxxxxxxxx
I have an FT Index on product names, and I'm trying to devalue common words
that would appear in many names.

I'm using a query like:

select p.product_name, k.rank from products p inner join

containstable(products,product_name,'isabout ("Bob&Ray" weight(.9),"video"
weight
(.1))') as k
on p.product_id = k.[key]
order by k.rank desc

When I run that query, the first result is

Jam Video with a rank of 952

followed by

Bob & Ray 2-volume video box set (256)
Bob & Ray first season (252)
Bob & Ray second season (252)
Crimestoppers Season 1 video (11)

etc.

Obviously I would like "Bob & Ray" to have a higher ranking that "Jam
Video" in the ranking. Is the ranking based on the number of words in the
field, as well as the matches against those words?

Is there any way to structure the query to devalue the word "video" so
that "Bob & Ray" would have the highest relevance?



.



Relevant Pages

  • Containstable weighting question
    ... When I run that query, ... Bob & Ray 2-volume video box set ...
    (microsoft.public.sqlserver.fulltext)
  • Re: Alternative to sennheiser mke300?
    ... Bob Ford wrote: ... The Rode video mic at $149. ... of 7 mics including sound files so that even readers with a bias such ...
    (rec.video.production)
  • Re: Alternative to sennheiser mke300?
    ... Bob Ford wrote: ... The Rode video mic at $149. ... of 7 mics including sound files so that even readers with a bias such ...
    (rec.video.production)
  • Re: Writing Tips For Ray
    ... In what sense is "verification impossible"? ... unless Ray identifies the video and the point ... Dr. Scott was a bald faced liar, and pulled his biblical and language ...
    (talk.origins)
  • Re: Super Tuesday
    ... (the guilt, the people who helped me along the way, since meeting Bob, ... thought) Bob Dylan and I want to be with him forever and ever? ... middle of the song (in an orange shirt, my T-shirt was white, and Bob ... Oh, also, his hair was sprayed black that night, and in the video, ...
    (rec.music.dylan)

Loading