Re: Question about decision tree algorithm in sqlserver2000
- From: "anonymous_user@xxxxxxxxxxxxxxxxxxxxxxx" <anonymous_user@xxxxxxxxxxxxxxxxxxxxxxx>
- Date: Sun, 25 Feb 2007 03:00:09 GMT
Thanks, it really helps. As your explaination,
if my target is discrete, the algorithm will
build a classification tree. How can I force
the algorithm to build a regressiion tree for
me even my target is discrete(true,false)? Is there some parameter here or the algorithm does
the job automatically?
.
The algorithm (like some others in SQL Server 2005) is actually a family of algorithms. Depending on the modeling, the algorithm may be a decision or a regression tree. If you model your target as discrete or discretized, the algorithm will build a classification tree (histograms in nodes/leaves).
If your target, on the other hand, is continuous (numeric or dateTime), the algorithm will build regression trees that have regression formulae in nodes and leaves.
The algorithm actually does a better job in providing high lift than high true rate. It is optimized for accuracy across the whole range of target values (i.e. high lift) and not for a single target state (which would favor hight true positives). You can measure the lift using the Accuracy Chart view in the mining model viewer.
Note that, if your target is continuous (i.e. a regression tree) than the lift chart is replaced by a scatter plot
bogdan
Thanks for your kind reply.
I still some other question about the detail of dtree algorithm. Is it a classification tree of regression tree? If my goal is high lift instead
of high true rate, is it suitable to use dtree
algorithm?
Hello
1. If there're only 5% or even lower positive samples in my train/testYes, it will most likely affect the performance of the algorithm. . To
data, will it affect the performance of decision tree algorithm?
improve the accuracy of the model, I'd suggest sampling your data so that
the distributions are more balanced. An article on performing stratified
sampling is available here:
http://www.sqlserverdatamining.com/DMCommunity/TipsNTricks/2615.aspx
2. Can I set some parameter to affect the penalty of FP(false positive)There is no algorithm parameter that handles this.
and FN (false nagative)? Because the business impact of FP and FN are
quite different.
However, the model viewers in SQL Server 2005 allow an analysis of FP vs FN
by using the Classification Matrix view.
Hope this helps
--
--
--
This posting is provided "AS IS" with no warranties, and confers no rights.
Please do not send email directly to this alias. It is for newsgroup
purposes only.
thanks,
bogdan
<anonymous_user@xxxxxxxxxxxxxxxxxxxxxxx> wrote in message
news:fde7cb95-3517-466d-8dec-014602d9f9cd@xxxxxxxxxxxxxxxxxxxxxxx
I've no idea about the implementation of this decision tree algorithm, so
my questions maybe silly-_-
My task is to create a decision tree to predict if one customer will buy
our product given some infomation.
1. If there're only 5% or even lower positive samples in my train/test
data, will it affect the performance of decision tree algorithm?
2. Can I set some parameter to affect the penalty of FP(false positive)
and FN (false nagative)? Because the business impact of FP and FN are
quite different.
Thanks in advance!
- Follow-Ups:
- Re: Question about decision tree algorithm in sqlserver2000
- From: anonymous_user@xxxxxxxxxxxxxxxxxxxxxxx
- Re: Question about decision tree algorithm in sqlserver2000
- References:
- Question about decision tree algorithm in sqlserver2000
- From: anonymous_user@xxxxxxxxxxxxxxxxxxxxxxx
- Re: Question about decision tree algorithm in sqlserver2000
- From: Bogdan Crivat [MSFT]
- Re: Question about decision tree algorithm in sqlserver2000
- From: anonymous_user@xxxxxxxxxxxxxxxxxxxxxxx
- Re: Question about decision tree algorithm in sqlserver2000
- From: bogdanc@xxxxxxxxxxxxxxxxxxxx
- Question about decision tree algorithm in sqlserver2000
- Prev by Date: Re: About wrong winner predicted in the model
- Next by Date: Re: Question about decision tree algorithm in sqlserver2000
- Previous by thread: Re: Question about decision tree algorithm in sqlserver2000
- Next by thread: Re: Question about decision tree algorithm in sqlserver2000
- Index(es):
Relevant Pages
|
Loading