RE: Outliers
- From: "Bogdan Crivat" <bogdanc@xxxxxxxxxxxxxxxxxxxx>
- Date: Thu, 26 Jan 2006 17:20:14 GMT
I replied in the original thread
thanks
bogdan
>
> I'm a newbie to both data mining generally and Sql Server BI in particular.
>
> I've played with the Linear Regression and Decision Tree algorithms a fair amount, but I have a question.
>
> I've created a test database with both a discrete and continuous attribute. The continuous attribute runs linearly within one of the discrete attributes and the nwill change and run linearly in a different direction.
>
> My whole point was to try and get the DT engine to create new nodes when it detects a change in the 'line'. This works great and it is way cool. The machine predicts everything perfectly and it works just as I would have expected.
>
> So now the caveat.
>
> Any regression based analysis, or so it seems to me, has major problems with outliers. If I change even just a few records of data so that they do not regress well, then the predictions, as you would expect, degrade badly.
>
> I see nothing in the BI stuff about outlier removal or analysis of any kind.
>
> I am presuming that the Model Accuracy tab will be of limited use here.
>
> What can I do to detect outliers?
>
> Hope this isn't too dumb...
>
.
- References:
- Outliers
- From: Paul Jacobs
- Outliers
- Prev by Date: RE: Outliers
- Next by Date: Re: building a web site
- Previous by thread: Outliers
- Next by thread: Outliers
- Index(es):
Relevant Pages
|