Newbie question on EQUAL_AREAS and cases
- From: "Bob" <robert_harford@xxxxxxxxxxx>
- Date: 20 Sep 2005 10:53:14 -0700
I've created a view joining (in essence) customers with sales. I am
then trying to predict sales amount (continuous) by customer attribute
inputs. I selected DISCRETIZED(EQUAL_AREAS,4) for the predicted
variable (sales amount).
What I don't understand is what "value" is telling me in the model. I
can understand the number of cases associated with "node total" under
value, as well as the number of cases associated with "missing".
However, the other values are numbers like "832.125", with an
associated case total of 593. Now to get that case total, the sales
amount had to be between 175 and 1,490. So in short, what the heck are
the actual sales ranges associated with these "equal buckets".
Also, as to the definition of "case", I understand that in my example
the case is a "customer" with a key of customer_id. But in the data
mining model, the number associated with "cases" for the "all" category
is a distinct count of all the input variables and the predicted
variable. So even though I define the case to be customer_id, the data
mining output makes the case count much higher and equal to again, the
distinct count of all the variables in the model.
Thanks.
--Bob Harford
Data Warehousing Architect
AlphaInsight Corp. / Falls Church, VA
.
- Follow-Ups:
- Prev by Date: Re: SELECT in sections
- Next by Date: Newbie question on EQUAL_AREAS and cases
- Previous by thread: Dates problems
- Next by thread: Re: Newbie question on EQUAL_AREAS and cases
- Index(es):
Relevant Pages
|